Course: Introduction to applied bioinformatics

Section outline

Select section General

General

Collapse all Expand all
- Select activity Oznámení
  
  Oznámení Forum
- Select activity Introduction to applied bioinformatics voluntary s...
  
  Introduction to applied bioinformatics
  
  voluntary subject
  
  Assoc. Prof. Ing. Petra Matoušková, Ph.D.
  contact: matousp7@faf.cuni.cz
- Select activity course aims The aim of this course is to learn ba...
  
  course aims
  
  The aim of this course is to learn basics about the bionformatics data, mainly gene and protein sequences, the formats, retreiving of the databases, comparison, search for similarity etc.
  
  The lectures are in the computer study room (S2250)
- Select activity Organisation credit: presence (8/10) + homeworks: ...
  
  Organisation
  
  credit:
  
  presence (8/10) + homeworks: ¨HW" from each lecture.
  
  exam:
  
  "written" by computer, 2 set of 5 tasks (1hour/ submitted via Moodle)
- Select activity Example of all HWs (in czech)
  
  Example of all HWs (in czech) File
- Select activity Literature Applied bioinformatics - An introducti...
  
  Literature
  
  Applied bioinformatics - An introduction, P. Selzer, 2018
  
  Bioinformatics for dummies, J.M. Claverie, C. Notredame, 2007
  
  Knowledge Discovery in Bioinformatics, X. Hu, Y. Pan, 2007
- Select activity Schedule 2025
  
  Schedule 2025 Page
- Select activity Topics 1. Literature search 2. Protein bioinform...
  
  Topics
  
  1. Literature search
  
  2. Protein bioinformatics I - sequences, features, digestions
  
  3. Protein bioinformatics II - domains, transmembrane helices, BLAST
  
  4. Protein bioinformatics III + sum up I - sequence comparison, multiple alignmet, 3D
  
  5. Nucleotide bioinformatics I - sequences, features
  
  6. Nucleotide bioinformatics II - translation, identification, sequencing
  
  7. Nucleotide bioinformatics III - RE digestion, primers
  
  8. Nucleotide bioinformatics IV - cloning, specific primers
  
  9. Nucleotide bioinformatics V - qPCR primers, DNA/RNA secondary structure, mutagenesis primers
  
  10. Summary, examples
  
  11. Exam 2021
Select section Topic 1 _Literature search

Topic 1 _Literature search
- Select activity HUGO Human Gene nomenclature
  
  HUGO
  Human Gene nomenclature
- Select activity Pubmed Search through Medline database. Full texts...
  
  Pubmed
  
  Search through Medline database. Full texts through FAF login.
- Select activity Web of ScienceDatabase Web of Science (Clavirate...
  
  Web of Science
  
  Database Web of Science (Clavirate Analytics) includes bibliographic materials from leading scientific journals from all fields. Enables managing of references through "WebEndNote".
- Select activity Scopus (H-index)
  
  Scopus
  
  Abstract and citation database. Brings information about the H-index of a scientist.
- Select activity Lesson 1
  
  Lesson 1 File
- Select activity HW1: Literature search
  
  HW1: Literature search Assignment
Select section Topic 2 _ Protein bioinformatics I

Topic 2 _ Protein bioinformatics I
- Select activity Protein databases / sequences retrieval Expasy /...
  
  Protein databases / sequences retrieval
  
  Expasy / UniProt
  
  Expasy is Swiss bioinformatics resource portal providing access to databases and software tools from a range of life science including genomics, proteomics, system biology etc.
  
  Uniprot is high-quality and freely accessible resource of protein sequence and functional information.
  detail tutorial:
  
  NCBI protein
  
  "National Center for Biotechnology Information Protein" The Protein database is a collection of sequences from several sources, including translations from annotated coding regions.
- Select activity Example of FASTA format
  
  Example of FASTA format Page
- Select activity Protein sequence analyses
  
  Protein sequence analyses
- Select activity SMS: The Sequence Manipulation Suite - lots of sma...
  
  SMS: The Sequence Manipulation Suite - lots of small programs in JavaScript for various sequence manipulations (Molecular weight, Isoelectric point, statistics, range extractor etc.)
- Select activity Lesson 2
  
  Lesson 2 File
- Select activity HW2: Proteins I
  
  HW2: Proteins I Assignment
Select section Topic 3_Protein bioinformatics II

Topic 3_Protein bioinformatics II
- Select activity PeptideCutter: predicts potential cleavage sites ...
  
  Simulation of protease cleavage
  
  PeptideCutter: predicts potential cleavage sites cleaved by proteases or chemicals in a given protein sequence.
- Select activity Searching for protein motives and domainsSearchi...
  
  Searching for protein motives and domains
  
  Searching databases for typical protein motifs/conserved domains enables the annotation of functional units in proteins, providing insights into sequence/structure/function relationships.
  
  Conserved domain database: NCBI/CD
  
  Other databases for domain search: SMART, InterPro
- Select activity Signal peptides The prediction of protein localiz...
  
  Signal peptides
  
  The prediction of protein localization by recognizing signal peptide on the protein N-terminus. SignalP
- Select activity Prediction of transmembrane helices Prediction is...
  
  Prediction of transmembrane helices
  
  Prediction is based on amino acid hydrophobicity and probability.
  
  Hydrofobicity profile: Expasy/ProtScale
  
  Transmembrane helices prediction: TMHMM, Phobius, TopCons (multiple programs consensus), CCTOP
  
  Figures: PROTTER
- Select activity Example of substitution matrix
  
  Example of substitution matrix Page
- Select activity Lesson 3 -pdf
  
  Lesson 3 -pdf File
- Select activity HW3 : Protein analyses
  
  HW3 : Protein analyses Assignment
Select section Topic 4_Protein bioinformatics III

Topic 4_Protein bioinformatics III
- Select activity BLAST- searching for similarity(Basic Local Alig...
  
  BLAST- searching for similarity
  
  (Basic Local Alignment Search Tool)
  
  Based on short parts of the query sequence program searches for similar sequences using „substitution matrix“, which defines the score of potential alignments.
  
  NCBI/BLAST tutorial:
- Select activity Pairwise and Multiple comparisons of protein seque...
  
  Pairwise and Multiple comparisons of protein sequences - (multiple)alignment
  
  Comparison is based on substitution matrix.
  
  Pair global comparison: Needle (compare sequences in full length)
  
  Pair local comparison: LALING (finds the most similar parts of two sequences)
  
  Multiple alignments:
  
  Multalin -a simple tool for comparison of two or more sequences
  
  Clustal Omega - enables to display of phylogeny tree
  
  Phylogenetic tree
  
  Advanced phylogeny here.
- Select activity 3-D Structure PDB (Protein Data Bank)
  
  3-D Structure
  
  PDB (Protein Data Bank)
- Select activity Specific databases: Enzymes (Brenda), interaction...
  
  Specific databases:
  
  Enzymes (Brenda), interactions (STRING)
- Select activity Examples:
  
  Examples of typical tasks in exam test:
- Select activity Ex1: Find two human DHRS7 sequences: DHRS7B (AAH09...
  
  Ex1: Find two human DHRS7 sequences: DHRS7B (AAH09679.1) and DHRS7C (AAI47025.1) Run pair-wise alignment. How identical are these two proteins? Hint. Solution.
- Select activity Ex2: Find in Uniprot sequences of human NQO1 isof...
  
  Ex2: Find in Uniprot sequences of human NQO1 isoforms and align them. How many isofroms are there? Compare the alignment output to the description of each isoform, is it correct? Hint. Solution.
- Select activity Ex3:Download the sequnce of "unknown protein" (her...
  
  Ex3:Download the sequnce of "unknown protein" (here). Using domain prediction guess what is the function of the protein. Verify that by BLAST. What organism does it come from? Does it have any transmembrane helices?
  
  Hint. Solution.
- Select activity Ex3_sequence identification
  
  Ex3_sequence identification File
- Select activity Lesson 4 - pdf
  
  Lesson 4 - pdf File
- Select activity HW4: Protein comparison, 3D structure
  
  HW4: Protein comparison, 3D structure Assignment
Select section Topic 5-Nucleotide bioinformatics I

Topic 5-Nucleotide bioinformatics I
- Select activity Searching for nucleotide sequences NCBI/Nucleotide...
  
  Searching for nucleotide sequences
  
  NCBI/Nucleotide database
  
  Specialized databases
  
  Focused on "genes". Gene
  
  Genecards (human) Atlas (oncology) Ensembl (vertebrate genomes) TCGA (cancer)
  
  Sequence analysis
  
  SMS: DNA stats, Filter DNA, Range Extractor DNA, Reverse complement
- Select activity Lesson 5 -pdf
  
  Lesson 5 -pdf File
- Select activity HW5: Searching for nucleotide sequence
  
  HW5: Searching for nucleotide sequence Assignment
Select section Topic 6_Nucleotide bioinformatics II

Topic 6_Nucleotide bioinformatics II
Sequence comparison and translation
- Select activity Comparisons of nucleotide sequences - (multiple)al...
  
  Comparisons of nucleotide sequences - (multiple)alignment
  
  Comparison is analogous to proteins. It is recommended to change substitution matrix.
  
  Multiple (or pairwise) alignments: Multalin
- Select activity Examples
  
  Examples Page
- Select activity Translation =translation of nucleotide sequence i...
  
  Translation
  
  =translation of nucleotide sequence into amino acids (protein)based on the genetic code
  
  SMS suite/ Translate → suitable only for full CDS (or when know ORF)
  
  NCBI/ORFfinder → suitable for translation of any nucleotide sequence, looking for ORFs
- Select activity ex1: unknown sequnece (ORF)
  
  ex1: unknown sequnece (ORF) Page
- Select activity Ex1-solution
  
  Ex1-solution Page
- Select activity ex2: unknown sequnece
  
  ex2: unknown sequnece Page
- Select activity Ex2-solution
  
  Ex2-solution Page
- Select activity Unknown sequence identification BLASTn (searching...
  
  Unknown sequence identification
  
  BLASTn (searching nucleotide databases for similar nucleotide sequences)
- Select activity DNA sequencing "Classic" Sanger´s sequencing (.sc...
  
  DNA sequencing
  
  "Classic" Sanger´s sequencing (.scf, .abi, .ab1)
- Select activity chromas
  
  chromas File
- Select activity Ex3: unknown sequence.ab1
  
  Ex3: unknown sequence.ab1 File
  
  This (.ab1) is "unsupported" format, needs to be saved and then opened in chromas.
- Select activity Detection of "vector contamination" in unknown seq...
  
  Detection of "vector contamination" in unknown sequence
  
  VecScreen
  
  Removing of "vector contamination" in unknown sequence
  
  SMS/Range Extractor DNA
- Select activity HW6
  
  HW6 Assignment
- Select activity HW6-unknown sequence.ab1
  
  HW6-unknown sequence.ab1 File
- Select activity Lesson 6 -pdf
  
  Lesson 6 -pdf File
Select section Topic 7_Nucleotide bioinformatics III

Topic 7_Nucleotide bioinformatics III
- Select activity Uknown sequence
  
  Uknown sequence File
- Select activity Primer designPCR primer design .OligoCal...
  
  Primer design
  
  PCR primer design
  .
  
  OligoCalc-calculator of primer properties
- Select activity Obtaining nucleotide sequence: NCBIReverse complem...
  
  Obtaining nucleotide sequence: NCBI
  Reverse complement: SMS
  
  Check for primers positions: Multiple (or pairwise) alignment: Multalin
- Select activity Lesson 7 -pdf
  
  Lesson 7 -pdf File
- Select activity HW7-primers
  
  HW7-primers Assignment
Select section Topic 8_Nucleotide bioinformatics IV

Topic 8_Nucleotide bioinformatics IV
- Select activity Primer design for gene detection Primer BLAST (or ...
  
  Primer design for gene detection
  
  Primer BLAST (or "Pick primers" for each gene in NCBI).
- Select activity Ex:Primers verification
  
  Ex:Primers verification Page
- Select activity Restriction analysis SMS Suite: Restriction Summar...
  
  Restriction analysis
  
  SMS Suite: Restriction Summary (prediction of restriction sites]
  
  SMS Suite: Restriction Digest (simulation of digestion)
- Select activity HW8-primers for detection and RE analysis
  
  HW8-primers for detection and RE analysis Assignment
- Select activity Lesson 8 -pdf
  
  Lesson 8 -pdf File
Select section Topic 9

Topic 9

Not available
Select section Summary, examples

Summary, examples
- Select activity Lesson 9
  
  Lesson 9 File
- Select activity Exam 2024 sequence
  
  Exam 2024 sequence Page
- Select activity Lesson_2024+solutions
  
  Lesson_2024+solutions File
- Select activity Exam_19052023
  
  Exam_19052023 File
- Select activity Exam_19052023-solution
  
  Exam_19052023-solution File
- Select activity Exam_05042022
  
  Exam_05042022 File
- Select activity Exam_05042022_solution
  
  Exam_05042022_solution File
- Select activity Exam_07042022
  
  Exam_07042022 File
- Select activity Exam_07042022-solution
  
  Exam_07042022-solution File
- Select activity Previous exams_2020_Lesson 10_versions 2020+solution
  
  Previous exams_2020_Lesson 10_versions 2020+solution File
- Select activity Exam_27-05-2021
  
  Exam_27-05-2021 File
- Select activity Exam_27-05-2021_solution
  
  Exam_27-05-2021_solution File
Select section Exam 2025

Exam 2025
- Select activity Exam 6.5.2025
  
  Exam 6.5.2025 File
- Select activity Exam 6.5.2025_solution
  
  Exam 6.5.2025_solution File
- Select activity Exam 6.5.2025
  
  Exam 6.5.2025 Assignment
- Select activity Exam_12052025
  
  Exam_12052025 File
- Select activity Exam 12052025
  
  Exam 12052025 Assignment
Select section 7 May - 13 May

7 May - 13 May

Not available

Section outline

Introduction to applied bioinformatics

Assoc. Prof. Ing. Petra Matoušková, Ph.D.

course aims

Organisation

credit:

exam:

Literature

Topics

Protein databases / sequences retrieval

Expasy / UniProt

Protein sequence analyses

Searching for protein motives and domains

Signal peptides

Prediction of transmembrane helices

BLAST- searching for similarity

Pairwise and Multiple comparisons of protein sequences - (multiple)alignment

Phylogenetic tree

3-D Structure

Specific databases:

Examples of typical tasks in exam test:

Searching for nucleotide sequences

Specialized databases

Sequence analysis

Sequence comparison and translation

Comparisons of nucleotide sequences - (multiple)alignment

Translation

Unknown sequence identification

DNA sequencing

Detection of "vector contamination" in unknown sequence

Removing of "vector contamination" in unknown sequence

Primer design

OligoCalc-calculator of primer properties

Obtaining nucleotide sequence: NCBI

Reverse complement: SMS

Check for primers positions: Multiple (or pairwise) alignment: Multalin

Primer design for gene detection

Restriction analysis