Algorithms in Bioinformatics - #22125
Information for participants
GENERAL SCHEDULE
Lectures will be in the morning from 9.00 - 12.00, and exercises in the afternoon from 13.00 - 17.00.
The morning sessions will consist of lectures and small
practical exercises introducing the different algorithms, and the afternoon sessions
will consist of programming exercises where
the algorithms will be implemented.
The main programming language will be Python
and all program templates provided in the course will be written in Python.
Prior detailed knowlegde of Python programming is NOT required but will make
following the course very much easier. However, basic programming skills are required to follow the course.
PROGRAMS AND TOOLS
Course Programme
Please note that the programme is updated on a regular basis - click the 'refresh' button once in a while to make sure that you have the most updated information
LITERATURE:
-
The course curriculum consists of review paper and selected chapters from Immunological Bioinformatics,
Lund et al., MIT Press, 2005. All course material will be made available online during the course.
All course material is available either as open access or here Course material
Wednesday, 4. June
Introduction to course. PSSM construction from pre-aligned sequences including pseudo count. Python recap
Morten Nielsen
BACKGROUND TEXTS
Essentials:
Additionals:
- 9.00 - 9.15
- Introduction to course
- Introduction to course [PDF]
(MP4).
- 9.15 - 9.30
- Introduction to the immune system [PDF] (MP4).
- 9.30 - 11.20 (coffee break included)
- Weight matrix construction[PDF]. [PPTX] Weight matrix construction (MP4).
- Logo Handout
- Handout. Estimation of pseudo counts
- 11.20 - 12.00
- Questions to the mornings lectures and other general issues
- Checking that we all have python and jupyter-notebook installed and running
- 12.00 - 13.00
- Lunch
- 13.00 - 13.30
- Some notes on sequence alignment [PDF] (MP4)
- 13.30 - 17.00
- A brief introduction to Python programming and Jupyter-notebooks
- Python intro
- Implementation of PSSM construction from pre-aligned sequences including pseudo count
correction for low counts and sequence clustering
- PSSM construction and evaluation
Thursday, 5. June. Holiday
Friday, 6. June
Sequence alignment, Dynamic programming, and Psi-Blast
Morten Nielsen
BACKGROUND TEXTS
Essentials:
Additionals:
- 9.00 - 9.30
- Questions to yesterdays lectures and exercises
- 9.30 - 11.00
- Blosum matrices [PDF] (MP4)
- Sequence alignment [PDF] . [PPTX] . (MP4)
- Handout (O3)
- Handout answers
- 11.00 - 12.00
- Blast alignment heuristics, Psi-Blast, and sequence profiles [PDF] . [PPTX] .
- Psi-Blast handout.
- 12.00 - 13.00
- Lunch
- 13.00 - 17.00
- Implementation of the Smith-Waterman Dynamic
programming algorithm
- Matrix dumps from alignment programs (to be used for debugging)
Monday, 9. June. Holiday
Tuesday, 10. June
Data redundancy reduction algortihms
Optimizations methods
Gibbs sampling
Morten Nielsen
BACKGROUND TEXTS - Data redundancy reduction and Gibbs sampling
Essentials:
Additionals:
BACKGROUND TEXTS - Hidden Markov Models
- 9.00 - 9.30
- Questions to yesterdays lectures and exercises
- 9.30 - 10.00
- Data redundancy reduction algorithms (Hobohm1 and Hobohm2) [PDF]. [PPTX]. (MP4).
- 10.00 - 10.45
- Optimization procedures - Gradient decent, Monte Carlo
- Optimization procedures [PDF] [PPTX] (MP4)
- GD handout
- 10.45 - 11.00
- Break
- 11.00 - 12.00
- Gibbs sampling and Gibbs clustering
- Gibbs sampling [PDF] . [PPTX] . (MP4).
- 12.00 - 13.00
- Lunch
- 13.00 - 17.00
- Hobohm redundancy reduction algorithms
- Implementating of a Gibbs sampling algorithm for prediction of MHC class II binding
Wednesday, 11. June
Hidden Markov Models
Morten Nielsen
BACKGROUND TEXTS
- 9.00 - 9.30
- Questions to yesterday lectures and exercise
- 9.30 - 12.00
- Hidden Markov models
- Viterbi decoding, Forward/Backward algorithm, Posterior decoding, Baum-Welsh learning
- [PDF]. [PPTX] HMM (MP4)
- Viterbi Handout
- Forward Handout
- 11.30 - 12.00
- Profile Hidden Markov Models
- 12.00 - 13.00
- Lunch
- 13.00 - 17.00
- Implementation of Viterbi and posterior decoding.
- Hidden Markov exercises
Thursday, 12. June
Cross validation and training of data driven prediction methods. Stabilization matrix method (SMM)
Morten Nielsen
BACKGROUND TEXTS
- 9.00 - 9.30
- Questions to yesterdays lectures and exercises
- 9.30 - 10.00
- Cross validation and training of data driven prediction methods
- [PDF] . [PPTX] . [MP4].
- 10.00 - 10.30
- Stabilization matrix method (SMM) background
- [PDF] . [PPTX] . [MP4].
- SMM handout
- 10.30 - 10.45
- Break
- 10.45 - 12.00
- Implementing and evaluating SMM algorithms using cross-validation
- 12.00 - 13.00
- Lunch
- 13.00 - 13.30
- Quiz with questions capturing essential parts of course up to now Quiz. We will go over the answers to the quiz tomorrow morning.
- Description of potential projects and formation of groups
- Project suggestions, and descriptions.
- Document for project signup.
- 14.00 - 17.00
- Continuation of exercise
Friday, 13. June
Artificial neural networks. Sequence encoding, feedforward and backpropagation algorithm
Morten Nielsen
BACKGROUND TEXTS
- Immunological Bioinformatics. MIT Press. Chapter 4.
- Background
- Sequence encoding
- Feed forward algorithm
- Back-propagation and neural network training
- 9.00 - 9.30
- Questions to yesterdays lectures and exercises
- 9.30 - 10.30
- Artificial neural networks[PDF] . [PPTX] . Artificial neural networks Part 1 (MP4). Artificial neural networks Part 2 (MP4).
- Handout
- 10.30 - 10.40
- Break
- 10.40 - 12.00
- Network training - backpropagation
- Training of artificial neural networks [PDF] . [PPTX] . (MP4)..
- Handout
- 12.00 - 13.00
- Lunch
- 13.00 - 17.00
- Artificial neural networks (Feedforward and Backpropagation)
Monday, 16. June
An introduction to Deep neural network architectures
Morten Nielsen
BACKGROUND TEXTS
- 9.00 - 9.30
- Questions to yesterdays lectures and exercises
- 9.30 - 10.00
- Trick for ANN training [PDF].
- 10.00 - 10.30
- NNAlign, alignment using ANN's [PDF]
- 10.30 - 12.00
- Deep Learning using FFNN and CNNs (PPTX) [PDF]
- 11.00 - 12.00
- Exercise: Constructing and training Deep ANN methods (Joakim Noeddeskov Glifford && Jonas Birkelund Nilsson)
- NNdeep exercise
- 12.00 - 13.00
- Lunch
- 13.00 -17.00
- Exercise: Constructing and training Deep ANN methods (Joakim Noeddeskov Glifford && Jonas Birkelund Nilsson) cont.
- Answer Deep FFNN
Tuesday, 17. June
An introduction to Deep neural network architectures
Introduction to the project work
Morten Nielsen
BACKGROUND TEXTS
- 9.00 - 9.30
- Questions to yesterdays lectures and exercises
- 9.30 - 10.00
- What goes on inside a CNN? (Morten Nielsen) [PDF]
- 10.00 - 10.15
- Break
- 10.15 - 11.00
- Doing things in C - A few examples: Alignment and ANN training
- Some code examples
- 11.00 - 12.00
- Selection of projects, formation of project groups and start of project work Document for project signup.
- 12.00 - 13.00
- Lunch
- 13.00 - 16.00
- Work on project (on your own)
Wednesday 18. - Tuesday 25. June. Project work
- No lectures. Project work
- Projects must be submitted (in PDF format) via campusnet Tuesday 24. of June 11.59 (just before lunch)
Wedensday, 25 - Thursday 26th June, Project evaluation and Exam
Each group has 15 minutes to present their project including 5 minutes for questions. Note, only the group will be present for the presentation of
the individual projects. After the present, each member of the group is
evalauted in a oral exam covering the complete course curriculum.
Go to