Algorithms in Bioinformatics - #22125
Information for participants
GENERAL SCHEDULE
Lectures will be in the morning from 9.00 - 12.00, and exercises in the afternoon from 13.00 - 17.00.
The morning sessions will consist of lectures and small
practical exercises introducing the different algorithms, and the afternoon sessions
will consist of programming exercises where
the algorithms will be implemented.
The main programming language will be Python
and all program templates provided in the course will be written in Python.
Prior detailed knowlegde of Python programming is NOT required but will make
following the course very much easier. However, basic programming skills are required to follow the course.
PROGRAMS AND TOOLS
Course Programme
Please note that the programme is updated on a regular basis - click the 'refresh' button once in a while to make sure that you have the most updated information
LITERATURE:
-
The course curriculum consists of review paper and selected chapters from Immunological Bioinformatics,
Lund et al., MIT Press, 2005. All course material will be made available online during the course.
All course material is available either as open access or here Course material
Monday, 8. June
Introduction to course. PSSM construction from pre-aligned sequences including pseudo count. Python recap
Morten Nielsen
BACKGROUND TEXTS
Essentials:
Additionals:
- 9.00 - 9.15
- Introduction to course
- Introduction to course [PDF]
(MP4).
- 9.15 - 9.30
- Introduction to the immune system [PDF] (MP4).
- 9.30 - 11.20 (coffee break included)
- Weight matrix construction[PDF]. [PPTX] Weight matrix construction (MP4).
- Logo Handout
- Answers
- Handout. Estimation of pseudo counts
- 11.20 - 11.40
- Some notes on sequence alignment [PDF] (MP4)
- 11.40 - 12.00
- Questions to the mornings lectures and other general issues
- Checking that we all have python and jupyter-notebook installed and running
- 12.00 - 13.00
- Lunch
- 13.00 - 17.00
- A brief introduction to Python programming and Jupyter-notebooks
- Python intro
- Python Answers
- Implementation of PSSM construction from pre-aligned sequences including pseudo count
correction for low counts and sequence clustering
- PSSM construction and evaluation
- PSSM answers
Tuesday, 9. June
Sequence alignment, Dynamic programming, and Psi-Blast
Morten Nielsen
BACKGROUND TEXTS
Essentials:
Additionals:
- 9.00 - 9.30
- Questions to yesterdays lectures and exercises
- 9.30 - 11.00
- Blosum matrices [PDF] (MP4)
- Sequence alignment [PDF] . [PPTX] . (MP4)
- Handout (O3)
- Handout (O2)
- Handout answers
- 11.00 - 12.00
- Blast alignment heuristics, Psi-Blast, and sequence profiles [PDF] . [PPTX] .
- Psi-Blast handout.
- 12.00 - 13.00
- Lunch
- 13.00 - 17.00
- Implementation of the Smith-Waterman Dynamic
programming algorithm
- Matrix dumps from alignment programs (to be used for debugging)
- Answers to sequence alignment exercise
Wednesday, 10. June
Data redundancy reduction algortihms
Optimizations methods
Gibbs sampling
Morten Nielsen
BACKGROUND TEXTS - Data redundancy reduction and Gibbs sampling
Essentials:
Additionals:
- 9.00 - 9.30
- Questions to yesterdays lectures and exercises
- 9.30 - 10.00
- Data redundancy reduction algorithms (Hobohm1 and Hobohm2) [PDF]. [PPTX]. (MP4).
- 10.00 - 10.45
- Optimization procedures - Gradient decent, Monte Carlo
- Optimization procedures [PDF] [PPTX] (MP4)
- GD handout
- 10.45 - 11.00
- Break
- 11.00 - 12.00
- Gibbs sampling and Gibbs clustering
- Gibbs sampling [PDF] . [PPTX] . (MP4).
- 12.00 - 13.00
- Lunch
- 13.00 - 17.00
- Hobohm data redundancy reduction algorithms
- Implementation of a Gibbs sampling algorithm for prediction of MHC class II binding
Thursday, 11. June
Hidden Markov Models
Morten Nielsen
BACKGROUND TEXTS
- 9.00 - 9.30
- Questions to yesterday lectures and exercise
- 9.30 - 12.00
- Hidden Markov models
- Viterbi decoding, Forward/Backward algorithm, Posterior decoding, Baum-Welsh learning
- [PDF]. [PPTX] HMM (MP4)
- Viterbi Handout
- Forward Handout
- 11.30 - 12.00
- Profile Hidden Markov Models
- 12.00 - 13.00
- Lunch
- 13.00 - 17.00
- Implementation of Viterbi and posterior decoding.
- Hidden Markov exercises
Friday, 12. June
Cross validation and training of data driven prediction methods. Stabilization matrix method (SMM)
Morten Nielsen
BACKGROUND TEXTS
- 9.00 - 9.30
- Questions to yesterdays lectures and exercises
- 9.30 - 10.00
- Cross validation and training of data driven prediction methods
- [PDF] . [PPTX] . [MP4].
- 10.00 - 10.45
- Stabilization matrix method (SMM) background
- [PDF] . [PPTX] . [MP4].
- SMM handout
- 10.45 - 11.00
- Break
- 11.15 - 11.30
- Description of potential projects and formation of groups
- Project suggestions, and descriptions.
- Document for project signup.
- 11.30 - 12.00
- Quiz with questions capturing essential parts of course up to now Quiz. We will go over the answers to the quiz tomorrow morning.
- 12.00 - 13.00
- Lunch
- 13.00 - 17.00
- Implementing and evaluating SMM algorithms using cross-validation
Monday, 15. June
Artificial neural networks. Sequence encoding, feedforward and backpropagation algorithm
Morten Nielsen
BACKGROUND TEXTS
- Immunological Bioinformatics. MIT Press. Chapter 4, pages 89-96.
- Background
- Sequence encoding
- Feed forward algorithm
- Back-propagation and neural network training
- 9.00 - 9.30
- Questions to yesterdays lectures and exercises
- 9.30 - 10.30
- Artificial neural networks[PDF] . [PPTX] . Artificial neural networks Part 1 (MP4). Artificial neural networks Part 2 (MP4).
- Handout
- 10.30 - 10.40
- Break
- 10.40 - 12.00
- Network training - backpropagation
- Training of artificial neural networks [PDF] . [PPTX] . (MP4)..
- Handout
- 12.00 - 13.00
- Lunch
- 13.00 - 17.00
- Artificial neural networks (Feedforward and Backpropagation)
Tuesday, 16. June
An introduction to Deep neural network architectures
Morten Nielsen
BACKGROUND TEXTS
- 9.00 - 9.30
- Questions to yesterdays lectures and exercises
- 9.30 - 10.00
- Trick for ANN training [PDF].
- 10.00 - 10.30
- NNAlign, alignment using ANN's [PDF]
- 10.30 - 12.00
- Deep Learning using FFNN and NNAlign (PPTX) [PDF]
- 11.00 - 12.00
- Exercise: Constructing and training Deep ANN methods (Joakim Noeddeskov Glifford && Jonas Birkelund Nilsson)
- NNdeep exercise
- 12.00 - 13.00
- Lunch
- 13.00 -17.00
- Exercise: Constructing and training Deep ANN methods (Joakim Noeddeskov Glifford && Jonas Birkelund Nilsson) cont.
- NNdeep exercise
Wednesday, 17. June
An introduction to Deep neural network architectures
Introduction to the project work
Morten Nielsen
BACKGROUND TEXTS
- 9.00 - 9.30
- Questions to yesterdays lectures and exercises
- 9.30 - 10.00
- Doing things in C - A few examples: Alignment and ANN training
- Some code examples
- 10.00 - 12.00
- Selection of projects, formation of project groups and start of project work Document for project signup.
- 11.00 - 17.00
- Work on project (on your own)
Thursday 18. - Wednesday 24. June. Project work
- No lectures. Project work
- Projects must be submitted (in PDF format) via campusnet Wedensday 24. of June 11.59 (just before lunch)
Thursday, 25 - Friday 26th June, Project evaluation and Exam
Each group has 15 minutes to present their project including 5 minutes for questions. Note, only the group will be present for the presentation of
the individual projects. After the present, each member of the group is
evaluated in a oral exam covering the complete course curriculum.
The exam will take place in the usual class room, building 210, room 042/048
Go to