Algorithms in Bioinformatics - #22125

Information for participants

GENERAL SCHEDULE
Lectures will be in the morning from 9.00 - 12.00, and exercises in the afternoon from 13.00 - 17.00.

The morning sessions will consist of lectures and small practical exercises introducing the different algorithms, and the afternoon sessions will consist of programming exercises where the algorithms will be implemented.

Note, that due to the Covid19 situation, this years course will be given in an online version. Further, the theroretical part of the course will in most cases be given as recorded lectures. Recorded lectures are marked as "Recorded" and online sessions as "Online" in the course program. Links to recorded lectures is provide in the program below. The online session will be via Zoom, and links provided on the program next to the session.

The main programming language will be Python (yes, this has been changed compared to earlier years), and all program templates provided in the course will be written in Python. Prior knowlegde of Python programming is NOT required. However, basic programming skills are required to follow the course.

LITERATURE: The course curriculum consists of review paper and selected chapters from Immunological Bioinformatics, Lund et al., MIT Press, 2005. All course material will be made available online during the course. All course material is available either as open access or here Course material.

PROGRAMS AND TOOLS


Course Programme
Please note that the programme is updated on a regular basis - click the 'refresh' button once in a while to make sure that you have the most updated information

LITERATURE:

O Friday, 4. June

Introduction to course, UNIX and Python programming crash course 101
Morten Nielsen

  • BACKGROUND TEXTS
    9.00 - 9.20 "Recorded"
    Introduction to the immune system
    Introduction to the immune system (MP4). [PDF] .
    9.20 - 9.35 "Recorded"
    Performance measures
    Performance measures (MP4). [PDF] .
    9.35 - 9.50
    Coffe break
    9.50 - 10.00 "Online"
    Introduction to course
    Introduction to course (MP4). [PDF] .
    10.00 - 12.00 "Online"
    Unix crash course
    A UNIX/Linux crash course
    Information about breakout rooms
    Signup sheet for help/questions (add your breakout room number and name)
    12.00 - 13.00
    Lunch
    13.00 - 13.30 "Online"
    Questions to the mornings lectures and other general questions - Cancelled
    13.30 - 17.00 "Recorded"
    Weight matrix construction (MP4). [PDF]. [PPTX].
    Logo Handout
    Handout. Estimation of pseudo counts


    O Monday, 7. June
    Weight matrix (PSSM) construction
    Morten Nielsen

  • BACKGROUND TEXTS
    9.00 - 9.30 "Recorded"
    Some notes on sequence alignment (MP4) [PDF] .
    9.30 - 10.00 "Online"
    Checking that we all have python and jupyter-notebook installed and running
    10.00 - 12.00 "Online"
    A brief introduction to Python programming and Jupyter-notebooks
    Python intro
    Signup sheet for help/questions (add your breakout room number and name)
    12.00 - 13.00
    Lunch
    13.00 - 13.30 "Online"
    Questions to mornings lecture and yesterdays exercise (MP4) (2020 MP4)
    13.30 - 17.00 "Online"
    Implementation of PSSM construction from pre-aligned sequences including pseudo count correction for low counts and sequence clustering
    PSSM construction and evaluation
    Signup sheet for help/questions (add your breakout room number and name)


    O Tuesday, 8. June

    Sequence alignment, Dynamic programming, and Psi-Blast
    Morten Nielsen

    BACKGROUND TEXTS
    Essentials: Additionals:
    9.00 - 11.00 "Recorded"
    Blosum matrices (MP4) [PDF]
    Sequence alignment (MP4) [PDF] . [PPTX] .
    Handout (O3)
    Handout (O2)
    11.00 - 12.00 "Recorded"
    Blast alignment heuristics, Psi-Blast, and sequence profiles [PDF] . [PPTX] .
    Psi-Blast handout.
    12.00 - 13.00
    Lunch
    13.00 - 13.30 "Online"
    Questions to mornings lectures and yesterdays exercises (MP4) (MP4 2020 class)
    13.30 - 17.00 "Online"
    Implementation of the Smith-Waterman Dynamic programming algorithm
    Matrix dumps from alignment programs (to be used for debugging)
    Signup sheet for help/questions (add your breakout room number and name)

    O Wednesday, 9. June
    Data redundancy reduction algortihms
    Optimizations methods
    Gibbs sampling
    Morten Nielsen

    BACKGROUND TEXTS
    Essentials: Additionals:
    9.00 - 10.00 "Recorded"
    Data redundancy reduction algorithms (Hobohm1 and Hobohm2) (MP4). [ PDF]. [ PPTX].
    10.00 - 10.45 "Recorded"
    Optimization procedures - Gradient decent, Monte Carlo
    Optimization procedures (MP4) [PDF] [PPTX]
    GD handout
    10.45 - 11.00
    Break
    11.00 - 12.00 "Recorded"
    Gibbs sampling and Gibbs clustering
    Gibbs sampling (MP4). [PDF] . [PPTX] .
    12.00 - 13.00
    Lunch
    13.00 - 13.30 "Online"
    Questions to mornings lectures and yesterdays exercise (MP4) (MP4 2020)
    13.30 - 17.00 "Online"
    Hobohm redundancy reduction algorithms
    Implementating of a Gibbs sampling algorithm for prediction of MHC class II binding
    Signup sheet for help/questions (add your breakout room number and name)

    O Thursday, 10. June
    Hidden Markov Models
    Morten Nielsen

  • BACKGROUND TEXTS
    9.00 - 11.00 "Recorded"
    Hidden Markov models (with a break around 10.30)
    Viterbi decoding, Forward/Backward algorithm, Posterior decoding, Baum-Welsh learning
    HMM (MP4) [ PDF]. [ PPTX]
    Viterbi Handout
    Forward Handout
    11.00 - 12.00
    Profile Hidden Markov Models. "Recorded" (part of the HMM talk above)
    12.00 - 13.00
    Lunch
    13.00 - 13.30
    Questions to mornings lecture and yesterdays exercise (MP4) (MP4 2020)
    13.30 - 17.00 "Online"
    Implementation of Viterbi and posterior decoding.
    Hidden Markov exercises
    Signup sheet for help/questions (add your breakout room number and name)

    O Friday, 11. June
    Cross validation and training of data driven prediction methods. Stabilization matrix method (SMM)
    Morten Nielsen

  • BACKGROUND TEXTS
    9.00 - 9.45 "Recorded"
    Cross validation and training of data driven prediction methods
    Cross-validation, overfitting and method evaluation (MP4). [PDF] . [PPTX] .
    9.45 - 10.15 "Recorded"
    Stabilization matrix method (SMM) background
    SMM background (MP4). [PDF] . [PPTX] .
    SMM handout
    10.15 - 10.30
    Break
    10.30 - 12.00 "Online"
    Implementing and evaluating SMM algorithms using cross-validation
    12.00 - 13.00
    Lunch
    13.00 - 14.00 "Online"
    Quiz with questions capturing essential parts of course up to now Quiz
    Questions to mornings lecture and yesterdays exercise (MP4) (MP4 2020)
    Description of potential projects and formation of groups
    Project suggestions, and descriptions.
    Document for project signup.
    14.00 - 17.00
    Continuation of exercise
    Answers
    Signup sheet for help/questions (add your breakout room number and name)

    O Monday, 14. June
    Artificial neural networks. Sequence encoding, feedforward and backpropagation algorithm
    Morten Nielsen

  • BACKGROUND TEXTS
    9.00 - 10.30 "Recorded"
    Artificial neural networks Part 1 (MP4). Artificial neural networks Part 2 (MP4). [PDF] . [PPTX] .
    Handout
    Answers
    10.30 - 10.40
    Break
    10.40 - 12.00 "Recorded"
    Network training - backpropagation
    Training of artificial neural networks (MP4).. [PDF] . [PPTX] .
    Handout
    12.00 - 13.00
    Lunch
    13.00 - 13.30 "Online"
    Questions to mornings lecture and yesterdays exercise (MP4) (MP4 2020)
    13.30 - 17.00 "Online"
    Artificial neural networks (Feedforward and Backpropagation)
    Signup sheet for help/questions (add your breakout room number and name)

    O Tuesday, 15. June
    An introduction to Deep neural network architectures
    Introduction to the project work
    Morten Nielsen

  • BACKGROUND TEXTS
    9.00 - 9.30 Online
    NNAlign, alignment using ANN's [PDF] NNAlign (MP4).
    9.30 - 10.00 Online
    Trick for ANN training [PDF]. NNtricks (MP4).
    10.00 - 12.00 Online
    Exercise: Constructing and training Deep ANN methods (Helle Rus Povlsen && Alessandro Montemurro)
    FNN lecture slides.pdf.
    deep_neural_nets slides.pdf.
    code.tar.gz.
    data.tar.gz.
    12.00 - 13.00
    Lunch
    13.00 - 13.30 Online
    Questions to mornings lecture and yesterdays exercise (MP4 2020)
    13.30 - 14.00 Online
    Selection of projects and formation of project groups Document for project signup.
    14.00 -17.00 Online
    Exercise: Constructing and training Deep ANN methods (Helle Rus Povlsen && Alessandro Montemurro) cont.


    O Wednesday 16. - Tuesday 22. June. Project work
    No lectures. Project work
    Projects must be submitted (in PDF format) via campusnet Tuesday 22. of June 11.59 (Lunch time) at the latest (to give me time to read and evaluate the reports before Wednesday).

    O Wednesday, 23. June

    Project evaluation

    Each group has 15 minutes to present their project including 5 minutes for questions. The 5 person group has a little longer and the 1 person group a little shorter.


    O Thursday, 24. June, Exam

    The written exam will be available at the course page at DTU Inside Thursday 24th of June at 8.59, and need to be handed in via DTU Inside Thursday 24th of June before 13.00. The exam is open book, meaning that you can seek information at the internet. You however cannot consult other students taken the exam, and plagiarism is seriously condemned.
    Go to