22111:Course plan spring 2020

From 22111
Jump to navigation Jump to search

General information

Teachers

  • Henrik Nielsen — Associate professor, course responsible.
  • Rasmus Wernersson — External associate professor, course responsible.
  • Paolo Marcatili — Associate professor, guest lecturer. Topic: Protein structure.
  • Anders Gorm Pedersen — Professor, guest lecturer. Topic: Phylogenetic trees.
  • Rasmus Weisel Jensen — PhD student (University of Copenhagen), guest lecturer. Topic: Malaria vaccines.

Teaching assistants

Course content

In this course, a large emphasis is placed on the practical usage of bioinformatics databases and tools. A typical lecture will present the theoretical aspects of the topics of the day — sometimes including a small group exercise using pen and paper — and last about an hour. The rest of the time will be spent on practical computer exercises, where the teachers and teaching assistants will be ready to help.

See also the course base about 22111.

Curriculum

There is no formal textbook. The curriculum consists of the exercise guides, supplemented with various papers and chapters which will be made available on this homepage or on DTU Inside. Please note that all exercise guides are mandatory curriculum — including the answers to the exercises which will be made available on this homepage after each exercise.

Computers

You must bring your own laptop to the exercises, and it must be able to connect to DTU's wireless network. The type of computer / operating system is not important; Windows, Mac or Linux will all work fine. An iPad or an Android tablet, on the other hand, will not be good enough. A Chromebook will also not be enough (unless you have succeeded in installing a Linux distribution on it, but in that case we assume you know what you're doing).

In some of the exercises ("PDB/PyMOL", "Malaria vaccine", and "Old exam questions"), you will work with the molecular visualization program PyMOL. This is rather difficult to control by a touchpad, so please remember to bring a mouse. The mouse should have two buttons plus a scroll-wheel.

Software:

  1. Most importantly: an updated internet browser (e.g. Google Chrome, FireFox, Opera, Safari for Mac or Edge for Windows). NB: You must have more than one browser installed; Safari for Mac or Edge for Windows may have glitches with some bioinformatics websites, and in those cases it is important to be able to switch to an alternative browser.
  2. Java runtime engine is needed for running some of the software we use in the course, including jEdit (see below). If it is not already installed on your computer, there are two options for downloading it:
    1. Oracle java can be downloaded from http://www.java.com. Note: it is not truly free anymore; if you want to use it for any commercial activities, you need to pay a license fee.
    2. A truly free alternative is AdoptOpenJDK which can be downloaded from https://adoptopenjdk.net/. Windows users: Choose "OpenJDK 8" and "HotSpot", and remember to enable all subfeatures when installing, otherwise it will not work with the software we are going to use. Mac users: we currently don't recommend this solution for Mac, since jEdit (see below) for Mac does not play nice with AdoptOpenJDK.
  3. A plain text editor for working with, e.g., sequence files. We recommend jEdit, which you can download for free from http://www.jedit.org. If you experience unsolvable problems installing or running jEdit, there are alternatives, e.g. Geany.

Other software will be installed during the exercises.

Where and when

Lectures plus subsequent exercises will take place every Tuesday afternoon during the semester, starting Tuesday Feb 4. Lectures will be from 13:00 to approx. 14 in building 208, auditorium 53, and the exercises will then take place in building 210, rooms 142+148+118.

Hand-ins

As preparation for the computer-based exam, each participant or group must write a "logbook" with answers to the questions posed in the exercise guides. After the exercise, you should upload the logbook to DTU Inside (Course 22111 → Assignments).

NB: It is possible to hand in as a group. We would much rather receive a group hand-in than a number of identical logbooks.

You decide which software you prefer for writing the logbook — e.g. Microsoft Word, LibreOffice (free), Apache OpenOffice (free), Pages for Mac, Google Docs or similar. You should be able to insert screenshots in the logbooks for documentation purposes. Microsoft Word has a built-in screenshot tool. For Windows users, however, we recommend the free program Greenshot which can not only take screenshots and copy them to the clipboard, but also make simple edits and annotations in the screenshots.

Regardless of your choice of writing software, the result must be handed in as a PDF file. MacOS X and Windows 10 have built-in functions for converting any printable file to PDF. Users of earlier versions of Windows must install a separate program. Several free alternatives exist, e.g. PrimoPDF. (It can be a good idea to install PrimoPDF even for Windows 10 users, it provides some extra options and the resulting files take up less space).

Please do not copy the questions from the exercise guide to your logbook. The hand-in module on DTU Inside has a system for plagiarism detection, which will raise an alarm if significant portions of your hand-in are identical to documents found on the internet — and that includes the exercise guides.

NB: The hand-ins do not affect your grade — they are mainly meant as a preparation for the exam. They are also a means for us to check the understanding of the teaching; if we can see that many participants have made the same mistake, we will try to explain the issue better at the next lecture.

Exam

The 22111 exam is electronic; i.e. you must bring your own computer, and you will not get a paper copy of the questions. The questions will be made available as a PDF file on DTU Inside. Hand-in also takes place on DTU Inside, and the procedure is the same as in the exercises. The only accepted hand-in format is PDF.

All aids are allowed at the exam; you can bring any books, papers or notes. You will have open access to the internet which includes all the materials and websites we have used during the course. You are also allowed to search information on Google, Wikipedia, etc., but you are not allowed to communicate with others through e-mail, Facebook, chat, or file sharing websites. The internet traffic will be logged during the exam to ensure that these restrictions are kept.

Just like in the weekly hand-ins, we kindly ask you: Please don't copy the questions in your answer document — that might result in the answer being flagged as plagiarism.

When you hand in the exam assignment in DTU Inside, you get a hand-in code that must be written on the hand-in envelope. It is very important that the code is handed in correctly, otherwise your answer cannot be assessed. The code serves as a control that the hand-in has not been modified since you left the examination room.

DTU Inside

Link to this year's DTU Inside group: https://cn.inside.dtu.dk/cnnet/element/612495/

Evaluation and feedback

We will be very happy to receive comments, suggestions, criticisms, or praise at any time during the semester. You can send these by email to the teachers or write them on "Forum" in the DTU Inside group. If somebody writes a message on the Forum, you can comment on it. If you see a message you agree on, please comment "Agree!" so that we can see that it is not just one person's opinion.

In addition, we will conduct a mid-term evaluation in DTU Inside.

Lecture & exercise plan

Note: This is a preliminary plan, changes may occur!

Tuesday Feb 4 — Introduction & taxonomy

Lectures:
  • Introduction to the course, bioinformatics, and computers — Henrik Nielsen.
  • Evolution and taxonomy — Rasmus Wernersson.
Slides: will be made available on DTU Inside File sharing.
Curriculum: Brief Introduction to Evolutionary Theory — Written by Anders Gorm Pedersen.
Test of prior knowledge: Go to https://evaluering.dtu.dk/, click "Test of prior knowledge" under 22111, and fill out the form (it's anonymous). Spend max. 10 minutes on it.
Exercises:
  1. Plain text files and jEdit — (Answers)
  2. Taxonomy databases — (Answers)
Extra material
"Bioinformatics" — Encyclopedia entry from 2009.

Tuesday Feb 11 — GenBank

Lecture: DNA as Biological Information — Rasmus Wernersson
Curriculum: DNA sequencing tutorial — source: IDT Tech Vault
Handout for the lecture: "Base-calling" exercise (for printing) [PDF] - "Base-calling exercise (version for on-screen viewing) [PDF] (NEW, Feb 2020).
Slides: on DTU Inside File sharing.
Exercise: Using the GenBank database — (Answers)
Reference material for the exercise: GenBank + FASTA format [PDF] (UPDATED, Feb 11th 2020)
Background material (supposedly known):
mRNA splicing (YouTube).
Overview of eukaryotic gene structure (PDF).
Extra material:
Entrez Sequences Quick Start (NCBI)
"GenBank" — article from the annual database issue of Nucleic Acids Research, 2019.
"GenBank" — article from the annual database issue of Nucleic Acids Research, 2020.

Tuesday Feb 18 — Translation & UniProt

Lecture: Protein databases — Henrik Nielsen
Curriculum: Virtual Ribosome — software article (PDF).
Slides: on DTU Inside File sharing.
Exercises:
  1. Exercise: Translation - Virtual Ribosome — (Answers)
  2. Exercise: The protein database UniProt — (Answers)
Background material (supposedly known):
Levels of protein structure [PDF]
Overview of eukaryotic gene structure (PDF).
Extra material:
"UniProt: a worldwide hub of protein knowledge" — article from the annual database issue of Nucleic Acids Research, 2019.

Tuesday Feb 25 — Pairwise alignment

Lecture: Pairwise alignment — Henrik Nielsen.
Curriculum: Page 35-55 in Immunological Bioinformatics (PDF: on DTU Inside File sharing → Textbook excerpt).
Handout for the lecture: Alignment scores
Slides: on DTU Inside File sharing.
Exercise: Pairwise alignment — (Answers)

Tuesday Mar 3 — Protein structure, PDB & PyMOL

Remember to bring a mouse for this day's exercise. The mouse should have two buttons and a scroll wheel.
Lecture: Protein 3D structure — Paolo Marcatili
Curriculum: Protein Structure (Wikipedia)
Slides: on DTU Inside File sharing.
Link to advanced course:
Software for installation: PyMOL
Note: you will need the license file found at the file sharing under this week's folder.
Exercises:
  1. PyMol tutorial (PDF) — basic usage of PyMOL.
  2. Protein Structure and Visualization — (Answers — NB: the last section labelled "PyMOL magic" is NOT curriculum, just a tip!)
Extra material:

Tuesday Mar 10 — BLAST

Lecture: Introduction to BLAST — Rasmus Wernersson.
Curriculum: section 3.2.5 → 3.3 (i.e. pages 47-52) in Immunological Bioinformatics (PDF: on DTU Inside File sharing → Textbook excerpt).
Slides: on DTU Inside File sharing.
Exercise: BLAST — (Answers)
Extra material:
Videos about BLAST from NCBI: (Video introduction to NCBI's web interface and Expect Values) NCBI's YouTube channel

Tuesday Mar 17 — Case: Malaria vaccine

Lecture: Malaria and vaccines — Rasmus Weisel Jensen (KU).
Curriculum: Malaria — Causal Agents / Life Cycle
Slides: on DTU Inside File sharing.
Exercise: Malaria vaccine — (Answers)

Tuesday Mar 24 — Sequence information & logo-plots

Lecture: Sequence information & logo-plots — Rasmus Wernersson
Curriculum:
  1. Pages 68-80 in Immunological Bioinformatics (PDF: on DTU Inside File sharing → Textbook excerpt).
  2. Pages 1-8 of "Information theory primer" (PDF)
    • Read also the appendix on logarithms (especially Log2) if needed!
Slides: on DTU Inside File sharing.
Handout for the lecture: How to construct sequence logos (PDF)
Mid-term evaluation: Go to https://evaluering.dtu.dk/ and click "Mid-term evaluation" under 22111
Exercise: DNA and Peptide Logos — (Answers)

Tuesday Mar 31 — Weight matrices and other prediction methods

Lecture: Introduction to prediction methods, especially Weight Matrices — Henrik Nielsen
Curriculum: Same as last week!
Slides: on DTU Inside File sharing.
Handouts for the lecture: How to estimate pseudo frequencies (Answer)
Exercise: Construction of sequence logos and weight matrices - (Answers)
Link to advanced course:
22125: Algorithms in bioinformatics

File:Easter-egg-free-to-use-cliparts.png Easter holidays File:Easter-egg-free-to-use-cliparts.png


Tuesday Apr 14 — PSI-BLAST

Lecture: PSI-BLAST — Rasmus Wernersson
Curriculum:
Slides: on DTU Inside File sharing.
Exercise: PSI-BLAST — (Answers)

Tuesday Apr 21 — Multiple alignments

Lecture: Multiple alignment — Henrik Nielsen
Curriculum: RevTrans (article)
Slides: on DTU Inside File sharing.
Exercise: Multiple Alignments — (Answers)

Tuesday Apr 28 — Phylogenetic trees

Lecture: Phylogenetic Reconstruction: Distance Matrix Methods — Anders Gorm Pedersen
Curriculum:
  1. Introduction to Tree Building, PDF on Inside File sharing → Slides etc → Lecture12
  2. Evolutionary trees (minus the section "How to reconstruct an evolutionary tree")
  3. Understanding Evolutionary Trees, PDF.
Slides: on DTU Inside File sharing.
Handout for lecture: Reconstructing a distance tree (Answer)
Software for installation: FigTree tree-viewer
Exercise: Phylogeny — (Answers)
Link to advanced course:

Tuesday May 5 — Bioinformatics in practice + old exam questions

Lecture: "Real life case": Bioinformatics and Systems Biology in precision medicine - Rasmus Wernersson
Curriculum: (None - lean back and enjoy)
Slides: on DTU Inside File sharing.
Exercise: We train on the old exam set from 2017 - available on the file share.

Exam

Tuesday May 26

Summer exam 2020: Go to http://onlineeksamen.dtu.dk/ and find 22111.

The assignment will be accessible from 9:00 on Tuesday May 26.

Checklist for computers

Check here whether your computer has all the software needed for the exam: Checklist for computers

Link collection

A quick overview of the websites we have used in the course: Link collection

FAQ

Questions we have received and answered: FAQ