When: April 20, 10.00-17.00
Where: Building 202, room R1020 how to get there?
Program:
10.00 - 10.15: Intro to the course
(Carolina Barra Quaglia)
10.15 - 11.00: Bioinformatics in Genomic
Medicine (Frederik Otzen Bagger)
11.00 - 11.15: Coffee Break
11.15 - 12.00: Somatic variant calling and RNA sequencing (Frederik
Otzen Bagger)
12.00 - 13.00: Lunch
13.00 - 13.30: Introduction
to germline variant characterization (Carolina Barra Quaglia)
13.30
- 17.00: Exercises in germline variant classification (Carolina Barra
Quaglia)
15.00 - 15.15: Coffee Break
Preparation:
Whole
genome sequencing in clinical practice
Slides:
Welcome
Bioinformatics
pipelines + Burrows Wheeler transformation
Variant
Not Found
Material:
Burrows-Wheeler
transformation explained (youtube video)
Extra Material on Protein Structure
Visualisation:
Pymol
software
Pymol basics
tutorial
Pymol wiki
Pymol
license file
Bioinformatics resources and databases:
ClinVar
Virtual
Ribosome
UniProt
Variant effect
predictor
AlphaFold
String
Exercise:
Burrows-Wheeler
Transformation and GATK - answers
Variant
not found - answers
Curriculum summary
Out of the material covered
today, the following is in the curriculum for the exam:
* The
following file formats: fastq, bam, sam, cram, vcf, vcf.gz (what’s in
them)
* Of the file formats above, which should you store for your
own data and why?
* The bioinformatics database and web services
listed above (from a basic usage perspective)
When: April 21, 9.00-17.00
Where: Building 202, room R1014
Program:
9.30 - 10.00: Exercise summary
10.00 - 10.15: Coffee Break
10.15 - 11.15: Clustering +
Classification algorithms (Carolina Barra Quaglia)
11.15 - 12.00:
Microbial genomics in personalized medicine (Rasmus Lykke Marvig)
12.00 - 13.00: Lunch
13.00 - 17.00: Exercises in variant
interpretation, clustering and classification (Carolina Barra
Quaglia)
15.00 - 15.15: Coffee Break
Social Dinner Event
Tuesday 21 April, 19hs
RizRaz, Store Kannikestræde 19, 1169 København
Slides:
Klinisk
variant klassificering
Clustering
Classification
Talk:
Microbial Genomics
Material:
Principal
component analysis explained
Paper describing
bioinformatics pipelines for molecular subtyping of cancer
Paper:
“Delivering precision oncology to patients with cancer”
Paper
describing machine learning algorithms for breast cancer classification
(very technical paper, but it provides a good overview - voluntary
reading)
Exercise:
Clustering
and classification
R
script
R
data
Curriculum summary
Out of the material covered
today, the following is in the curriculum for the exam:
* Euclidean
distance (and knowing that there are other distance metrics out
there)
* Basic concept of data distributions
* Hierarchical
agglomerative clustering
* knn classification
* Distance to
centroid classification
* The fact that classifiers have tweakable
parameters (e.g. the “k” in knn) and how to decide on the best
settings
* Principal component analysis (in broad strokes, not the
math of it)
When: April 22, 9.00-17.00
Where: Building 202, room R1014
Program:
9.00 - 9.30: Exercise summary
9.30
- 10.00: Immunoinformatics and CAR T cells
10.00 - 10.15: Coffee
Break
10.15 - 11.15: Bioinformatics tools for assessing CAR targets
(Lars Rønn Olsen)
11.15 - 12.00: Exercises in CAR target assessment
(Carolina Barra Quaglia)
12.00 - 13.00: Lunch
13.00 - 17.00:
Exercises in CAR target assessment (Carolina Barra Quaglia)
15.00 -
15.15: Coffee Break
Slides:
Online
bioinformatics tools for assessing CAR therapy targets
Material:
Blog
post describing the prediction of signal peptides
Overview
of post-translational modifications
Paper
describing the importance of target isoforms in CAR cell therapy
Introduction
to Xenabrowser
Bioinformatics resources and databases:
Xenabrowser
Clustal Omega
(Protein) BLAST
DeepLoc
SignalP
TopCons
BepiPred
NetPhosP
UniProt
Exercise:
Evaluating
targets for chimeric antigen receptor therapy
Curriculum summary
Out of the material covered
today, the following is in the curriculum for the exam:
* The FASTA
format and why we use it
* The purpose of the web servers and
databases listed above (if you use any of them for your final project,
you will also need to be able to provide a high level explanation of how
it works)
When: April 23, 9.00-15.00
Where: Zoom link
Program:
9.00 - 9.30: MHC binding and T cell
epitopes (Carolina Barra Quaglia)
Exercise: Making Sequence
logos
10.00 - 10.30: MHC binding predictions (Carolina
Barra Quaglia)
Exercise: Calculate the weights of a neural
network
11.00 - 11.30: MHC binding predictions and medical
applications (Carolina Barra Quaglia)
Exercise: IEDB
database search
12.00 - 14.00: Lunch + Exercise T cell
epitope prediction (off-line time)
14.00 - 15.00:
Walk-through T cell Epitope prediction Exercise (Carolina Barra
Quaglia)
Slides:
Predicting
T cell epitopes
Material:
T
cell epitope prediction book chapter
IEDB
Lecture
Bioinformatics resources and databases:
Sequence
Logo generator
Predictor
of MHC-I presented peptides
Predictor
of MHC-II presented peptides
Allele
Frequencies
Exercise:
Making
sequence logo
Neural
network exercise
Immune
Epitope Database exercise
Prediction
of T cell epitopes - answers
Curriculum summary
Out of the material covered
today, the following is in the curriculum for the exam:
* What is
visualized using logo plots and what do the axes of the plot mean
*
How are HLA binders predicted? (General method and training data)
*
What is the content of the immune epitope database and what is the
source of these data?
When: April 27, 9.00-14.00
Where: Self study and Zoom
Program:
9.00 - 10.00: Video lecture: Pairwise
alignment summary + BLAST
10.00 - 12.00: Exercise: BLAST
13.00 - 14.00: Q&A time on Zoom
Slides:
Blast
updated slides
Exercises
Pairwise
alignment - answers
BLAST
- answers
Material:
Pairwise
alignment tool on google colab
Blast
exercise in Python using google colab
Curriculum summary
Out of the material covered
today, the following is in the curriculum for the exam:
* What is
achieved using pairwise sequence alignment
* What is achieved using
BLAST?
* What is an e value?
* What is % identity?
When: April 28, 9.00-14.00
Where: Self study and Zoom
Program:
9.00 - 10.00: Video lecture: Multiple
sequence alignment + Phylogenetic trees
10.00 - 12.00: Exercise:
building phylogenetic trees
13.00 - 14.00: Q&A time on Zoom
Slides:
Phylogeny
slides
Exercises
Phylogeny
- answers
Hand-out
answers
Material:
Tree
building help manual
Understanding
evolutionary trees
Curriculum summary
Out of the material covered
today, the following is in the curriculum for the exam:
* What is
achieved using multiple sequence alignment
* What are the steps for
building a phylogenetic tree
* Trees can be used to investigate
evolutionary relationships, how does this tool relates to hierarchical
clustering seen on day2?
Introduction
to project work
When: April 29 - May 25
Where: Self-organized
Program:
Q&A times on Zoom
TBD
When: Submission of written assignment in Digital
Eksamen: 12:00 noon, Monday, May 25, 2026
When:
Oral exam May 29, 9.00-12.00 (exact program TBD)
Where: Online