Antibody Structure Prediction and Analysis

From 22145
Jump to navigation Jump to search

Antibody modeling

Requirements: Pymol

Download the version for your OS, then use the license file provided by the teachers

Modeling of CLL antibody

From the rearrangements file, select the SI35 light and heavy chains. Translate the nucleic acid sequences to protein sequences (use a service of your choice, e.g. expasy or virtual ribosome).

Go to the Lyra web page.

Lyra is a tool for predicting antibody and TCR structures. It is possible to use Lyra in automated or expert mode.

Insert the light and heavy chains in the proper text area and submit. After a few seconds you will arrive to the output page, where the alignment and all the templates are displayed.

Q1: what is the sequence similarity of the best H and L templates for the framework (FW)?

Q2: What is the light chain type? Is it the most common in humans?

Focus on the sequence alignment used to generate the model.

Q3: where are the differences between target and template located? Which are the most drastic that could affect the modeling process (e.g. G->W)?

hint: you can use the Blosum62 matrix to estimate the expected effect of a substitution: the lower the number, the larger the expected impact

Q4: Do the loops/CDR have the same length in target and template? What does it mean, in terms of expected accuracy?

Now, look at the templates automatically selected by Lyra to build the final model

Q5: How many templates were used? Which is the region for which the template is more dissimilar to the target sequence?

Download the pdb file and open it with pymol. There is a small "S" bottom right, it shows/hides the sequence. Show the whole protein as cartoon and color by chain. Select and color in red all the CDRs. (the selected residues appear as the "sele" item in the menu)

Q6: What is the general shape of the ABS: flat, a pocket, a mountain or a groove?

Q7: Which loop is the longest and protruding the most? Is this the most common case?

Use the Domain Gap Align tool identify the somatic mutations in the V and J regions. Show the mutations as sticks (main chain and side chain) and color them green

Q8: Which Mutations are located inside/near the CDRs? Which are the ones that are most likely to have an impact on the binding of the antibody?

Generate the electrostatic potentials of the Ig (action -> generate -> vacuum electrostatic -> protein contact potentials).

Q9: Identify the residues in the CDRs that mostly contribute to the charge distribution.
highlight them in the sequence and look for charged residues such as K, R, D, E

Q10: (optional) Mutate some residues in order to have a positive charge in the middle of the antigen binding site, then recalculate the electrostatic potentials

Finished!