Protein Structure: Difference between revisions

From 22111
Jump to navigation Jump to search
Line 99: Line 99:
== Structure comparisons ==
== Structure comparisons ==


Proteins exhibiting the same fold may occasionally have similar function, especially in the case of enzymes. However, when the proteins have reached the same fold by convergent evolution (or diverged a very long time ago), such similarities are not always obvious from sequence comparisons alone. Here, we will compare RGAE with platelet-activating factor acetylhydrolase (PAFA) from domestic cow (''Bos taurus'') and have a look at their active sites. These two enzymes have similar hydrolytic functions and catalytic residues but have no obvious sequence similarity (advanced alignment tools will identify approximately 20% identical residues).
Now that we have a clear idea of where the epitope of the allergen is, we would like to compare the allergen with another closely related profilin protein in mammals, for example in cow, to see what the differences are.
 
'''Question 5.''' For that purpose, we will need to search for an homologous sequence to the one in our crystal structure, and guess which software we will use for that?
 
Yes, you have guessed correctly! And to avoid you spending some time on the webserver we have performed such a search for you and found that the Uniprot ID for that one is Q2NKT1.
 
Go to the Uniprot site and check on the available structures. You will notice that there are no experimental structures determined for this cow Profilin, however there are some predictions and models.  


⇒ First, fetch the structure for PAFA called 1WAB and open it in PyMOL along with the structure of RGAE. You will notice that the two structures are not aligned. To fix this, type the following:
align 1WAB, 1K7C
This will align the structure of PAFA (1WAB) with that of RGAE by moving the former. Navigate to the active site of RGAE
found previously and identify the residues in PAFA corresponding to the active site residues in RGAE.


'''<span style="color:#FF0000">Q7</span>'''
'''<span style="color:#FF0000">Q7</span>'''
The active site residues of PAFA are: Ser_____, His_____ and Asp_____. Does this correspond to the residue numbering in RGAE? Why/why not?


<blockquote style="background-color: lavender; border: solid thin grey;">
'''Question 6.''' Can you explain what is the main difference of the crystal structures, the predicted structure and the modelled structure with SWISS-MODEL? Which one do you think would be more reliable?
'''Hint:'''  
 
# if you color the active site amino acids in RGAE (1K7C) to something that is easy to recognize, it will be much easier to spot the overlap.
Pick one and download it. Now open the structure with PyMOL, in the same window where you have the previous complex. You can see that the new Profilin is in a random position in our screen. Scroll where you can see the sequence, which name has the new profilin in PyMOL.
# Sequence viewer mode will also be a big help here - when you click a AA side chain in the other structure, the corresponding position will light up in the sequence view.
 
</blockquote>
We will now try to align the sequences, to see where they are different and maybe explain why the cow profilin is not giving allergy.
 
To align the two chains, use the command ''align''.
 
 
'''Question 7.''' Provide a screenshot of the alignment. Can you see the differences of the two Profilins?
 
'''Question 8.''' Provide an hypothesis of why the cow profilin does not cross-react with the IgE antibody giving allergy to Hev b8.


<!--
<!--
== Making pretty pictures (an example) – NOT mandatory ==
== Making pretty pictures (an example) – NOT mandatory ==
Now that you have found the active site residues, it is time for you to make a nice image of the active site. Select the three residues by clicking them (they now all become marked with pink dots). Choose cartoon representation for the whole protein and a neutral colour for the protein (some pale colour). Show just the side chains of the active site residues (they will then be sticking out from the cartoon representation). Give them individual colors to make them stand out. Show also the sulfate ions in some suitable representation. Now zoom in on the active site and find a good orientation. Before generating the final image, set the background color to white by either selecting Display – Background – White on the menu or simply typing
bg white
on the command line. Finally, simply type
ray
on the command line or press the Ray button (right side of the command window) to generate a ray-traced image. Examples of such an image could look like the ones below (or figure 3 above). Different ray trace modes are available and will change the appearance of the final image.




Line 131: Line 133:




Change between the different ray trace modes by going to Setting – Edit All... and enter a number (0, 1, 2 or 3) in the box next to ray_trace_mode. You can achieve the same result by typing (for mode 3)
set ray_trace_mode, 3
on the command line. All other settings in PyMOL can be changed in this way. Try playing with them to change the way PyMOL behaves or the way images appear in the viewer window.
-->
-->



Revision as of 21:22, 6 October 2025

By Carolina Barra Quaglia

Overview

In this exercise you will learn how to

  • Search diferent protein databases to obtain protein structures.
  • Critically choose the best structure, when more than one is available.
  • Visualize a protein structure using PyMOL
  • Highlight features of interest and perform a basic alignment.

This exercise is written a bit different. Instead of the step wise exercise we are going to try a different method called inquiry-based learning, where students explore real-world problems and questions, taking ownership of their learning by asking questions, conducting research, and developing their own solutions and understandings.

A bit of background

The question of today’s exercise is about allergens.

Allergens are fascinating because they aren’t random proteins; they cluster into certain protein families that repeatedly trigger allergic sensitization in humans.

However, one very interesting allergen protein family is the profilins. Here’s why:

They are highly conserved across species.

Profilins are small actin-binding proteins found in almost all eukaryotic cells (plants, animals, fungi).

Despite their ubiquity and structural conservation, humans typically only become allergic to plant profilins (e.g., from birch pollen, grass pollen, the rubber tree, or certain fruits/vegetables).

Today we would seek to understand:

Why only plant profilins are giving an allergenic response despite their high conservation across species?

Getting started

We know that protein functions are closely correlated to their 3D structure, so in order to understand what is different in Profilin proteins from plants and mammals we are going to gather some experimentally determined protein structures from plant and mammal proteins.

First go to the Protein Data Bank (PDB) https://www.rcsb.org/

We are looking for and antibody of the class E (those are related to allergy) in complex with a plant allergen (in this case we will use Hevea brasiliensis, the rubber tree).

And search for IgE AND Hev b 8


We are looking for a protein structure complex, so dismiss all the search results that do not contain a protein complex including both the IgE AND Hev b 8.


Question 1. Look at the method X-RAY diffraction (Å). Which of the complexes is of better quality and why?

Visualization & PyMOL

You can visualize the structure directly at the PDB website using a browser-based viewer (buttons are found below the structure image), but we will use the viewer PyMOL for our purposes. It is an excellent viewer that can also be used to prepare publication-quality images of protein structures, and it is a very valuable tool when working with protein structures.

⇒ If you have not already, download PyMol from the web site and install it on your computer.

The program has three panels:

  • The viewer panel where the molecule will be displayed,
  • an "objects panel" on the right, with a list of all your objects with the pull-down menus to show (S) or hide (H) elements,
  • and the command-line panel in the bottom, where you can type commands.


⇒ Now select the best complex and load it into PyMOL using the fetch command.

(If you cannot solve question 1 use this alternate code to continue the exercise: fetch 7SBG)

You can toggle the object on and off by clicking on its name. Try this. To the right of the object name, there are five buttons: A(ction), S(how), H(ide), L(abel) and C(olor).


⇒ Remove the water molecules and cofactors (hetatm) used for making the crystal using the command remove (these are not part of the of the protein complex). If you show the sequence, you will notice that each of the chains have a name H and L for the antibody (heavy and light chain) and C for the allergen.

⇒ Colour the different chains of the antibody (IgE) in yellow and blue and the allergen (Hev b 8) in green.


Question 2. Provide a screenshot of the complex.


Figure 3. The active site in RGAE.


Let’s try to look at the surface of the interaction now. You can create an object called “antibody”, that contains both antibody chains (chain H+L) by the command select, and show as surface.


Question 3. Provide a screenshot of the interaction surface. Which secondary structures of the allergen Is the antibody interacting with: alpha helices, beta-sheets or loops?


Now we want to colour which specific residues of the allergen are in close contact with the antibody (exactly at a distance of 4Å). We will call this object the “epitope”. To define the distance, we can use the command around.


In order to select the full residues and not just the atoms of the allergen that are in direct contact we have to do a trick and select by residue using the command byres.


Hint: byres.(XX around 6) if we want to select all residues in a distance of 6Å to XX


Question 4. Now colour the epitope in red and provide a screenshot. How many residues can you count that are in close contact with the IgE antibody?

Structure comparisons

Now that we have a clear idea of where the epitope of the allergen is, we would like to compare the allergen with another closely related profilin protein in mammals, for example in cow, to see what the differences are.

Question 5. For that purpose, we will need to search for an homologous sequence to the one in our crystal structure, and guess which software we will use for that?

Yes, you have guessed correctly! And to avoid you spending some time on the webserver we have performed such a search for you and found that the Uniprot ID for that one is Q2NKT1.

Go to the Uniprot site and check on the available structures. You will notice that there are no experimental structures determined for this cow Profilin, however there are some predictions and models.


Q7

Question 6. Can you explain what is the main difference of the crystal structures, the predicted structure and the modelled structure with SWISS-MODEL? Which one do you think would be more reliable?

Pick one and download it. Now open the structure with PyMOL, in the same window where you have the previous complex. You can see that the new Profilin is in a random position in our screen. Scroll where you can see the sequence, which name has the new profilin in PyMOL.

We will now try to align the sequences, to see where they are different and maybe explain why the cow profilin is not giving allergy.

To align the two chains, use the command align.


Question 7. Provide a screenshot of the alignment. Can you see the differences of the two Profilins?

Question 8. Provide an hypothesis of why the cow profilin does not cross-react with the IgE antibody giving allergy to Hev b8.


PyMOL links