Download LS1a Fall 2014 Lab 2: Computer Modeling of Proteins with PyMOL

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

G protein–coupled receptor wikipedia , lookup

Ancestral sequence reconstruction wikipedia , lookup

Nucleic acid analogue wikipedia , lookup

Magnesium transporter wikipedia , lookup

Interactome wikipedia , lookup

Ribosomally synthesized and post-translationally modified peptides wikipedia , lookup

Western blot wikipedia , lookup

Peptide synthesis wikipedia , lookup

Point mutation wikipedia , lookup

Metabolism wikipedia , lookup

Protein–protein interaction wikipedia , lookup

Homology modeling wikipedia , lookup

Two-hybrid screening wikipedia , lookup

Protein wikipedia , lookup

Genetic code wikipedia , lookup

Amino acid synthesis wikipedia , lookup

Metalloprotein wikipedia , lookup

Biosynthesis wikipedia , lookup

Proteolysis wikipedia , lookup

Biochemistry wikipedia , lookup

Transcript
LS1a Fall 2014
Lab 2: Computer Modeling of Proteins with PyMOL
Goals: The objectives of this lab are to provide you with a better understanding of:
1. how to use PyMOL at a basic level.
2. how to translate a two-dimensional chemical structures to a three-dimensional chemical model.
3. how to relate different three-dimensional representations used by scientists to depict biological
polymers.
4. the two types of protein secondary structure: alpha (α)-helices and beta (β)-sheets
5. how the exceptional polarity of peptide bonds contributes to the stability of secondary structure.
6. how the properties of individual amino acids influence the structure of proteins
Required Safety Regulations and Lab Etiquette
 Wear lab coat and safety glasses at all times
 Wearing appropriate clothes that protect the leg, foot, and ankle (e.g., jeans, socks, and sneakers)
 No eating or drinking in lab
[Points can be deducted from your post-lab assignment for not wearing the appropriate clothes, not
wearing the required personal protective equipment, or leaving a mess.]
This laboratory is a computer-based exercise that will help you to become more comfortable with
viewing two- and three-dimensional images of amino acids and proteins using a software program
called PyMOL (it’s also the same software that is used to make a lot of images look at throughout the
course).
You can complete this lab by reading the instructions as you progress through the different exercises.
Once you become comfortable using PyMOL, answer the different questions on the answer sheets
provided in lab as you move along.
Your answer sheet will be due a week from today’s lab, and the PyMOL files you will be using for today’s
lab will be placed online Friday after 4pm if you would like to finish your assignment after section is
over.
Introduction
As you have learned in lecture, some of the most important molecules associated with life are
“polymers.” Polymers are chains composed of many repeating units called “monomers” that are
covalently linked together. You can think of monomers as chemical building blocks. Different types of
polymers are made of different types of monomers. For example, nucleotides are the monomeric
building blocks of the polymer DNA; just as amino acids are the monomeric building blocks of
polypeptides. (The distinction between a “protein” and a “polypeptide” is largely semantics, but
occasionally a protein will consist of multiple polypeptides, as we will see in the case of hemoglobin.)
Figure 1: A polymer is a string of monomers covalently attached together.
1
Amino acids are the monomer “building blocks” of proteins; conversely, proteins are strings of amino
acids covalently attached to each other. Different proteins have different amino acid sequences and the
shape and function of a protein depends on the order and arrangement of its amino acid building
blocks.
In order to understand how proteins work, we must first understand what they look like. Using methods
such as X-ray crystallography and nuclear magnetic resonance (NMR) spectroscopy (both of which are
beyond the scope of this course), scientists are able to determine the distinct three-dimensional
arrangement of atoms within macromolecules which are otherwise waaaaaaaay too small to be seen
with even the most powerful microscopes. Scientists can then use this structural information about a
protein to make educated guesses about how the protein works, based on the relative locations of
certain molecular components. This information is vital to improving our understanding of healthy and
diseased cells at a molecular level, and for identifying potential drug targets, as we will see later in the
course using the examples of HIV and Chronic Myeloid Leukemia (“CML”).
III. Procedure
This lab is composed of three sections, each which deals with a different aspect of amino acid and
protein chemistry and structure. The lab questions starting in Section 2 are highlighted with a “►”
symbol in the left margin of the page and should be answered as you work your way through the
different exercises. Answer your questions on the question sheet that is due in section a week from
today.
Section 1.1: Introduction to PyMOL
A) Basic Manipulations
To begin using PyMOL, pair up with a buddy at a computer and do the following:
 Login to the “Students” account using the password “g3n0m3.” Shh, it’s a secret ;)
 Open the “Life Sciences Labs” folder on the bottom.
 Click on the file that says “LS1a_PyMOL_Lab2.zip.” This will automatically decompress the file and
create a “LS1a_PyMOL_Lab2” folder. This folder has all the appropriate files for us.
 Open the “LS1a_PyMOL_Lab2” folder
 Click on the “1_Amino_Acid_Basics” folder and then open the “Amino_Acids.pml” file.
After clicking on “Amino_Acids.pml,” PyMOL will automatically open have loaded up into two windows:
an “external” interface on top, and a larger “viewer” window below it. We’ll be focusing mostly on the
lower “viewer” window, which you can maximize by clicking the green “+” button at the top left corner
of the window.
The atomic structure of the amino acid (L)-valine should appear in the viewing window and look
something like this:
2
Change the orientation and zoom of the molecule by clicking on it with your mouse (playing around
with this program is the best way to use it to learn and highly encouraged). The basic mouse controls
are as follows:




Rotate (= left mouse button): To rotate the molecule, click on the left mouse button and move the
mouse left, right, up or down.
Move (= middle button): To “move” an object on the screen, you must click on the object using the
wheel as a button. Make sure to hold the button down as you move the mouse and try to not turn
the wheel. Turning the wheel simply controls the “Z-axis clipping plane” for the viewer. If you rotate
the wheel by accident and clip the image, turn it back in the other direction until the entire
molecule reappears.
Zoom (= right mouse button): Click on the right mouse button and move the mouse up or down in
order to zoom in or out on the molecule. Go easy at first! It doesn’t take much to “lose” the
molecule if you go too far.
Type “reset” in one of the text entry panels if anything ever gets too crazy, or just close and open
the .pml (or .pse) file again.
It takes a little practice to feel comfortable manipulating molecules, but it’s pretty easy once you
remember what each button does. Practice using all three functions; once you have a feel for how to
manipulate the image, move on to the next section. If you ever lose the molecule, just close the
program and click on the “Amino_Acids.pml” file again.
b. Turning the Objects On/Off
On the right side of the viewer window, you will notice a series of rectangular buttons with names in
them, as well as a series of smaller square buttons ([A], [S], [H], [L], and [C]) next to each rectangular
button:
Each of the rectangular buttons controls an “object” in the viewer. By pressing on the button, you can
make the object appear or disappear. For example, clicking on “L_Valine” will make the amino acid
disappear completely; clicking on it again will cause the amino acid to reappear.
You will not be using the small square buttons for today’s lab ([A], [S], [H], [L] or [C]). If you’d like to
learn more about using what they do, you can visit the company’s website at PyMOL.org or a helpful
PyMOL wiki page. (Briefly: “A” stands for “action,” “S” stands for “show,” “H” stands for “hide,” “L”
stands for “label,” and “C” stands for “color.” Clicking on any of them produces a drop-down menu.)
3
Section 1.2: Amino Acid Basics
Let’s begin with the basic building block of proteins: the amino acid. There are 20 standard naturally
occurring amino acids that our cells use as the monomers to construct proteins. The one we will begin
with is valine.
First, make sure that the object “Valine” is turned on and all other rectangular buttons are turned off.
The particular representation currently being used to show (L)-valine is called “sticks.” Carbon atoms
are shown in green, oxygen atoms in red, and nitrogen atoms in blue.
The alpha (α)-carbon
These 20 molecules are all called “amino acids” because all 20 of them contain an amino group and a
carboxylic acid group (nifty). A third group, called the side chain (denoted below by “R”), is different for
each of the 20 amino acids (in fact, it is the only difference between each of the 20 natural amino acids).
In amino acids, the carboxylic acid group, the amino group and the side chain group are all connected to
the Cα carbon.
Amino acid
α-carbon (“Cα”)
Carboxylic acid
Amine
Can you identify the Cα carbon in the valine structure shown on the computer? Once you think you have
identified it, do the following:
 Turn off “Valine” and turn on “Ca_Carbon” This will highlight the Cα carbon in yellow.
Hydrogens
Notice that PyMOL does not explicitly show any hydrogen atoms as part of the valine molecule. The
hydrogens are typically implied in all of the 3-D structures you will be viewing (just as they are implied
by many of the 2-D standard line drawings we look at), so it is worthwhile getting used to looking at
molecules this way. (The lack of hydrogens is because most of our molecular structures have been
determined using X-ray crystallography, a technique that can detect carbon, nitrogen, oxygen,
phosphorous, and sulfur, but not hydrogen because hydrogen is too small.)

Turn off “Ca_Carbon” and turn on “Hydrogens”
This gives you a better sense of where the hydrogen atoms are located in the structure. Notice that
three hydrogen atoms are bound to the nitrogen, while no hydrogen atoms are bound to either of the
two carboxyl-group oxygens. This is the way valine would exist in solution at cellular pH (pH
approximately 7.4):
Generic amino acid at physiological pH
Notice that because PyMOL does not include typically hydrogens, it can also not tell you anything about
whether atoms are formally charged.
4
Notice that if you rotate the molecule such that the nitrogen is on the left, the carboxyl group is on the
right, and side chain is pointing towards the top, then we can observe the amino acid’s stereochemistry:
the L-configuration of the amino acid describes the side chain pointing out towards towards you as the
hydrogen points away from you:
Figure 2: Stereochemistry of a L-valine, notice how the side chain points “out of the page”
towards you when the molecule is rotated such that the hydrogen points “into the page” away
from you. On the left is the two-dimension line drawings we usually use to represent L-valine, on
the right is a PyMOL image of the same molecule in the same orientation.
Double Bonds
You might also have noticed that double bonds are not distinguished from single bonds in the PyMOL
molecule you are viewing. Again, this will be a common theme for the structures that you will be
viewing in this lab. In certain cases it may indeed be useful to see the double bonds, and PyMOL is
capable of showing them.


Turn off “Hydrogens”
Turn on “Double_Bond”
Here you can see how PyMOL distinguishes single bonds from double bonds using the “sticks”
representation. In this case, a double bond can be seen between the carbon atom and one of the
oxygen atoms of the carboxylic acid group.
Using different representations to show structures
Although the “sticks” representation is useful for viewing the atomic connectivity and the geometry of a
molecule, it is less useful at showing the space occupied by the electrons that surround the nuclei of
different atoms within the molecule. A different type of representation (“Mesh”) can be used to show
the volume that the molecule occupies.


Turn off “Double_Bond”
Turn on “Hydrogens” and “Mesh”
This representation shows a mesh that depicts the approximate boundary of electron density around
the different atoms in the molecule. Notice how the space occupied by electrons is not at all apparent
from a simple sticks model.

Turn on “Space_Filling”
5
Here we see another, more realistic view of the relative space occupied by each atom in the molecule.
However, notice that now it becomes much more difficult to see how atoms are connected to each
other and how bonds are oriented relative to adjacent bonds.

Turn off “Mesh”
Scientists use these different representations (sticks, space filling, mesh, etc.) depending on their
particular needs when studying a molecule. Note that each representation has its own advantages and
disadvantages. For example, while the “sticks” representation provides a rather detailed look at a
macromolecule’s structure at the atomic level, it is difficult to discern the general macromolecular
shape (the structure has too much “information” in it, so to speak). This is particularly true when
viewing structures that are much larger than just a single amino acid.
Section 2: Protein Basics
In this section, we will be examining the basics of protein structure. The specific protein we will be
looking at is hemoglobin. You may be already familiar with hemoglobin; it is a protein found
prominently in red blood cells and is responsible for carrying and distributing oxygen throughout your
body.



Close your current PyMOL viewer window to exit the program.
Return to the “LS1a_PyMOL_Lab2” folder and open the “2_Protein_Basics” folder.
Click on the “Protein_Basics.pml” file. This will automatically open up the PyMOL program and load
the coordinates for hemoglobin.
Hemoglobin consists of four polypeptide chains. We will begin looking at a single amino acid of one of
the polypeptide chains and build our way up until we visualize the entire protein. In that vein, this file
begins where you left off with the amino acid valine in the now familiar “sticks” representation. In this
case, valine is simply referred to as “Amino_Acid” in the list of objects (rectangular buttons) on the
right.

Turn off “Amino_Acid” and turn on “Tripeptide.” You may need to zoom out a bit and move the
structure back to the center of the screen using the appropriate mouse buttons.
Here we can see how amino acids link together. Note how the carboxylic acid group from one amino
acid is used to link to the amine group of another to form what is called an “amide” (or a “peptide”)
bond. In this tripeptide, a lysine is bound to the N-terminal end (the side that ends with the amine) of
the valine and an alanine is bound to the C-terminal end (the side that ends with the carboxylic acid) of
the valine. Typically, we write peptide sequences from the N-terminal end to the C-terminal end, so this
example tripeptide would be written as Lys-Val-Ala.
► Question 1

Turn off “Tripeptide” and turn on “Primary.” Feel free (as always) to play around with the molecule.
You may need to readjust the frame to see the entire structure.
We are now seeing even more of one of hemoglobin’s polypeptide chains. The sequence of amino acids
in a polypeptide (or a protein) is known as either the primary sequence or the primary structure. The
primary sequence for this particular segment is:
6
Nt-Ser-Ala-Gln-Val-Lys-Gly-His-Gly-Lys-Lys-Val-Ala-Asp-Ala-Leu-Thr-Asn-Ala-Val-Ala-His-Val-Ct,
or using the one-letter code: Nt-SAQVKGHGKKVADALTNAVAHV-Ct. The terms “Nt” and “Ct” stand for
“amino-terminal end” and “carboxy-terminal end,” respectively.
[A quick note about terminology: the difference between the term “polypeptide” and “protein” is the
scale. Proteins are often much larger than polypeptides, and the term polypeptide often refers to a
small portion of a protein. Both peptides and proteins are polymers of amino acids connected by
peptide bonds. The terms “polypeptide” and “protein” are sometimes used synonymously. Some key
differences are: 1) proteins are polypeptides that perform specific functions inside cells; and 2) some
proteins, such as hemoglobin, consist of several polypeptide chains.
In addition, amino acids present within a protein are commonly referred to as residues. Thus, the
phrase “N-terminal residue” refers to the amino acid that is present at the N-terminus of the protein (or
peptide). This is because the synthesis of a peptide bond between two amino acids is a “condensation
reaction” as it releases a water molecule. The bound amino acids are therefore the “residue” left after
the loss of a water molecule.]
We can simplify this structure by removing the amino acids’ side chains but keeping the “backbone”
atoms:

Turn off “Primary” and turn on “Backbone”
You may notice that the peptide backbone is starting to take on a helical shape. The term “peptide
backbone” typically applies to the Cα carbon atoms and the atoms involved in the peptide bonds, but
excludes the atoms in the side chains. The backbone view shown here excludes the side chains (the “R”
groups) that are normally attached to the Cα carbon atoms. The cartoon representation further helps to
simplify and clearly indicate the helical structure of the backbone.

Turn on “Secondary”
This cartoon represents an alpha (“α”)-helix, an example of secondary structure. Notice how the helical
cartoon lacks any detailed depictions of the atoms that form the structure. Such simplified diagrams are
often used to show the overall shapes of proteins to avoid introducing too much detail into the picture.
We will discuss specific types of secondary structures in the next segment.
To see how the amino acid side chains are oriented along the helical backbone, do the following:

Turn off “Backbone” and turn on “Helix_Sidechains”
“Helix_Sidechains” shows the orientation of the side chains relative to the helix, which are pointed
outwards rather than inwards.

Turn off “Secondary” and “Helix_Sidechains”, then turn on “Tertiary.” Zoom out to see the whole
polypeptide chain.
If we combine all of the secondary structural elements included within a polypeptide, we get a large,
compact structure called the tertiary structure of the protein. The shape of each polypeptide is unique
and depends upon the polypeptide’s unique amino acid sequence. The protein’s shape allows it to carry
7
out its particular function with great efficiency; the shape of the protein is optimized for the protein’s
required function within (or outside of) the cell.

Turn on “Full_Backbone” (and keep “Tertiary” on)
Notice how well the backbone aligns with the “cartoon” representation of the tertiary structure. Again,
the cartoon is a convenient simplification of the overall structure of the protein. Let’s see what the
entire protein looks like without simplification:

Turn off “Full_Backbone” and “Tertiary,” turn on “Sticks_Rep”
While showing the entire polypeptide as sticks describes the location of each atom in the polypeptide,
it’s quite a lot of information and not as obvious what we’re looking at. Let’s examine another type of
representation.

Turn on “Surface_Rep”
This is called a surface representation. This representation depicts the actual volume and shape the
protein occupies using the van der Waal radii of each atom. It provides a great sense of what the actual
surface of the protein looks like.
► Question 2
Now let’s see what happens when multiple polypeptide chains combine to form a single functional
protein:


Turn off “Surface_Rep” and “Sticks_Rep”
Turn on “Quaternary”
Each of these differently-colored polypeptide chains is virtually identical in structure and sequence;
sometimes, a protein consists of multiple polypeptide chains that come together to form one functional
unit.
Many proteins consist of just a single polypeptide chain. However, for proteins that consist of several
different polypeptide chains, their “quarternary” structure is the level of structure at which multiple
polypeptide chains—each with their own primary, secondary, and tertiary structures—fold together
into one functional unit.
► Question 3
As we mentioned earlier, hemoglobin acts as an oxygen carrier. But where does it carry the oxygen?
Some proteins need additional structural components to perform their duties. In many cases, proteins
utilize a small molecule, called a cofactor, to assist.

Turn on “Cofactor”
In this particular case, the cofactor is a porphyrin ring, a large ring with a metal ion in the center (here,
iron is used as the central ion). Notice that each chain has its own cofactor.
8
Now let’s see where the oxygen gets bound to the ring.

Turn on “Oxygen” and “Surface_Rep”
► Question 4
[Note: The oxygen molecules (O2) are shown using a surface representation for clarity, which makes
them appear much larger than the surrounding atoms in the porphyrin ring (which are shown with a
“stick” representation).]
At this point, it is worth stressing that the ability of hemoglobin to bind oxygen and its cofactors is
dependent on the structure of the protein itself. As you’ve seen, this structure is determined by the
sequence of amino acids that combine in a distinct order to form the protein. Therefore, the primary
sequence of a protein determines its structure, and its structure determines the protein’s function.
Section 3: Protein Secondary Structure

Close your current viewer, open up the folder called “3_Secondary_Structure,” and click on the file
called “Secondary_Structure.pse”. This will automatically open up the PyMOL program, and load up
the coordinates for a different protein called Top7.
In this brief section, we will be examining protein secondary structure, and how cartoons are used to
simplify visualizing protein structure. Since stick representation is a bit noisy, so let’s simplify the
protein to its backbone as we’ve done previously.

Turn off “Protein_Sticks” and turn on “Backbone”
α-helices
Eliminating the side chains alone helps simplify the structure and you can begin to see some of the basic
elements of secondary structure. Let’s examine each of these elements in greater detail.

Turn off “Backbone” and turn on “Helix_Sticks”
Now, these particular amino acids form helical secondary structures, exactly like those we saw in
hemoglobin. We can therefore simplify them in cartoon mode as helices. (By playing around with these
structures in stick representation, can you see how they are helical before turning on the cartoon
representation?) These types of structure are called α-helices, and are quite common in proteins. Here,
we’ve colored them magenta for clarity.
•
Turn off “Helix_Sticks” and turn on “Helix_Cartoon”
The cartoon representation is one of the most common ways that scientists portray α-helical regions
contained within proteins. Notice that in the cartoon representation, none of the atoms that comprise
the helix are shown. Instead, the helix cartoon outlines the shape of the peptide backbone and makes it
easier to view complex structures.
So what causes these regions to take on a helical shape? Part of the answer is that the amino acids are
able to form a series of hydrogen bonds that help to stabilize the helix. Certain amino acids favor this
secondary structure more than others.
9
To take a closer look at the hydrogen bonds that are important in stabilizing helices, we will focus on
one of the two helices shown in this protein.


Turn off “Helix_Cartoon”
Turn on “Alpha_Helix”
It may help to re-center the image on your screen so that you can rotate it more easily. To do so, rightclick on the mouse and choose the “center (vis)” command.
In this image, the hydrogen atoms that are attached to the backbone nitrogen atoms are depicted in
white. Also, the image specifically shows the double bonds that form between the carbon and oxygen
atoms in the backbone.
► Question 5
Now, let’s take a look at some of the hydrogen bonds that are important in stabilizing the helix. For
simplicity, only hydrogen bonds in a short stretch of the helix are shown (the helix contains more than
just the six H-bonds that are depicted here). To see the bonds:

Successively turn on “H_bond1” through “H_bond6”
► Question 6
It is important to recognize that the network of H-bonds that stabilizes the helix stems from the
interactions between backbone amine (-NH) and backbone carbonyl (-C=O) groups. The side chains of
the amino acids are not involved in the H-bond network shown here. To help illustrate this more clearly:

Turn on “Helix_Ribbon” and “Sidechains”
Here you can see that the side chains are not forming any of the hydrogen bonds that are important in
maintaining the shape of the helix.
► Question 7
In order to get a better sense of how the side chains are distributed along a helix, it is useful to view the
helix from a different perspective. Scientists sometimes use a depiction that shows the helix as viewed
from above (rather than from the side), which can be done by rotating the helix such that one is looking
straight down its center.
 Turn off “Helix_Ribbon” and “Sidechains” and “Alpha_Helix”
 Turn off all H_bonds
 Then, turn on “Top_Down_Helix” and “Top_Down_Cartoon” (make sure to re-center and re-zoom)
Now, rotate the Top_Down helix such that you are viewing it from above looking straight down its
center. Make sure that the N-terminal end of the helix is facing you. Remember, the amine (-NH)
groups point towards the N-terminal end, while the carbonyl groups (-C=O) point towards the Cterminal end.
10
Your screen should look something like this:
Once you have the N-terminal end facing you:
•
•
Turn off “Top_Down_Helix” (but keep the Top_Down_Cartoon visible), and
Turn on “With_Sidechains”
Your helix should now look something like the one shown below. Carbon atoms in the side chain are
shown in blue, oxygen in red, and Sulfur in yellow.
Scientists sometimes use a “helical wheel” diagram to represent a helix viewed from above. Such a
diagram is shown below on the right, where the side chains that are projecting from the helix are
labeled as R1, R2, etc. starting from the N-terminus.
► Question 8
11
The ways in which side chains are positioned along the different faces of an alpha helix can significantly
influence the manner in which a helix is positioned relative to other proteins, or other parts of the same
protein.
► Question 9
Beta Sheets
To look at another type of secondary structure, first do the following:


Turn off “Top_Down_Cartoon” and “With_Sidechains”
Then turn on “Sheet_Sticks” (zoom out & re-center if necessary)
These strands take on a different shape than those seen in helices. Rather than curling, these strands of
peptides form long, straight sheets when aligned with each other. Thus, we simplify them in cartoon
mode as beta-strands, which combine to form beta-sheets. We’ve illustrated the sheets (and the
strands) here in blue.

Turn on “B_Sheet”
Notice how strands are represented in cartoon mode using long, flattened arrows; the arrows provide
directionality for the strands. Since amino acids, by convention, are written from N-terminal end to Cterminal end, the arrows point from the N-terminal to the C-terminal direction of the peptide strands.
Similarly to α-helices, the driving force for formation of beta-sheets is the formation of stabilizing
hydrogen bonds. In this case, H-bonds form between the backbone –NH and –C=O groups of
neighboring strands (see picture below). Again, certain amino acid sequences favor β-sheet formation
over helices or loops.
In order for a beta-sheet to form, it needs several flat peptide strands (beta strands) to align together to
form a flat sheet. Since aligned strands can run “in the same direction” or “in the opposite direction,”
we describe beta sheets as being either parallel or antiparallel. The image above is an example of an
antiparallel sheet.
► Question 10
► Question 11: Follow the instructions on your question sheet to take two images of the beta sheet:
one of the hydrophobic face and one of the hydrophilic face.
12
Now let’s take a look at the protein segments that connect these secondary structure elements.

Turn off “Sheet_Sticks” and then turn on “Loop_Sticks”
These segments do not exhibit a uniform structure, as seen in the previous two examples. Rather, they
serve simply to connect the adjacent strands or helices together.

Turn on “Loops” and then turn off “Loops_Sticks”
Thus, we simplify these as loops, which are depicted as simple ribbons (in this case, yellow).

Turn on “C_Term_Sticks” and “N_Term_Sticks”
Note: You will need to scroll down the menu to see the box “N_Term_Sticks”.
In addition to loops, the N-terminal and C-terminal ends of the peptide strand are often lacking in
definite structure as well. These termini are generally depicted in a similar manner as loops, though
they don’t actually connect anything. Rather, they signify the “beginning” and “end” of the protein.



Turn off “C_Term_Sticks” and “N_Term_Sticks”
Turn on “C_Term” and N_Term”
Turn on “Helix_Cartoon” and “Sheet_Sticks”
► Question 12
And here, we now have our entire protein represented in secondary structure elements, via a cartoon
representation.
To see the protein as a single cartoon, turn off all the other elements and turn on “Protein” as follows:
 Turn off all elements by hitting the “(all)” button at the top; turn on “Protein” at the bottom.
Notice how the secondary motifs combine to generate a unique tertiary structure for the protein.
Figure 3 below is a schematic in which the secondary structural motifs are mapped out along the linear,
primary amino acid sequence. The protein begins at the amino-terminus (“N-terminus”) and concludes
at the carboxy-terminus (“C-terminus”). There are 97 amino acids total in the protein, and every 10
amino acids is ticked off along the length of the primary sequence.
Figure 3: A schematic listing the linear primary amino-acid sequence of the protein annotated with
the secondary structure of each region. The numbers on top indicate the amino acid position (there
are 97 total amino acids). The green arrows indicate β-strands and the red portions represent αhelices. The black lines between each secondary structural element are loops.
13
Section 4: Bulk Water
As a final exercise, you’ll have a chance to get a better sense of what scientists believe water may look
like as a solvent.

Close your current viewer window and open the folder titled “4_Bulk_Water” and then open the file
“Water.pml”
This particular set of coordinates is not from a solved atomic structure, but rather a “simulation” of how
scientists bulk water is thought to be organized in liquid form at any given instant.
Here we see water represented in “sticks” form with the hydrogens included (without the hydrogens,
the “sticks” representation looks rather silly as just a bunch of dots where the oxygens go).
Let’s view how these molecules primarily interact with each other:

Turn on “H_Bonds”
As you may recall, the ability of water to hydrogen-bond to itself is one of the reasons it is such an
important molecule to sustain life. Hydrogen bonds are similar in nature to normal dipole-dipole
interactions, but are considerably stronger.
Atoms are understood to be nuclei surrounded by “electron clouds.” Thus, it is sometimes helpful to
view molecules from a “spheres” representation, to get a better view of how the molecules fit together.

Turn off “Sticks” and “H_Bonds,” turn on “Spheres”
Notice how closely packed together the molecules are, from this perspective.

Turn “Spheres” off and turn “Sticks” and “H_Bonds” back on
Keep in mind that in liquid form, the molecules are in constant motion, rotating freely around each
other. This “free motion” is associated with the water’s entropy.
► Question 13
14