Download The amino acids

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Fatty acid synthesis wikipedia , lookup

Ribosomally synthesized and post-translationally modified peptides wikipedia , lookup

Fatty acid metabolism wikipedia , lookup

Two-hybrid screening wikipedia , lookup

Catalytic triad wikipedia , lookup

Protein–protein interaction wikipedia , lookup

Nucleic acid analogue wikipedia , lookup

Western blot wikipedia , lookup

Point mutation wikipedia , lookup

Peptide synthesis wikipedia , lookup

Nuclear magnetic resonance spectroscopy of proteins wikipedia , lookup

Protein wikipedia , lookup

Homology modeling wikipedia , lookup

Metalloprotein wikipedia , lookup

Metabolism wikipedia , lookup

Proteolysis wikipedia , lookup

Genetic code wikipedia , lookup

Biosynthesis wikipedia , lookup

Amino acid synthesis wikipedia , lookup

Biochemistry wikipedia , lookup

Transcript
Applied Bioinformatics
The amino acids
Overview
• Proteins (sneak preview)
– Primary structure
– Secondary structure
– Tertiary structure
• The amino acids
– One amino acid
– Our first protein
– A closer look at the amino acids
– Secondary structure preferences
Our goal for today:
(a little) understanding of the
relation between amino acids and
protein structure
Proteins
• Primary structure
– A.K.A. “the sequence”
• Secondary structure
– Short stretches form distinct ‘substructures’
• Helices
• Sheets
• Turns & Loops
• Tertiary structure
– The arrangement of secondary structure elements with
respect to each other
Primary structure
Proteins are polymers.
The amino acid sequence (also called primary structure) of a protein is
the order of the amino acids (the monomers) in the protein chain.
The sequence is always read from the
the protein.
to the C-terminus of
For example:
-Lys-Val-Phe-Ala-Met-Cys-Leu-Leu-Arg-Val-COO-
Proteins
• Primary structure
– A.K.A. “the sequence”
• Secondary structure
– Short stretches form distinct ‘substructures’
• Helices
• Sheets
• Turns & Loops
• Tertiary structure
– The arrangement of secondary structure elements with
respect to each other
Secondary structure - α helix
Secondary structure – β sheets
Secondary structure – turns & loops
Proteins
• Primary structure
– A.K.A. “the sequence”
• Secondary structure
– Short stretches form distinct ‘substructures’
• Helices
• Sheets
• Turns & Loops
• Tertiary structure
– The arrangement of secondary structure elements with respect to
each other
Protein structure (secondary & tertiary) is
largely determined by its primary structure.
When you understand the amino acids, you
understand everything
The amino acids
A short introduction
One amino acid
- Cα is at the heart of the amino acid
- Cα, C N and O are called backbone atoms
- R can be any of the 20 side chains
Our first protein
The 20 amino acids
A
C
D
E
F
G
H
I
K
L
M
N
P
Q
R
S
T
V
W
Y
Ala
Cys
Asp
Glu
Phe
Gly
His
Ile
Lys
Leu
Met
Asn
Pro
Gln
Arg
Ser
Thr
Val
Trp
Tyr
Alanine
Cysteine
Aspartic acid (Aspartate)
Glutamic acid (Glutamate)
Phenylalanine
Glycine
Histidine
Isoleucine
Lysine
Leucine
Methionine
Asparagine
Proline
Glutamine
Arginine
Serine
Threonine
Valine
Tryptophan
Tyrosine
The 20 amino acids
The side chains, R, determine the differences in the structural and chemical
properties of the 20 ‘natural’ amino acids.
The 20 amino acids can, for example, be classified as follows:
Hydrophobic
Aliphatic
Aromatic
Ala, Leu, Ile, Val
Phe, Tyr, Trp, (His)
Hydrophilic
Polar
Alcoholic
Charged
Asn, Gln
Ser, Thr, (Tyr)
Arg, Lys, Asp, Glu, (His)
Inbetween:
Sulfur-containing
Special
Met, Cys
Gly (no R), Pro (cyclic)
Several amino acids belong in more than one category.
•There are many ways to characterize the properties of amino acids. The ones
most useful and most commonly used are:
•Hydrophobicity
•Size
•Charge
•Secondary structure preference
•Alcoholicity
•Aromaticity
•And on top of that there are some special characteristics like bridge forming
by cysteines, rigidity of prolines, titrating at physiological pH of histidine,
flexibility of glycines, etc.
Aromatic
Charged
Sulfur - containing
Really special
Cysteines are extra special
Key points about the character of
amino acid side chains
• amino acids don’t fall neatly into classes--they are different
combinations of small/large, charged/uncharged,
polar/nonpolar properties
• the properties of a residue type can also vary with
conditions/environment
Secondary structure preferences
Obviously, there are relations between the
physico-chemical characteristics of the amino
acids and their secondary structure
preference.
Chou Fasman parameters
• Take all protein structures
• Calculate for each secondary structure type how many amino acids
are in that structure type (in % of all amino acids)
• Calculate for each amino acid type the distribution across secondary
structure types (in % of all amino acids of that type)
• Calculate the preference score
Chou Fasman parameters
• Say your dataset is 1000 amino acids and 350 of them are in alpha-helix
conformation.
• This is 35%.
• There are 50 Alanines in your set and 25 of them are in alpha-helix
conformation.
• This is 50%.
• The helix preference parameter P for Ala is 50/35=1,43
Alanine
Arginine
Aspartic Acid
Asparagine
Cysteine
Glutamic Acid
Glutamine
Glycine
Histidine
Isoleucine
Leucine
Lysine
Methionine
Phenylalanine
Proline
Serine
Threonine
Tryptophan
Tyrosine
Valine
helix
1.42
0.98
1.01
0.67
0.70
1.39
1.11
0.57
1.00
1.08
1.41
1.14
1.45
1.13
0.57
0.77
0.83
1.08
0.69
1.06
strand
0.83
0.93
0.54
0.89
1.19
1.17
1.10
0.75
0.87
1.60
1.30
0.74
1.05
1.38
0.55
0.75
1.19
1.37
1.47
1.70
turn
0.66
0.95
1.46
1.56
1.19
0.74
0.98
1.56
0.95
0.47
0.59
1.01
0.60
0.60
1.52
1.43
0.96
0.96
1.14
0.50
Chou Fasman parameters
•
Take home message:
• Preference parameter > 1.0
 specific residue has a preference for the specific secondary structure.
• Preference parameter = 1.0
 specific residue does not have a preference for, nor dislikes the specific
secondary structure
• Preference parameter < 1.0
 specific residue dislikes the specific secondary structure.
©CMBI 2006
Secondary structure - helix
Secondary structure - helix
•
Helices pack because of the hydrogen bonds and because of the hydrophobic
packing of side chains along the length of the helix.
•
Certain residues do this hydrophobic packing better than others, and those
residues are thus good for a helix.
Remember: AMELK
Secondary structure - strands
Secondary structure - strands
•
Also strands pack because of the hydrogen bonds between the strands and
hydrophobic packing of side chains along the length of the strand.
•
Certain residues do this hydrophobic packing better than others, and those
residues are thus good for a strands. b-branched residues (Ile, Thr, Val) are very
good for strands, and so are the large hydrophobic residues.
•
Remember: VITWYF
Secondary structure - turn
Secondary structure - turns
•
To create a turn the backbone needs to be bent pretty sharply, and some residues
are really good at that.
•
Glycine is special because it is so flexible, so it can easily make the sharp turns and
bends needed in a b-turn. Proline is special because it is so rigid; you could say
that it is pre-bent for the turn. Aspartic acid, asparagine, and serine have in
common that they have short side chains that can form hydrogen bonds with the
own backbone. These hydrogen bonds compensate the energy loss caused by
bending the chain into a turn
•
Remember: PSDNG
Hydrophobicity
When hydrophobic
objects come together in
water, the number of
unhappy waters go
down, and that is good
for stability.
Free waters are happy
waters.
Hydrophobicity
Hydrophobicity is the most important characteristic of amino acids. It is the
hydrophobic effect that drives proteins towards folding.
Actually, it is all done by water. Water does not like hydrophobic surfaces. When a
protein folds, exposed hydrophobic side chains get buried, and release water of its sad
duty to sit against the hydrophobic surfaces of these side chains.
Water is very happy in bulk water because there it has on average 3.6 H-bonds and
about six degrees of freedom.
So, whenever we discuss protein structure, folding, and stability, it is all the entropy of
water, and that is called the hydrophobic effect.