Download Determination of primary structure

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Bisulfite sequencing wikipedia , lookup

RNA-Seq wikipedia , lookup

Genomic library wikipedia , lookup

Human genome wikipedia , lookup

Therapeutic gene modulation wikipedia , lookup

Point mutation wikipedia , lookup

Genetic code wikipedia , lookup

Metagenomics wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Helitron (biology) wikipedia , lookup

Genome editing wikipedia , lookup

Sequence alignment wikipedia , lookup

Genomics wikipedia , lookup

Transcript
9/30/16
Determination of primary structure
1) Purify protein
2) Determine the amino-acid composition,
including stoichiometry
3) Disrupt structure (2°, 3°, 4°, and disulfides)
4) Determine the number of peptide chains by
counting number of amino terminal ends
5) Divide into fragments and determine sequence
6) Divide into different set of fragments and
determine sequence
7) Determine overlaps and piece original sequence
back together
25
Disrupt structure (2°, 3°, 4°, and disulfides)
What holds these levels of structure together? …....non-covalent bonds (H-bonds,
van der Waals, ionic, hydrophobic)
What have you used in the lab that might disrupt non-covalent bonds? …....Urea, SDS, pH extremes, heat, etc.
What about the covalent S-S bond? …....2-mercaptoethanol (b-mercaptoethanol, BME) or dithothreitol (DTT).
Page 113
To keep disulfides from reforming…..
1) keep BME at high concentration in buffers
2) alkylate the SH groups
1
9/30/16
Determination of primary structure
1) Purify protein
2) Determine the amino-acid composition,
including stoichiometry
3) Disrupt structure (2°, 3°, 4°, and disulfides)
4) Determine the number of peptide chains by
counting number of amino terminal ends
5) Divide into fragments and determine sequence
6) Divide into different set of fragments and
determine sequence
7) Determine overlaps and piece original sequence
back together
27
Determine the number of peptide chains by counting
number of amino terminal ends Example:
Frederick Sanger
(1918-2013)
Sanger’s Reagent
Box 5-1
•
•
•
•
small tripeptide
AA comp = A,K,L
C-term = K
DNP-A
Sequence is A-L-K
Absorbs at 353 nm (yellow)
2
9/30/16
Determine the number of peptide chains by counting
number of amino terminal ends
Excitation at 330 nm
Emission at 519 nm
Figure 5-14
Determination of primary structure
1) Purify protein
2) Determine the amino-acid composition,
including stoichiometry
3) Disrupt structure (2°, 3°, 4°, and disulfides)
4) Determine the number of peptide chains by
counting number of amino terminal ends
5) Divide into fragments and determine sequence
6) Divide into different set of fragments and
determine sequence
7) Determine overlaps and piece original sequence
back together
30
3
9/30/16
Determination of primary structure
1) Purify protein
2) Determine the amino-acid composition,
including stoichiometry
3) Disrupt structure (2°, 3°, 4°, and disulfides)
4) Determine the number of peptide chains by
counting number of amino terminal ends
5) Divide into fragments and determine sequence
6) Divide into different set of fragments and
determine sequence
7) Determine overlaps and piece original sequence
back together
31
Proteolytic Cleavage
Divide into fragments
“TABLE 5-5” Specificities of Exopeptidase
O–
Carboxylpeptidase A Bovine pancreas
Rn = C-terminal; Rn-1 ≠ Pro
4
9/30/16
Divide into fragments
Cyanogen Bromide Cleavage
Methionine
Similar result as an endopeptidase, but leaves a homoserine lactone as the C-terminal residue
Figure 5-15
Copyright © 2016 John Wiley & Sons, Inc. All rights reserved.
Separation and isolation of peptide
fragments
Example: paper electrophoresis
Also, TLC, silica gel, etc.
to Ninhydrin
34
5
9/30/16
Determination of primary structure
• Determine amino-acid composition
• Dansyl chloride or FDNB to determine aminotermini and number
• Proteases: Cleaves peptide bonds only after specific
residues.
• Cleave protein with 2 different proteases.
• Sequence fragments with Edman degradation. Piece
together sequence from overlapping fragments.
35
Determination of primary structure
1) Purify protein
2) Determine the amino-acid composition,
including stoichiometry
3) Disrupt structure (2°, 3°, 4°, and disulfides)
4) Determine the number of peptide chains by
counting number of amino terminal ends
5) Divide into fragments and determine sequence
6) Divide into different set of fragments and
determine sequence
7) Determine overlaps and piece original sequence
back together
36
6
9/30/16
Determine the Sequence: Edman degradation
Edman Degradation Animated Figure
(https://wileyassets.s3.amazonaws.com/Voet_Fundamentals_of_Biochemistry_5e_
ISBNEPROF12533/media/animated_figures/ch05/5-15.html)
Figure 5-16
Copyright © 2016 John Wiley & Sons, Inc. All rights reserved.
Determination of primary structure
1) Purify protein
2) Determine the amino-acid composition,
including stoichiometry
3) Disrupt structure (2°, 3°, 4°, and disulfides)
4) Determine the number of peptide chains by
counting number of amino terminal ends
5) Divide into fragments and determine sequence
6) Divide into different set of fragments and
determine sequence
7) Determine overlaps and piece original sequence
back together
38
7
9/30/16
Determine overlaps
and piece original
sequence back
together
Wiley
Guided_explorations
Determination of primary structure
THREE basic ways to know the primary structure. Only the
CHEMICAL method will give the entire covalent structure, including
any disulfide bonds. But other methods are more sensitive. One
can classify these methods by:
CHEMICAL
PHYSICAL
BIOINFOMATICAL
We just went through the CHEMICAL.
The PHYSICAL method still requires the same strategy,
including purification, fragmentation, chromatography, and
alignment.
But, instead of an Edman degradation the use of
tandem Mass Spectrometry (MS) is employed.
Lets look at the use of MS in biochemistry
•
•
•
•
•
Ions “fly” in a vacuum toward a target with a velocity ∝ z/m (charge-to-mass ratio)
Molecules with higher charge and lower mass get detected first.
Molecules with a lower charge and higher mass get detected last.
Plotted as m/z to read peaks from left to right
Instruments can distinguish molecules with same charge by < 1 Da
8
9/30/16
Determine the Sequence: Tandem MS
The major problem in
using MS for
macromolecules is
getting them to “fly” in
a vacuum with a
charge.
TWO major methods:
1) Electro-Spray
Ionization (ESI)
2) Matrix-Assisted
Laser-Desorption
Ionization (MALDI)
ESI
Mass = (m/z) x z
893.3z = 849.7(z+1)
893.3z = 849.7z-849.7
849.7 = z(43.6)
19 = z
Mass = 893x19 = 16,967
Mass = 849.7x20 = 16,994
Mass = 1696.3x10 = 16,963
Figure 5-17
Determine the Sequence: Tandem MS
Matrix-Assisted LaserDesorption Ionization (MALDI)
MALDI
How do you use MS to get protein sequences?
9
9/30/16
Determine the Sequence: Tandem MS
• Mass spectrometry uses charge-to-mass ratio of
different ions to determine mass
• Tandem MS-MS: First selects a fragment, second
determines mass. By comparing all of fragments,
compare those that different by mass of one amino acid
to determine sequence
Determine the Sequence: Tandem MS
• EXAMPLE
AVAW
c3
c2
c1
+NH
3
CH3
a3
AA MW
Ala 90
Val 109
Trp 206
CH
CH3 CH3
CH3
CH
a2
a1
AWVA
199
296
AVAW
405
495
90
90
a3 c2
c1
199
a2
c3
a1
m0
289 296
405
495
206
a3 a2 c1
a1 c2
c3
m0
There is an issue with K & Q, which have the same MW!
10
9/30/16
Determination of primary structure
THREE basic ways to know the primary structure:
Edman Degradation requires >100 pmole (1-5 µg)
✔CHEMICAL
MS/MS requires >1-10 pmole (100-500 ng)
✔PHYSICAL
BIOINFOMATICAL
We just went through the CHEMICAL and PHYSICAL.
The BIOINFORMATICAL method requires information from
chemical or physical, but only a limited amount of sequence.
Example: a sequence of 6 AA is only possible as one of 206 possible
hexa-peptide sequences (1 of 64 x 106).
There are no more than 50,000 protein-coding genes with ≤400 AA on
average. This is ~20 x 106 possible unique sequences.
So, a hexamer is not likely to appear more than once.
Once you have at least 6 AA sequence, you can compare that to all
possible proteins encoded in the entirety of the gene sequences
(genome) for a species for which the genome is known using
appropriate bioinformatic tools. This will then give you the entire
protein sequence.
There is one remaining issue: Where are the Disulfides, if any?
…...This requires chemical and/or physical methods
Determination of primary structure
THREE basic ways to know the primary structure:
Edman Degradation requires >100 pmole (1-5 µg)
✔CHEMICAL
MS/MS requires >1-10 pmole (100-500 ng)
✔PHYSICAL
BIOINFOMATICAL
We just went through the CHEMICAL and PHYSICAL.
The BIOINFORMATICAL method requires information from
chemical or physical, but only a limited amount of sequence.
Example: a sequence of 6 AA is only possible as one of 206 possible
hexa-peptide sequences (1 of 64 x 106).
There are no more than 50,000 protein-coding genes with ≤400 AA on
average. This is ~20 x 106 possible unique sequences.
So, a hexamer is not likely to appear more than once.
Once you have at least 6 AA sequence, you can compare that to all
possible proteins encoded in the entirety of the gene sequences
(genome) for a species for which the genome is known using
appropriate bioinformatic tools. This will then give you the entire
protein sequence.
There is one remaining issue: Where are the Disulfides, if any?
…...This requires chemical and/or physical methods
11
9/30/16
Determine which Cys are in
Disulfide bonds
Cys
Cys
Cys
Wiley
Guided_explorations
Cys
Which of these
Cys is bonded
to…. This Cys
(https://wileyassets.s3.amazonaws.com/Voet_Fun
damentals_of_Biochemistry_5e_ISBNEPROF12
533/media/guided_explorations/ge_04_Protein_s
equence_determination/protein_sequence.html)
Cys
Figure 5-13
Determine which Cys are in
Disulfide bonds
Recall that in the disruption of –S-S- bonds is needed to separate polypeptides for
sequence analysis
And, that prevention of their re-oxidation can be done by alkylation.
=
O
HCOOH
Performic Acid
Oxidize disulfide, which
breaks -S–S- bond
O3–
Sulfonic
acids
+
– O3
Prevention of their re-oxidation can
be done by peroxidation as well.
Separate and sequence
the polypeptides
How do we use this to find the Disulfides?
12
9/30/16
Determine which Cys are in
Disulfide bonds
We change the protection step for
sulfhydryl groups.
• Cleave/Protect
(reduction/alkylation or
oxidation) AFTER fragmentation
• Separate fragments as before,
but any linked by –S–S- will not
separate and remain together
(e.g., orange peptide).
• THEN break –S–S- bonds, and
re-separate. Wiley
Guided_explorations
More complicated actual example of ribonuclease that combines this
approach with changing up the fragmentation by employing CNBr after
separation.
(https://wileyassets.s3.amazonaws.com/Voet_Fundamentals_of_Biochemistry_5e_ISBNEPROF12
533/media/guided_explorations/ge_04_Protein_sequence_determination/protein_sequence.html)
Determine which Cys are in
Disulfide bonds
We change the protection step for
sulfhydryl groups.
• Protect (reduction/alkylation or
oxidation) AFTER fragmentation
• Separate fragments as before,
but any linked by –S–S- will not
separate and remain together
(e.g., orange peptide).
• THEN break –S–S- bonds, and
re-separate.
Determine the sequence of those peptides
that fall off the diagonal by either Edman
degradation or tandem MS/MS.
Technique is called ”2D-diagonal
electrophoresis.”
13