Download Chapter 32 - s3.amazonaws.com

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Butyric acid wikipedia , lookup

Western blot wikipedia , lookup

Ribosomally synthesized and post-translationally modified peptides wikipedia , lookup

Silencer (genetics) wikipedia , lookup

RNA-Seq wikipedia , lookup

Citric acid cycle wikipedia , lookup

Messenger RNA wikipedia , lookup

Deoxyribozyme wikipedia , lookup

List of types of proteins wikipedia , lookup

Molecular evolution wikipedia , lookup

Cell-penetrating peptide wikipedia , lookup

Protein adsorption wikipedia , lookup

Metalloprotein wikipedia , lookup

Gene expression wikipedia , lookup

Protein wikipedia , lookup

Non-coding RNA wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Peptide synthesis wikipedia , lookup

Protein (nutrient) wikipedia , lookup

Bottromycin wikipedia , lookup

Epitranscriptome wikipedia , lookup

Proteolysis wikipedia , lookup

Metabolism wikipedia , lookup

Point mutation wikipedia , lookup

Nucleic acid analogue wikipedia , lookup

Transfer RNA wikipedia , lookup

Ribosome wikipedia , lookup

Protein structure prediction wikipedia , lookup

Biochemistry wikipedia , lookup

Expanded genetic code wikipedia , lookup

Genetic code wikipedia , lookup

Transcript
Chapter 32
The Genetic Code
Including a review of tRNA structure
from Chapter 12 section 7
Review of Steps in Gene Expression
RNA = nucleotide sequence
Adaptor molecule
protein = amino acid sequence
From Access Excellence: http://www.accessexcellence.org/AB/GG/steps_to_Prot.html
Translating the Message
•
•
•
•
How does the sequence of mRNA translate into the
sequence of a protein?
What is the genetic code?
How do you translate the "four-letter code" of mRNA
into the "20-letter code" of proteins?
And what are the mechanics like? There is no
obvious chemical affinity between the purine and
pyrimidine bases and the amino acids that make
protein.
Three major advances gave the clues to solving this
dilemma
Clue #1
The Discovery that Proteins are made on
ribosomes
• By Paul Zamecnik (early 1950s)
• He asked where in the cell are proteins
synthesized?
• Injected rats with radioactive amino acids
• A short time after injection (when the amino acids
should be incorporated into newly-synthesized
proteins) he killed the rats, harvested their livers,
ground them up and divided the cell components
into “subcellular fractions” by centrifugation
Results
• Radioactivity was found in small ribonucleoprotein
particles visible by electron microscopy.
• These were later characterized and called
“ribosomes” (since they had RNA as a major
component)
From Lehninger “Principles of Biochemistry” p 1021
Clue #2
The Discovery that amino acids are
“Activated”
• By Hoagland and Zamecnik
• They incubated amino acids with the cytosolic fraction
of liver cells, and with ATP
• They found the amino acids became “activated” during
the incubation
• Activation consists of attaching the amino acids to a
heat-stable soluble RNA (which we now know is tRNA)
• Activated amino acids are called aminoacyl- tRNAs
• The enzymes that do the activation are called
aminoacyl-tRNA synthetases (next class!)
Clue #3
Crick’s Adaptor Hypothesis
• Francis Crick thought about the problem
• He reasoned that a small nucleic acid could serve as
an adaptor between RNA and protein synthesis if it
could bind both RNA and an amino acid
• His idea was that one end of the adaptor would bind a
specific amino acid and the other would bind to a
specific sequence in the RNA that coded for that amino
acid
Crick’s Adaptor Hypothesis
•Must have one adaptor
per amino acid
•Therefore there must be
a family of adaptors:
•These are the tRNAs
• each tRNA can
recognize specific
sequences in the RNA
transcript
•Each is “charged” with
the amino acid that is
specified by that sequence
From Lehninger “Principles of Biochemistry” p 1021
Review of tRNA Structure
• tRNAs are the “adaptors” in protein synthesis
• There are many different tRNAs, each has a distinct
sequence
• However, all tRNA have several conserved features
1) small (73-93 nucleotides long)
2) they have a conserved secondary structure - 4
stems and 4 loops with important functions
3) they contain many unusual bases
Inosine (I), pseudouridine (y), dihydrouridine (D),
ribothymidine (T), and methylated bases (mG, mI)
Unusual bases in tRNAs
Invariant bases
Amino acid
addition site
Interacts with
the ribosome
Varies in size
Base pairs with the codon
in the mRNA transcript
Example of a
specific tRNA
The yeast
alanyl-tRNA
The cloverleaf tRNA
folds into an L-shape by
interactions between
conserved bases in the
stems and loops
tRNA tertiary structure: banana or L-shape
Noncanonical base pairs stabilize the 3o structure
Animation of tRNA secondary and
tertiary structure
Elucidation of the Genetic Code
4 major advances helped figure out the code
1) The demonstration of colinearity
between genes and protein
2) The idea of triplet codons
3) Deciphering the first word (UUU= Phe)
4) Deciphering the rest of the code
The Colinearity of Gene and
Protein Structures
• Yanofsky provided evidence in 1964: he showed
that the relative distances between mutations in
DNA were proportional to the distances between
amino acid substitutions in E. coli tryptophan
synthase
Charles Yanofsky - 1964
(trp operon)
• Had a collection of mutants in the trpA gene (coding for
tryptophan synthase)
• He made 2 determinations using these mutants:
1) determined the position of the mutation in the gene
2) determined the position of the mutant amino acid in the
protein encoded by each mutant gene
Mutant 1
Mutant 3
Mutant 2
gene
protein
Therefore gene sequence is colinear with protein sequence.
What is the nature of the Code
mRNA (nucleotides)
4 different nucleotides
protein (amino acids)
20 different amino acids
• 1:1 correspondence can’t work
• Therefore nucleotides must be read in combinations
• Is 2 enough? 4X4 = 16 different combinations possible
- not enough
• But 3 would give 4X4X4 = 64 combinations
• This would be enough to code for 20 amino acids
• Therefore the concept of the triplet codon was born
What is the nature of the Code
•
•
•
•
Is the code overlapping or non-overlapping?
Is the code punctuated or non-punctuated?
These details were worked out by Crick
Crick used mutagens to introduce changes
into a coding sequence
• Used mutagens that either induced point
mutations in the DNA or insertions.
• After generating mutants he checked the
proteins coded by the mutant sequences
Changes 3 amino
acids
X
X
Changes 1 amino acid
Changes all following
amino acids
following amino acids
are unchanged
The Nature of the Genetic Code
summary of results
• A group of three bases codes for one
amino acid
• The code is not overlapping
• The base sequence is read from a fixed
starting point, with no punctuation
• The code is degenerate (in most cases,
each amino acid can be designated by
any of several triplets)
Biochemists Break the Code
Assignment of "codons" to their respective amino acids
• Marshall Nirenberg and Heinrich Matthaei
Worked with an in vitro translation system from E. coli
Cell-free extract
•Ribosomes
•tRNAs
•Amino acids
•Enzymes
•ATP, GTP
+ mRNA = protein
Deciphering the first word
They generated RNAs that are homopolymers
• using the enzyme polynucleotide
phosphorylase (catalyzes random synthesis of
RNA chains)
i.e. nNDPs
polynucleotide phosphorylase
(NMP)n + nPi
Using this system they made a polyU mRNA by
programming their reaction with UDP
When this was put into the cell-free extract it
should be translated into a protein made up of
amino acids coded by the codon UUU
Deciphering the first word
(continued)
Experiment:
• They set up 20 different test tube reactions
• Each one was spiked with a different
radioactive amino acid
• They programmed each with the polyU RNA
• Then recovered the proteins by acid
precipitation
• Under these conditions the proteins precipitate
but the free amino acids do not
• Then they asked which reaction (out of the 20)
has radioactivity in the protein pellet?
Biochemists Break the Code
Results
• Marshall Nirenberg and Heinrich Matthaei showed
that poly-U produced polyphenylalanine in a cellfree solution from E. coli. In other words, only the
test tube reaction spiked with radioactive Phe
generated a radioactive pellet
They repeated the experiment with other synthetic
homopolymer RNAs
• Poly-A gave polylysine (AAA = Lys)
• Poly-C gave polyproline (CCC = Pro)
• Poly-G gave polyglycine (GGG = Gly)
Getting at the Rest of the Code
• Work with nucleotide copolymers (poly (A,C), etc.),
revealed some of the codes
• Gobind Khorana (organic chemist)
• -synthesized DNA composed of alternating
copolymers eg: ACACACACACAC…..
• Then used RNAP to make RNA from the DNA
template eg: UGUGUGUGUGUGU……
• This RNA transcript has two possible alternating
codons: UGU GUG UGU GUG
• In a translation extract you should get a protein with
2 alternating amino acids
Nirenberg and Leder
• Took a cell-free translation extract (ribosomes and
tRNAs charged with their specific amino acid)
• Added a synthetic triplet RNA (a codon) eg UUU
• They found that addition of that simple triplet RNA to
the cell-free extract could stimulate the binding of the
tRNA that recognized that codon to a ribosome
• Since the tRNA is covalently linked to the amino acid
that is coded for by the codon, therefore that amino
acid gets localized to the ribosome
• If they collect the ribosomes from the experiment
they can identify which amino acid was brought to
the ribosome by that triplet codon
Nirenberg and Leder
Phe
Ribosome
•Ternary
complex
•Very large
•Can be
captured
on a filter
AAA
UUU
Triplet RNA
Experiment: for each triplet RNA set up 20 reactions, each one
spiked with a different radioactive amino acid. Ask which
reaction generates radioactivity on the filter. That’s the amino
acid coded for by the triplet codon!
Getting at the Rest of the Code
• Finally Marshall Nirenberg and Philip Leder cracked
the entire code in 1964
• They showed that trinucleotides bound to ribosomes
could direct the binding of specific aminoacyl-tRNAs
(See Figure 31.6)
• By using C-14 labeled amino acids with all the
possible trinucleotide codes, they elucidated all 64
correspondences in the code
• Found that all the codons (except the 3 stop codons)
specified an amino acid
• There are 64 codons and 20 amino acids
• Therefore amino acids can be encoded by >1 codon
Features of the Genetic Code
• All the codons have meaning: 61 specify amino
acids, and the other 3 are "nonsense" or "stop"
codons
• The code is unambiguous - only one amino acid is
indicated by each of the 61 codons
• The code is degenerate - except for Trp and Met,
each amino acid is coded by two or more codons
• Codons representing the same or similar amino
acids are similar in sequence
• 2nd base pyrimidine: usually nonpolar amino acid
• 2nd base purine: usually polar or charged aa