Download 1 CHAPTER 3- DNA FUNCTION – THE EXPRESSION OF GENETIC

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

History of genetic engineering wikipedia , lookup

Cre-Lox recombination wikipedia , lookup

Epigenetics of neurodegenerative diseases wikipedia , lookup

Mutation wikipedia , lookup

Protein moonlighting wikipedia , lookup

Short interspersed nuclear elements (SINEs) wikipedia , lookup

Designer baby wikipedia , lookup

RNA interference wikipedia , lookup

Genomics wikipedia , lookup

Non-coding DNA wikipedia , lookup

NEDD9 wikipedia , lookup

RNA world wikipedia , lookup

Epigenetics of human development wikipedia , lookup

Vectors in gene therapy wikipedia , lookup

Frameshift mutation wikipedia , lookup

Polyadenylation wikipedia , lookup

Microevolution wikipedia , lookup

RNA silencing wikipedia , lookup

Nucleic acid tertiary structure wikipedia , lookup

Helitron (biology) wikipedia , lookup

Messenger RNA wikipedia , lookup

RNA wikipedia , lookup

Gene wikipedia , lookup

Deoxyribozyme wikipedia , lookup

RNA-Seq wikipedia , lookup

Nucleic acid analogue wikipedia , lookup

History of RNA biology wikipedia , lookup

Therapeutic gene modulation wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Transfer RNA wikipedia , lookup

Expanded genetic code wikipedia , lookup

Non-coding RNA wikipedia , lookup

Point mutation wikipedia , lookup

Primary transcript wikipedia , lookup

Genetic code wikipedia , lookup

Epitranscriptome wikipedia , lookup

Transcript
1
CHAPTER 3- DNA FUNCTION – THE EXPRESSION OF GENETIC
INFORMATION
Questions to be addressed:
1. How is information in nucleus (DNA) transmitted to the cytoplasm (site of protein
synthesis)
2. How is mRNA made from DNA
3. How does linear DNA sequence specify the linear amino acid sequence?
4. How is the nucleic acid “code” translated into a protein sequence?
5. How is protein made?
6. How may we relate genes to biological processes?
Terminology (see also Glossary pages 655-681 and web site
http://helios.bto.ed.ac.uk/bto/glossary/)
Informational RNA provide a template for protein synthesis (mRNA)
Functional RNA function as an RNA molecule (e.g. tRNA, rRNA, snRNA)
Transcription – production of RNA from a DNA template
RNA polymerase – the enzyme which transcribes DNA into RNA
Promoter – a set of DNA sequences to which RNA polymerase binds
Repressor – a protein that binds to a DNA element and prevents transcription
Activator – a protein that binds to a DNA element and activates transcription
Codon – 3 nucleotides in mRNA that encodes an amino acid
Anticodon – 3 nucleotides in tRNA that from complementary base pairs with the codon
Wobble – the ability of the 3rd nucleotide of the anticodon to pair imprecisely, so that
the anticodon can align with several codons
Active site – the part of the protein that is required for protein function
Wild type – the standard form of a gene or characteristic
Mutation –DNA or chromosome that is different from wild type
allele: a different form of a gene caused by alteration to the DNA sequence
mutant – an organism or cell containing a mutation
2
frameshift – the insertion or deletion of nucleotides results in altered translational
reading frame
genotype: the alleles contained by an individual
phenotype: the form taken by a character in a specific individual
e.g. characteristic = flower colour; phenotype = purple
dominant: an allele whose phenotype is expressed when heterozygous with another
allele
recessive: an allele whose phenotype is not expressed when heterozygous with another
allele
haplo-sufficiency – one copy of the normal gene is able to confer a normal phenotype
haplo-insufficiency – one copy of the normal gene is unable to confer the normal
phenotype
prototrophic – an organism that will survive on minimal medium (carbon source,
inorganic salts, water)
auxotrophic – an organism that will not survive on minimal medium, but whose
growth depends on supplementation of medium with a specific substance
1: How is information in nucleus (DNA) transmitted to the cytoplasm (site of protein
synthesis)
observations: nucleic acid RNA is synthesized in the nucleus, and moves into the
cytoplasm
note: RNA is different from DNA, since it:
1) contains ribose instead of deoxyribose
2) is single-stranded, but can form duplexes through complementary bonds with RNA
or DNA
3) contains uracil in place of thymine (bases are A, C, G and U)
hypothesis: RNA = intermediate between DNA and protein
hypothesis 1: RNA will move from nucleus to cytoplasm.
experiment 1) labeled uracil with radioactive 32P isotope
3
expected result if hypothesis is correct?:
conclusion:
hypothesis 2: increase in a specific RNA will precede an increase in a specific protein
experiment 2: infected a bacterial cell with a virus
•
virus injects genetic information (DNA) into the cell and uses bacterial biosynthetic
machinery to make protein
expected result if hypothesis is correct:
conclusion: RNA is the precursor to protein
Overall conclusion: DNA acts as a template for the production of RNA, which then
moves into the cytoplasm and acts as a template for protein synthesis. This RNA is
molecule is called messenger RNA = mRNA.
Classes of RNA (page 59-60)
1) Informational RNAs – provide information template for protein synthesis
Messenger RNAs
2) Functional RNAs = RNA that is functional as an RNA molecule and is
not translated into a polypeptide.
Transfer RNAs – tRNAs – transport amino acids to RNA during protein synthesis
Ribosomal RNAs – rRNAs – component of the ribosome
Small nuclear RNAs – snRNAs – involved in RNA processing in eukaryotes
Small cytoplasmic RNAs – scRNAs – protein trafficking in eukaryotes
2. How is RNA made from DNA?
TRANSCRIPTION = Production of mRNA from DNA template (pg. 60-64)
-in each gene, one DNA strand is used as a template for complementary base pairing
4
-other strand = non-template stand
-in each gene, template strand may be different
-transcribed RNA is complementary to the template
-labeling of gene is based on direction of RNA transcript
-
(figures 3-4, 3-5, 3-7, 3-8)
enzymes which drive transcription = RNA polymerase
RNA polymerase I (pol I) – transcribes rRNA genes
RNA polymerase II (pol II) – transcribes protein coding genes
RNA polymerase III (pol III) – transcribes tRNA, snRNA, scRNA
3 steps in transcription =
•
initiation
•
elongation
•
termination
-we will concentrate on what is known about E. coli (prokaryotic) transcription, similar
mechanisms are used in eukaryotes
RNA polymerase in E. coli:
-consists of several different subunits which have different roles during the
transcription process
initiation - holoenzyme consists of 4 subunits: α, β, β‘, σ
elongation and termination - core enzyme consists of only 3 subunits: α, β, β‘
1) initiation (figure 3-9):
-determines where a mRNA molecule will start
a set of DNA sequences (= promoter) are required to initiate transcription (note - =
upstream, + = downstream)
consensus sequences:
•
-35 region = TTGACAT - recognized by σ-factor
5
•
-10 region = TATAAT - helix opens (note A-T joined by only 2 H-bonds)
•
+1 often A in CAT - beginning of transcription
2) elongation
•
within a few bases of the initiation site, σ-factor dissociates, leaving the core enzyme
to continue polymerization of RNA
•
reads the DNA template to create an mRNA molecule
•
identity of bases added is based on complementarity to the DNA
•
mRNA is complementary to the DNA from which it is synthesized
•
new bases are added to the 3’ OH, thus the RNA is synthesized 5’ ---> 3’
3) termination
-
determines where the mRNA molecule will end (3’ end)
-
2 mechanisms are used:
a) intrinsic (figure 3-10)
-
in DNA template- GC rich region followed by As
-
in mRNA - hairpin loop followed by Us
-
hairpin loop is a structure which causes the RNA polymerase to pause
-
U-A have a weak association, and as the polymerase pauses, the RNA-DNA
heteroduplex dissociates
b) rho dependent
-
requires rho protein
-
rho binds to RNA at rut site, and moves along RNA towards RNA polymerase
-
RNA polymerase pauses at a site within the RNA
-
rho catches up, and causes RNA polymerase to dissociate from the RNA
Control of transcription
-
not all genes are transcribed in all cells at all times
-
numerous points at which gene expression can be controlled
-
most common is at transcription initiation
6
-
activators help RNA polymerase to initiate (positive regulation), repressors keep
RNA polymerase from initiating (negatie regulation)
-
in E. coli, -35, -10 and +1 sequences are consensus sequences
-
present in many but not all promoters
-
a promoter which does not have the ideal consensus sequence needs help from
other proteins = activators
-
access to promoter may be prevented by other proteins = repressors
RNA processing = post transcriptional modification (Figures 3-11, 3-12, 3-16)
-occurs in eukaryotes, before pre-mRNA enters the cytoplasm
1) cap of 7-methylguanosine to 5’ end by Guanyltransferase
2) endonuclease cuts 3’ end, Poly A polymerase adds poly A tail to 3’ end
3) removal of introns (intervening sequences) and splicing together of exons
(expressed sequences, those used to make protein)
RNA splicing (Figures 3-11, 3-12, 3-13, 3-14, 3-15)
•
conserved sequences at exon-intron boundary are recognized and aligned by
snRNPs (protein + snRNAs) to form a splicesome
•
OH group in intron attacks the 5’ splice site
•
5’ exon is cleaved, and intron forms a lariat
•
3’ OH cleaves the 3’ splice site, the lariat is released, and the two exons ligated
Following RNA processing, mature RNA is transported to the cytoplasm
Protein Structure
protein = a polymer of amino acids joined by peptide bonds = polypeptide
•
20 amino acids, each has an amino (NH3) and a carboxy (COOH) group, differ in R
group
•
peptide bond forms between amino and carboxy groups (Figure 3-17)
•
polypeptide has an amino end (NH2) and a carboxyl end (COOH)
7
Levels of organization (Figure 3.18)
primary structure - amino acid sequence linked by peptide bonds
secondary structure - weak bonds (e.g. e.g. H-bonds) between neighbouring amino
acids
tertiary structure - bonds (e.g. sulfide bonds) between distant amino acids
quaternary structure - interactions between different polypeptides to form a multimeric
protein
•
now known that an enzyme may contain multiple polypeptides (subunits),
each encoded by a different gene
3. How is the linear nucleic acid molecule, DNA, used to generate the linear
polypetide molecules?
Hypothesis: a particular DNA sequence serves as a template for each polypeptide
expectations based on hypothesis:
1) each protein has a unique amino acid sequence
2) mutation in gene causes change in amino acid sequence
3) sequence of nucleotides relates to sequence of amino acids
Question 1: does each protein have a unique sequence?
Experiment: amino acid sequences of different polypeptides deduced through Sanger
sequencing
-
Proteolytic enzymes cut between specific aa to produce specific fragments
-
Movement through electorphoretic field depends on size and charge of
fragments, allowing separation of different fragments
-
Movement in different buffer allows further separation
Result: each polypeptide produces a unique fragment profile (fingerprint)
8
Question 2: Do mutations alter protein sequence?
Experiment: compare sequence of amino acids in protein from patients with Sickle
Cell Anemia to the sequence from normal individuals
•
disease results from a mutation in a single gene
result: Sanger sequencing shows that hemoglobin fingerprint of normal people is
different from that of sickle cell people
•
one amino acid difference in B subunit
conclusion: mutation ----> changes amino acid sequence
Question 3: does the sequence of nucleotides relate to sequence of amino acids –
Charles Yanofsky, Box 3-1
TRPA - gene required for tryptophan biosynthesis in E.coli
•
a number of different mutations in the TRPA gene are known to result from single
nucleotide changes (these are called different alleles of the TRPA gene)
Experiment: compare position of changed nucleotide (mutation) with position of
changed amino acid
nucleotide sequence -------X--------Y----------Z-------W-------wild type protein
------tyr------leu-------thr-----gly-------
mutant x protein
------cys------leu-------thr-----gly-------
mutant y protein
------tyr-------arg-------thr-----gly-------
result:
conclusion:
9
4. How do genes relate to biological processes?
Hypothesis: biological processes (biosynthetic, degradation, developmental) can be
thought of as pathways, where each pathway consists of a series of steps
Beadle and Tatum, 1942 (Box 3-1)
-
choose to work on a very simple amino acid biosynthetic pathway in the fungus
Neurospora
-
three mutant strains each carrying a mutant allele of 3 different genes (arg-1, arg-2,
arg-3) were available
-
these failed to make the final amino acid product
-
the wild type fungus is a prototroph (self-feeder, that can survive on minimal
medium containing only inorganic salts, water and a carbon source), while the
mutants are auxotrophic mutants, since they require an outside source of amino
acid
-
by a series of simple and elegant experiments, Beadle and Tatum formulated the
“one gene - one enzyme theory”, which has 3 fundamental components:
-
biosynthetic pathways = steps
-
steps controlled by enzymes
-
one gene produces one enzyme
Hypothesis 1: biosynthetic pathways = series of steps
consider a pathway by which A is converted to D through 2 intermediates, B and C:
A
B
(upstream)
C
D
(downstream)
- expect that if a defect occurs in one step, subsequent steps will not occur
Experimental observations: Mutations in 3 different Neurospora genes unable to make
arginine
= auxotrophic mutants (able to grow only if supplied with a particular substance,
i.e. arginine)
10
conclusion:
-
pathway is a series of 3 steps
-
one gene controls step
-
mutation in a gene = a defective step
How might mutant strains be rescued?
Hypothesis 2: If the stepwise pathway is correct…
Results:
Table 1: 3 auxotrophic arginine mutants were tested for growth on medium containing
several compounds. Growth (+) or no growth (-) was scored
mutant
arg-1
+
+
+
arg-2
-
+
+
arg-3
-
-
+
conclusion: compounds can rescue mutants, indicating that 1) compounds =
intermediates in pathway and 2) mutants = defects in steps
Based on the data presented in Table 1, what is the pathway for arginine biosynthesis?
Alternative Experiment:
similar information can be gained from cross-feeding experiments, in which pairs of
mutants are grown side-by-side on medium containing enough arginine to support
11
weak growth. Mutant strains are scored for their ability to increase growth of (“feed”)
the neighbouring strain.
according to our pathway:
•
mutant arg-3 - accumulates
•
mutant arg-2 -accumulates
•
mutant arg-1 - accumulates ?
question: which mutant should be able to feed another mutant?
Results:
arg-1
arg-2
arg-3
Conclusions:
•
each mutant strain has a unique ability to feed other mutant strains
•
each mutant represents a defect in a specific step in the biosynthetic pathway
•
each mutant strain carries a mutation in a different gene
•
therefore, each gene controls a different step in the biosynthetic pathway
RULE - downstream mutants feed upstream mutants
5. How is the nucleotide code translated into amino acids?
= The Genetic Code (Figure 3-20)
Question: How many nucleotides must be in each “word”?
12
-
20 amino acids, therefore must be at least 20 words
-
42 = 16
-
43 = 64
-
must be at lest 3 nucleotides in each word
Question: Which word = which amino acid?
experiment: synthesize an mRNA containing only Us
-add mRNA to an in vitro translation mix to produce a protein
result: protein contains only Phenylalanine
conclusion:
experiment: synthesize an mRNA containing 1/4 G and 3/4 U
p(GGG) = 1/4 x 1/4 x 1/4 = 1/64
p of any of (GUG, GGU or UGG) = 1/4 x 1/4 x 3/4 = 3/64
•
compare frequency of amino acids to known frequency of codons
this sort of experiment generated the Codon Table (figure 3-20)
•
now known that codon usage is universal, with the same code being recognized in
all organisms
6. How is Protein Synthesized?
•
Translation = the process in which amino acid polymer of specific sequence is made
using mRNA as an information template
steps:
1) initiation (start)
2) elongation (polymerization)
3) termination (end)
components:
1) mRNA - template for protein synthesis
13
2) tRNA - carries the amino acids
3) synthetase - attaches a specific amino acid to the tRNA
4) ribosome - provides machinery
-consists of rRNA + protein
5) Protein factors (IF) – factors necessary for initiation (Initiation Factors), elongation
(Elongation Factors) and termination (Release Factors)
tRNA (figures 3-21, 3-22)
•
matches a codon in the mRNA to an amino acid
•
each tRNA is specific to a particular amino acid
•
some amino acids can be carried by tRNAs having different anticodons (accounts for
some of the degeneracy in the code)
tRNA consists of:
a) anticodon loop
•
anticodon loop contains 3 bases (anticodon) which are complementary to a codon
•
anticodon of tRNA binds to codon through complementary base pairing
wobble position (figure 3-22, Tables 3-2 and 3-3)
•
flexibility in 3rd nucleotide (5’) position of anticodon, allows this nucleotide to pair
with nucleotides other than its complementary nucleotide
•
anticodon pairs with more than 1 codon
•
accounts for degeneracy of code
b) amino acid acceptance site
•
3’ OH becomes linked to a specific amino acid, through the action of synthetase
Charging the tRNA
•
each tRNA is recognized by a specific synthetase enzyme, which joins the
appropriate amino acid to the tRNA, to produce a charged tRNA = amino-acyl
tRNA
synthetase1
aa1 + tRNA1 -------------------> aa1-tRNA1
14
+ ATP
+ AMP + Pi
uncharged tRNA
= amino-acyl tRNA
4. Ribosome
•
structure which positions the mRNA, charged tRNA and appropriate factors so that
peptide bonds can be made between sequential amino acids
•
consists of both RNA and protein (Figure 3-23)
•
RNA in ribosome interacts with mRNA through complementary base pairing
•
in prokaryotes, 2 subunits, 50S, 30S = 70S
•
in eukaryotes, 2 subunits, 60S, 40S = 80S
within the ribosome, the following reactions polymerize the polypeptide (figure 324):
1) formation of peptide bond between first two amino acids
aa1-tRNA1 + aa2-tRNA2 -------> aa1-aa2-tRNA2 + tRNA1
amino-acyl tRNA
peptidyl tRNA + uncharged tRNA
2) addition of sequential amino acids
aa1-aa2-tRNA2 + aa3-tRNA3 -------------> aa1-aa2-aa3-tRNA3
peptidyl tRNA
amino-acyl tRNA
peptidyl tRNA
•
A site = holds amino-acyl tRNA
•
P site = holds peptidyl tRNA during elongation, holds an amino-acyl tRNA only
during initiation
Initiation of protein synthesis:
1) -binding of ribosome to mRNA
15
2) -entry of first tRNA
•
30S (small) subunit of ribosome + IF- translocates along mRNA
•
binds to ribosome binding site (GGAGG) = Shine-Dalgarno sequence
•
positions the ribosome so that the first codon of the mRNA (initiation codon = AUG)
is in P site of ribosome
•
AUG in P accepts a specific amino-acyl tRNA
3) IF + GTP + fMet-tRNA -----> enters P site
•
IFs released, 50S (large) subunit joins
•
at the completion of initiation, both subunits of ribosome are attached to the mRNA,
with the initiation codon and fMet-tRNA in the P site
Elongation
1. entry of aa2-tRNA2 into A site
•
requires EF + GTP
•
aa2-tRNA2 in A site and f-met-tRNA in P-site
2) formation of peptide bond, translocation
•
peptide bond forms between aa2 and f-Met
•
releases f-Met from tRNA1
•
fMet-aa2-tRNA2 (peptidyl tRNA) in A site, empty tRNA in P-site
•
empty tRNA is released
•
ribosome moves to position peptidyl-tRNA2 in P site
•
translocation requires EF
•
aa3-tRNA3 enters A site (requires EF + GTP)
•
many cell toxic compounds inhibit polypeptide elongation
•
antibiotic fusidic acid - mimics EF, allowing it to bind to ribosome, but cannot
translocate ribosome
•
diphtheria toxin - inhibits EF, therefore stops translocation
16
termination
•
ribosome encounters a STOP codon (UAA, UGA, or UAG)
•
3 codons UAG, UAA, UGA are not recognized by any tRNA = STOP codons
•
these codons are recognized by RFs, which enter A site instead of a charged tRNA
•
entry of RF causes release of polypeptide from P site
Protein Function
•
two major classes of protein, structural (e.g. myosin, keratin) and active (e.g.
enzymes such as PEP carboxylase, alcohol dehydrogenase)
•
specificity of protein action is due to specificity of physical interaction
e.g. enzyme and substrate interaction (Fig 3-23)
•
active site is a region of the protein critical for the interaction and therefore for
protein function (Fig 3-24)
7. Relating mutations to protein function and phenotypic changes
Types of mutations (see Chapters 10 and 11 for more detail):
•
the normal DNA sequence is called “wild type” and is designated either by
UPPERCASE letters, or by a “+” superscript (nomenclature depends on the
organism)
•
a mutant allele that changes the DNA sequence is designated by either lowercase
letters, or by the absence of the “+” superscript
Examples of simple mutations
1. single nucleotide change
2. single nucleotide addition or deletion (= frameshift)
Phenotype = the outward manifestation of the genotype, i.e. the form an individual
takes as the result of the genes present in that organism
17
•
the normal organism, containing a wild type set of genes, will display a wild type
phenotype
•
alterations to DNA sequence (mutations) within the gene may alter the time, place
or level of expression (if mutation is in the regulatory region), or the efficiency of
protein function (if mutation is in the coding region) (Figure 3-27)
•
if mutation alters expression or efficiency of protein, a change in phenotype may
arise
•
e.g. gene involved in short term memory in flies
question: based on the mutant phenotype, what does the wild type DUNCE+ allele do?
NOTE: Basic principle of genetic analysis – a gene and its function are identified
based on what happens to the organism when the gene is malfunctioning (usually
through mutation)
1. Geneticists identify genes important to a process through mutation
2. Geneticists identify region of a gene important to its function through mutation
(Figure 3-27, Figure 3-29)
Dominant and Recessive alleles
•
dominance and recessive relationships are determined based on the phenotype of
an individual who is heterozygous (two different alleles are present) at the gene
•
can only be defined in diploid individuals
Dominant: an allele that expresses its phenotype in the heterozygote
Recessive: an allele whose phenotype is not expressed in the heterozygote
haplo-sufficiency or haplo-insufficiency of a gene determines whether loss-of –
function mutant alleles are dominant or recessive (Figure 3-30)
•
Haplo-sufficiency: If one copy (“haplo”) of the normal gene is able to confer the
normal phenotype (i.e. the heterozygote is normal)
•
Haplo-insufficiency: If one copy of the normal gene is unable to confer the normal
phenotype (i.e. the heterozygote is mutant)
18
e.g.
haplo-sufficient gene
haplo-insufficient gene
genotype
phenotype
genotype
phenotype
a+a+
normal
B+B+
normal
a+a
normal
B+B
defective
aa
defective
BB
defective
(i.e. a+ is dominant to a)
(i.e. B is dominant to B+)