Download Transcription in Bacteria

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Gel electrophoresis of nucleic acids wikipedia , lookup

SR protein wikipedia , lookup

Molecular cloning wikipedia , lookup

Histone acetylation and deacetylation wikipedia , lookup

Gene regulatory network wikipedia , lookup

RNA interference wikipedia , lookup

Molecular evolution wikipedia , lookup

Cre-Lox recombination wikipedia , lookup

Community fingerprinting wikipedia , lookup

Point mutation wikipedia , lookup

Two-hybrid screening wikipedia , lookup

Gene wikipedia , lookup

Biosynthesis wikipedia , lookup

Real-time polymerase chain reaction wikipedia , lookup

RNA silencing wikipedia , lookup

Non-coding DNA wikipedia , lookup

Transcription factor wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Messenger RNA wikipedia , lookup

Replisome wikipedia , lookup

RNA wikipedia , lookup

Polyadenylation wikipedia , lookup

Nucleic acid analogue wikipedia , lookup

RNA-Seq wikipedia , lookup

Deoxyribozyme wikipedia , lookup

Epitranscriptome wikipedia , lookup

Non-coding RNA wikipedia , lookup

Promoter (genetics) wikipedia , lookup

Gene expression wikipedia , lookup

Silencer (genetics) wikipedia , lookup

Eukaryotic transcription wikipedia , lookup

RNA polymerase II holoenzyme wikipedia , lookup

Transcriptional regulation wikipedia , lookup

Transcript
Transcription in Bacteria
Transcription in Bacteria
Transcription in Bacteria
Transcription is the first step of gene expression, in which a
particular segment of DNA is copied into RNA by the enzyme,
RNA polymerase.
If the gene transcribed encodes a protein, the result of
transcription is messenger RNA (mRNA), which then will be used
to create that protein via the process of translation.
Alternatively, the transcribed gene may encode for either noncoding RNA genes (such as microRNA) or ribosomal RNA (rRNA)
or transfer RNA (tRNA), other components of the proteinassembly process, or other ribozymes.
Mechanism of transcription
• RNA polymerase is the enzyme
that catalyzes RNA synthesis.
• Using DNA as a template, RNA
polymerase joins, or polymerizes,
nucleoside triphosphates (NTPs)
by phosphodiester bonds from 5'
to 3'.
• In bacteria there is one type of
RNA polymerase and transcription
and translation are coupled (they
occur within a single cellular
compartment).
• As soon as transcription of the
mRNA begins, ribosomes initiate
protein synthesis.
• The whole process occurs within
minutes.
Transcription in E. coli and in Eucaryotes
Procaryotes
Genes are grouped into operons
Eucaryotes
Genes are not grouped in operons
mRNA may contain transcript of
several genes (poly-cistronic)
each mRNA contains only
transcript of a single gene
(mono-cistronic)
Transcription and translation are coupled.
Transcript is translated already during
transcription.
Transcription and translation are
NOT coupled.
Transcription takes place
in nucleus, translation in cytosol.
Gene regulation takes place by
modification of transcription rate
Gene regulation via transcription
and by RNA-processing,
RNA stability etc.
mRNAs are not processed in prokaryotes
mRNAs are processed in in
eukaryotes (splicing , CAP, poly-A tail)
rRNA and tRNA are processed both in eukaryotes and prokaryotes
Transcription and
translation is coupled in
prokaryotes
Initiation of transcription
Sense strand → nontemplate strand = mRNA
Antisense strand → template strand
As soon the growing
mRNA chain separates
from DNA, ribosomes
attach to it and begin
translation on the 5’ end
of the molecule
following right behind
the RNA polymerase
while it is transcribing
3
1
2
How does transcription get started?
RNA polymerase binds to a region on DNA called a PROMOTER
Bacterial promoter structure
Bacterial promoters are not absolutely conserved but they do
have a consensus sequence.
Conserved sequence:
sequence When sequences of DNA have exactly
the same series of nucleotides in a given region.
Consensus sequence:
sequence there is some variation in the sequence
but certain nucleotides are present at high frequency.
Bacterial promoter structure
It is convention to indicate the start of transcription by the number +1 and to
use positive numbers to count farther down the DNA in the direction of
transcription (downstream).
If transcription is proceeding to the right, then the direction to the left is
called upstream with the bases indicated by negative numbers.
For the majority of E. coli genes, the promoter consensus sequence
consists of two hexamer sequences.
Centered at −10 is the consensus sequence TATAAT, which is also
known as the TATA box or Pribnow box. (capital letters indicate bases
found in those positions in more than 50% of promoters analyzed; small
letters = less than 50%).
Bacterial promoter structure
Another region with similar sequences among many promoters is
centered at −35.
The consensus sequence at −35 is TTGACA.
The spacing between the −10 and −35 sequences and the start
point for transcription is important, and deletions or insertions
that change the spacing are deleterious.
How does RNA polymerase recognize and bind to
the promoter?
Structure of bacterial RNA polymerase
Comprised of a core enzyme plus a transcription factor called the sigma factor (σ).
Together they form the complete, fully functional enzyme complex called the
holoenzyme.
Structure of bacterial RNA polymerase
The core enzyme
The core enzyme is the component of the holoenzyme that catalyzes polymerization.
It is 400 kD in size and has five subunits: two copies
of the α-subunit (αI, αII), and one copy each of the
β-, β′-, and ω-subunits.
X-ray crystallographic studies revealed a crab claw
shape. Subunits αI, αII, and ω form the base of the
claw, and subunits β and β′ form the pincers. These
pincers form an internal channel (2.7 nm wide). The
enzyme active site is located at this internal channel
where an essential Mg2+ ion is bound.
The core enzyme has high affinity for most DNA. In the absence of σ,
it can initiate synthesis anywhere on a DNA template in vitro.
The σ factor is responsible for decreasing the nonspecific binding affinity of
the RNA polymerase.
The sequence, structure, and function are evolutionarily conserved from
bacteria to humans.
Structure of bacterial RNA polymerase
Sigma factor
In bacteria there are many different σ factors. In E. coli the most
abundant σ factor is σ70. It has a higher binding affinity for the
RNA polymerase core enzyme than other σ factors.
Most of the σ factors share four regions of amino acid sequence
homology that play a role in recognizing the promoter
These four regions are further divided
into subdomains with specific functions.
In the holoenzyme the globular domain
of the σ factors are spread out across
the face of the crab claw.
Structure of bacterial RNA polymerase
Sigma factor
• σ70 is required for specific binding of RNA polymerase to the promoter of
the majority of genes in E. coli. It stimulates a change in shape of the
internal channel where transcription takes place.
• The −35 and the −10 sequences are necessary for recognition by the σ70
factor
• The −10 sequence is the region of contact for the core enzyme.
• In addition, the −10 sequence is necessary for the initial melting of the
DNA to expose the template strand.
• A domain of the σ70 factor binds to the nontemplate strand of the −10
sequence in a sequence-specific manner that stabilizes the initial
transcription bubble.
The sigma factor, σ70 (MW = 72000)
Fragments 2.1 and 2.2 of σ70 bind strongly to β'.
Adjacent helical segments located in fragments 2.3
and 2.4 are involved in recognition of the -10
region of the promoter. The 2.3 region is required
for melting.
In addition, sequences near the N-terminal (1.1 and
1.2) of σ70 were found to be inhibitory to DNA
binding.
The addition of σ to the polymerase core gives the RNA polymerase holoenzyme recognizing a site at
-10 to form the closed complex. In the holoenzyme form, an additional DNA binding domain of σ, the
region 4.2, become unmasked, and this recognizes a second site at -35, approximately 2 helical turns of
DNA away. If the -35 site is recognized, the holoenzyme melts the region -11 to +3 in the DNA, giving
the open complex, and the bubble is stabilized by the ssDNA binding domain of σ at region 2.3.
The region 2.5 interacts with dsDNA from -11 to –17 (spacer region).
Melting of the transcription bubble admits the template strand to the catalytic site, allowing initiation to
proceed.
Alternative sigma factors respond to general environmental changes
E. Coli sigma factors recognize promoters with different consensus sequences
Sigma70 (rpoD)
(-35)TTGACA (-10)TATAAT
Primary sigma factor, or housekeeping sigma factor.
Sigma54 (rpoN)
(-35)CTGGCAC (-10)TTGCA
Alternative sigma factor involved in transcribing nitrogen-regulated genes (among
others).
Sigma32 (rpoH)
(-35)TNNCNCCCTTGAA (-10)CCCATNT
Heat shock factor involved in activation of genes after heat shock.
SigmaS (rpoS)
intrinsic curvature
(-10)TGNCCATA(C/A)T
Alternative sigma factor transcribing genes of stationary phase of growth.
Note the extended -10 element.
The use of different sigma factors gives E. coli flexibility in responding to different
DNAse I Footprinting
1. Prepare end-labeled DNA.
2. Bind protein.
3. Mild digestion with DNAse I (randomly
cleaves ds DNA on each strand)
4. Separate DNA fragments on denaturing
acrylamide gels.
DNase I footprint performed
on an end-labeled DNA
fragmentFIS
Footprint
Samples in lanes 2-4
had increasing amounts
of the DNA-binding
protein (lambda protein
cII); lane 1 had no
protein.
Partially DNase I digested DNA is subjected to linear PCR
DNA-protein
complex
DNase I
Products of DNase I
digestion are primer
extended by linear PCR
using a 5’ end-labeled
oligonucleotide
Sequencing gel
Gel retardation assay
Gel Shift
Electro Mobility Shift Assay (EMSA)
Band Shift
No protein
add protein
Non-denaturing PAGE
Incubating a purified protein, or a complex mixture
of proteins e.g. nuclear or cell extract, with a 32P
end-labelled DNA fragment containing the putative
protein binding site (from promoter region).
*
*
Reaction products are then analysed on a nondenaturing polyacrylamide gel.
The specificity of the DNA-binding protein for the
putative binding site is established by competition
experiments using DNA fragments or
oligonucleotides containing a binding site for the
protein of interest, or other unrelated DNA
sequences.
Retarded
mobility
due to
protein
binding
Free DNA probe
virB
virF
virG
Bound DNA
EMSA
Free DNA
Evaluating the
Binding Affinity
Primer extension
mRNA
5’
3’
annealing
+
G A T
primer-32P
C
Early-log
37°C 10°C
Mid-log Late-log
37°C 10°C
37°C 10°C
mRNA
5’
primer -32P
3’
-10
+1
reverse transcriptase
+24
mRNA
5’
primer
-32P
3’
+42
cDNA
run on denaturating gel
+77
cspA mRNA
The transcription process
consists of three stages:
stages:
Initiation
Elongation
Termination
Initiation
Initiation of transcription
The RNA polymerase holoenzyme initially binds to the
promoter at nucleotide positions −35 and −10 relative to the
transcription start site (+1) to form a closed promoter
complex.
The term “closed” indicates that the DNA remains doublestranded and the complex is reversible.
Initiation of transcription
The complex then undergoes a structural transition to the “open” form
in which approximately 18 bp around the transcription start site are
melted to expose the template strand of the DNA.
Transcription is aided by negative supercoiling of the promoter region
of some genes. Formation of the open complex is generally irreversible
and transcription initiated in the presence of NTPs.
No primer is required for initiation by RNA polymerase.
Initiation of transcription
During initial transcription, RNA polymerase produces and releases short RNA
transcripts of less then ten ribonucleotides (abortive synthesis) before escaping
the promoter (promotor clearance).
It is not clear why RNA polymerase must undergo this period of abortive initiation
before achieving escape, but a region of the σ factor appear to be involved.
Infact, in this step there is sequential displacement of some domains of σ that
would otherwise act as a barrier to the extension of the nascent RNA as it
emerges from the RNA exit channel.
Elongation
Elongation
After about 9-12 nt of RNA have been synthesized, the initiation complex
enters the elongation stage.
As RNA polymerase moves during elongation, it holds the DNA strands apart,
forming a transcription “bubble.”
The moving polymerase protects a “footprint” of ~30 bp along the DNA against
nuclease digestion.
Within the transcription bubble, one strand of DNA acts as the template for
RNA synthesis by complementary base pairing.
Transcription always proceeds in the 5′→3′ direction.
Proofreading
E. coli RNA polymerase synthesizes RNA with remarkable fidelity in vivo. Its low
error rate may be achieved by two proofreading mechanisms.
Pyrophosphorolytic editing
The RNA polymerase use its active site in a simple back-reaction, to catalyze
the removal of an incorrectly inserted ribonucleotide by reincorporation of Ppi.
Hydrolytic editing
The polymerase backtracks by one or more nucleotides and cleaves the RNA
product, removing the error-containing sequence.
Hydrolytic editing is stimulated by Gre factors, which both enhance hydrolytic
function and serve as elongation stimulating factor.
Gre factors ensure that polymerase elongates efficiently and help overcome
arrest in presence of a mismatch.
Termination
Termination of transcription
The RNA polymerase core enzyme moves down the DNA until a
stop signal or terminator sequence is reached by the RNA
polymerase.
There are two types of terminators recognized,
• Rho-dependent
• Rho-independent terminators
E. coli uses both kinds of transcript terminators.
Termination of transcription
RhoRho
-independent termination
Also called “intrinsic terminators” because they cause termination of transcription in the
absence of any external factors.
Terminator is characterized by a consensus sequence that is an inverted repeat.
Stem-loop structures can form within the mRNA just before the last base
transcribed, by the pairing of complementary bases within the inverted repeat.
The inverted repeat sequence in the mRNA is followed by seven to eight uracilcontaining nucleotides. A hybrid helix of U in the RNA base paired with A in the DNA is
less stable than other complementary base pairs.
This property, combined with formation of the stem loop in the exit channel of RNA
polymerase, is sufficient to cause the enzyme to pause, resulting in transcript release.
Termination of transcription
RhoRho
-dependent termination
Rho-dependent termination is controlled by the ability of the Rho protein
to gain access to the mRNA.
Terminator is an inverted repeat with no simple consensus sequence and
no string of Ts in the nontemplate strand.
Rho is a ring-shaped, hexameric
helicase protein with a distinct RNAbinding domain and an ATP-binding
domain.
Rho binds specifically to a C-rich site
called a Rho utilization (or rut site)
at the 5′ end of the newly formed
RNA, as it emerges from the exit
site of RNA polymerase.
Temporary release of one subunit of the hexamer allows the 3′ segment of the
nascent transcript to enter the central channel of the Rho ring.
Termination of transcription
RhoRho
-dependent termination
In an ATP-dependent process, Rho travels along the RNA, “chasing”
the RNA polymerase. When the polymerase stalls at the terminator stemloop structure, Rho catches up and unwinds the weak DNA–RNA hybrid.
This causes termination of RNA synthesis and release of all the
components.
Bacterial gene
expression