* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download What is transcription
Gene regulatory network wikipedia , lookup
Genetic code wikipedia , lookup
Gel electrophoresis of nucleic acids wikipedia , lookup
Community fingerprinting wikipedia , lookup
Molecular cloning wikipedia , lookup
Molecular evolution wikipedia , lookup
Transcription factor wikipedia , lookup
RNA interference wikipedia , lookup
Cre-Lox recombination wikipedia , lookup
Messenger RNA wikipedia , lookup
Artificial gene synthesis wikipedia , lookup
Biosynthesis wikipedia , lookup
Real-time polymerase chain reaction wikipedia , lookup
Non-coding DNA wikipedia , lookup
Polyadenylation wikipedia , lookup
RNA silencing wikipedia , lookup
Epitranscriptome wikipedia , lookup
Nucleic acid analogue wikipedia , lookup
Gene expression wikipedia , lookup
Promoter (genetics) wikipedia , lookup
Silencer (genetics) wikipedia , lookup
Deoxyribozyme wikipedia , lookup
Non-coding RNA wikipedia , lookup
RNA polymerase II holoenzyme wikipedia , lookup
Molecular Biology (3/30~4/25, 2007) Ch. 9 What is transcription? How transcription works? Stages Machinery Molecular mechanism Ch. 10,11 How transcription is regulated? Regulators Mechanisms Ch. 12, 11 Examples of transcriptional regulation Phage strategy RNA silencing 沈湯龍 (Tang-Long Shen) 助理教授 細胞生物學 一號館315室 Tel: 3366-4998; E-mail: [email protected] Transcription in Prokaryotes vs. Eukaryotes Prokaryotic Cell Because there is no nucleus to separate the processes of transcription and translation, when bacterial genes are transcribed, their transcripts can immediately be translated. Eukaryotic Cell Transcription and translation are spatially and temporally separated in eukaryotic cells; that is, transcription occurs in the nucleus to produce a premRNA molecule. The pre-mRNA is typically processed to produce the mature mRNA, which exits the nucleus and is translated in the cytoplasm. Transcription in prokaryotes The basis of life What is transcription? Central Dogma of Biology: DNA → RNA → protein ☆ Gene Expression: Transcription Transcription = DNA → RNA ☆ Gene functions (majority) are expressed as the proteins they encode: Translation Translation = RNA → protein RNA is structurally similar to DNA But…….. Gene Transcription: DNA → RNA genetic information flows from DNA to RNA by RNA polymerase RNA is identical in sequence with one strand of the DNA (but T→U), called coding strand. Four stages of transcription: 1. Promoter recognition and initial melting (binary complex formation) 2. Initiation (ternary complex formation) 3. Elongation 4. Termination Transcription Unit RNA polymerase Transcription unit binding A transcription unit is the distance between sites of initiation and termination by RNA polymerase; may include more than one gene (particularly in prokaryotes). release May include more than , one gene 5’ no number 0 5’ 3’ (Primary transcript) 3’ A relative location on a linear sequence mRNA How transcription works? Basic principles of transcription Template recognition: polymerase and duplex DNA Initiation: polymerase* and promoters Elongation: RNA polymerase Termination: terminator abortive initiation Initiation • Binding of an RNA polymerase to the dsDNA • (Slide) to find the promoter • Unwind the DNA helix • Synthesis of the RNA strand at the start site (initiation site), this position called position +1 Transcription Bubble To fulfill the principle process of transcription, that is complementary base pairing, a transient bubble has to be created. Two strands of DNA are separated (about 12~14 bp in length). Template strand is used to synthesize a complementary sequence of RNA. The length of RNA-DNA hybrid within the bubble is about 8~9 bp. As RNA polymerase moves along the DNA, the transient bubble moves along with and the RNA chain grows continuously. Transcription Bubble RNA-DNA hybrid length Ternary Complex: ~ 8 to 9 bases, it is short and transient Polymerase-DNA-RNA Function of RNA Polymerase Unwinding and Rewind DNA NTPs polymerized to a RNA chain Moving in the DNA About 25-base RNA molecule associated with the ternary complex at any moment. Elongation 5’ RNA extension 3’ Progression of transcription bubble is association with RNA polymerase movement on DNA Movement models 1. Sliding: 2. inchworm DNA rewind behind DNA unwind ahead RNA Reaction in Transcription (RNA polymerization) DNA replication RNA polymerization Direction 5’ to 3’ ~40 nt/sec 5 4 3 NTP γ 2 β Substrates ATP, UTP, GTP, CTP 5’ → 3’ ~800 bp/sec α NTP 5 4 Phosphate α,β,γ N → C termini ~15 aa/sec 3 2 1 NTP 5 Nucleotide γ 3 β Ribose 5C -- 1,2,3,4, 5 α NTP 5 3 Protein translation Stages of transcription (5’) Promoter : closed complex Terminator (3’) : open complex Binary Promoter clears Bubble moves on Ternary Abortive initiation: to ensure the initiation in a right way. Movement models 1. Sliding: common 2. inchworm (before the 10th base is added on nascent RNA chain within the bubble) move Extending RNA chain is accomplished with RNA poly (bubble) moves along DNA. The bases after 9th enable added on the growing RNA chain. Recognize termination signal Release RNA chain (by disrupt RNA:DNA hybrid) Dissociation of RNA pol Machinery in transcription Transcription in Prokaryotes RNA polymerase Prokaryotes have a single RNA polymerase enzyme--synthesizes mRNAs, rRNAs, and tRNAs Transcribe over > 1000 transcription units. The complexity is modified by interacting with diverse regulatory factors. Eukaryotes have three RNA polymerase Enzymes: RNA Pol I rRNA RNA Pol II mRNA RNA Pol III tRNA, 5S rRNA E. coli RNA polymerase RNA polymerase binds to the promoter Core enzyme + sigma factor = holoenzyme 155 KD 36.5 KD 11 KD 36.5 KD 70 KD Initiation only 151 KD 465kD Both initiation & elongation Structure and functions of E. coli RNA Polymerase 2 a subunits Enzyme assembly, Promoter recognition, factor binding b subunit Catalytic Center b' subunit Catalytic Center Template-binding s subunit Promoter specificity Eubacteria RNA polymerase (Pol) About 7000 RNA polymerase molecules are present in an E. coli cell. Most of them are engaged in transcription. In a short period of time, 2000-5000 Pol molecules can be synthesized. E. coli Polymerase: α subunit • • • • Two identical subunits in the core enzyme Encoded by the rpoA gene Required for core protein assembly May play a role in promoter recognition and regulatory factors interaction • ADP-ribosylation on an arginine upon T4 infection E. coli polymerase: b subunit 1. 2. • Encoded by rpoB gene. The catalytic center of the RNA polymerase Rifampicin (used for anti-tuberculosis): bind to the β subunit (12A away from active site), and inhibit transcription initiation. Blocking the path for extending RNA chain beyond 2-3 nts. Mutation in rpoB gene can result in rifampicin resistance. • Streptolydigins:resistant mutations are mapped to rpoB gene as well. Inhibits transcription elongation but not initiation. 3. b subunit may contain two domains responsible for transcription initiation and elongation E. coli polymerase: b’ subunit 1. 2. • 3. Encoded by the rpoC gene . Binds two Zn 2+ /Mg 2+ ions and may participate in the catalytic function of the polymerase Heparin:binds to the b’ subunit and inhibits transcription in vitro due to it competes with DNA for binding to the polymerase. b’ subunit may be responsible for binding to the template DNA . E. coli polymerase: s factor 1. 2. 3. 4. 5. Many prokaryotes contain multiple s factors to recognize different promoters. The most common s factor in E. coli is s70. (differential specificity) Binding of the s factor converts the core RNA pol into the holoenzyme. s factor is critical in promoter recognition, by decreasing the affinity of the core enzyme for nonspecific DNA sites (104) and increasing the affinity for the corresponding promoter s factor is released from the RNA pol after initiation (RNA chain is 8-9 nt) Less amount of s factor is required in cells than that of the other subunits of the RNA pol. Holoenzyme on promoter recognition (Core enzyme + sigma factor = holoenzyme) Core enzyme has the ability to synthesize RNA on a DNA template, but cannot initiate transcription at the proper sites. Holoenzyme has ~104-fold lower affinity for loose binding complexes than core. About 60 min half-life reduce to <1 sec. Holoenzyme has ~103-fold higher affinity for specific binding to promoters than core with a half life of several hours. Totally, sigma factor can result in 107 increase in DNA binding specificity. Core enzyme does not distinguish between promoters and other sequences of DNA. Sigma factor is required only for initiation reversible Wide range Faster Tight binding Fastest Less than 10 bases Beyond 10 bases leads to elongation Slow Recycle of sigma factor for the utilization of core enzyme Sigma factor is much less in number than core enzyme Evidence: 1/3 of sigma factors are not associated with core enzyme while elongation recycled Immediately after initiation Molecular structure of RNA polymerases in functioning Architecture of RNA polymerases (prokaryotes) T7 RNA polymerase (<100 kD) Bacterial RNA polymerase (465kD) Multiple subunits: 2α+β+β’+(σ) 25A wide Enzyme movement ~200 nts/sec Specificity recognition between enzyme and DNA bases (upstream of startpoint +1) ~40nts/sec A channel/groove on the surface ~25A wide forms a path for DNA. Path holds for 16 bp in prokaryotes 25 bp in eukaryotes More DNA bp can reside on the enzyme Further crystal structure will provide more direct and detailed view in a molecular level. Architecture of RNA polymerases (eukaryotes) Yeast RNA polymerase contains 12 subunits (10 are shown here) Nevertheless, it shares similar organization as bacterial one. Cleft between two large subunits forms as an active center A channel/groove on the surface forms a path for DNA. 25 bp DNA can be held in the path. Ternary Complex Channel within RNA polymerase Active center + Enzyme movement DNA in and out DNA out rudder RNA dissociated DNA turns DNA in Flexible ss DNA RNA flipped out Rigid straight duplex DNA entry (control by bridge protein) How many bp(s) in the bubble? Contact among the ternary structure in the active site These contacts can stabilize the single strand nucleic acid chains. Cycle of making and breaking bonds between enzyme and nucleic acids nt enters, adds, and interacts with the bridge protein nt still interacts with the bridge protein, which leads the protein to bending due to Pol moves one bp forward. Meanwhile, bridge blocks free nt enters. Finally, bridge releases Its interaction with newly added nt on RNA chain. Change in conformation of “bridge” protein is closely related to translocation of the enzyme along the nucleic acid. How does RNA polymerase find promoter sequences? Random diffusion RNA polymerase found promoters is very faster. Diffusion in the whole genome cannot support this fast. Directed walk (Direct displacement) vs. Random walk No DNA protein is known to work in this way Enzyme moves preferentially from a weak site to a strong site Transitions in shape and size of RNA polymerase during transcription Covered DNA length 75-80 bp (-55 to +20) 60 bp (-35 to +20s) 30-40 bp (interact w/ RNA pol) How to resume the stalled/pausing RNA polymerase? Cleavage 3’ end of RNA chain (Create a 3’-OH for further polymerization) Backtracks of RNA polymerase as a whole A constant distance between active site and frond end To correct mispositioned template during stall Accessory factors are needed such as: GreA and GreB for E. coli RNA polymerase TFIIS for eukaryotic RNA polymerase II One more function of RNA polymerase: * cleavage activity is from RNA polymerase itself. unwind Rewind DNA/RNA binding polymerize RNA Sequence elements in Transcription Promoter Coding sequence Terminator What is a promoter? •The sequence of DNA needed for RNA polymerase to bind to the template and accomplish the initiation reaction. •Its structure (not transcribed) is the signal (others are needed to be converted into RNAs or proteins). It is a cis-acting site. •Different from sequences whose role is to be transcribed or translated. What signal (structure) of a promoter provides? AT has only 2 H-bonds, which is easier to be broken (Open binary complex formation) (recognition domain (Closed binary complex formation) (i.e. the distance of separation between -10 and -35; intermediate sequence is irrelevant) Pribnow, D.: Nucleotide Sequence of an RNA Polymerase Binding Site at an Early T7 Promoter. PNAS 72, 784 (1975). Pribnow, D.: Bacteriophage T7 early promoters: nucleotide sequences of two RNA polymerase binding sites. J. Mol. Biol. 99, 419 (1975). Schaller, H. et al.: Nucleotide Sequence of an RNA Polymerase Binding Site from the DNA of Bacteriophage fd. PNAS 72, 737 (1975). The sequence comparison of five E. coli promoters Consensus TTGACA TATAAT Consensus: the most common base sequence to appear at such points on the DNA helix; there may be variations in various organisms Prokaryotic promoters display four conserved features: 1. Startpoint: >90% PURINE (A or G) 2. -10 consensus sequence (Pribnow box)--TAtAaT T80 A95 t45 A60 a50 T96 3. -35 consensus sequence--TTGACa T82 T84 G78 A65 C54 a45 4. Distance (spacing) between the -10 and -35 sequences (The distance is critical in holding the two sites at the appropriate separation for the geometry of RNA polymerase.) 5. UP element. TA rich sequence upstream of promoter. Functions of promoter domains -35 recognition domain Closed binary complex formation -10 unwinding domain: due to A-T pairs need lower energy to disrupt (melt) Open binary complex formation Sequence around the startpoint (+1 to +30): influences the initiation event. Rate of promoter clearance Other ancillary proteins may help RNA polymerase to recognize deficient promoters. Other structures may exist in a promoter A-T rich sequence It interacts with the α subunit of the RNA polymerase, which to ensure the higher gene expression. 100-fold variation in vitro Down mutation: mutations are tend to be concentrated in the most highly conserved positions. Up mutation: less cases happen within promoters RNA polymerase-promoter interactions A promoter with consensus sequences for the -10 and -35 regions (boxed) is shown; the sequences of actual promoters deviate from those shown here. The "jaws" of RNA polymerase are shown on the right of the molecule. This region of the RNA polymerase would grasp the DNA downstream of the catalytic site. Contacts between RNA polymerase and promoter DNA are shown by the solid lines. Not all contacts occur in every RNA polymerase-promoter interaction, but in all known cases (including promoters activated by regulator proteins), at a minimum, some contacts between and the 10 region appear to be required. J Bacteriol, June 1998, p. 3019-3025, Vol. 180, No. 12 -10 sequence (Pribnow box) • 6 bp sequence which is centered at around the –10 position (Pribnow, 1975). • A consensus sequence of TATAAT • The first two bases(TA) and the final T are most highly conserved among other E. coli promoters • This hexamer is separated by 5 to 9 bp from position +1, and the distance is critical • DNA unwinding is initiated at promoter by the polymerase -35 sequence: enhances recognition and interaction with the polymerase s factor • A conserved hexamer sequence around position –35 • A consensus sequence of TTGACA • The first three positions (TTG) are the most conserved among E. coli promoters. • Separated by 16-18 bp from the –10 box in 90% of all promoters Transcription start site The sequence around the start site influences initiation • A purine (A or G) in 90% of all genes • Often, there are C and T bases on either side of the start site nucleotide (i.e. CGT or CAT) Promoter efficiency (1) • There is considerable variation in sequence between different promoters, and the transcription efficiency can vary by up to 1000fold . • The –35 sequence, -10 sequence, and sequence around the start sites all influence initiation efficiency. Promoter efficiency (2) • The sequence of the first 30 bases to be transcribed controls the rate at which the RNA polymerase clears the promoter, hence influences the rate of the transcription and the overall promoter strength . • Strand separation in the initiation reaction • Some promoter sequence are not strong enough to initiate transcription under normal condition, activating factor is required for initiation. For example, Lac promoter Plac requires cAMP receptor protein (CRP ) DNA unwinding • Necessary to unwind the DNA so that the antisense strand to become accessible for base pairing, carried out by the polymerase. • Negative supercoiling enhances the transcription of many genes but not all (e.g. gyrase) by facilitating unwinding . • The initial unwinding of the DNA results in formation of an open complex with the polymerase and this process is referred to as tight binding Supercoiling during transcription At initiation ∵ Supercoiled structure requires less free energy for the initial melting of DNA ∴ it enhances the efficiency of transcription in vitro After initiation DNA is rotated during RNA pol movement; front is overwound and behind is released. A twin domain on transcribing DNA formed RNA polymerase binds to one face of DNA (-9 to +3 for unwinding) Touch down extension recognition contact initiation Sigma factor controls promoter recognition Different sigma is used for distinct responses "housekeeping" starvation/stationary extracytoplasmic stress nitrogen-limitation flagellar sigma factor The specificity is determined by recognizing different consensus sequences in promoters Bacillus subtilis sigma factors s factor gene function 43 rpoD, sigA housekeeping 28 sigD flagella/chemotaxis 29 spoIIGB, sigE sporulation 30 spo0H, sigH sporulation 32 sigC ? 37 sigB ? spoIIAC spoIIAC sporulation ? spoIIIC sporulation gp28 SPO1 28 phage middle gp33/34 SPO1 33,34 phage late Sigma factors may be organized into cascades A new sigma factor displaces the previous sigma factor Sigma factors directly contact DNA which contributes the binding specificities of sigma factors (most conserved) melting 2.3 2.4 4.2 domain1 15A Release N-terminal autoinhibition due conformation change via interaction with RNA polymerase conformation change 20A Free Holo: inside the active site Complex: displace from active site Coding stand Transcription initiation DNA-dependent RNA polymerases are promoter binding, DNA strand melting, RNA chain initiation and nascent RNA chain formation, and abortive RNA synthesis occurs finally escape from the promoter sequences. rate-limiting for the synthesis of productive RNAs What is the role of sigma factor in abortive initiation/promoter clearance (escape)? Elongation Elongation • Add ribonucleotides to the 3’-end (OH group) • The RNA polymerase extend the growing RNA chain in the direction of 5’ 3’ (E. coli: 40 nt/sec) • The enzyme itself moves in 3’ to 5’ along the antisense DNA strand. RNA chain elongation • σFactor is released to form a ternary complex of the pol-DNA-RNA (newly synthesized), causing the polymerase to progress along the DNA (promoter clearance) • Transcription bubble (unwound DNA region, ~ 17 bp) moves along the DNA with RNA polymerase which unwinds DNA at the front and rewinds it at the rear • 3’ part of RNA forms hybrid helix (ca. 12bp) with antisense DNA strand. • The E. coli polymerase moves at an average rate of ~ 40 nt per sec, depending on the local DNA sequence. Termination • The dissociation of the transcription complex from the template strand and separation of RNA strand from DNA • Occurring at the terminator (often stem-loop or hairpin structure), some need rho protein as accessory factor. it is a regulatory event Hence, it is possible to readthrough the terminator (anti-termination) in a signal-dependent manner. RNA chain termination • Termination: dissociation of RNA > re-annealing of DNA > release of RNA pol • Terminator sequence (stop signal): • RNA hairpin very common • Accessory rho protein The DNA sequences required for termination are located prior to the terminator sequence. Formation of a hairpin in the RNA may be necessary RNA hairpin structure: an intrinsic terminator near the base of the stem. Hairpin leads RNA pol to slow/pause The rU.dA RNA –DNA hybrid has an unusually weak base-paired structure; it requires the least energy of any RNA-DNA hybrid to break the association between the two strands. A model for intrinsic termination Rho-dependant termination • Some genes contain terminator sequences requiring an accessory factor,the rho protein (ρ) to mediated transcription termination • Rho binds to specific sites in the single-stranded RNA • Rho hydrolyses ATP and moves along the nascent RNA towards the transcription complex then enables the polymerase to terminate transcription Termination efficiency determinants: @ The Sequence of the hairpin @ The length of the U-run @ Sequences both upstream and downstream of the intrinsic terminator @ Ancillary proteins @ others rich poor A bias sequence preceding actual terminator site (RNA) is important for termination efficiency (rho dependent terminator). ~275kD