* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Module 7 – Microbial Molecular Biology and Genetics
Transposable element wikipedia , lookup
Mitochondrial DNA wikipedia , lookup
SNP genotyping wikipedia , lookup
Zinc finger nuclease wikipedia , lookup
Non-coding RNA wikipedia , lookup
History of RNA biology wikipedia , lookup
Human genome wikipedia , lookup
Genome evolution wikipedia , lookup
Bisulfite sequencing wikipedia , lookup
Epigenetics of human development wikipedia , lookup
Gel electrophoresis of nucleic acids wikipedia , lookup
United Kingdom National DNA Database wikipedia , lookup
Genealogical DNA test wikipedia , lookup
Genetic code wikipedia , lookup
Nutriepigenomics wikipedia , lookup
Cancer epigenetics wikipedia , lookup
Genomic library wikipedia , lookup
DNA damage theory of aging wikipedia , lookup
Genetic engineering wikipedia , lookup
DNA polymerase wikipedia , lookup
DNA vaccination wikipedia , lookup
Cell-free fetal DNA wikipedia , lookup
Designer baby wikipedia , lookup
Epigenomics wikipedia , lookup
Molecular cloning wikipedia , lookup
Site-specific recombinase technology wikipedia , lookup
No-SCAR (Scarless Cas9 Assisted Recombineering) Genome Editing wikipedia , lookup
Nucleic acid double helix wikipedia , lookup
DNA supercoil wikipedia , lookup
Genome editing wikipedia , lookup
Point mutation wikipedia , lookup
Non-coding DNA wikipedia , lookup
Extrachromosomal DNA wikipedia , lookup
Vectors in gene therapy wikipedia , lookup
Microevolution wikipedia , lookup
Cre-Lox recombination wikipedia , lookup
Nucleic acid analogue wikipedia , lookup
Helitron (biology) wikipedia , lookup
Primary transcript wikipedia , lookup
Therapeutic gene modulation wikipedia , lookup
History of genetic engineering wikipedia , lookup
NPTEL – Biotechnology – Microbiology Module 7 – Microbial Molecular Biology and Genetics Lecture 1 - Structure and function of genetic material Overview: • Griffith's experiment showed that something caused the transformation of bacteria. • Avery, MacLeod, and McCarty showed that DNA was responsible, but people were still unconvinced. • Hershey and Chase finally proved that DNA is the genetic material. • DNA is a polymer of nucleotides. • Chargaff’s rules • Watson and Crick summarized the above findings and postulated their model of DNA structure. • A, B, Z DNA • RNA structure Griffith Experiment • The bacteria in the experiment had two different strains: R strain (harmless) and S strain (harmful). • Live R strain had no effect on the mice, while live S strain killed the mice. • Heat-killed S strain failed to kill the mice. • When live R strain (harmless) was mixed with heat-killed S strain (now harmless), and the injected mice died. • CONCLUSION: The heat-killed cells were somehow able to retain and transfer information. Joint initiative of IITs and IISc – Funded by MHRD Page 1 of 91 NPTEL – Biotechnology – Microbiology Fig 1. Griffith's experiment discovering a "transforming principle" in heat-killed virulent smooth pneumococcus is that it enables the transformation of rough non-virulent pneumococcus. (This file is licensed under the Creative Commons Attribution –Share Alike 3.0 Unported license). Griffith experimented with two different strains of the bacteria Diplococcus pneumoniae: strain R (rough) and strain S (smooth). The S cells have a protective protein coat which protects them from being destroyed by the host cell's immune system. Therefore, the R strain is harmless while the S strain is harmful. Griffith injected mice with live strain R bacteria. The mice were found healthy and contained no living bacteria. However, when he injected the mice with the S strain, the mice died and Griffith found live S cells in their bodies. He then injected the mice with heat-killed S bacteria. The mice did not die and contained no live bacteria. Nevertheless, when he injected mice with live R cells and heat-killed S cells, the mice died. From this he concluded that the heatkilled cells, although they were not living, still passed their hereditary material to the living R cells somehow. Avery-MacLeod-McCarty Experiment • Extracted components from heat-killed S bacteria. • After each extraction, S cells were mixed with R bacteria. • R bacteria transformed each time until DNA was extracted from S cells. • Avery and his colleagues concluded that DNA was the "transforming principle." Joint initiative of IITs and IISc – Funded by MHRD Page 2 of 91 NPTEL – Biotechnology – Microbiology In 1944, Oswald Avery, Colin MacLeod, and Maclyn McCarty provided additional experimental evidence using test tubes to strengthen Griffith's "transforming principle." Like Griffith, Avery and his colleagues used harmless R bacteria to determine the genetic factor of bacteria. First, they lysed heat-killed S cells extracted from Streptococcus pneumonia. When the lysate combined with R bacteria, virulent S bacteria were produced. To determine the factor responsible for transformation, Avery, MacLeod, and McCarty removed the sugar coats, proteins, RNA, and DNA from the lysate. The R bacteria remained non-virulent only when the DNA was removed from the lysate. In all other cases, the R bacteria were transformed. This experiment showed that DNA was the "transforming principle." Hershey-Chase Experiment: Fig 2. Hershey- Chase experimented with radioactive phosphorous and sulfur to confirm that DNA is the “Transforming principle”. (Author: Thomasione. This is a file from the Wikimedia Commons; GNU Free Documentation License) • Labeled bacteriophage DNA with radioactive phosphorus • After bacteriophage infected bacteria, phosphorus was found in bacteria • Labeled protein coat with radioactive sulfur • After bacteriophage infected bacteria, no sulfur was found in bacteria • Hershey and Chase provided further evidence that DNA was the "transforming principle". Joint initiative of IITs and IISc – Funded by MHRD Page 3 of 91 NPTEL – Biotechnology – Microbiology Although Avery, MacLeod, and McCarty discovered some evidence to show that DNA was responsible for the transfer of information, many people were still skeptical and believed it was protein. Alfred Hershey and Martha Chase were determined to provide more concrete evidence to prove that DNA was the genetic material in bacteriophages. In their first experiment, Hershey and Chase labeled bacteriophage DNA by injecting radioactive phosphorus into the bacteriophage. Because DNA contains phosphorus and amino acids do not, only the DNA was tagged. After the bacteriophage infected a strain of E. coli, Hershey and Chase observed radioactive phosphorus in the bacteria. In their second experiment, Hershey and Chase injected the bacteriophage with radioactive sulfur in order to tag only the protein coat. This time, after the bacteriophage infected the E. coli, Hershey and Chase did not observe the presence of sulfur in the bacteria. From their experiments, Hershey and Chase concluded that DNA was responsible for transferring information in Griffith’s experiment. Hershey and Chase’s experiment finally convinced everyone of DNA’s role as the genetic material in bacteriophages. Nucleic Acid Structure: Nucleotides A nucleotide is composed of a nucleobase (nitrogenous base), a five-carbon sugar (either ribose or 2-deoxyribose), and one phosphate group. Without the phosphate group, the nucleobase and sugar compose a nucleoside. A nucleotide can thus also be called a nucleoside monophosphate. The phosphate group’s form bonds with the 2, 3, or 5-carbon of the sugar, with the 5-carbon site most common. Cyclic nucleotides form when the phosphate group is bound to two of the sugar's hydroxyl groups. Nucleotides contain either a purine or a pyrimidine base. Ribonucleotides are nucleotides in which the sugar is ribose. Deoxyribonucleotides are nucleotides in which the sugar is deoxyribose. Nucleic acids are polymeric macromolecules made from nucleotide monomers. In DNA, the purine bases are adenine and guanine, while the pyrimidines are thymine and cytosine. RNA uses uracil in place of thymine. Adenine always pairs with thymine by 2 hydrogen bonds, while guanine pairs with cytosine through 3 hydrogen bonds, each due to their unique structures. Joint initiative of IITs and IISc – Funded by MHRD Page 4 of 91 NPTEL – Biotechnology – Microbiology Fig. 3. Nucleotide base structures: Purines and pyrimidines and its mono, di, tri phosphates. Table 1. Naming nucleosides and nucleotides Definitions Adenine (A) Bases Guanine Cytosine (G) (C) Uracyl (U) The combination of a ribose and a base Adenosine Guanosine Cytidine Uridine constitutes a nucleoside. The combination of a phosphate, a ribose Adenylate Guanylate Cytidylate Uridylate and a base constitutes a nucleotide. Chargaff's rules: Chargaff's rules state that DNA from any cell of all organisms should have a 1:1 ratio of pyrimidine and purine bases and, more specifically, that the amount of guanine is equal to cytosine and the amount of adenine is equal to thymine. This pattern is found in both strands of the DNA. They were discovered by Austrian chemist Erwin Chargaff. The rule holds that a double-stranded DNA molecule globally has percentage base pair equality: %A = %T and %G = %C. The rigorous validation of the rule constitutes the basis of Watson-Crick pairs in the DNA double helix. DNA as a double helix DNA is a long polymer made from repeating units called nucleotides. As first discovered by James D. Watson and Francis Crick, the structure of DNA of all species comprises two helical chains each coiled round the same axis, and each with a pitch of 34 Ångströms (3.4 nanometres) and a radius of 10 Ångströms (1.0 nanometres). According to another study, when measured in a particular solution, the DNA chain measured 22 to 26 Ångströms wide (2.2 to 2.6 nanometres), and one nucleotide unit Joint initiative of IITs and IISc – Funded by MHRD Page 5 of 91 NPTEL – Biotechnology – Microbiology measured 3.3 Å (0.33 nm) long. Although each individual repeating unit is very small, DNA polymers can be very large molecules containing millions of nucleotides. For instance, the largest human chromosome, chromosome number 1, is approximately 220 million base pairs long. Fig 4. The structure of DNA showing with detailed structure of the four bases, adenine, cytosine, guanine and thymine, and the location of the major and minor groove. In living organisms DNA does not usually exist as a single molecule, but instead as a pair of molecules that are held tightly together. These two long strands entwine like vines, in the shape of a double helix. The nucleotide repeats contain both the segment of the backbone of the molecule, which holds the chain together, and a nucleobase, which interacts with the other DNA strand in the helix. A nucleobase linked to a sugar is called a nucleoside and a base linked to a sugar and one or more phosphate groups is called a nucleotide. Polymers comprising multiple linked nucleotides (as in DNA) are called a polynucleotide. The backbone of the DNA strand is made from alternating phosphate and sugar residues. The sugar in DNA is 2-deoxyribose, which is a pentose (five-carbon) sugar. The sugars are joined together by phosphate groups that form phosphodiester bonds between the third and fifth carbon atoms of adjacent sugar rings. These asymmetric bonds mean a strand of DNA has a direction. In a double helix the direction of the nucleotides in one strand is opposite to their direction in the other strand: the strands are antiparallel. The asymmetric ends of DNA strands are called the 5′ (five prime) and 3′ (three prime) ends, Joint initiative of IITs and IISc – Funded by MHRD Page 6 of 91 NPTEL – Biotechnology – Microbiology with the 5' end having a terminal phosphate group and the 3' end a terminal hydroxyl group. One major difference between DNA and RNA is the sugar, with the 2-deoxyribose in DNA being replaced by the alternative pentose sugar ribose in RNA. The DNA double helix is stabilized primarily by two forces: hydrogen bonds between nucleotides and base-stacking interactions among the aromatic nucleobases. In the aqueous environment of the cell, the conjugated π bonds of nucleotide bases align perpendicular to the axis of the DNA molecule, minimizing their interaction with the solvation shell and therefore, the Gibbs free energy. The four bases found in DNA are adenine (abbreviated A), cytosine (C), guanine (G) and thymine (T). These four bases are attached to the sugar/phosphate to form the complete nucleotide. Fig. 5. Chemical structure of DNA, with colored label identifying the four bases as well as the phosphate and deoxyribose components of the backbone. The nucleobases are classified into two types: the purines, A and G, being fused five- and six-membered heterocyclic compounds, and the pyrimidines, the six-membered rings C and T. A fifth pyrimidine nucleobase, uracil (U), usually takes the place of thymine in RNA and differs from thymine by lacking a methyl group on its ring. Uracil is not usually found in DNA, occurring only as a breakdown product of cytosine. In addition to RNA and DNA a large number of artificial nucleic acid analogues have also been created to study the proprieties of nucleic acids, or for use in biotechnology. Twin helical strands form the DNA backbone. Another double helix may be found by tracing the spaces, or grooves, between the strands. Grooves are adjacent to the base pairs and may provide a binding site. As the strands are not directly opposite each other, the grooves are unequally sized. One groove, the major groove, is 22 Å wide and the other, the minor groove, is 12 Å wide. The narrowness of the minor groove means that the edges of the bases are more accessible in the major groove. As a result, proteins Joint initiative of IITs and IISc – Funded by MHRD Page 7 of 91 NPTEL – Biotechnology – Microbiology like transcription factors that can bind to specific sequences in double-stranded DNA usually make contacts to the sides of the bases exposed in the major groove. In a DNA double helix, each type of nucleobase on one strand normally interacts with just one type of nucleobase on the other strand. This is called complementary base pairing. Here, purines form hydrogen bonds to pyrimidines, with A bonding only to T, and C bonding only to G. This arrangement of two nucleotides binding together across the double helix is called a base pair. As hydrogen bonds are not covalent, they can be broken and rejoined relatively easily. The two strands of DNA in a double helix can therefore be pulled apart like a zipper, either by a mechanical force or high temperature. As a result of this complementarity, all the information in the double-stranded sequence of a DNA helix is duplicated on each strand, which is vital in DNA replication. Indeed, this reversible and specific interaction between complementary base pairs is critical for all the functions of DNA in living organisms. The two types of base pairs form different numbers of hydrogen bonds, AT forming two hydrogen bonds, and GC forming three hydrogen bonds. DNA with high GC-content is more stable than DNA with low GC-content. As noted above, most DNA molecules are actually two polymer strands, bound together in a helical fashion by noncovalent bonds; this double stranded structure (dsDNA) is maintained largely by the intrastrand base stacking interactions, which are strongest for G, C stacks. The two strands can come apart – a process known as melting – to form two ss DNA molecules. Melting occurs when conditions favor ssDNA; such conditions are high temperature, low salt and high pH (low pH also melts DNA, but since DNA is unstable due to acid depurination, low pH is rarely used). A, B and Z DNA In a DNA molecule, the two strands are not parallel, but intertwined with each other. Each strand looks like a helix. The two strands form a "double helix" structure, which was first discovered by James D. Watson and Francis Crick in 1953. In this structure, also known as the B form, the helix makes a turn every 3.4 nm, and the distance between two neighboring base pairs is 0.34 nm. Hence, there are about 10 pairs per turn. The intertwined strands make two grooves of different widths, referred to as the major groove and the minor groove, which may facilitate binding with specific proteins. Joint initiative of IITs and IISc – Funded by MHRD Page 8 of 91 NPTEL – Biotechnology – Microbiology In a solution with higher salt concentrations or with alcohol added, the DNA structure may change to an A form, which is still right-handed, but every 2.3 nm makes a turn and there are 11 base pairs per turn. Another DNA structure is called the Z form, because its bases seem to zigzag. Z DNA is left-handed. One turn spans 4.6 nm, comprising 12 base pairs. The DNA molecule with alternating G-C sequences in alcohol or high salt solution tends to have such structure. RNA Each nucleotide in RNA contains a ribose sugar, with carbons numbered 1' through 5'. A base is attached to the 1' position, in general, adenine (A), cytosine (C), guanine (G), or uracil (U). Adenine and guanine are purines, cytosine, and uracil are pyrimidines. A phosphate group is attached to the 3' position of one ribose and the 5' position of the next. The phosphate groups have a negative charge each at physiological pH, making RNA a charged molecule (polyanion). An important structural feature of RNA that distinguishes it from DNA is the presence of a hydroxyl group at the 2' position of the ribose sugar. The presence of this functional group causes the helix to adopt the A-form geometry rather than the B-form most commonly observed in DNA. This results in a very deep and narrow major groove and a shallow and wide minor groove. A second consequence of the presence of the 2'hydroxyl group is that in conformationally flexible regions of an RNA molecule (that is, not involved in formation of a double helix), it can chemically attack the adjacent phosphodiester bond to cleave the backbone. RNA is transcribed with only four bases (adenine, cytosine, guanine and uracil), but these bases and attached sugars can be modified in numerous ways as the RNAs mature. Pseudouridine (Ψ), in which the linkage between uracil and ribose is changed from a C– N bond to a C–C bond, and ribothymidine (T) are found in various places (the most notable ones being in the TΨC loop of tRNA). Another notable modified base is hypoxanthine, a deaminated adenine base whose nucleoside is called inosine (I). Inosine plays a key role in the wobble hypothesis of the genetic code. There are nearly 100 other naturally occurring modified nucleosides, of which pseudouridine and nucleosides with 2'-O-methylribose are the most common. The specific roles of many of these modifications in RNA are not fully understood. Joint initiative of IITs and IISc – Funded by MHRD Page 9 of 91 NPTEL – Biotechnology – Microbiology The functional form of single-stranded RNA molecule requires a specific tertiary structure. The scaffold for this structure is provided by secondary structural elements that are hydrogen bonds within the molecule. This leads to several recognizable "domains" of secondary structure like hairpin loops, bulges, and internal loops. Since RNA is charged, metal ions such as Mg2+ are needed to stabilize many secondary and tertiary structures. REFERENCES: Text Books: 1. Jeffery C. Pommerville. Alcamo’s Fundamentals of Microbiology (Tenth Edition). Jones and Bartlett Student edition. 2. Gerard J. Tortora, Berdell R. Funke, Christine L. Case. Pearson - Microbiology: An Introduction. Benjamin Cummings. 3. J. Krebs, E.S. Goldstein, Stephen T. Kilpatrick. Lewin’s Genes X. Jones and Bartlett Publishers. Reference Books: 1. Lansing M. Prescott, John P. Harley and Donald A. Klein. Microbiology. Mc Graw Hill companies. Joint initiative of IITs and IISc – Funded by MHRD Page 10 of 91 NPTEL – Biotechnology – Microbiology Module – 7 Microbial Molecular Biology and Genetics Lecture 2 - DNA replication DNA replication Templated replication • Sense and antisense strands: The sense strand of a double-stranded DNA molecule has a base sequence similar to that of the RNA that is transcribed from the DNA. The antisense strand (also known as template strand) has a sequence that is the reverse complement of the sense strand (base paired to it with an antiparallel 5' to 3' polarity). • Semiconservative replication: Discovery of the double helical structure of DNA immediately suggested a mechanism for precise replication -- namely separation of the two strands and template synthesis of a new copy of each of the missing complementary strands. This model predicted that each new double helix should have one strand of parental DNA and one strand of newly synthesized DNA. Fig 6. Three postulated methods of replication: 1. Semiconservative, 2. Conservative, 3. Dispersive Joint initiative of IITs and IISc – Funded by MHRD Page 11 of 91 NPTEL – Biotechnology – Microbiology • Messelson-Stahl experiment: Semiconservative replication was demonstrated by starting with bacteria whose DNA was labeled with heavy nitrogen (15N) and then allowing various amounts of growth to occur in media labeled with normal nitrogen. After one round of replication, the entire DNA had an intermediate density. After two rounds, one half was light and the other half still had an intermediate density. There was no conservation of heavy DNA. However, the data suggested that each of the labeled strands from the original DNA remained intact and separated from its partner in each replication cycle. Initiation of replication • Origin of Replication: The DNA of a bacterial chromosome is a closed circular structure. DNA replication begins at a specific site (origin of replication) characterized by the presence of repeated 9 base and 13 base nucleotide sequences. The repeating units are referred to as 9-mers and 13-mers. The 13 mers are AT rich, making easier to separate the two strands of the double-stranded DNA. • DNAa protein: The protein coded by the DNA a gene binds to the repeated 9mers. This forms a tight loop and generates a strain that causes strand separation in the region containing the AT-rich 13-mers • Helicase: Enzymes called helicases use energy derived from ATP to further separate the two strands of the DNA double helix. • Topoisomerase I: Separation of the two strands of the DNA double helix requires substantial unwinding of the helix. An enzyme known as topoisomerase I (gyrase) relieves twisting strain that is generated by unwinding the double helix. It is believed to act by cutting one of the strands such that the other strand can rotate freely to relieve the strain, and then resealing the strand that has been temporarily cut. • Keeping the helix open: Single stranded DNA-binding proteins (SSBPs) attach to the single stranded DNA generated by unwinding the double helix and temporarily keep it from reforming double helical structures. Joint initiative of IITs and IISc – Funded by MHRD Page 12 of 91 NPTEL – Biotechnology – Microbiology Patterns of replication • Replication forks: When the two strands of double helical DNA separate and replication of both strands begins, a forked or Y-shaped structure is formed. • Bidirectional replication: In bacterial cells, replication starts at a specific origin of replication within the circular DNA molecule and proceeds in both directions away from the origin. This results in the formation of a replication "bubble", which continues to elongate as replication proceeds. A similar pattern is seen in eukaryotic chromosomes, except that multiple origins are involved. • Theta structure in bacteria: As replication proceeds in both directions around the circular chromosomes of bacteria, a structure reminiscent of the Greek letter theta is formed. • Replicons: The very long DNA double helices found in eukaryotic chromosomes contain multiple origins of replication, which often initiate bidirectional replication more or less synchronously. Each unit of replication is called a replicon. Replication continues until the replication bubbles fuse to yield fully replicated DNA strands. • Other patterns There are also two alternative strategies for replication of circular DNA that are more complex than the simple bidirectional model. These are the rolling circle mode (sigma mode) used by some types of viruses, and the D loop mode, used for replication of mitochondrial and chloroplast DNAs. 5'-to-3' synthesis • Unidirectional addition of nucleotides: Polymerization of DNA, and also of RNA, occurs by condensing a 5'-nucleoside triphosphate (dNTP or NTP) onto the 3' hydroxyl group of another nucleotide, or onto the 3'-end of a growing polynucleotide chain. No mechanism exists for extending the 5'-end of a nucleotide chain. • Energy for synthesis: Hydrolysis of the dNTP or NTP that is being added provides the energy needed to form a covalent bond. The outer two phosphates of the triphosphate are split off and the innermost phosphate forms an ester linkage Joint initiative of IITs and IISc – Funded by MHRD Page 13 of 91 NPTEL – Biotechnology – Microbiology with the 3'-hydroxyl group at the end of the pre-existing chain. This results in a phosphodiester bond RNA priming • No new DNA starts: Deoxynucleotides can only be added to 3'-end of a preexisting strand of DNA (or RNA). There are no enzymes capable of initiating the synthesis of a DNA-templated DNA molecule at the level of a single nucleotide. This makes necessary to use an indirect priming procedure. • RNA primers: A variety of enzymes are capable of initiating new DNAtemplated RNA synthesis (as in transcription). New DNA synthesis is primed with a short segment of RNA that is later removed • Primase: A separate enzyme in the initiation complex called primase synthesizes a short RNA primer each time that new DNA synthesis begins, including all new starts in the discontinuous pattern of synthesis described below. Leading and lagging strands • Unidirectional synthesis of antiparallel DNA: The inability to synthesize new chains in a 3' to 5' direction adds a major complication to the replication of antiparallel double-stranded DNA. At any replication fork, one of the template strands has a 3' to 5' orientation, which is what is needed for synthesis of a new complementary strand in a 5' to 3' direction. Synthesis on that template is primed and starts very quickly. However, the other template strand has a 5' to 3' orientation and is thus unable to support synthesis beginning at the origin and moving away from it in a 3' to 5' direction. Because of this, the 5' to 3' template strand accumulates in a single stranded configuration until there is a sufficient length so that synthesis of its complementary antiparallel strand can be primed and initiated in a 5'-to-3' direction ("backward" toward the origin of replication). The strand whose synthesis begins immediately is called the "leading" strand, and the one whose synthesis is delayed is called the "lagging" strand. Joint initiative of IITs and IISc – Funded by MHRD Page 14 of 91 NPTEL – Biotechnology – Microbiology • Discontinuous synthesis -- Okazaki fragments: As synthesis of the leading strand continues, more and more of the single stranded DNA of the lagging strand is unwound. Each time that a sufficient length is reached, synthesis of a new segment of the complementary strand begins. If the replicating DNA is denatured (separated into individual strands) before the newly synthesized pieces of the lagging strand have been ligated together, a number of relatively small fragments of newly synthesized DNA will be recovered, together with the much longer strands produced by continuous synthesis in the leading strand. The small fragments are called "Okazaki fragments", named for the person who first discovered them. DNA polymerase III Template DNA synthesis: After the RNA primer has reached an adequate length, DNA polymerase III begins synthesis of DNA, which proceeds to completion in the leading strand, and proceeds until the 5'-end of the previous primer is encountered in the lagging strand. • DNA polymerase III holoenzyme: DNA polymerase III is a highly complex dimeric aggregate, consisting of 20 or more protein subunits. The alpha subunits perform the actual DNA synthesis, but operate in conjunction with multiple accessory proteins • Simultaneous synthesis of leading and lagging strands: There is yet another complicating factor in DNA synthesis. It is now generally believed that the leading and lagging strands are synthesized simultaneously by a single dimeric DNA polymerase III complex. This requires formation of a looped structure with the leading and lagging strands so positioned that their synthesis can occur side by side in the same orientation, despite the fact that the newly synthesized chains are growing in opposite directions relative to the overall DNA that is being replicated. A similar looping also appears to occur in eukaryotic DNA replication. • Clamping function of beta subunits: The beta subunits appear to have a clamping function that keeps the leading and lagging strands appropriately aligned with the catalytically active alpha subunits. Joint initiative of IITs and IISc – Funded by MHRD Page 15 of 91 NPTEL – Biotechnology – Microbiology • Replisome complex: The entire DNA-synthesizing complex at each replication fork, which also includes topoisomerase, helicase, and primase, is sometimes referred to as a replisome. DNA polymerase I • Not the primary enzyme for DNA synthesis: DNA polymerase I is so named because it was the first of the DNA polymerase enzymes to be isolated and characterized. Although it is capable of template-directed DNA synthesis, it is now known not to be the enzyme primarily responsible for new DNA synthesis, but it does have other important roles, as described below. • 5' to 3' exonuclease activity: DNA polymerase I has an unusual 5' to 3' exonuclease activity. This gives it the ability to start at a single-stranded break and progressively remove nucleotides and replace them in a 5' to 3' direction. • Removal of RNA primer: In the lagging strand, when DNA polymerase III runs into the 5'-end of the previous RNA primer, it is unable to proceed further. Although the new DNA butts up against the primer, it is not covalently joined to it. DNA polymerase III dissociates, leaving a "nick" (a single stranded gap) between the new DNA and the primer. Starting at the nick, DNA polymerase I removes the primer ribonucleotides one at a time, using its 5' to 3' exonuclease activity, and replaces them with deoxyribonucleotides, using its DNA polymerase activity.. • DNA repair: DNA polymerase I is also used to fill in short gaps in DNA, often as a part of a repair process that excises part of a damaged strand and replaces it with new DNA templated from the remaining strand. Proofreading and DNA repair • Removal of mismatched bases: DNA polymerases III and I both have 3' to 5' exonuclease activity. This allows them to remove a mismatched nucleotide that has just been added to a growing DNA chain and make another attempt to insert the correct nucleotide. This "proofreading" function helps to reduce the number of mistakes in DNA synthesis that would otherwise result in mutation.. Joint initiative of IITs and IISc – Funded by MHRD Page 16 of 91 NPTEL – Biotechnology – Microbiology • DNA polymerase II: There is yet another prokaryotic DNA polymerase, designated DNA polymerase II, whose main function appears to be to synthesize replacement DNA during DNA repair. Final steps in DNA synthesis • DNA Ligase: After the last ribonucleotide is removed from the primer and replaced with a deoxynucleotide, there is still a nick in the newly synthesized lagging strand. This nick is closed by DNA ligase, which forms a covalent phosphodiester bond between the Okazaki fragments, joining them into a continuous strand of DNA. Note that Okazaki fragments accumulate when ligase function is impaired. • Topoisomerase II: Its role is to cut and reseal the newly synthesized DNAs as needed so that they can separate from each other (it is easy to visualize two circular genomes linked through each other at the end of replication). Fig. 7. Diagram of DNA polymerase extending a DNA strand and proof-reading. Joint initiative of IITs and IISc – Funded by MHRD Page 17 of 91 NPTEL – Biotechnology – Microbiology REFERENCES: Text Books: 1. Jeffery C. Pommerville. Alcamo’s Fundamentals of Microbiology (Tenth Edition). Jones and Bartlett Student edition. 2. Gerard J. Tortora, Berdell R. Funke, Christine L. Case. Pearson - Microbiology: An Introduction. Benjamin Cummings. 3. J. Krebs, E.S. Goldstein, Stephen T. Kilpatrick. Lewin’s Genes X. Jones and Bartlett Publishers. Reference Books: 1. Lansing M. Prescott, John P. Harley and Donald A. Klein. Microbiology. Mc Graw Hill companies. Joint initiative of IITs and IISc – Funded by MHRD Page 18 of 91 NPTEL – Biotechnology – Microbiology Module – 7 Microbial Molecular Biology and Genetics Lecture 3 - The genetic code and the gene structure The Genetic code Since there are 20 different kinds of amino acids in proteins and only four kinds of nucleotides in DNA, the relationship between the gene and its most elementary functional product, i.e., between DNA and protein, can hardly be interpreted through a code of one nucleotide = one amino acid. A coding sequence of two nucleotides for one amino acid, or a doublet code, would produce only 16 possible coding combinations, or codons. Codons are group of nucleotides that specifies one amino acid. By the genetic code (George Gamov, 1954), we mean, a collection of base sequences (codons) that correspond to each amino acid and to translation signals. A codon size of three nucleotides for one amino acid are triplet codon seems more likely, since it produces 64 possible codons, however, only 20 amino acids need to be coded, 44 codons in a triplet code seem to be superfluous. To account for the excess of codons beyond the necessary 20, we can suppose that more than one codon can code for a particular amino acid. For example, if each kind of amino acid were coded by three different possible codons, 60 possible codons would be accounted for. A code in which there is more than one codon for the same amino acid, is called degenerate. It is also possible that some or all of the codons in excess of 20 do not code for any amino acid and are therefore nonsense codons. Fig. 8. Genetic Code. Joint initiative of IITs and IISc – Funded by MHRD Page 19 of 91 NPTEL – Biotechnology – Microbiology The genetic code has following general properties, mostly applicable to the genes of all the organisms: Genetic code is triplet. As discussed earlier, singlet and doublet codons cannot form 20 combinations, which is the minimum requirement, therefore triplet codon is a necessity, so that all the amino acids must be coded. Genetic code is non-overlapping. During translation, the codons are read one after another, in a sequence. One base of a codon is not used by the other codons. Therefore, if there are six bases, they will code for two amino acids only. e.g.; in case of non-overlapping, a gene sequence of UUUCCC only two amino acids will be coded, phenylalanine (UUU) and proline (CCC), whereas for an overlapping code, more than two amino acids could be coded, phenylalanine (UUU), serine (UCC) and proline (CCC). Fig. 9. Overlapping and non-overlapping genetic code Genetic code is commaless. The bases are read one after the other in the codons, i.e., no bases or codons are reserved for punctuation or comma. When the first amino acid is coded, the second will be coded by the next three bases immediately, and no base will be wasted to serve as a comma. Once the translation begins, the codons are read one after the other with no break or demarcating signals in between them. Genetic code is non-ambiguous. Each codon has a particular amino acid for coding, and it will code for that amino acid only. There is one to one relationship between codon to amino acid. However, there is an exception, AUG codes for methionine and GUG codes for valine, but if AUG is absent, then GUG codes for methionine, as starting codon for Joint initiative of IITs and IISc – Funded by MHRD Page 20 of 91 NPTEL – Biotechnology – Microbiology protein synthesis. In an ambiguous code, one codon can code for more than one amino acid. In certain rare cases, the genetic code is found to be ambiguous i.e, some codons codes for different amino acids under different conditions, e.g. in streptomycin sensitive strain of E.coli the codon UUU normally codes for phenylalanine, but it may also code for isoleucine, serine when treated with streptomycin. This ambiguity is enhanced at high Mg ion concentration, low temperature and the presence of ethyl alcohol. Genetic code is degenerate. Since there are more codons than the amino acids, more than one codon may specify the same amino acid. Such different codons that specify the same amino acid are called as synonymous codons. e.g., UUU = UUC = phenylalanine. Genetic code has start/stop signals. Some codons are specially meant for initiation and termination of protein synthesis. e.g.: AUG codes for methionine, serves as initiation codon in eukaryotes and GUG in case of prokaryotes. Three codons UAG (amber), UAA (ochre), UGA (opal) are called as termination codons, because they terminate protein synthesis. Earlier, they were called as nonsense codons, because they do not code for any amino acid, but, since they are involved in termination of protein synthesis, they are called as termination codons. The initiation and termination codons are known as signals and this phenomenon is known as punctuation. Genetic code is polar. It means that the genetic code has a fixed start and termination ends, and is always read in a fixed direction, i.e. in 5’→ 3’ direction and the polypeptide chain is synthesized in N→C direction i.e., from amino group (NH2) to carboxylic group (COOH). Genetic code is universal. The same genetic code is applicable to all organisms, from bacteria to man, i.e.; the codons have the same meaning in all the organisms. e.g., UUU = phenylalanine in bacteria, mouse, man and tobacco. The universality of the genetic code, however, does not mean that DNA base ratios must be similar in different species for genes specifying similar proteins. The fact that the code is degenerate enables many bases to be changed by mutation in a sequence of mRNA, but this mRNA could still produce the same amino acid sequence. Joint initiative of IITs and IISc – Funded by MHRD Page 21 of 91 NPTEL – Biotechnology – Microbiology In 1979 investigators began started DNA sequencing of mitochondrial DNA in humans, cattle and mice. During their experiments, they were surprised to learn that the genetic code used by the mitochondrial DNA was not the same as the universal genetic code. e.g. UGA, which is a non-sense codon, but it codes for tryptophan in mtDNA, AGG which codes for arginine is a non-sense codon in mtDNA. So, extra chromosomal DNA such as mtDNA and ctDNA do not come under the universality of the genetic code. Wobble Hypothesis. Out of the 64 codons, three are involved in termination process. So, there are only 61 codons specifying the amino acids, and the cell should have 61 different types of tRNAs, each having a different anticodon for the recognition of codons. However, the actual number of tRNA is found to be much less than 61. This means that the anticodons of tRNA read more than one codon on the mRNA. Crick (1966) proposed a hypothesis to explain the degeneracy of the genetic code; the hypothesis is known as Wobble hypothesis. According to this hypothesis, the major degeneracy occurs at the third position, i.e., the third codon is not important in base pairing, and the actual pairing occurs only in the first two codon-anticodon pairs. The base at the 5′ end of the anticodon and the base at the 3′ end of the codon form hydrogen bonds without any specificity. The third base is called as the wobble base. This wobble base of codon lacks specificity and the base in the first position of the anticodon is usually abnormal e.g., inosine, tyrosine, etc. These abnormal bases are able to pair up with more than one nitrogen base at the same position e.g., inosine (I) can pair up with A, C and U. The pairing between unusual base of tRNA and wobble base of mRNA is called wobble pairing. Gene structure: A gene is a linear sequence of nucleotides or codons (= follow-up of three nucleotides) with a fixed start and end point. In its most simplified version one gene codes the information for one enzyme (--> "one gene - one enzyme hypothesis"). This hypothesis has been modified over the years to the "one gene - one polypeptide" hypothesis, which holds true for most bacterial genes. Many enzymes and proteins for which genes carry the code Joint initiative of IITs and IISc – Funded by MHRD Page 22 of 91 NPTEL – Biotechnology – Microbiology are made up from more than one polypeptide requiring more than one gene. "The DNA segment which codes for a single polypeptide sequence is often referred to as a cistron." Most genes consist of discrete sequences of codons that are "read" (transcribed) on only one way to generate one translated polypeptide sequence. The code is NOT overlapping and there is only one defined start point with one reading frame, i.e. the way in which the DNA nucleotides are arranged in codons. While each gene harbors the genetic information of the DNA strand and typically codes for a specific protein or enzyme, viral, prokaryotic and eukaryotic genes structures differ significantly. In contrary to bacterial genes, genes in eukaryotic life forms are more spaced apart and also are organized in so-called exon-intron structure; - exons are coding segments of a genes, while introns are non-coding gene regions which play a role in splicing and gene regulation; - genetic information of cistrons in prokaryotes is usually continuous and introns rarely occur; moreover, bacterial genes are often organized in so-called operons; - transcription of bacterial operons generates polygenic or polycistronic mRNA, i.e. which is RNA that is translated into more than one polypeptide. Fig. 10. A bacterial structural gene. Joint initiative of IITs and IISc – Funded by MHRD Page 23 of 91 NPTEL – Biotechnology – Microbiology Functional structure of a gene: Fig 11. Diagram of the "typical" eukaryotic protein-coding gene. Promoters andenhancers determine what portions of the DNA will be transcribed into theprecursor mRNA (pre-mRNA). The pre-mRNA is then spliced into messenger RNA(mRNA) which is later translated into protein. • All genes have regulatory regions in addition to regions that explicitly code for a protein or RNA product. A regulatory region shared by almost all genes is known as the promoter, which provides a position that is recognized by the transcription machinery when a gene is about to be transcribed and expressed. A gene can have more than one promoter, resulting in RNAs that differ in how far they extend in the 5' end. Although promoter regions have a consensus sequence that is the most common sequence at this position, some genes have "strong" promoters that bind the transcription machinery well, and others have "weak" promoters that bind poorly. These weak promoters usually permit a lower rate of transcription than the strong promoters, because the transcription machinery binds to them and initiates transcription less frequently. Other possible regulatory regions include enhancers, which can compensate for a weak promoter. Most regulatory regions are "upstream"—that is, before or toward the 5' end of the transcription initiation site. Eukaryotic promoter regions are much more complex and difficult to identify than prokaryotic promoters. • Many prokaryotic genes are organized into operons, or groups of genes whose products have related functions and which are transcribed as a unit. By contrast, eukaryotic genes are transcribed only one at a time, but may include long stretches of DNA called introns which are transcribed but never translated into Joint initiative of IITs and IISc – Funded by MHRD Page 24 of 91 NPTEL – Biotechnology – Microbiology protein (they are spliced out before translation). Splicing can also occur in prokaryotic genes, but is less common than in eukaryotes. Of the two nucleotide strands of the DNA of a gene, only one strand contains coded genetic information and directs synthesis of RNA via a cell process called DNA transcription. This strand is called the template strand. During DNA transcription, only one strand of the double-helix of a gene, the so-called sense or template strand, serves as the so-called template for DNA transcription to form the new RNA molecule. While during DNA replication certain enzymes called helicases help to unwind and open the DNA double helix at the ori, it is the RNA polymerase enzyme which unwinds DNA during DNA transcription to create single stranded DNA regions.The distinct places along the DNA strand where transcription begins are called transcription start sites. - at the transcription start site, a complex protein cluster, the so-called transcriptome, assembles itself; in eukaryotic cells, the transcriptome is comprised of many protein components. Along these unwind and single-stranded DNA regions, new nucleotides are paired according to the Watson-Crick base-pairing rule (--> A with T; and G with C). The polymerization of the new nucleotides during the process of DNA transcription is catalyzed by an enzyme called RNA polymerase (RNA POL). The RNA polymerase recognizes certain structures on the DNA strand, called promoter regions, or promoters. Promoters are sequences of DNA which are usually located upstream from the actual coding or transcribed region of a gene. Different genes have promoters with different DNA sequences. In E.coli the promoter has two different functions which relate to two different regions within the promoter, each with a unique consensus sequence: 1. RNA polymerase recognition site - Mediated by the RNA polymerase recognition site, with the consensus sequence 5'-TTGACA-3' on the non-template strand of the gene - The site of the initial association of the RNA polymerase enzyme with the gene - Located about 35bp (-35 region) away from the transcription start site 2. RNA polymerase binding site - this DNA sequence is also known as the "Pribnow box" - centered at the -10bp region of the gene - this is the region where the RNA polymerase enzyme starts to localized unwinding of the Joint initiative of IITs and IISc – Funded by MHRD Page 25 of 91 NPTEL – Biotechnology – Microbiology DNA double strand of the gene - both gene regions tells the ribosome where to dock on and where to start the DNA transcription process on a gene - promoter regions often contain important regulatory sites, such as the operator region or the CAP binding site (see bacterial gene regulation!), which allow the fine tuning of DNA transcription depending on different environmental conditions. Another important stretch of DNA on a gene is the so-called leader sequence. - this gene region located downstream of the promoter region is transcribed first into mRNA, but is does NOT contain a translated codon; - it is a non-translated DNA sequence with two known functions: 1. it is important for initiation of protein translation at the small subunit of the ribosome - this happens with the help of the Shine-Dalgarno consensus sequence, (5'-AGGA3'), which is complementary to a sequence on the 16S rRNA transcript 2. it is involved in the regulation of DNA transcription, e.g. during attenuation. Following the non-transcribed leader sequence of a gene follows the coding region. - this code carrying region directs the actual synthesis of the polypeptide chain at the ribosome - a coding sequence of a gene usually begins with the consensus sequence 5'-TAC-5'; - this sequence after transcription by the RNA polymerase into the RNA translation initiation codon 5'-AUG-3' is translated by the ribosome into the amino acid Nformylmethionine; - the remainder of the coding region of a gene consists of a unique sequence of triplet codons, e.g. UCG, that specifies the sequence of amino acids for that particular gene; Another important sequence or region on a typical gene is the terminator sequence. - these gene region located at the end of a gene sequence is important to enforce the termination of DNA transcription by stopping the RNA polymerase; - it often after a non-transcribed so-called trailer sequence, which is important for the proper expression of the coding region of a gene. Joint initiative of IITs and IISc – Funded by MHRD Page 26 of 91 NPTEL – Biotechnology – Microbiology In order to begin gene transcription, the RNA polymerase enzyme further requires a number of helper proteins, the so-called transcription factors (TFs) - important bacterial transcription factors Sigma and Rho; - important eukaryotic transcription factors are myc, AP-1, TFIIA, TFIIB, etc.; - the RNA polymerase plus the transcription factors recognize and bind to the so-called promoter region (TATA box), which is located shortly before the beginning of a typical gene REFERENCES: Text Books: 1. Jeffery C. Pommerville. Alcamo’s Fundamentals of Microbiology (Tenth Edition). Jones and Bartlett Student edition. 2. Gerard J. Tortora, Berdell R. Funke, Christine L. Case. Pearson - Microbiology: An Introduction. Benjamin Cummings. 3. J. Krebs, E.S. Goldstein, Stephen T. Kilpatrick. Lewin’s Genes X. Jones and Bartlett Publishers. Reference Books: 1. Lansing M. Prescott, John P. Harley and Donald A. Klein. Microbiology. Mc Graw Hill companies. Joint initiative of IITs and IISc – Funded by MHRD Page 27 of 91 NPTEL – Biotechnology – Microbiology Module 7 – Microbial Molecular Biology and Genetics Lecture 4 - The expression of genes Expression of genes • DNA serves two major functions: 1. Encoding structural information that can be converted into RNA and (usually), hence into protein sequence. 2. Encoding regulatory signals that allow certain proteins to decide where to begin or terminate reading DNA • The "central dogma" of molecular biology: information flows from nucleic acids to proteins, not the reverse: DNA ----> RNA ----> polypeptides (proteins) Transcription = the process of making RNA from DNA templates • Translation = the process of making polypeptides Transcription • Transcription is a fundamental cellular process: RNA polymerases "transcribe" the genetic information on DNA into RNA strands. All cells have RNA polymerases (RNAP). • The RNA polymerases increase in complexity as you go from viruses (example, T7 RNA polymerase is made up of a single protein), to bacterial systems (one RNA polymerase made up of the proteins - beta, beta', 2 x alpha, omega and the sigma factor), and finally to eukaryotic systems (Three RNA polymerases - Pol I, Pol II, and Pol III, each with ten or more subunits). • While the RNA polymerases have become increasing complex as life evolved, their overall structure (as evidenced by crystallographic structures of bacterial RNA polymerase and Pol II) show remarkable similarity. There is also sequence similarity between the bacterial polymerase protein subunits and the proteins that make up the eukaryotic polymerases. • Transcription of any gene usually involves three distinct stages: Joint initiative of IITs and IISc – Funded by MHRD Page 28 of 91 NPTEL – Biotechnology – Microbiology o First, the RNA polymerase has to find the start site of a gene. The "holoenzyme" form of the polymerase does this by looking for the "promoter" site that exists just upstream of the gene start site. This process is termed "transcription initiation". This is followed by opening up (melting) of the duplex DNA to form an "open complex". o This is followed by a rapid change into the "elongation" phase of transcription where the "core polymerase" part of the RNA polymerase rapidly transcribes an RNA strand that is complementary to the "template" strand of the DNA. The change into the elongation phase usually occurs after a few bases of RNA have been transcribed (typically about 8-9 bases of RNA in bacterial system which form a RNA-DNA hybrid with the template strand), and involves a "clamping down" on the DNA to prevent the polymerase from falling off the DNA. o The final stage of the transcription of a gene is "termination", after the stop codon of the gene. The process of termination usually involves sequences where the polymerase slows down or stalls, and the polymerase-RNA-DNA complex (often proteins such as rho and NusA are involved in bacterial systems). • Genes have to be transcribed to mRNAs before they can be translated into proteins; more or less mRNA from a particular gene equals more or less of the protein encoded by the gene. Transcription is, thus, an important point in the control of gene "expression". Most genes are controlled transcriptionally, usually by regulation of the level of transcriptional initiation. For example, if a gene has a strong promoter, it will be more highly expressed when compared to another gene with a weak promoter site. Similarly if a regulatory protein can bind the promoter site of a gene (and prevent transcription initiation), then it can turn off the expression of that gene. • Transcriptional control of genetic expression is vital for cellular functions, and many diseases and cancers are results of defects in the transcriptional control of essentials genes. Joint initiative of IITs and IISc – Funded by MHRD Page 29 of 91 NPTEL – Biotechnology – Microbiology Transcription initiation • Transcription initiation in bacteria (prokaryotes) involves sigma factors. The sigma factor combine with the core RNA polymerase to form a holoenzyme that is competent for promoter binding. The core RNA polymerase, by itself, cannot bind the promoter site. • The sigma factor can be thought of as the specificity factor in the RNA polymerase. Each bacteria has several different sigma factors that recognize slightly different promoter sequences. Predominant among these is the sigma70 (70 kDa in E. coli; also called rpoD), which initiates the transcription of most genes in exponentially growing cells. There are two general classes of sigma factors - sigma 70 class and sigma 54 class: • The sigma 70 class of sigma factors share extensive sequence homology, and bind to two conserved sequences upstream of the gene start site (the -35 box and the -10 box). Each of these sigma factors recognizes slight variations in these conserved sequence boxes. Sigma70 binds promoter DNA as a part of the holoenzyme, and binds DNA very poorly in absence of the core enzyme. • The sigma54 class of genes (54 kDa in E. coli; also called rpoN) controls a much smaller set of genes than sigma70. It recognizes different conserved sequences (the 12 and -24 boxes). Unlike Sigma70, sigma54 can bind DNA even in the absence of the core RNA polymerase. It however lacks the ability to melt promoter DNA on its own - for this it needs to interact with other activator proteins that bind further upstream of the promoter site, as well as the core RNAP. • In eukaryotic systems, transcription initiation is very different. There are no sigma factors. Instead, the central protein in forming the "pre-initiation complex " (PIC) is the TATA binding protein (TBP), that binds to the TATA box, a conserved sequence just upstream of the initiation region. A large number of other general transcription factors such as TFIIB, TFIIE, TFIIF, TFIIH (TF stands for transcription factor; II stands for Pol II; there are similar factors for Pol I and Pol II) and others assemble to form the multisubunit TFIID complex. This PIC then recruits the RNA polymerase (Pol I, II, or III in eukaryotic cells) to initiate transcription. The PIC often remains at the promoter site, and is then available to initiate another round of transcription. Joint initiative of IITs and IISc – Funded by MHRD Page 30 of 91 NPTEL – Biotechnology – Microbiology • TBP is a universal transcription factor, and is seen in all eukaryotes and archaea. It sharply bends DNA at the TATA box. Transcription elongation • Once initiation is complete and the open complex forms, the RNA polymerase begins to read the template strand and add corresponding RNA nucleotides. This process is not always efficient, and the polymerase may make several passes at this. After a long enough RNA-DNA hybrid is made, the polymerase clears the promoter region and moves rapidly downstream. This is preceeded by a large conformational change in the polymerase core enzyme, as it clamps down on the DNA and becomes quite processive. • While transcription elongation is quite rapid, the polymerase does not transcribe all sequences with equal efficieny. The elongation rate is not uniform, and there can be pausing or stalling. Elongation factors (GreA/GreB in bacterial systems; TFIIS in eukaryotes) act to help the polymerase along by stimulating backtracking and cleavage of the newly formed RNA (from the 3' end). • Other factors are involved in the elongation cycle - for example, in eukaryotic Pol II there are the elongins and ELL proteins that increase the elongation rate, as well as factors to remodel chromatin. Fig. 12. Transcription elongation Joint initiative of IITs and IISc – Funded by MHRD Page 31 of 91 NPTEL – Biotechnology – Microbiology Transcription activation & repression • There are multiple modes of transcription activation. All these usually involve different protein factors (activators) that bind DNA sequences (enhancer sequences) in and around the promoter site. All these act to increase the affinity of the initiation complex or RNA holoenzyme at the promoter (thus enhancing the chance that a productive open complex will form and transcription will initiate). • When enhancer elements combine with poor promoter sequences, activators can modulate the activity of a gene by several hundred or thousand fold. For example, if a gene has a poor -35 or -10 box in its promoter site, it will be poorly expressed - an activator protein can, in such a case, enhance the activity of the gene by several orders of magnitude, by recruiting the RNA polymerase (or other initiation factors) to the promoter site. • Repression, on the other hand, works by having protein (repressors) that sit on or close to the promoter regions of the DNA, preventing RNA polymerase or initiator/activator proteins from starting transcription initiation (thereby "turningoff" the gene). • Activators and repressors are usually DNA binding proteins. These proteins have common DNA binding motifs such as Zn-fingers, helix-turn-helix, etc. Activators also have regions that interact with different domains of the RNA polymerase (parts of alpha, sigma, etc.). • In addition to protein activators, there are also DNA sequences that can directly interact with RNA polymerase components. For example, the alpha subunit in the bacterial polymerase has two domains - the alpha NTD (n-terminal domain) and the alpha CTD (c-terminal domain), linked by a flexible linker region. The alpha CTD can bind DNA sequences (UP elements) upstream of the promoter region, enhancing the affinity of the polymerase on the promoter. Joint initiative of IITs and IISc – Funded by MHRD Page 32 of 91 NPTEL – Biotechnology – Microbiology Transcription termination • Termination in bacterial system can be broadly classified as "rho independent", and "rho dependent". o In rho independent termination, there is formation of a stable GC rich stem-loop in the newly synthesized RNA followed by a string of U's (A's in the template strand) spaced about 20 bases downstream (these sites are often called intrinsic terminators). The stem loop "snares" the polymerase, slowing or stalling it. This pause, coupled with the low stability of the RNA-DNA hybrid at the active site (run of A=U basepairs) allows the RNA polymerase to fall off the template DNA and terminates the RNA transcription for that gene. o In rho dependent termination, rho binds the newly formed RNA (as hexamers), and stalls the RNA polymerase by interacting with it. In some models, the Rho hexamer translocates on the RNA. Rho termination activity is stimulated by ATP hydrolysis. This activity is greatly enhanced by Nus factors. • In eukaryotic cells, transcription termination involves cleavage of the elongating RNA chain by specific endonucleases which recognize particular sequences (AAUAAA) in the newly formed RNA. Once this happens, the RNA elongation complex is destabilized, and falls off the DNA. It is then available to attach to another nearby PIC, and start transcribing again. Fig. 13. Process of transcription in bacteria Joint initiative of IITs and IISc – Funded by MHRD Page 33 of 91 NPTEL – Biotechnology – Microbiology RNA processing (for eukaryotes) Eukaryotic genes contain Exons (which encode information that will end up being translated into protein) separated by Introns (sequences that will not encode proteins). At the 5' end of a gene we find a promoter and often a CpG island. Both of these elements regulate the transcription of a gene. At the 3' end of a gene we find a stop sequence, and a signal for polyadenylation (AAUAAA). In Eukaryotes this primary RNA transcript is then processed into a messenger RNA (mRNA) if the gene is to be translated into protein. This involves removing introns, which do not encode for protein, and splicing the remaining exons together. At the 5´ end of the mRNA a 7-Methyguanosine residue is added to provide a protective cap. At the 3´ end of the mRNA, 50-100 adenosine residues are added, generating a poly A tail. REVERSE TRANSCRIPTION Some viruses (such as HIV, the cause of AIDS), have the ability to transcribe RNA into DNA. HIV has an RNA genome that is duplicated into DNA. The resulting DNA can be merged with the DNA genome of the host cell. The main enzyme responsible for synthesis of DNA from an RNA template is called reverse transcriptase. In the case of HIV, reverse transcriptase is responsible for synthesizing a complementary DNA strand (cDNA) to the viral RNA genome. An associated enzyme, ribonuclease H, digests the RNA strand, and reverse transcriptase synthesizes a complementary strand of DNA to form a double helix DNA structure. This cDNA is integrated into the host cell's genome via another enzyme (integrase) causing the host cell to generate viral proteins that reassemble into new viral particles. In HIV, subsequent to this, the host cell undergoes programmed cell death, apoptosis of T cells. However, in other retroviruses, the host cell remains intact as the virus buds out of the cell. Some eukaryotic cells contain an enzyme with reverse transcription activity called telomerase. Telomerase is a reverse transcriptase that lengthens the ends of linear chromosomes. Telomerase carries an RNA template from which it synthesizes DNA repeating sequence, or "junk" DNA. This repeated sequence of DNA is important because, every time a linear chromosome is duplicated, it is shortened in length. With "junk" DNA at the ends of chromosomes, the shortening eliminates some of the nonJoint initiative of IITs and IISc – Funded by MHRD Page 34 of 91 NPTEL – Biotechnology – Microbiology essential, repeated sequence rather than the protein-encoding DNA sequence farther away from the chromosome end. Telomerase is often activated in cancer cells to enable cancer cells to duplicate their genomes indefinitely without losing important protein-coding DNA sequence. Activation of telomerase could be part of the process that allows cancer cells to become immortal. Translation: the Synthesis of Proteins Role of mRNA • carries codons (3-nucleotide sequences) arranged in linear fashion that code for amino acids • also carries signals needed to tell how to recognize ribosomes, start and stop signals for decoding protein • leader sequence on small ribosome subunit binds to complementary sequence on mRNA, allows initial formation of RNA-ribosome complex. Fig. 14. Role of mRNA Role of ribosome • two parts: a small subunit and a large subunit. These are separated except when attached to m-RNA • ribosomes contain a set of ribosomal proteins and several types of ribosomal RNA (r-RNA) • ribosomes catalyze the formation of peptide bonds; intially thought to be due to protein activity (enzyme), but now known to be due to RNA catalytic activity (ribozyme). Joint initiative of IITs and IISc – Funded by MHRD Page 35 of 91 NPTEL – Biotechnology – Microbiology Fig. 15. Ribosome Role of tRNA • structure: 4 loops, anticodon, AA binding site • ~ 60 types in bacteria (>100 in mammals) • only 73-93 nucleotides long • some bases modified after transcription; like pseudouridine. • extensive hairpin loops • anticodon site: recognizes codon on mRNA • Activation of tRNA: adding amino acids o requires special enzyme: AA-tRNA activating enzymes o ATP required, forms AA-AMP + PP, then AA-tRNA + AMP Fig. 16. tRNA strcuture Joint initiative of IITs and IISc – Funded by MHRD Page 36 of 91 NPTEL – Biotechnology – Microbiology Initiation of Translation • small ribosomal subunit initiates binding to mRNA • locates 5' end of mRNA • small subunit ribosome finds first AUG codon = start codon • large ribosome binds • tRNA carries the amino acid methionine to first position Fig. 17. Initiation of translation Elongation of Translation • 2 adjacent sites on ribosome: P (Peptide) and A (Amino Acid) site • A site accepts a new tRNA-AA • P site holds existing chain • peptide transferred from P site tRNA to A-site AA • enzyme activity is in ribosomal RNA (ribozyme) • also required: Energy (GTP) and elongation factors Fig. 18. Elongation of translation Joint initiative of IITs and IISc – Funded by MHRD Page 37 of 91 NPTEL – Biotechnology – Microbiology Termination of Translation • reach a "stop codon" UAG, UAA, or UGA • no t-RNAs bind • instead, specific release factors required • Net cost of translation: 4 phosphate bonds/amino acid added. Fig. 19. Termination of translation NOTE: Typical RNA polymerization rate (~ 40 nts/second at 37 °C in bacteria - close to translation rate of about ~15 aa/second). Joint initiative of IITs and IISc – Funded by MHRD Page 38 of 91 NPTEL – Biotechnology – Microbiology Fig. 20. The complete process of translation Joint initiative of IITs and IISc – Funded by MHRD Page 39 of 91 NPTEL – Biotechnology – Microbiology REFERENCES: Text Books: 1. Jeffery C. Pommerville. Alcamo’s Fundamentals of Microbiology (Tenth Edition). Jones and Bartlett Student edition. 2. Gerard J. Tortora, Berdell R. Funke, Christine L. Case. Pearson - Microbiology: An Introduction. Benjamin Cummings. 3. J. Krebs, E.S. Goldstein, Stephen T. Kilpatrick. Lewin’s Genes X. Jones and Bartlett Publishers. Reference Books: 1. Lansing M. Prescott, John P. Harley and Donald A. Klein. Microbiology. Mc Graw Hill companies. Joint initiative of IITs and IISc – Funded by MHRD Page 40 of 91 NPTEL – Biotechnology – Microbiology Module 7 – Microbial Molecular Biology and Genetics Lecture 5 - REGULATION OF GENES GENE REGULATION Each cell expresses, or turns on, only a fraction of its genes. The rest of the genes are repressed, or turned off. The process of turning genes on and off is known as gene regulation. Gene regulation also allows cells to react quickly to changes in their environments. Gene regulation can occur at any point during gene expression, but most commonly occurs at the level of transcription (when the information in a gene’s DNA is transferred to mRNA). Signals from the environment or from other cells activate proteins called transcription factors. These proteins bind to regulatory regions of a gene and increase or decrease the level of transcription. By controlling the level of transcription, this process can determine the amount of protein product that is made by a gene at any given time. Introduction • E. coli lives in colon. It has a metabolic pathway that allows for the synthesis of the amino acid tryptophan (Trp). o This pathway starts with a precurser molecule and proceeds through five enzyme catalyzed steps before reaching the final Trp product. • It is important that E. coli be able to control the rate of Trp synthesis because the amount of Trp available from the environment varies considerably. o If we eat a meal with little or no Trp, the E. coli in our gut must compensate by making more. o If we eat a meal rich in Trp, E. coli doesn't want to waste valuable resources or energy to produce the amino acid because it is readily available for use. • • Therefore, E. coli uses the amount of Trp present to regulate the pathway. o If levels are not adequate, the rate of Trp synthesis is increased. o If levels are adequate, the rate of Trp synthesis is inhibited. There are two ways of regulating the Trp pathway: Joint initiative of IITs and IISc – Funded by MHRD Page 41 of 91 NPTEL – Biotechnology – Microbiology o The first method works to decrease the synthesis of Trp by inhibiting the first enzyme in the pathway, preventing the rest of the pathway from proceeding. o What inhibits the first enzyme? Trp does! The more Trp in the cell, the more that can bind to the first enzyme and prevent it from catalyzing the first step. o This method of regulation is feedback inhibition in which the end product of a pathway acts as an inhibitor of an enzyme in that pathway. • The other method of control stops the production of the enzymes in the pathway at the transcription level. o Remember that enzymes are proteins that must be transcribed and translated from the genetic code. o If the genes for the enzymes are not transcribed to mRNA, then translation to the enzymes cannot occur. o • Without enzymes, there is no Trp synthesis. This method of control is called regulation of gene expression because control is taking place at the genetic level. This method of control will now be examined in detail. Trp Operon: Fig. 21. Structure of trp operon Joint initiative of IITs and IISc – Funded by MHRD Page 42 of 91 NPTEL – Biotechnology – Microbiology • The genes for the five enzymes in the Trp synthesis pathway are clustered on the same chromosome in what is called the Trp operon. • The Trp operon has three components: o Five Structural Genes: These genes contain the genetic code for the five enzymes in the Trp synthesis pathway o One Promoter: o DNA segment where RNA polymerase binds and starts transcription One Operator: DNA segment found between the promoter and structural genes. It determines if transcription will take place. If the operator is turned "on", transcription will occur. • When nothing is bonded to the operator, the operon is "on". o RNA polymerase binds to the promoter and transcription is initiated. o The five structural genes are transcribed to one mRNA strand. o The mRNA will then be translated into the enzymes that control the Trp synthesis pathway. • The operon is turned "off" by a specific protein called the repressor. o The repressor is a product of the regulator gene which is found some distance from the operon. Transcription of the regulator produces mRNA which is translated into the repressor. The repressor is inactive in this form and cannot bind properly to the operator with this conformation. o To become active and bind properly to the operator, a co-repressor must associate with the repressor. The co--repressor for this system is Trp This makes sense because E. coli does not want to synthesize Trp if it is available from the environment The more Trp available, the more that can associate with repressor molecules. Joint initiative of IITs and IISc – Funded by MHRD Page 43 of 91 NPTEL – Biotechnology – Microbiology o An active repressor binds to the operator blocking the attachment of RNA polymerase to the promoter. o Without RNA polymerase, transcription and translation of the structural genes can't occur and the enzymes needed for Trp synthesis are not made. Repressible vs Inducible Systems • The Trp pathway is anabolic as Trp is being synthesized. The Trp and other regulated anabolic pathways are usually repressible because the system can be repressed by an overabundance of the end product. o The end product, Trp, in this case, decreases or stops the transcription of the enzymes necessary for its production. • Regulated catabolic pathways, on the other hand, are usually inducible because the pathway is stimulated rather than inhibited by a specific molecule. An example of an inducible system is lactose metabolism. Fig. 22. Inducible and repressible operons Joint initiative of IITs and IISc – Funded by MHRD Page 44 of 91 NPTEL – Biotechnology – Microbiology The lac Operon • The genes that code for the enzymes needed for lactose catabolism are clustered on the same chromosome in what is called the lac operon. • The Lac operon has three components: o Three Structural Genes: These contain the genetic code for the three enzymes in the lac catabolic pathway o One Promoter: o DNA segment where RNA polymerase binds and starts transcription One Operator: DNA segment found between the promoter and structural genes. It determines if transcription will take place. If the operator in turned "on", transcription will occur. Fig. 23. The lac operon • As in the Trp operon, the Lac operon is turned "off" by a specific protein called the repressor. o The repressor is the product of the regulator gene which is found outside the operon. Transcription of the regulator produces mRNA which is translated into the repressor. o But unlike the Trp operon, the repressor is active in this form and does not require a co-repressor. Joint initiative of IITs and IISc – Funded by MHRD Page 45 of 91 NPTEL – Biotechnology – Microbiology o The active repressor binds to the operator blocking the advancement of RNA polymerase to the structural genes. o Without RNA polymerase, transcription and translation of the genes can't occur and the enzymes needed for Lac metabolism are not made. • What turns the Lac operon "on"? Lactose does! o This makes sense because the cell only needs to make enzymes to catabolize lactose if lactose is present. • When lactose enters the cell, allolactose, an isomer of lactose is formed. • Allolactose binds to the repressor and alters its conformation so that it can't bind to the operator. • RNA polymerase can now start transcription. o The three structural genes are transcribed to one mRNA strand. o The mRNA will then be translated into the enzymes that control lactose catabolism. • In this sense, allolactose is an inducer. Fig. 24. The lac operon: a model of gene regulation in prokaryotic cells Joint initiative of IITs and IISc – Funded by MHRD Page 46 of 91 NPTEL – Biotechnology – Microbiology Negative vs Positive Control • While the Trp operon is an example of repressible gene regulation and the Lac operon is an example of inducible gene regulation, both are examples of negative control of genes because both operons are shut "off" by an active repressor. • Gene regulation would be positive; on the other hand, if an activator molecule turned the operon "on". • The Lac operon is also an example of a positive control system and is turned on by the cAMP-CAP complex, as described below: • E. coli can be described as a fussy eater. o Its first choice at every meal is glucose because glucose supplies maximum energy for growth. o Therefore, E. coli will only metabolize lactose if concentrations of glucose are low. • For this to work, there must be a signal to tell the Lac operon that glucose is not available and to start transcribing the genes to metabolize lactose. o This signal is a small molecule called cyclic AMP (cAMP). The amount of cAMP present in a cell is inversely proportional to the amount of glucose present. As a result, the absence of glucose results in an increase in cAMP in the cell. • The following describes the situation where there is lactose but no glucose available to the cell: o No glucose means high levels of cAMP. o cAMP binds to a molecule known as CAP. o CAP, when in association with cAMP, can bind to the promoter at the CAP binding site. o Here, the cAMP-CAP complex stimulates transcription by helping RNA polymerase bind to the promoter. RNA polymerase has a weak affinity for the Lac promoter and will not bind without this help. Joint initiative of IITs and IISc – Funded by MHRD Page 47 of 91 NPTEL – Biotechnology – Microbiology o Remember with lactose present so is allolactose. Allolactose binds to the repressor and prevents it from binding to the operator. Therefore, transcription and translation of the genes can occur. o The following depicts what happens when glucose and lactose are both present for E. coli to metabolize: With glucose present, there is very little or no cAMP. It cannot bind to the CAP binding site. Without this complex, RNA polymerase cannot bind to the promoter and transcription cannot occur. Even though allolactose is present and blocks the action of the repressor, there is no transcription of the lac genes because glucose is present. Fig. 25. Overall structural elements of Lac Operon Joint initiative of IITs and IISc – Funded by MHRD Page 48 of 91 NPTEL – Biotechnology – Microbiology Fig. 26. Lactose present, glucose scarce (cAMP level high): abundant lac mRNA synthesized Fig. 27. Lactose present, glucose present (cAMO level low): little lac mRNA synthesized Joint initiative of IITs and IISc – Funded by MHRD Page 49 of 91 NPTEL – Biotechnology – Microbiology REFERENCES: Text Books: 1. Jeffery C. Pommerville. Alcamo’s Fundamentals of Microbiology (Tenth Edition). Jones and Bartlett Student edition. 2. Gerard J. Tortora, Berdell R. Funke, Christine L. Case. Pearson - Microbiology: An Introduction. Benjamin Cummings. 3. J. Krebs, E.S. Goldstein, Stephen T. Kilpatrick. Lewin’s Genes X. Jones and Bartlett Publishers. Reference Books: 1. Lansing M. Prescott, John P. Harley and Donald A. Klein. Microbiology. Mc Graw Hill companies. Other References: 1. http://www.nature.com/scitable/topic/gene-expression-and-regulation-15 2. http://ghr.nlm.nih.gov/handbook/howgeneswork/geneonoff Joint initiative of IITs and IISc – Funded by MHRD Page 50 of 91 NPTEL – Biotechnology – Microbiology Module 7 – Microbial Molecular Biology and Genetics Lecture 6 – DNA Repair and Microbial Recombination DNA REPAIR It can be defined as “any one of the cellular process that attempts to correct errors in cellular DNA which occurred during cell division or by external environment”. There are many cellular proteins and enzymes which help in repairing most of the errors by means of different mechanisms. Photoreactivation • Catalyzed by DNA-Photolyase (DPL) • Reverses cyclobutyl pyrimidine dimers resulting from UV irradiation. • It has low abundance (10 to 20 molecules per cell) in E. coli. • DPL binds irradiated DNA 100x better than non-irradiated DNA. • DPL contains a non-covalently associated chromophore (FADH or FADH2). • The photochemical mechanism of photorepair has been proposed to be lightdependent redox reaction between the singlet excited state of FADH2 and the pyrimidine dimer. • Photolyase has not yet been identified in placental mammals. Fig. 28. Photoreactivation Joint initiative of IITs and IISc – Funded by MHRD Page 51 of 91 NPTEL – Biotechnology – Microbiology Aberrant Methylation • Catalyzed by 6-O-methylguanine methyltransferase (6-O-MGM) • 6-O-MGM recognizes 6-O-methylguanine in DNA and removes the methyl group, transferring the group to an amino acid on itself in a "suicide" mechanism. • 6-O-MGM is encoded by the ada gene in E. coli and the MGMT gene in eukaryotes. • Mice knocked out for MGMT are cancer-prone and sensitive to methylation agents Common Features of Excision-Type Repair Pathways • Recognition: Altered DNA is recognized and bound by a specific damagerecognition protein. This first step recruits other components required for the repair reaction. • Excision: Damaged base(s), and in some cases adjacent nucleotides, are excised from the strand by exonucleases, resulting in a gapped DNA. • Resynthesis: The gap is refilled by a DNA polymerase using the complementary strand as a template. Base-Excision Repair (BER) • Damaged bases are removed as free bases. • BER primarily handles oxidative and alkylative damage. • BER is thought to have an important role in aging. • BER recognizes base deamination, oxidative damage, and other minor base modifications. • Five gene products are required for BER: glycosylase, AP endonuclease, phosphodiesterase, DNA polymerase, and DNA ligase. • DNA glycosylase recognizes the damaged base and removes it, generating an AP (apurinic, apyramidinic) site. • AP endonuclease cleaves the phosphodiester bond, generating a single-strand break with a 5'-terminal deoxyribophosphate moiety. • The 5'-deoxyribophosphate is excised by action of a DNA phosphodiesterase. Joint initiative of IITs and IISc – Funded by MHRD Page 52 of 91 NPTEL – Biotechnology – Microbiology • The resulting single-nucleotide gap is repaired by DNA polymerase β (beta). • The resulting nick is sealed by DNA ligase. • BER is relatively inefficient, due to the large number of peptides needed to recognize each damage type. Nucleotide-Excision Repair (NER) • Damaged bases are removed as oligonucleotides. • NER is primarily responsible for removal of UV-induced damage and bulky adducts, but also removes ~20% of oxidative damage. • Deficiencies cause many human disorders. • Xeroderma Pigmentosum (both classical and variant) and Cockayne's syndrome are caused by defects in NER, resulting in various detrimental effects upon exposure to UV light. Fig. 29. a) Nucleotide excision repair; b) base excision repair DNA Mismatch Repair (MMR) • Mismatches are removed as long oligonucleotides. • MMR is primarily responsible for removal of replicative errors. • Also prevents recombination of non-homologous sequences Joint initiative of IITs and IISc – Funded by MHRD Page 53 of 91 NPTEL – Biotechnology – Microbiology • Deficient in many human cancers. • MMR recognizes base-base mismatch and small insertion/deletion loops. Fig. 30. DNA mismatch repair Error prone translation DNAsynthesis When this pathway is active, DNA repair becomes significantly less accurate and a high mutation rate occurs. It is a part of cellular stress response to extensive DNA damage, also known as SOS response. The cell, at all cases had to survive, so DNA gets repaired, although it contains a lot of errors. DNA RECOMBINATION DNA recombination involves the exchange of genetic material either between multiple chromosomes or between different regions of the same chromosome. This process is generally mediated by homology; that is, homologous regions of chromosomes line up in preparation for exchange, and some degree of sequence identity is required. • Homologous recombination occurs between two homologous chromosomes. • Nonhomologous, or illegitimate recombination occurs between two different chromosomes, though the segment at recombination sites may be related. Joint initiative of IITs and IISc – Funded by MHRD Page 54 of 91 NPTEL – Biotechnology – Microbiology • Site-specific recombination can result in integration of viral, bacterial, or plasmid DNA into a chromosome at a specific location (such as att). • Replicative recombination results in sequence transposition, and is mediated by transposase enzymes. Types of Homologous Recombination • Reciprocal recombination results from two chromosomes exchanging the same amount of DNA. • Non-reciprocal recombination results from two chromosomes exchanging different amounts of DNA. • Intramolecular recombination results from recombination in chromosomal loops. o Direct repeats cause plasmid formation from the intervening loop. o Inverted repeats cause the intervening loop to change direction. • Double crossover results in exchanging a short segment of DNA, rather than a large segment of chromosome from the crossover point to the terminus. Holliday Model for Homologous Recombination • Two DNA molecules with nicks induced by an endonuclease in strands of the same polarity can invade each other with free single strands. • A Holliday junction, or Chi structure, is formed as a recombination intermediate, and is solidified by ligase sealing the nicks, uniting the two homologues. • Branch migration allows exchange of material, since the loose area around the junction allows unzipping/rezipping. • New nicks allow separation of recombined DNA and resolution of the recombination intermediate. • The Holliday model assumes reciprocal and equal exchange of genetic material between DNA molecules. Joint initiative of IITs and IISc – Funded by MHRD Page 55 of 91 NPTEL – Biotechnology – Microbiology Fig. 31. Holliday model Double-Strand Break Repair Model (Szostak Model) for Recombination • Double-strand breaks are introduced into one of the homologues. • Broken ends are recessed by specific exonucleases which produce longer 3' single strands. • The single strands can invade a region of homologous DNA. • The invading end serves as a primer for a polymerase which extends it, unwidnding the template DNA and generating a D-loop. A Holliday junction is formed, and DNA synthesis proceeds until it reaches the opposing end of the recombining chromosome. • The displaced DNA of the opposing end of the invading chromosome anneals to the template (invaded chromosome), forming a second Holliday junction by ligation of the strands. Holliday junctions can migrate prior to resolution with resolvase endonucleases. Joint initiative of IITs and IISc – Funded by MHRD Page 56 of 91 NPTEL – Biotechnology – Microbiology • Resolution can cut the invading strands, resulting in a non-crossover event, since each chromosome has only a tiny fragment of DNA from the other chromosome. • Crossover results from cutting the strands not participating in the recombination, generating a crossover by leaving a large segment of each chromosome attached to the other. • The Szostak model accounts for non-reciprocal recombination. Fig. 32. Double and single strand break repair model Factors Involved in Each Recombination Step • The Szostak model is an error-free pathway (since it can only occur in homologues), and is of major importance in yeasts and mammals. • A 5' to 3' recession reaction is mediated by endonucleases. • Strand invasion requires multiple Rad proteins, including DNA helicases and DNA end-binding proteins. • New DNA synthesis requires DNA Pol δ and ε (delta and epsilon). Repair of DSBs by Non-Homologous End Joining (NHEJ) • NHEJ is an error-prone pathway, and is the major pathway in mammals for repair of DSBs. Joint initiative of IITs and IISc – Funded by MHRD Page 57 of 91 NPTEL – Biotechnology – Microbiology • DNA-PKcs (DNA protein kinase, catalytic subunit) binds ends of the DSB (double-strand break). • Endonucleases create blunt (no overhang) ends. • Synapsis is achieved through microhomologies. • Ends are ligated, and, due to end-blunting and other factors, small insertions and deletions are introduced. The errors are often of little consequence, since most DNA is non-coding. Site specific recombination Site-specific recombination, also known as conservative site-specific recombination, is a type of genetic recombination in which DNA strand exchange takes place between segments possessing only a limited degree of sequence homology. Site- specific recombinases perform rearrangements of DNA segments by recognizing and binding to short DNA sequences (sites), at which they cleave the DNA backbone, exchange the two DNA helices involved and rejoin the DNA strands. While in some sitespecific recombination systems just a recombinase enzyme and the recombination sites is enough to perform all these reactions, in other systems a number of accessory proteins and/or accessory sites are also needed. Site-specific recombination systems are highly specific, fast and efficient, even when faced with complex eukaryotic genomes. They are employed in a variety of cellular processes, including bacterial genome replication, differentiation and pathogenesis, and movement of mobile genetic elements. For the same reasons, they present a potential basis for the development of genetic engineering tools. Recombination sites are typically between 30 and 200 nucleotides in length and consist of two motifs with a partial inverted-repeat symmetry, to which the recombinase binds, and which flank a central crossover sequence at which the recombination takes place. The pairs of sites between which the recombination occurs are usually identical, but there are exceptions (e.g. attP and attB of λ integrase,) Joint initiative of IITs and IISc – Funded by MHRD Page 58 of 91 NPTEL – Biotechnology – Microbiology Fig. 33. Insertion and excision mediated by aligned Lox sites and the cre recombinase. Red X designates recombination. REFERENCES: Text Books: 1. Jeffery C. Pommerville. Alcamo’s Fundamentals of Microbiology (Tenth Edition). Jones and Bartlett Student edition. 2. Gerard J. Tortora, Berdell R. Funke, Christine L. Case. Pearson - Microbiology: An Introduction. Benjamin Cummings. 3. J. Krebs, E.S. Goldstein, Stephen T. Kilpatrick. Lewin’s Genes X. Jones and Bartlett Publishers. Reference Books: 1. Lansing M. Prescott, John P. Harley and Donald A. Klein. Microbiology. Mc Graw Hill companies. Other References: 1. http://www.nature.com/scitable/topicpage/genetic-recombination-514 Joint initiative of IITs and IISc – Funded by MHRD Page 59 of 91 NPTEL – Biotechnology – Microbiology Module 7 – Microbial Molecular Biology and Genetics Lecture 7 –Bacterial Plasmids and Transposable elements PLASMIDS It is an extra chromosomal DNA molecule separate from the chromosomal DNA which is able to replicate independently of the chromosomal DNA. Most commonly found as small circular, double-stranded DNA molecules in bacteria, plasmids are sometimes present in archaea and eukaryotic organisms. In nature, plasmids carry genes that may benefit survival of the organism (e.g. antibiotic resistance), and can frequently be transmitted from one bacterium to another (even of another species) via horizontal gene transfer. Artificial plasmids are widely used as vectors in molecular cloning, serving to drive the replication of recombinant DNA sequences within host organisms. • They vary from 1kb to 1000kb • The term plasmid was first introduced by the American molecular biologist Joshua Lederberg in 1952. • The plasmid does not contribute to the genome of the bacterial cell it is present in but often translates proteins of significance. For example o It contains the antibiotic resistance gene that helps the bacteria survive from that specific antibiotic. o It can provide the bacteria with an ability to fix elemental nitrogen or to degrade calcitrant organic compounds which provide an advantage under conditions of nutrient deprivation. Fig. 34. This image shows a line drawing of a bacterium with its chromosomal DNA and several plasmids within it. Joint initiative of IITs and IISc – Funded by MHRD Page 60 of 91 NPTEL – Biotechnology – Microbiology Plasmids used in genetic engineering are called vectors. Plasmids serve as important tools in genetics and biotechnology labs, where they are commonly used to multiply (make many copies of) or express particular genes. Many plasmids are commercially available for such uses. The gene to be replicated is inserted into copies of a plasmid containing genes that make cells resistant to particular antibiotics and a multiple cloning site (MCS, or polylinker), which is a short region containing several commonly used restriction sites allowing the easy insertion of DNA fragments at this location. Next, the plasmids are inserted into bacteria by a process called transformation. Then, the bacteria are exposed to the particular antibiotics. Only bacteria that take up copies of the plasmid survive, since the plasmid makes them resistant. In particular, the protecting genes are expressed (used to make a protein) and the expressed protein breaks down the antibiotics. In this way, the antibiotics act as a filter to select only the modified bacteria. Now these bacteria can be grown in large amounts, harvested, and lysed (often using the alkaline lysis method) to isolate the plasmid of interest. Another major use of plasmids is to make large amounts of proteins. In this case, researchers grow bacteria containing a plasmid harboring the gene of interest. Just as the bacterium produces proteins to confer its antibiotic resistance, it can also be induced to produce large amounts of proteins from the inserted gene. This is a cheap and easy way of mass-producing a gene or the protein it then codes for, for example, insulin or even antibiotics. A plasmid can contain inserts of up to 30-40 kbp. To clone longer lengths of DNA, lambda phage with lysogeny genes deleted, cosmids, bacterial artificial chromosomes, or yeast artificial chromosomes are used. Replication of plasmid Non Integrative Replication In this type of replication the plasmid DNA once introduced into the cell grows as per the copy number and the multiplication of the cells Integrative Plasmid Episome Under certain conditions some plasmids may integrate into the bacterial chromosome. They are called episome or integrative plasmids. At this stage they replicate along with the bacterial chromosome. The plasmids in this way are classified into 2 types Joint initiative of IITs and IISc – Funded by MHRD Page 61 of 91 NPTEL – Biotechnology – Microbiology Relaxed plasmids They are the ones which are normally maintained at multiple copies per cell. Stringent plasmids They are the ones which have a limited number of copies per cell. In this case of plasmid replication, the plasmid DNA is integrated in the bacterial chromosome and grows along with the cell. It uses the bacterial machinery for division. A good example of this type of replication is Ti Plasmid, often used in agricultural genetic engineering experiments. It completely uses the cell genetic mechanism to grow. Fig. 35. Replication of plasmid Types of plasmid • Fertility plasmid: That contains the tra genes required for conjugation else known as F plasmids. For example F plasmids of E coli. Joint initiative of IITs and IISc – Funded by MHRD Page 62 of 91 NPTEL – Biotechnology – Microbiology Fig. 36. Fertility plasmid • Resistance Plasmid or R plasmids Carry genes of resistance to one or more antibacterial agents such as ampicilin, etc • Col plasmid: It contains genes for production of bacteriocins, proteins that kill other bacteria example col E1. • Virulent plasmids: Which in turn convert bacteria into a pathogen. For example, the Ti plasmid of Agrobacterium tumefaciens induces crown gal disease on dicot of the palnts Joint initiative of IITs and IISc – Funded by MHRD Page 63 of 91 NPTEL – Biotechnology – Microbiology Table 2. Size of some plasmids as vectors Plasmid Size Mol. Wt Marker Organism Apr Escherichia coli K12 in kb puc 8 2.1 2.7 JM83 col E1 6.4 6.6 rP4 54.0 56.4 E1 imm Escherichia coli JC411 Apr, Kmr, Escherichia coli K12 J53 Tcr ToL 117.0 78.0 pseudomonas pTiach 5 213.0 142.0 A. tumefaciens TRANSPOSABLE ELEMENTS A transposable element (TE) is a DNA sequence that can change its relative position (self-transpose) within the genome of a single cell. Barbara McClintock's discovery of these jumping genes early in her career earned her a Nobel prize in 1983. They are often considered "junk DNA”. Fig. 37. Bacterial composite transposon Transposable elements are only one of several types of mobile genetic elements. They are assigned to one of two classes according to their mechanism of transposition, which can be described as either “copy or paste" (for class I TEs) or "cut and paste" (for class II TEs). Class I (retrotransposons): They copy themselves in two stages, first from DNA to RNA by transcription, then from RNA back to DNA by reverse transcription. The Joint initiative of IITs and IISc – Funded by MHRD Page 64 of 91 NPTEL – Biotechnology – Microbiology DNA copy is then inserted into the genome in a new position. Reverse transcription is catalyzed by a reverse transcriptase, which is often coded by the TE itself. Retrotransposons behave very similarly to retroviruses, such asHIV. There are three main orders of retrotransposons (other orders are less abundant): Those with long terminal repeats (LTRs): encode reverse transcriptase, similar to retroviruses; LINEs: encode reverse transcriptase, lack LTRs, transcribed by RNA polymerase II; SINEs: do not code for reverse transcriptase, transcribed by RNA polymerase III. Retroviruses can be considered as TEs. Indeed, after entering a host cell and converting their RNA into DNA, retroviruses integrate this DNA into the DNA of the host cell. The integrated DNA form (provirus) of the retrovirus is viewed as a particularly specialized form of eukaryotic retrotransposon, which is able to encode RNA intermediates that usually can leave the host cells and infect other cells. The transposition cycle of retroviruses also has similarities to that of prokaryotic TEs. The similarities suggest a distant familial relationship between these two TEs types. Class II (DNA transposons): By contrast, the cut-and-paste transposition mechanism of class II TEs does not involve an RNA intermediate. The transpositions are catalyzed by various types oftransposase enzymes. Some transposases can bind non-specifically to any target site, while others bind to specific sequence targets. The transposase makes a staggered cut at the target site producing sticky ends, cuts out the DNA transposon and ligates it into the target site. A DNA polymerase fills in the resulting gaps from the sticky ends and DNA ligase closes the sugar-phosphate backbone. This results in target site duplication and the insertion sites of DNA transposons may be identified by short direct repeats (a staggered cut in the target DNA filled by DNA polymerase) followed by inverted repeats (which are important for the TE excision by transposase). The duplications at the target site can result in gene duplication, which plays an important role in evolution. Cut-and-paste TEs may be duplicated if transposition takes place during S phase of the cell cycle when the "donor" site has already been replicated, but the "target" site has not. Joint initiative of IITs and IISc – Funded by MHRD Page 65 of 91 NPTEL – Biotechnology – Microbiology Both classes of TEs may lose their ability to synthesise reverse transcriptase or transposase through mutation, yet continue to jump through the genome because other TEs are still producing the necessary enzymes. Hence, they can be classified as either "autonomous" or "non-autonomous". For instance for the class II TEs, the autonomous ones have an intact gene that encodes an active transposase enzyme; the TE does not need another source of transposase for its transposition. In contrast, non-autonomous elements encode defective polypeptides and accordingly require transposase from another source. REFERENCES: Text Books: 1. Jeffery C. Pommerville. Alcamo’s Fundamentals of Microbiology (Tenth Edition). Jones and Bartlett Student edition. 2. Gerard J. Tortora, Berdell R. Funke, Christine L. Case. Pearson - Microbiology: An Introduction. Benjamin Cummings. 3. J. Krebs, E.S. Goldstein, Stephen T. Kilpatrick. Lewin’s Genes X. Jones and Bartlett Publishers. Reference Books: 1. Lansing M. Prescott, John P. Harley and Donald A. Klein. Microbiology. Mc Graw Hill companies. Joint initiative of IITs and IISc – Funded by MHRD Page 66 of 91 NPTEL – Biotechnology – Microbiology Module 7 – Microbial Molecular Biology and Genetics Lecture 8 – Mutations and their chemical basis, detection and isolation of mutants MUTATIONS • A Mutation occurs when a DNA gene is damaged or changed in such a way as to alter the genetic message carried by that gene. • A Mutagen is an agent of substance that can bring about a permanent alteration to the physical composition of a DNA gene such that the genetic message is changed. • Once the gene has been damaged or changed the mRNA transcribed from that gene will now carry an altered message. • The polypeptide made by translating the altered mRNA will now contain a different sequence of amino acids. The function of the protein made by folding this polypeptide will probably be changed or lost. Chemical Mutagens – They change the sequence of bases in a gene in a number of ways: • Mimic the correct nucleotide bases in a DNA molecule, but fail to base pair correctly during DNA replication. • Remove parts of the nucleotide (such as the amino group on adenine), again causing improper base pairing during DNA replication. • Add hydrocarbon groups to various nucleotides, also causing incorrect base pairing during DNA replication. Joint initiative of IITs and IISc – Funded by MHRD Page 67 of 91 NPTEL – Biotechnology – Microbiology Radiation - High energy radiation from a radioactive material or from X-rays is absorbed by the atoms in water molecules surrounding the DNA. This energy is transferred to the electrons which then fly away from the atom. Left behind is a free radical, which is a highly dangerous and highly reactive molecule that attacks the DNA molecule and alters it in many ways. Radiation can also cause double strand breaks in the DNA molecule, which the cell's repair mechanisms cannot put right. Sunlight- It contains ultraviolet radiation (the component that causes a suntan) which, when absorbed by the DNA causes a cross link to form between certain adjacent bases. In most normal cases the cells can repair this damage, but unrepaired dimers of this sort cause the replicating system to skip over the mistake leaving a gap, which is supposed to be filled in later. Unprotected exposure to UV radiation by the human skin can cause serious damage and may lead to skin cancer and extensive skin tumors. Spontaneous - Mutations occur without exposure to any obvious mutagenic agent. Sometimes DNA nucleotides shift without warning to a different chemical form (know as an isomer) which in turn will form a different series of hydrogen bonds with it's partner. This leads to mistakes at the time of DNA replication. • EXAMPLES : 1. Nitrous Acid: Nitrous Acid affects DNA complementation. The acid randomly modifies the base adenine so that it will pair with cytosine instead of thymine. This change is made evident during DNA replication when a new base pair appears in daughter cells in a later generation. 2. A Base Analog: A base analog is a compound sufficiently similar to one of the four DNA bases but have different pairing properties. Joint initiative of IITs and IISc – Funded by MHRD Page 68 of 91 NPTEL – Biotechnology – Microbiology For example, 5-bromouracil is the analog of thymine but sometimes pairs with guanine and 2-aminopurine is the analog of adanine but sometimes pairs with cytosine. The incorporation of a base analog will result in base pair substitution . 3. UV Light: Exposure to direct UV light induces covalent linking between adjacent thymine nucleotides on a DNA strand forming a thymine dimer. These dimers cause the strand to buckle, disrupting normal base pairing. This prevents proper replication and transcription. Bacteria have enzymes to fix the damage created by UV light. An enzyme cuts the DNA at two point and removes the damaged portion. DNA polymerase synthesizes a new DNA segment using the healthly strand as a template. DNA ligase joins the new fragment to the old strand. Types of Mutations: NOTE: For all others examples cited below, the below given DNA sequence shall be considered as wild type: ATG CCG TGT CAG ATG TTC AUG CCG UGU CAG AUG UUC Met Pro Cys Gln met ---------------- DNA ------------ mRNA phe ------ Amino acid sequence 1. Synonymous / Silent Mutations 1. No alteration in polypeptide product of the gene 2. Single base pair substitution 3. Occur in the third position of a codon 4. Codes for the same amino acid 5. No alteration of the protein Joint initiative of IITs and IISc – Funded by MHRD Page 69 of 91 NPTEL – Biotechnology – Microbiology Example: Samesense Mutation: a codon is changed to a different codon that specifies the same amino acid. ATG CCG TGC CAG ATG TTC -----MUTATED DNA (compare with wild type) AUG CCG UGC CAG AUG UUC ----- mRNA Met Pro Cys Gln Met Phe -------- amino acid sequence 2. Non-Synonymous Mutations 1. Occur less frequently than synonymous mutations 2. Leads to alteration in the encoded polypeptide 3. Result in abnormal function→ disease Occur in one of three main ways: 1. Missense 2. Nonsense 3. Frameshift Missence mutation: a codon is changed to a different codon that specifies a different amino acid. ATG CCG TGG CAG ATG TTC -----MUTATED DNA AUG CCG UGG CAG AUG UUC ----- mRNA Met Pro Trp Gln Met Phe -------- amino acid sequence Nonsense Mutation: A codon that specifies an amino acid is changed to a stop codon. (termination codon), this mutation usually destroys the function of the gene product. ATG CCG TGA CAG ATG TTC ----- MUTATED DNA AUG CCG UGA CAG AUG UUC ----- Met Pro STOP mRNA -------- amino acid sequence Joint initiative of IITs and IISc – Funded by MHRD Page 70 of 91 NPTEL – Biotechnology – Microbiology Frameshift Mutations: One or 2 nucleotide pairs are inserted into or deleted from the molecule, causing an alteration of the reading frame. As the result of this shift, codons downstream of the insertion or deletion site specify an entirely new sequence of amino acids. Depending on where the insertion or deletion occurs in the gene, different effects can be generated. In addition to producing an entirely new polypeptide sequence immediately after the change, frameshift mutations usually produce a stop or termination codon within a short distance of the mutation. This codon terminates the already altered polypepetide chain. A frame shift in a gene specifying an enzyme usually result in a loss of enzyme activity. If the enzyme is an essential one, the effect on the organism can be disastrous. Fig. 38. Types of mutations Joint initiative of IITs and IISc – Funded by MHRD Page 71 of 91 NPTEL – Biotechnology – Microbiology Mutation in non-coding DNA They will have a phenotyping effect if it occurs in 1. Regulatory elements (e.g. TATA box) → affect level of gene expression 2. Splicing of introns: highly conserved GT & AG at the end of introns Either: a. coding sequences being lost b. intronic sequences being added to the mRNA Functional effects of Mutation on the Protein 1. Loss of function 2. Gain of Function Loss of Function Reduce in activity In heterzygous state →half normal levels of the protein product Or complete loss of the gene product Gain of Function: Increased levels of gene expression Development of a new function(s) of the gene product Chromosomal Mutations: Chromosomal mutations are grouped into four broad types: 1) Deletion: are chromosomal changes in which one or more genes or segments of chromosomal DNA are lost. Joint initiative of IITs and IISc – Funded by MHRD Page 72 of 91 NPTEL – Biotechnology – Microbiology 2) Duplication: are chromosomal changes in which one copy or more copies of a gene are present on the same chromosome 3) Inversion: in which a segment of DNA is released and rotated 180 degrees before being reinserted into the DNA. If the inverted DNA segment carries part of the proteincoding sequence, the resulting protein would be drastically altered and most likely nonfunctional. 4) Translocation: occur when a segment of DNA moves from one chromosome and inserted into a different nonhomologous chromosome. Translocation can also be reciprocal, that is two nonhomologous chromosome may break and trade pieces of DNA. Translocation mutations frequently cause problems in meiosis and sometimes lead to aneuploidy (the gain or loss of chromosome) Joint initiative of IITs and IISc – Funded by MHRD Page 73 of 91 NPTEL – Biotechnology – Microbiology Mutant Isolation • How can you tell if there are any mutant colonies in a culture? This is done by either positive (direct) selection or by negative (indirect) selection. 1. Positive Selection: Positive selection entails growing the culture on a medium that will allow for the growth of only the mutant colonies. If, for example, we want to find mutants that resistant to penicillin, we grow the culture on a medium that contains penicillin. Only those colonies that are resistant to penicillin will grow and we can identify them directly. Fig. 39. Mutant isolation 2. Negative Selection: Negative selection is used to identify mutants that have lost the ability to perform a certain function that their parents had. Auxotrophic mutants, for example, are bacteria that have lost the ability to synthesize an essential nutrient. Joint initiative of IITs and IISc – Funded by MHRD Page 74 of 91 NPTEL – Biotechnology – Microbiology The replica-plating technique is used to identify mutants by negative selection. the replica-plating technique can be used, for example, to identify mutants that have lost the ability to synthesize the amino acid histidine. Therefore, mutants are His- and require histidine in order to survive. Inoculate a histidine enriched medium with bacteria. Incubate so that cells can form colonies. This is the master plate. Press a sterile velvet surface into the colonies of the master plate. Some cells from each of the colonies adhere to the velvet. Prepare two mediums, one with histidine, the other without histidine. Transfer cells from the velvet to each plate. Compare growth on the two plates after incubation. Colonies that grow on the histidine enriched medium but not on the medium lacking histidine are His- mutants Joint initiative of IITs and IISc – Funded by MHRD Page 75 of 91 NPTEL – Biotechnology – Microbiology Fig. 40. Negative selection Joint initiative of IITs and IISc – Funded by MHRD Page 76 of 91 NPTEL – Biotechnology – Microbiology Fig. 41. Ames test The Ames Test utilizes a histidine auxotroph of Salmonella to determine if a chemical agent is a mutagen. Though some spontaneous back mutations (a reversion back to the strain of Salmonella that can synthesize histadine) can occur, if many colonies of the microbe appear on the minimal plate after the addition of the test chemical, this is an indication the the chemical is a mutagen. Joint initiative of IITs and IISc – Funded by MHRD Page 77 of 91 NPTEL – Biotechnology – Microbiology REFERENCES: Text Books: 1. Jeffery C. Pommerville. Alcamo’s Fundamentals of Microbiology (Tenth Edition). Jones and Bartlett Student edition. 2. Gerard J. Tortora, Berdell R. Funke, Christine L. Case. Pearson - Microbiology: An Introduction. Benjamin Cummings. 3. J. Krebs, E.S. Goldstein, Stephen T. Kilpatrick. Lewin’s Genes X. Jones and Bartlett Publishers. Reference Books: 1. Lansing M. Prescott, John P. Harley and Donald A. Klein. Microbiology. Mc Graw Hill companies. Joint initiative of IITs and IISc – Funded by MHRD Page 78 of 91 NPTEL – Biotechnology – Microbiology Module 7 – Microbial Molecular Biology and Genetics Lecture 9 - BACTERIAL CONJUGATION, TRANSDUCTION AND TRANSFORMATION Sometimes when two pieces of DNA come into contact with each other, sections of each DNA strand will be exchanged. This is usually done through a process called crossing over in which the DNA breaks and is attached on the other DNA strand leading to the transfer of genes and possibly the formation of new genes. Genetic recombination is the transfer of DNA from one organism to another. The transferred donor DNA may then be integrated into the recipient's nucleoid by various mechanisms. In the case of homologous recombination, homologous DNA sequences having nearly the same nucleotide sequences are exchanged by means of breakage and reunion of paired DNA segments. Genetic information can be transferred from organism to organism through vertical transfer (from a parent to offspring) or through horizontal transfer methods such as conjugation, transformation or transduction. Bacterial genes are usually transferred to members of the same species but occasionally transfer to other species can also occur Horizontal gene transfer, also known as lateral gene transfer, is a process in which an organism transfers genetic material to another organism that is not its offspring. The ability of Bacteria and Archaea to adapt to new environments as a part of bacterial evolution most frequently results from the acquisition of new genes through horizontal gene transfer rather than by the alteration of gene functions through mutations. (It is estimated that as much as 20% of the genome of Escherichia coli originated from horizontal gene transfer.) Horizontal gene transfer is able to cause rather large-scale changes in a bacterial genome. For example, certain bacteria contain multiple virulence genes called pathogenicity islands that are located on large, unstable regions of the bacterial genome. These pathogenicity islands can be transmitted to other bacteria by horizontal gene transfer. However, if these transferred genes provide no selective advantage to the bacteria that acquire them, they are usually lost by deletion. In this way the size of the bacterium's genome can remain approximately the same size over time. Joint initiative of IITs and IISc – Funded by MHRD Page 79 of 91 NPTEL – Biotechnology – Microbiology There are three mechanisms of horizontal gene transfer in bacteria: transformation, transduction, and conjugation. The most common mechanism for horizontal gene transmission among bacteria, especially from a donor bacterial species to different recipient species, is conjugation. Although bacteria can acquire new genes through transformation and transduction, this is usually a more rare transfer among bacteria of the same species or closely related species. CONJUGATION Bacterial conjugation is the transfer of genetic material between bacterial cells by direct cell-to-cell contact or by a bridge-like connection between two cells. Discovered in 1946 by Joshua Lederberg and Edward Tatum, conjugation is a mechanism of horizontal gene transfer as are transformation and transduction although these two other mechanisms do not involve cell-to-cell contact. Fig. 42. Genetic recombination Lederberg and Tatum did not directly prove that physical contact of the cells was necessary for gene transfer. This evidence was provided by Bernard Davis (1950), who constructed the U tube consisting of two pieces of curved glass tubing fused at the base to form a U shape with a fritted glass filter between the halves. The filter allows the passage of media and not bacteria. Davis discovered that when two auxotrophic strains were Joint initiative of IITs and IISc – Funded by MHRD Page 80 of 91 NPTEL – Biotechnology – Microbiology separated by the filter, gene transfer could not take place. Therefore direct contact is necessary for the recombination to take place as discovered by Lederberg and Tatum. Fig. 43. U-tube experiement Bacterial conjugation is often regarded as the bacterial equivalent of sexual reproduction or mating since it involves the exchange of genetic material. During conjugation the donor cell provides a conjugative or mobilizable genetic element that is most often a plasmid or transposon. Most conjugative plasmids have systems ensuring that the recipient cell does not already contain a similar element. The genetic information transferred is often beneficial to the recipient. Benefits may include antibiotic resistance, xenobiotic tolerance or the ability to use new metabolites. Such beneficial plasmids may be considered bacterial endosymbionts. Other elements, however, may be viewed as bacterial parasites and conjugation as a mechanism evolved by them to allow for their spread. The prototypical conjugative plasmid is the F-plasmid, or F-factor. The Fplasmid is an episome (a plasmid that can integrate itself into the bacterial chromosome by homologous recombination) with a length of about 100 kb. It carries its own origin of replication, the oriV, and an origin of transfer, or oriT. There can only be one copy of the F-plasmid in a given bacterium, either free or integrated, and bacteria that possess a copy are called F-positive or F-plus (denoted F+). Cells that lack F plasmids are called F-negative or F-minus (F-) and as such can function as recipient cells. Joint initiative of IITs and IISc – Funded by MHRD Page 81 of 91 NPTEL – Biotechnology – Microbiology Fig. 44. Conjugation Among other genetic information the F-plasmid carries a tra and trb locus, which together are about 33 kb long and consist of about 40 genes. The tra locus includes the pilin gene and regulatory genes, which together form pili on the cell surface. The locus also includes the genes for the proteins that attach themselves to the surface of Fbacteria and initiate conjugation. Though there is some debate on the exact mechanism of conjugation it seems that the pili are not the structures through which DNA exchange occurs. This has been shown in experiments where the pilus are allowed to make contact, but then are denatured with SDS and yet DNA transformation still proceeds. Several proteins coded for in the tra or trb locus seem to open a channel between the bacteria and it is thought that the traD enzyme, located at the base of the pilus, initiates membrane fusion. When conjugation is initiated by a signal the relaxase enzyme creates a nick in one of the strands of the conjugative plasmid at the oriT. Relaxase may work alone or in a complex of over a dozen proteins known collectively as a relaxosome. In the F-plasmid system the relaxase enzyme is called TraI and the relaxosome consists of TraI, TraY, TraM and the integrated host factor IHF. The nicked strand, or T-strand, is then unwound from the unbroken strand and transferred to the recipient cell in a 5'-terminus to 3'terminus direction. The remaining strand is replicated either independent of conjugative action (vegetative replication beginning at the oriV) or in concert with conjugation (conjugative replication similar to the rolling circle replication of lambda phage). Conjugative replication may require a second nick before successful transfer can occur. A recent report claims to have inhibited conjugation with chemicals that mimic an Joint initiative of IITs and IISc – Funded by MHRD Page 82 of 91 NPTEL – Biotechnology – Microbiology intermediate step of this second nicking event. If the F-plasmid that is transferred has previously been integrated into the donor’s genome some of the donor’s chromosomal DNA may also be transferred with the plasmid DNA. The amount of chromosomal DNA that is transferred depends on how long the two conjugating bacteria remain in contact. In common laboratory strains of E. coli the transfer of the entire bacterial chromosome takes about 100 minutes. The transferred DNA can then be integrated into the recipient genome via homologous recombination. A cell culture that contains in its population cells with non-integrated F-plasmids usually also contains a few cells that have accidentally integrated their plasmids. It is these cells that are responsible for the low-frequency chromosomal gene transfers that occur in such cultures. Some strains of bacteria with an integrated F-plasmid can be isolated and grown in pure culture. Because such strains transfer chromosomal genes very efficiently they are called Hfr (high frequency of recombination). The E. coli genome was originally mapped by interrupted mating experiments in which various Hfr cells in the process of conjugation were sheared from recipients after less than 100 minutes (initially using a Waring blender). The genes that were transferred were then investigated. Fig. 45. Conjugation – formation of an Hfr cell Joint initiative of IITs and IISc – Funded by MHRD Page 83 of 91 NPTEL – Biotechnology – Microbiology TRANSDUCTION: Transduction, a process of genetic recombination in bacteria in which genes from a host cell (a bacterium) are incorporated into the genome of a bacterial virus (bacteriophage) and then carried to another host cell when the bacteriophage initiates another cycle of infection. In general transduction, any of the genes of the host cell may be involved in the process; in special transduction, however, only a few specific genes are transduced. It has been exploited as a remarkable molecular biological technique for altering the genetic construction of bacteria, for locating bacterial genes, and for many other genetic experiments. Transduction happens through either the lytic cycle or the lysogenic cycle. If the lysogenic cycle is adopted, the phage chromosome is integrated (by covalent bonds) into the bacterial chromosome, where it can remain dormant for thousands of generations. If the lysogen is induced (by UV light for example), the phage genome is excised from the bacterial chromosome and initiates the lytic cycle, which culminates in lysis of the cell and the release of phage particles. The lytic cycle leads to the production of new phage particles which are released by lysis of the host. Fig. 46. Lytic and lysogenic cyles. Joint initiative of IITs and IISc – Funded by MHRD Page 84 of 91 NPTEL – Biotechnology – Microbiology The packaging of bacteriophage DNA has low fidelity and small pieces of bacterial DNA, together with the bacteriophage genome, may become packaged into the bacteriophage genome. At the same time, some phage genes are left behind in the bacterial chromosome. There are generally three types of recombination events that can lead to this incorporation of bacterial DNA into the viral DNA, leading to two modes of recombination. Generalized Transduction Generalized transduction is the process by which any bacterial gene may be transferred to another bacterium via a bacteriophage, and typically carries only bacterial DNA and no viral DNA. In essence, this is the packaging of bacterial DNA into a viral envelope. This may occur in two main ways, recombination and headful packaging. If bacteriophages undertake the lytic cycle of infection upon entering a bacterium, the virus will take control of the cell’s machinery for use in replicating its own viral DNA. If by chance bacterial chromosomal DNA is inserted into the viral capsid which is usually used to encapsulate the viral DNA, the mistake will lead to generalized transduction. If the virus replicates using 'headful packaging', it attempts to fill the nucleocapsid with genetic material. If the viral genome results in spare capacity, viral packaging mechanisms may incorporate bacterial genetic material into the new virion. The new virus capsule now loaded with part bacterial DNA continues to infect another bacterial cell. This bacterial material may become recombined into another bacterium upon infection. When the new DNA is inserted into this recipient cell it can fall to one of three fates 1. The DNA will be absorbed by the cell and be recycled for spare parts. 2. If the DNA was originally a plasmid, it will re-circularize inside the new cell and become a plasmid again. 3. If the new DNA matches with a homologous region of the recipient cell’s chromosome, it will exchange DNA material similar to the actions in conjugation. Joint initiative of IITs and IISc – Funded by MHRD Page 85 of 91 NPTEL – Biotechnology – Microbiology This type of recombination is random and the amount recombined depends on the size of the virus being used. Fig. 47. Generalized Transduction Specialized Transduction Specialized transduction is the process by which genes that are near the bacteriophage genome may be transferred to another bacterium via a bacteriophage. The genes that get transferred (donor genes) always depend on where the phage genome is located on the chromosome. This second type of recombination event which is the result of mistakes in the transition from a virus' lysogenic to lytic cycle is called specialized transduction, and non-viral DNA is carried as an insertion/substitution. If a virus incorrectly removes itself from the bacterial chromosome, bacterial DNA from either end of the phage DNA may be packaged into the viral capsid. Specialized transduction leads to three possible outcomes: 1. DNA can be absorbed and recycled for spare parts. 2. The bacterial DNA can match up with a homologous DNA in the recipient cell and exchange it. The recipient cell now has DNA from both itself and the other bacterial cell. Joint initiative of IITs and IISc – Funded by MHRD Page 86 of 91 NPTEL – Biotechnology – Microbiology 3. DNA can insert itself into the genome of the recipient cell as if still acting like a virus resulting in a double copy of the bacterial genes. When the partially encapsulated phage material infects another cell and becomes a "prophage" (is covalently bonded into the infected cell's chromosome), the partially coded prophage DNA is called a "heterogenote". Esther Lederberg, Larry Morse, Herman Kalckar, Michael Yarmolinsky, and Yukinori Hirota went on to do detailed studies of Galactosemia. Specialized transduction was used in these studies for gene mapping. At about this time, Esther Lederberg, Julius Adler, and Enrico Calef were also engaged in similar research involving Maltophilia. Example of specialized transduction is λ phages in Escherichia coli discovered by Esther Lederberg as well as Fertility Factor F, also discovered by Esther Lederberg. Fig. 48. Specialized transduction Joint initiative of IITs and IISc – Funded by MHRD Page 87 of 91 NPTEL – Biotechnology – Microbiology TRANSFORMATION: Transformation involves the uptake of free or naked DNA released by donor by a recipient. It was the first example of genetic exchange in bacteria to have been discovered. This was first demonstrated in an experiment conducted by Griffith in 1928. The presence of a capsule around some strains of pneumococci gives the colonies a glistening, smooth (S) appearance while pneumococci lacking capsules have produce rough (R) colonies. Strains of pneumococci with a capsule (type I) are virulent and can kill a mouse whereas strains lacking it (type II) are harmless. Griffith found that mice died when they were injected with a mixture of live non capsulated (R, type II) strains and heat killed capsulated (S, type I) strains. Neither of these two when injected alone could kill the mice, only the mixture of two proved fatal. Live S strains with capsule were isolated from the blood of the animal suggesting that some factor from the dead S cells converted the R strains into S type. The factor that transformed the other strain was found to be DNA by Avery, McLeod and McCarty in 1944. Fig. 49. Evidence of transformation Joint initiative of IITs and IISc – Funded by MHRD Page 88 of 91 NPTEL – Biotechnology – Microbiology Transformation is gene transfer resulting from the uptake by a recipient cell of naked DNA from a donor cell. Certain bacteria (e.g. Bacillus, Haemophilus, Neisseria, Pneumococcus) can take up DNA from the environment and the DNA that is taken up can be incorporated into the recipient's chromosome. By 1926 the quest to determine the mechanism for genetic inheritance had reached the molecular level. Previous discoveries by Gregor Mendel, Walter Sutton, Thomas Hunt Morgan, and numerous other scientists had narrowed the search to the chromosomes located in the nucleus of most cells. But the question of what molecule was actually the genetic material had not been answered. In 1928 Frederick Griffith, in a series of experiments with Diplococcus pneumonia (bacterium responsible for pneumonia), witnessed a miraculous transformation. During the course of his experiment, a living organism (bacteria) had changed in physical form. The pneumococcus bacterium occurs naturally in two forms with distinctively different characteristics. The virulent (S-strain) form has a smooth polysaccharide capsule that is essential for infection. The nonvirulent (R-strain) lacks the polysaccharide capsule, giving it a rough appearance. Mice injected with S-strain of the pneumococcus bacteria die from pneumonic infection within a few days, while mice injected with the Rstrain bacteria continue to live. Injection with heat-killed S-strain bacteria also results in the mice surviving. Griffith was surprised to find in his experiments that mice injected with a mixture of heat-killed S-strain and live but nonvirulent R-strain produced lethal results. In fact, Griffith discovered living forms of the S-strain bacteria in the infected mice. He hypothesize that the R-strain bacteria had somehow been transformed by the heat-killed S-strain bacteria. Some "transforming principle", transferred from the heat-killed S-strain, had enabled the R-strain to synthesize a smooth polysaccharide coat and become virulent. Oswald Avery, Colin McCleod, and Maclyn McCarty (1934-1944) at the Rockefeller Institute, building on Griffith's work, showed that only DNA could cause the transformation. They isolated a cell-free extract from the S-strain bacteria and were able Joint initiative of IITs and IISc – Funded by MHRD Page 89 of 91 NPTEL – Biotechnology – Microbiology to transform living R-strain into a culture containing both S-strain and R-strain cells. The purified extract contained Griffith's "transforming principle". Through biochemical testing, they showed it to be deoxyribonucleic acid (DNA). Fig. 50. Transformation (a) with DNA fragments and (b) with a plasmid Fig. 51. Mechanism of Transformation Joint initiative of IITs and IISc – Funded by MHRD Page 90 of 91 NPTEL – Biotechnology – Microbiology REFERENCES: Text Books: 1. Jeffery C. Pommerville. Alcamo’s Fundamentals of Microbiology (Tenth Edition). Jones and Bartlett Student edition. 2. Gerard J. Tortora, Berdell R. Funke, Christine L. Case. Pearson - Microbiology: An Introduction. Benjamin Cummings. 3. J. Krebs, E.S. Goldstein, Stephen T. Kilpatrick. Lewin’s Genes X. Jones and Bartlett Publishers. Reference Books: 1. Lansing M. Prescott, John P. Harley and Donald A. Klein. Microbiology. Mc Graw Hill companies. Joint initiative of IITs and IISc – Funded by MHRD Page 91 of 91