* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download No Slide Title
Molecular cloning wikipedia , lookup
Epigenomics wikipedia , lookup
Neocentromere wikipedia , lookup
Copy-number variation wikipedia , lookup
Ridge (biology) wikipedia , lookup
Mitochondrial DNA wikipedia , lookup
Polycomb Group Proteins and Cancer wikipedia , lookup
Zinc finger nuclease wikipedia , lookup
Public health genomics wikipedia , lookup
Gene desert wikipedia , lookup
Long non-coding RNA wikipedia , lookup
History of RNA biology wikipedia , lookup
Deoxyribozyme wikipedia , lookup
Nutriepigenomics wikipedia , lookup
Genetic engineering wikipedia , lookup
Gene expression profiling wikipedia , lookup
X-inactivation wikipedia , lookup
Primary transcript wikipedia , lookup
Gene expression programming wikipedia , lookup
Point mutation wikipedia , lookup
Epigenetics of human development wikipedia , lookup
Genomic imprinting wikipedia , lookup
Cre-Lox recombination wikipedia , lookup
Vectors in gene therapy wikipedia , lookup
Metagenomics wikipedia , lookup
Microsatellite wikipedia , lookup
Extrachromosomal DNA wikipedia , lookup
Minimal genome wikipedia , lookup
No-SCAR (Scarless Cas9 Assisted Recombineering) Genome Editing wikipedia , lookup
Genome (book) wikipedia , lookup
History of genetic engineering wikipedia , lookup
Pathogenomics wikipedia , lookup
Human genome wikipedia , lookup
Therapeutic gene modulation wikipedia , lookup
Designer baby wikipedia , lookup
Short interspersed nuclear elements (SINEs) wikipedia , lookup
Genomic library wikipedia , lookup
Microevolution wikipedia , lookup
Genome editing wikipedia , lookup
Site-specific recombinase technology wikipedia , lookup
Artificial gene synthesis wikipedia , lookup
Genome evolution wikipedia , lookup
Non-coding DNA wikipedia , lookup
Jumping s Gene Mobile & Transposable Elements 1 Ethnobotany 2 mosaic kernels 3 1938: Marcus Rhoades reported odd phenotypic ratios in corn. Self pollination of a pigmented corn kernel yielded: 12 : 3 : 1 pigmented : dotted : colorless 4 A hypothesis: Two mutations at unlinked loci: 1. pigment gene A1 mutated to colorless mutant a1, and 2. a dominant allele for dotting (Dt) appeared. The presence of the Dt allele caused spots of pigment to appear. 5 Barbara McClintock 1902-1992 6 Barbara McClintock 1902-1992 7 8 9 10 Transposition = the movement of genetic information from one chromosomal location, the donor site, to another, the target site. 11 DNA sequences that can change their genomic location intragenomically either autonomously or non-autonomously are called transposable elements. 12 “copy-and-paste” “cut-and-paste” 13 Retrotransposons can be divided into five orders on the basis of their mechanistic features, sequence organization, and reverse transcriptase phylogeny: LTR retrotransposons, DIRS-like elements, Penelope-like elements, LINEs, and SINEs. 14 15 When a transposable element is inserted into a host genome, a small segment of the host DNA (usually 4-12 bp) is duplicated at the insertion site. 16 17 18 Transposition = the movement of genetic information from one chromosomal location, the donor site, to another, the target site. 19 DNA sequences that can change their genomic location intragenomically either autonomously or non-autonomously are called transposable elements. 20 Classifications of Transposable Elements 21 Transposition may be replicative or conservative. Replicative transposition will result in two copies of the element, one at the donor site and one at the target site. Following conservative transposition the transposable element will only be found at the target site, with no change in copy number. 22 Conservative transposition = “cut-and-paste” transposition 23 Duplicative transposition = “copy-and-paste” transposition 24 DNA-mediated and RNA-mediated transposable elements: 1.Class I transposable elements (retrotransposons). 2.Class II transposable elements (DNA transposons). 25 26 Autonomous and nonautonomous transposable elements Autonomous transposable elements encode all the components of the transposition machinery. Nonautonomous transposable elements appropriate the transposition machinery of autonomous transposable elements. 27 Active and fossil transposable elements A transposable element is defined as active if it contains all the necessary sequence elements for either autonomous or nonautonomous transposition. Active elements may be rendered defective by different types of mutation, in which case they are referred to as fossil transposable elements. 28 Active and fossil transposable elements A transposable-element family may contain different combinations of active autonomous, active nonautonomous, fossil autonomous, and fossil nonautonomous transposable elements. For example, the human genome contains approximately 50,000 fossil autonomous and 200,000 fossil nonautonomous DNA transposons. Intriguingly, the human genome seems to contain NO active DNA transposons. 29 According to the numbers and kinds of genes they contain, DNA-mediated transposable elements are divided into insertion sequences and transposons. 30 31 Found in Escherichia coli and Shigella dysinteria. Length = 770 nucleotides, including two inverted terminal repeats, 23 bp each. Contains two out-of-phase reading frames, insA and insB, from which a single protein is produced by translational frameshifting at a run of adenines. The N-terminal is an inhibitor of transposition; the Cterminal is a transposase, an enzyme that catalyzes the insertion of transposable elements into insertion sites. 32 Carriers and non-carriers of the insertion sequence can be separated by centrifugation because the carriers are heavier. 33 INSERTION SEQUENCES (IS) Insertion sequences were first discovered in the gal operon of E. coli. Galactose (gal) operon galE galT galK galM galE = UDP-galactose 4-epimerase galT = galactose-l-phosphate uridylyltransferase galK = galactokinase galM = mutarotase 34 Galactose (gal) operon galT galE galK galM IS Insertion of an IS affects only the transcription of the genes downstream from the insertion. For example, if the IS occurs in the galT gene, the galT, galK and galM genes will be disrupted, but galE will not be. This phenomenon is known as a POLAR mutation. 35 36 37 38 39 Stupidity 40 Composite Hypercomposite transposons contain two or more transposons. 41 (symmetrical-inverted) Tn3 from E. coli confers streptomycin resistance. tnpR and bla are transcribed on one strand; tnpA on the other. Tn3 is flanked by 38-bplong inverted repeats. (asymmetrical) Tn554 from Staphylococcus aureus lacks terminal repeats and contains 8 protein-coding genes. Three of the genes are transcribed as a unit and encode transposases (tnpA, tnpB, and tnpO). The spc and ermA genes confer spectinomycin and erythromycin resistance, 42 respectively. Composite transposon Tn9 from Escherichia coli contains two copies of IS1 flanking the cat gene, which encodes a chloramphenicol-resistance protein. 43 Some mobile elements can transpose themselves in all cells; others are cellspecific. Tc1 elements in the nematode Caenorhabditis elegans and P elements in Drosophila melanogaster are usually mobile only in germ cells. 44 Transposition of many elements is regulated by developmental stage. From an evolutionary point of view, the developmental timing of transposition is particularly important, because it affects the propagation of the transposable element to future generations. 45 LINE-1 transposable elements in mammals are particularly active during leptotene and zygotene, when DNA-strand breakages occur. This offers an opportunity for transposable elements to insert themselves into new sites. 46 Genomic locations of target sites for transposition: Exclusive genomic preference: In the vast majority of cases IS4 incorporates itself in the galactosidase operon of Escherichia coli, and thus each bacterium contains mostly one copy of IS4. Complete randomness: Bacteriophage Mu transposes itself at random within the genome. Intermediate genomic preference: 40% of all Tn10 transposons in E. coli are found in the lacZ gene, which constitutes a minute fraction of the host genome. 47 Genomic locations of target sites for transposition: Affinity for a particular nucleotide composition: IS1 favors AT-rich sites. Affinity for a particular sequence: IS630 has a special affinity for 5'—CTAG—3' sequences. Chromosomal preference: TRIM elements in Drosophila miranda exhibit a preference for the Y chromosome. 48 Hotspots for P element insertion in the X chromosome of Drosophila melanogaster 49 Genomic locations of target sites for transposition: The DIRS-1 transposable element in the slime mold Dictyostelium discoideum 50 Genomic locations of target sites for transposition: Self affinity: DIRS-1 preferentially inserts itself into other DIRS-1 sequences. D. discoideum contains, on average, ~40 intact copies of DIRS-1 and ~300 fragments. oldest newest Active DIRS oldest 51 Species specificity: DIRS in Dictyostelium discoideum only. mariner moves from species to species, even if the species belong to different taxonomic kingdoms. 52 Retroelements are sequences that contain a gene for reverse transcriptase, which catalyzes the synthesis of cDNA from an RNA template. Not all retroelements possess the intrinsic capability to transpose. Therefore, not all retroelements are transposable elements. Retroelements that transpose do so by retroposition. 53 Central Dogma 54 Modified Central Dogma 55 Not the Central Dogma 56 That would have been nice… 57 That would have been nice… Joanna Masel 58 59 Retroelements Retroelements are DNA or RNA sequences that contain a gene encoding the enzyme reverse transcriptase, which catalyzes the synthesis of DNA from an RNA template. The resulting DNA molecule is called complementary DNA (cDNA). Not all retroelements are transposable or mobile. Retroelements can be divided into three categories: (1) transposable elements that move within a genome by replicative RNA-mediated transposition (but may also move intergenomically) (2) mobile nontransposable elements that only move intergenomically (3) non-mobile elements 60 Masayori Inouye Rutgers University 61 Retrons are widely distributed among bacterial species. Within each bacterial species retrons tend to be rare. Natural populations of retron-carrying genomes possess a single retron copy, either in the intergenic part of the genome or inside a prophage (a viral genome that had became integrated into the bacterial chromosome). 62 multicopy single-stranded DNA (msDNA) 63 TERT genes Human telomeres consist of the sequence TTAGGG tandemly repeated many thousand times. Because of asymmetrical DNA replication, a few of these repeats are lost from the tips of the chromosomes each replication cycle. 64 TERT genes Telomerases are nucleoproteins whose function is to add DNAsequence repeats to the 3' end of the DNA strands in the telomeres at the ends of linear eukaryotic chromosomes. The de novo addition of TTAGGG repeats by the enzyme telomerase partially or wholly compensates for telomere shortening. Telomerases in all eukaryotic species share at least two components essential for catalytic activity: a telomerase reverse transcriptase protein (TERT) and a telomerase RNA. The TERT encoding gene is a retroelement. In humans, this retroelement is located on chromosome 5. 65 Retroplasmids Extragenomic DNA and RNA molecules (plasmids) are frequently detected in fungal mitochondria. They can be divided into: - Genomically derived plasmids (similar in sequence to the mitochondrial sequence) - Autonomously-replicating true plasmids that exhibit no sequence similarity with the host mitochondrial genome. 66 Group-II introns Group-II introns are a subclass of self-splicing introns. Some group II introns contain protein-coding genes for endonuclease and reverse transcriptase. The latter may act as retrotransposable genetic elements. 68 Group-II introns Group-II introns can integrate into the homologous position of an intronless allele of the same gene (homing), and at much lower frequencies into other sites (retroposition). Retroposition 69 Twintrons are introns-within-introns excised by sequential splicing reactions. Group II twintrons have presumably been formed by the insertion of a group II intron into an existing group II intron. 70 Group-III introns are short ORF-less introns found in a small number of protist eukaryotes, such as Euglena gracilis. They appear to be group-II introns from which the central ORF-containing portion has been removed. Thus, group-III introns are essentially nonautonomous group-II introns. Group III twintrons are known. 71 Retrotransposons are transposable elements that use RNAmediated transposition, but do not construct virion particles, i.e., they lack the env (envelope) gene, and so, unlike retroviruses, cannot independently transport themselves across cells. Initially, the retrotransposons were divided into LTR retrotransposons and non-LTR retrotransposons (or retroposons) according to whether or not their coding sequences were flanked by long terminal repeats (LTRs). Subsequent evolutionary studies indicated that while most nonLTR retrotransposons constitute a monophyletic group, the LTR-retrotransposons are paraphyletic. Some LTR-retrotransposons have secondarily acquired env-like reading frames that may enable them to move from cell to cell (i.e., 72 they are in practice viruses). LTR retrotransposons Non-LTR retrotransposons 73 74 75 76 Integrase 77 78 Pararetroviruses: 1. HepaDNAviruses, e.g., hepatitis B virus 79 Pararetroviruses: 2. Caulimoviruses, e.g., cauliflower mosaic virus 80 81 Pararetroviruses are not transposable elements 82 83 84 RETROSEQUENCES Restrosequences (or retrotranscripts) are genomic sequences that have been derived through the reverse transcription of RNA and subsequent integration of the resulting cDNA into the genome. Retrosequences lack the ability to produce reverse transcriptase, and have been produced through the use of a reverse transcriptase from a retroelement. 85 86 Diagnostic features of mRNA derived retrosequences 1. lack of introns 2. precise boundaries coinciding with the transcribed regions 3. stretches of poly(A) at the 3’ end 4. short direct repeats at both ends 5. truncations 6. posttranscriptional modifications 87 88 many many few 89 Retrosequences: Retrogenes Processed genes Semiprocessed genes Retropseudogenes Processed pseudogenes Semiprocessed pseudogenes 90 91 92 Number of human retropseudogenes and number of parental functional genes __________________________________________________________________________ Number of Number of Gene genes retropseudogenes __________________________________________________________________________ argininosuccinate synthetase 1 14 -actin 1 ~20 -tubulin 2 15-20 Cu/Zn superoxide dismutase 1 >4 cytochrome c 2 20-30 dihydrofolate reductase 1 ~5 G3PD 1 ~25 lactate dehydrogenase A 1 10 lactate dehydrogenase B 1 3 lactate dehydrogenase C 1 6 laminin 1 >20 nonmuscle tropomyosin 1 >3 nucleophosmin B23 1 7-9 phosphoglycerate kinase 1 2 prohibitin 1 >4 prothymosin 1 >5 ribosomal protein L32 1 ~20 triosephosphate isomerase 1 5-6 93 __________________________________________________________________________ Genes tend to “bombard” the genome with dead copies of themselves. The “Vesuvian” paradigm 94 How do you get reverse transcribed sequences to become incorporated into the germline genome if the gene itself is not transcribed in the germline? 95 96 Pseudogenes are affected by two evolutionary processes: Compositional assimilation: The accumulation of mutations which obliterate the similarity between the pseudogene and its functional paralogue. The nucleotide composition of the pseudogene will come to resemble its surroundings, eventually “blending” into it. Abridgment: Due to the excess of deletions over insertions, pseudogenes become increasingly shorter compared to the functional gene. 97 It takes on average 400 million years for a mammalian retropseudogene to lose half of its length. Mammals are ~200 million years old and, therefore, the mammalian genome is expected to contain reptilian pseudogenes. These ancient pseudogenes have by now become unrecognizable. 98 Mammalian processed pseudogenes are created at a much faster rate than the rate by which they are obliterated by deletion. 99 Processed pseudogenes are abundant in mammals. Processed pseudogenes are rare in amphibians, rarer in birds, and even rarer in Drosophila. 100 Creation rates rates Deletion 101 Hypothesis: Retroposition occurs mainly in the female germline Spermatogenesis is similar among animals. Oogenesis in mammals differs from that in the other animals by a prolonged lambrush stage (= suspended animation) that lasts from birth to ovulation (up to 40 years in humans). Lampbrush chromosome 102 Creation rates are determined by the length of the suspended animation during oogenesis 20-40 years in humans. 2-4 months in amphibians. Less than 3 weeks in birds. Less than 1 day in Drosophila. 103 Prediction: Retrosequences should be found in high numbers on the X chromosome, in intermediate numbers on autosomes, and be rare on the Y chromosome. 104 Prediction: Retrosequences should be found in high numbers on the X chromosome, in intermediate numbers on autosomes, and be rare on the Y chromosome. Density of Processed Pseudogenes in Human Chromosomes (from Bischof et al. 2006) Mean Density in Autosomes Density in X-chromosome Density in Y-chromosome 2.28 ± 0.40 3.01 0.74 105 Differences among organisms in numbers of retropseudogenes can also be due to deletion rate. DNA loss in Drosophila is ~75 times faster than that in mammals. This high rate may explain the dearth of pseudogenes in Drosophila. 106