* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project
Download PPT3
DNA barcoding wikipedia , lookup
Gene expression wikipedia , lookup
Gene desert wikipedia , lookup
Nucleic acid analogue wikipedia , lookup
Gel electrophoresis of nucleic acids wikipedia , lookup
Transcriptional regulation wikipedia , lookup
History of molecular evolution wikipedia , lookup
Silencer (genetics) wikipedia , lookup
Gene regulatory network wikipedia , lookup
List of types of proteins wikipedia , lookup
Promoter (genetics) wikipedia , lookup
Deoxyribozyme wikipedia , lookup
Molecular ecology wikipedia , lookup
Holliday junction wikipedia , lookup
Genome evolution wikipedia , lookup
Point mutation wikipedia , lookup
Community fingerprinting wikipedia , lookup
Molecular cloning wikipedia , lookup
Endogenous retrovirus wikipedia , lookup
Non-coding DNA wikipedia , lookup
Vectors in gene therapy wikipedia , lookup
Artificial gene synthesis wikipedia , lookup
Biochemistry, computing in biology Course outline 1 Introduction 2 Theoretical background Biochemistry/molecular biology 3 Theoretical background computer science 4 History of the field 5 Splicing systems 6 P systems 7 Hairpins 8 Detection techniques 9 Micro technology introduction 10 Microchips and fluidics 11 Self assembly 12 Regulatory networks 13 Molecular motors 14 DNA nanowires 15 Protein computers 16 DNA computing - summery 17 Presentation of essay and discussion Recombination Recombination and crossover Recombination and crossover Recombination and crossover If no exchange of genes (i.e. phenotypic marker) occurs, recombination event can not be detected Recombination and crossover Introduction to ciliates literature Genome Gymnastics: Unique Modes of DNA Evolution and Processing in Ciliates. David M. Prescott, Nature Reviews Genetics Computational power of gene rearrangement. Lila Kari and Laura Landweber, DIMACS series in discreet mathematics and theoretical computer science The ciliate Very ancient ( ~ 2 . 109 years ago) Very rich group ( ~ 10000 genetically different organisms) Very important from the evolutionary point of view The ciliate DNA molecules in micronucleus are very long (hundreds of kilo bps) DNA molecules in macronucleus are genesize, short (average ~ 2000 bps) The ciliate The ciliate tree Baldauf et al. 2000. Science 290:972. Urostyla grandis Eschaneustyla sp. Holosticha kessleri Uroleptus Bar: 50 mm Scrambled Genes sp. S.Found lemnae O. trifallax O. nova Bar: 25 mm Bar: 100 mm Bar: 100 mm The ciliate The ciliate Dapi staining of the ciliate Nuclei Micronucleus the small nucleus containing a single copy of the genome that is used for sexual reproduction Macronucleus the large nucleus that carries up to several hundred copies of the genome and controls metabolism and asexual reproduction Lifecycle of a ciliate Micronucleus Macronucleus Cutting, splicing, elimination, reordering, and amplification of DNA Prescott, 2000 The ciliate, meiosis The ciliate, reproduction MIC MAC Cell Pairing Meiosis and Nuclear Exchange Nuclear Fusion and Duplication of the Zygotic Nucleus Macronuclear Development and Nuclear Degeneration Polytenization Chromatid breakage De novo telomere formation Modified from Larry Klobutcher & Carolyn Jahn Ann. Review Microbiology, 2002 Computing in ciliates The ciliate Astounding feats of ‘DNA computing’ are routine in this ‘simple’ single -celled organism— a protozoan. In initial micronucleus, DNA is‘junky’ and scrambled, but…. ….it reassembles itself in proper sequence by means of computer-like acrobatics (unscrambling, throwing out genetic ‘junk’)—in macronucleus The complexity of spirotrich biology Telomere Pointers MAC MIC MDS: macronuclear destined sequences IES: internal eliminated segments Splicing Fractioned genes The complexity of gene scrambling Intervening non-coding DNA regions (IES: internal eliminated segments) interrupt protein-coding sequences (MDS macronuclear destined sequences) IESs are removed during macronuclear development MDSs are unscrambled Prescott, 2000 Scramble genes -TBP, actin I, DNA pol -TBP Prescott et al., 1998 Actin I Oxytricha nova Hogan et al., 2001 DNA polymerase Landweber et al., 2000 Degree of scrambling in -TBP Prescott et al, 1998 Unscrambling of actin I Hogan et al, 2001 Degree of scrambling in DNA pol Landweber et al, 2000 DNA folding and recombination DNA pol DNA folding and recombination DNA folding and recombination DNA pol DNA pol : Hairpin loop Prescott, 2000 Recombination -TBP Prescott et al, 1998 Tracing evolutionary scrambling (i) Isolate the micronuclear and macronuclear forms of the -TBP gene (ii) Compare the micronuclear and macronuclear gene structures (MDS and IESs) to determine whether the gene is scrambled (iii) Compare homologous MDSs and scrambling patterns in various stichotrich species (earlier diverging species vs later diverging species) (iv) Trace a parsimonious evolutionary scrambling pathway Comparisons of scrambling complexity Oxytrichidae and Paraurostyla weissei Uroleptus sp. The evolution of recombination Paraurostyla weissei Uroleptus sp. Stylonychia mytilus 100 100 Oxytricha nova 100 Oxytricha trifallax Evolutionary scrambling pathway Holosticha sp. S. mytilus O. nova O. trifallax P. weissei Uroleptus sp. Formal theory Ciliate computing The process of gene unscrambling in hypotrichous ciliates represents one of nature’s ingenious solutions to the computational problem of gene assembly. With some essential genes fragmented in as many as 50 pieces, these organisms rely on a set of sequence and structural detangle their coding regions. clues to For example, pointer sequences present at the junctions between coding and non-coding sequences permit reassembly of the functional copy. As the process of gene unscrambling appears to follow a precise algorithm or set of algorithms, the question remains: what is the actual problem being solved? The problem in the cell Genomic Copies of some Protein-coding genes are obscured by intervening nonprotein-coding DNA sequence elements (internally eliminated sequences, IES) Protein-coding sequences (macronuclear destined sequences, MDS) are present in a permuted order, and must be rearranged. Assumption By clever structural alignment…, the cell decides which sequences are IES and MDS, as well as which are guides. After this decision, the process is simply sorting, O(n). Decision process unknown, but amounts to finding the correct path. Most Costly. Ciliate computing there is some as yet undiscovered “oracle”mechanism within the cell, or the cell simulates non-determinism the former solution lacks biological credibility and the latter implies exponential time and space explosion. What we want is a deterministic algorithm for applying the inter- and intramolecular recombination operations to descramble an arbitrary gene. Ciliate computing The first proposed step in gene unscrambling— alignment or combinatorial pattern matching— may involve searches through several possible matches, via either intra-molecular or intermolecular strand associations. This part could be similar to Adleman’s (1994) DNA solution of a directed Hamiltonian path problem. Ciliate computing The second step—homologous recombination at aligned repeats—involves the choice of whether to retain the coding or the non-coding segment between each pair of recombination junctions. This decision process could even be equivalent to solving an n-bit instance of a satisfiability problem, where n is the number of scrambled segments. Ciliate computing We use develop our knowledge a model for of the first step to the guided homologous recombinations and prove that such a model has the computational power of a Turing machine, the accepted formal model of computation. This indicates that, in principle, these unicellular organisms may have the capacity to perform at least any computation electronic computer. carried out by an Ciliate computing, the naïve model Assume the cell simply reconstructs the genes by matching up pointers. Just one problem... pointer sequences are not unique. In fact, may have multiplicities greater than 13. The proposed solution to this was that the cell would simply try every possible combination of pointers until it found the right two. How the cell computes Relies on short repeat sequences to act as guides in homologous recombination events Splints analogous to edges in Adleman One example represents solution city HP (50 pieces reordered) of 50 Formal model Guided recombination system uxwxv uxv wx Formal model Context necessary for a recombination between repeats x (p, x, q) ~ (p’, x, q’) Formal model, splicing operation Formal Language Model uxwxv uxv wx Where u=u’p, w=qw’=w’’p’, v=q’v’ Intramolecular recombination. x. The guide is Delete x wx from original. Intermolecuar recombination. Exchange. Strand This is a universal Turing machine (proven by Tom Head) Formal model, splicing operation Gene unscrambling algorithm Ciliate computing Gene assembly in ciliates Micronucleus: cell mating Macronucleus: RNA transcripts (expression) Micro: I0 M1 I1 M2 I2 M3 … Ik Mk Ik+1 M = P1 N P2 Macro: permutation of (possibly rotated) M1,…, Mk and I0 ,…, Ik+1are removed Molecular operators Molecular operators Molecular operators Molecular operators Molecular operators Molecular operators Molecular operators Molecular operators Molecular operators Molecular operators Molecular operators Molecular operators Molecular operators Pointers The pointer sequences must be in spatial proximity during unscrambling Topology must be faithfully reproduced somehow Relocation of a locus Recombination event attaches Minor Locus to end of Major Locus