* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Slide 1
Point mutation wikipedia , lookup
Gel electrophoresis wikipedia , lookup
Vectors in gene therapy wikipedia , lookup
Promoter (genetics) wikipedia , lookup
Real-time polymerase chain reaction wikipedia , lookup
Metalloprotein wikipedia , lookup
Biochemistry wikipedia , lookup
Non-coding DNA wikipedia , lookup
Messenger RNA wikipedia , lookup
Protein structure prediction wikipedia , lookup
Biosynthesis wikipedia , lookup
Transcriptional regulation wikipedia , lookup
Silencer (genetics) wikipedia , lookup
Genetic code wikipedia , lookup
RNA interference wikipedia , lookup
RNA polymerase II holoenzyme wikipedia , lookup
Eukaryotic transcription wikipedia , lookup
Polyadenylation wikipedia , lookup
Gene expression wikipedia , lookup
Nucleic acid analogue wikipedia , lookup
Epitranscriptome wikipedia , lookup
Measuring Selection in RNA. Naila Mimouni, Rune Lyngsoe and Jotun Hein Department of Statistics, Oxford University Aim Preliminary Results: • Extract selection information from conservation of secondary structure of alignments of homologous RNA sequences from different species, for different RNA families. Overall: Motivation RNA Evolution: + Mutation on the DNA (before it is translated to RNA). + Selection on the RNA to preserve structure stability and function. Protein Selection: + Periodicity of the 3 bases forming the amino acid. + The genetic code. + Ka/Ks. RNA Selection: No equivalent of Ka/Ks. Method Basic idea: Exploit conservation of RNA secondary structure. Stem Classes: I: II: Two Approaches: A) Counting Approach: + Reconstruct phylogeny: Non-directionality. + Count Substitution according to their location: stem lengthening/shortening in loops/stems respectively. + Class Definitions: III: B) Evolutionary Modelling: + Reconstructing phylogeny: directionality. + Novel Rates: Singlet, doublet and junk rates. + Homologous RNAs partitioned according to similar homology. + Likelihood test: Given a gene: pseudogene Vs functional. Loop Classes I: Data: + Rfam: RNA sequence alignments with conserved structure. + 503 different RNA families. + Largest dataset for investigating RNA selection. + Initial focus: 47 miRNA families, each containing aligned sequences with conserved structure. III: Future Work + Maximum likelihood for phylogeny recontruction. + Devise the singlet, doublet rates. + construct selection model + Expand to other RNA families. 1: Knudsen, B. & Hein, J. 2003. Pfold: RNA secondary structure prediction using stochastic context-free grammars. Nucleic Acids Research, 31(13), 3423-8. 2: Schroeder, S. J., Burkard, M. E. & Turner, D. H. 2001. The Energetics of Small Internal Loops in RNA. Biopolymers Nucleic Acid Sci., 52, 157-167. II: