* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Lecture15
Long non-coding RNA wikipedia , lookup
Gene desert wikipedia , lookup
Extrachromosomal DNA wikipedia , lookup
Human genetic variation wikipedia , lookup
Copy-number variation wikipedia , lookup
Gene expression profiling wikipedia , lookup
Polycomb Group Proteins and Cancer wikipedia , lookup
Short interspersed nuclear elements (SINEs) wikipedia , lookup
Ridge (biology) wikipedia , lookup
Microevolution wikipedia , lookup
Transposable element wikipedia , lookup
Epigenetics of human development wikipedia , lookup
Gene expression programming wikipedia , lookup
No-SCAR (Scarless Cas9 Assisted Recombineering) Genome Editing wikipedia , lookup
Designer baby wikipedia , lookup
History of genetic engineering wikipedia , lookup
Oncogenomics wikipedia , lookup
Skewed X-inactivation wikipedia , lookup
Metagenomics wikipedia , lookup
Artificial gene synthesis wikipedia , lookup
Genomic imprinting wikipedia , lookup
Public health genomics wikipedia , lookup
Mitochondrial DNA wikipedia , lookup
Non-coding DNA wikipedia , lookup
Segmental Duplication on the Human Y Chromosome wikipedia , lookup
Whole genome sequencing wikipedia , lookup
Y chromosome wikipedia , lookup
Helitron (biology) wikipedia , lookup
Genomic library wikipedia , lookup
X-inactivation wikipedia , lookup
Genome editing wikipedia , lookup
Site-specific recombinase technology wikipedia , lookup
Pathogenomics wikipedia , lookup
Human genome wikipedia , lookup
Neocentromere wikipedia , lookup
Genome (book) wikipedia , lookup
Minimal genome wikipedia , lookup
Bioinformatics Lecture 15 • Chromosome rearrangements • Chromosome and genome comparison versus gene comparison • Permutations and breakpoint graphs • Transforming Men into Mouse • Evolutionary reconstructions Types of chromosome rearrangements • Translocations • Inversions • Insertions A B C D E F K L M N O P K L M C D E F A B O P A C D E F K O D E F K L M N B A B C • Duplications A B C • Deletions A B C • Fusions/Fissions A B C E F D E E F N M L P K L L M N E F A B C E F N K L K L O P K L O P O O P O P P Chromosome and genome comparisons versus gene comparisons • Comparisons of genes, proteins and non-coding sequences is not the only way to study relations between different species. • Attempts were made from 1930s to use chromosome rearrangements information for this purpose. • It has been shown that genomes consist of a relatively moderate number of “conserved” so called syntenic blocks, which carry nearly the same or very similar set of genes. The latest study revealed 281 syntenic blocks, which were observed when human and mouse genomes were compared. • This type of analysis is independent and different from sequence data and provide very useful information, which can not be obtained by other means. • When the number of rearrangements is small (<10), tracing back rearrangements is not too demanding. However, sophisticated mathematics and specialised computer programmes are required to reconstruct chains of events using real numbers of rearrangements. Transformation of cabbage in turnip A most parsimonious rearrangement scenario for transformation of worm mtDNA into human (26 reversals) human mtDNA Red arrows show direction of reversals. You may continue this process. mtDNA of worm Ascaris suum Permutations. Sorting by reversals. • • • ? Genome rearrangements can be modelled by a combinatorial problem of sorting by reversals. This problem is known in computer science as the pancake flipping problem. A chef is sloppy and the size of pancakes vary significantly. A waiter rearrange them by flipping, thus putting pancakes in the right order. Bill Gates and Christos Papadimitriou tried to solve the problem while being undergraduates at Harvard (1979). They proved that the prefix reversal diameter of the symmetric group, dpref(n) = maxSn dpref (), is less than or equal to 5/3 (n) + 5/3 and that for infinitely many n, dpref(n) 17/16(n). The pancake flipping problem thus still remains unsolved in general form. Breakpoints, breakpoint graph, and maximum cycle decomposition vertices (nodes) edges The estimate of reversal distance in terms of breakpoints is not very accurate. Another parameter (size of maximum cycle decomposition of the breakpoint graph) is better and plays an important role in estimating reversal distance (Pevzner 2000). For most of biological examples, d() = n + 1 – c(), where c() is a maximum number of edge-disjoint alternative cycles and n is number of elements in permutations. This procedure reduces the reversal distance problem to the maximum cycle decomposition problem. Modelling a signed permutation by an unsigned permutation Genes are directed fragments of DNA (5’ 3’) and a sequence of n genes is represented by signed (+ or -) permutation. Every reversal changes both the order and the signs of the elements within that fragment, chromosome or genome. Signed permutations and physical maps Physical maps usually do not provide information about the direction of genes, and therefore lead to representation of a genome as an unsigned permutation . But it can be done using signed permutations. Optimal sorting of a permutation by five reversals Breakpoint graph of this permutation Transformation of a signed permutation into an unsigned permutation and the breakpoint graph G() Interleaving graph H with two oriented and one unoriented component Multichromosomal genomes • Complications are inevitable on the way from genomes containing one chromosome to multichromosomal genomes. • Internal and terminal inversion should be distinguished and translocations are considered as internal if it is neither a fusion nor fission. • A new order of chromosomal segments and chromosome in a genome called concatenate. There exist 2N concatenates in a genome with N elements (chromosomes/chromosome arms). • Sometime certain types of rearrangements can be mimicked by others. Translocations can be mimicked by inversions inversion Evolutionary transformation of genome A into genome B A -3-2-1+4 +1+2+3+4 +5+6+7+8 inversion +9+10+11 +5+6+7+8 translocation +1+2+3+4 +1+2+3+4 +5+6+7+11 +5+6+7+8 +9+10+8 +9+10+11 +9+10+11 B +1+2+3+4+5+6+7+11 +9+10+8 fission +1+2+3+4+5+6+7+11 +9 +10+8 fusion Two different most parsimonious scenarios that transform the order of the 11 synteny blocks on the mouse X chromosome into order on the human X chromosome. Genome rearrangements and phylogenetic studies • Cytogenetic methods could reveal only a small fraction of numerous rearrangements, which took place in evolution. • Huge amount of “rearrangement” information hidden in genomes became available very recently, when whole genomes were sequenced and reassembled. • In the near future the amount of information in this field will grow quickly and phylogenetic reconstructions using rearrangements data, which are independent from sequence data, will be an important area of research. • This is already true for simple genome.