* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Beyond Galled Trees – Decomposition and Computation of Galled
Survey
Document related concepts
Transcript
Drawing Phylogenetic Networks Daniel H. Huson joint work with Tobias Kloepper and Regula Rupp 1Future Directions in Phylogenetics, Cambridge, December 2007 Split Networks and Cluster Networks Split network Data: binary sequences (Kumar, 1998) Cluster network 2 How to Draw Cluster Networks? Data: 61 genes (Leebens-Mack et al, MBE, 2005) 3 Cladograms For Trees 4 Phylograms and Radial Diagrams 5 Drawing a Cladogram for a Tree Assign x-coords in a postorder traversal of N: if v is a leaf: x(v) = 0 4 else: x(v) = max x of children + 1 1 0 0 3 0 2 1 0 0 0 7 Drawing a Cladogram for a Tree 1 Assign y-coords in a postorder traversal: if v is a leaf: y(v) = number of leaves visited else: y(v) = mean y of children 1.5 2 2.625 3 3.75 4.125 4.5 4 5 6 8 Naïve Algorithm for Drawing Networks Network N: p q p Choose a guide tree T Compute coordinates for T Draw network using tree coordinates 9 Naïve Algorithm for Drawing Networks Q R P Problems: x-coordinates: P and Q have different x-coordinates y-coordinates: R isn‘t placed between P and Q Unnecessary edge crossings 10 Better x-Coordinates Assign x-coords in a postorder traversal of N: if v is a leaf: x(v) = 0 else: x(v) = max x of children + 1 2 1 0 0 1 4 0 0 3 2 0 11 Better y-Coordinates? Need to introduce: – – LSA guide tree topological embedding 12 Lowest Single Ancestor The LSA of a node v is the last node ( v) on all paths from to v: v lsa(v) 13 LSA Guide Tree Connect each reticulate node to its LSA and remove all reticulate edges: LSA tree T cladogram 14 Topological Embedding A topological embedding is given by an ordering of the children of each node v: v e q v d r q r r w p b p w a Any gives rise to a planar drawing of T 15 Better y-Coordinates Choose so that reticulate nodes are placed between their sources: Order subtrees in preorder traversal of T 16 Resulting Cladogram Use diagonal or curved lines for reticulate edges 17 Additional Twist for y-Coordinates LSA guide tree has true and false leaves: Network N LSA tree T A leaf is false if it is only a leaf in T 18 Additional Twist for y-Coordinates False leaves produce uneven spacing of leaves in N: 19 Additional Twist for y-Coordinates Assign integer coordinates to true leaves, fractional to false ones: 1 2 3 3½ 4 5 6 20 Circular Cladograms Compute polar coordinates in similar way: 21 Phylograms and Radial Diagrams y-coordinates: as for cladogram x-coordinates: preorder traversal 22 Example 1 Multiple gene trees Leebens-Mack et al, MBE, 2005 61 chloroplast genes for 26 plants Filtered cluster networks – – – – 50% 30% 20% 10% 23 Example 2: Splits vs Clusters Split network of consensus splits from 106 maximum-parsimony trees (Rokas et al, 2003) (Holland et al, 2004) Weighted cluster network 24 Weights for Reticulate Edges? Use LSA to determine weight of reticulate edge: 1,2,3,4,5 1,3,4 v lsa(v) 2,5 Use average weight on paths to lsa(v) 25 Example 3 Arndt von Haeseler over 12,000 trees 76.5 % 11.4 % 11.4 % Weighted cluster network 26 Outlook All algorithms discussed have been implemented and will be made available in Dendroscope2 Dendroscope2 will be released in first half of 2008 Dendroscope 1 (for trees only) is available from: www-ab.informatik.uni-tuebingen.de/software 27