* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download PPT - Bioinformatics Research Group at SRI International
Survey
Document related concepts
Transcript
Navigation to Related Objects Mario Latendresse October 2010, Pathway Tools Workshop Bioinformatics Research Group SRI International [email protected] 1 SRI International Bioinformatics Available on the Web 2 Applies for compounds, genes, proteins, reactions, pathways Once such an object is found, a new menu appears on the right on the top menu bar The menu offers to search the object found: 1. In all databases or 2. Another database The “In all databases” is demanding (BioCyc has over 1000 databases); more so for genes and proteins SRI International Bioinformatics Does this Object Exist in Other PGDBs? 3 SRI International Bioinformatics How is Similarity Computed? For pathways, reactions, and compounds: based on frame-ids (unique identifiers) For proteins and genes: based on names, synonyms, and orthologs The result page tells you, for each database, if found based on name/synonyms or orthologs, or both 4 SRI International Bioinformatics Demo on the “another database” 1. 2. 3. 4. 5. 5 Web At BioCyc.org, select E. coli K12 as the database to search Search for the glycolysis pathway using the Search->Pathways command Once the glycolysis pathway is displayed, Select the command Pathway->Show this pathway in another database Select the database to search (e.g., MetaCyc) If it exists in the other database, the glycolysis pathway is displayed for this database SRI International Bioinformatics of Demo on the Web of “in all databases” 1. 2. 3. 4. 6 At BioCyc.org, select E. coli K12 as the database to search Search for compound N-methyltryptophan. Once the compound is displayed, select the command Compound->Show this compound in all databases About 28 databases (the number depends on the current version of BioCyc.org) are found containing that compound SRI International Bioinformatics Result for “In all databases” for a Compound 7 SRI International Bioinformatics For Genes and Proteins The search for similar proteins and genes is based on names (name and synonyms) and orthologs The result page gives the reasons of the match 8 SRI International Bioinformatics Result “In all databases” for a Gene 9 SRI International Bioinformatics Implementation The two commands are implemented using BioVelo. A general purpose query language for Pathway Tools The query itself can be saved since it appears as a URL in the browser 10 SRI International Bioinformatics Implementation (2) Searching all databases for a pathway [ ( Z1,r ): Z1<-dbs, r := Z1~GLYCOLYSIS ] Searching all databases for a compound [ (Z1,r,r^Chemical-formula): Z1 <- dbs, r:=Z1~PEPTIDE-TRYPTOPHAN ] Searching all databases for a reaction [ (Z1,r,r^EC-number): Z1 <- dbs, r := Z1~L-IDITOL-2-DEHYDROGENASE-RXN ] Searching all databases for a gene [ r : l1 := find-objects(META,ECOLI~EG11024), l2 := find-orthologs(META,ECOLI~EG11024), both := l1**l2 , ort := l2 -- both, sim := l1 -- both, r <- … both … ort … sim …] 11 SRI International Bioinformatics Navigation to Related Objects Questions? 12 SRI International Bioinformatics