* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Modelling interactomes
Paracrine signalling wikipedia , lookup
Ribosomally synthesized and post-translationally modified peptides wikipedia , lookup
Signal transduction wikipedia , lookup
Biochemistry wikipedia , lookup
Ancestral sequence reconstruction wikipedia , lookup
Magnesium transporter wikipedia , lookup
Gene expression wikipedia , lookup
G protein–coupled receptor wikipedia , lookup
Structural alignment wikipedia , lookup
Expression vector wikipedia , lookup
Metalloprotein wikipedia , lookup
Bimolecular fluorescence complementation wikipedia , lookup
Western blot wikipedia , lookup
Protein purification wikipedia , lookup
Homology modeling wikipedia , lookup
Proteolysis wikipedia , lookup
Two-hybrid screening wikipedia , lookup
MODELLING INTERACTOMES RAM SAMUDRALA ASSOCIATE PROFESSOR UNIVERSITY OF WASHINGTON How does the genome of an organism specify its behaviour and characteristics? How can we use this information to improve human health and quality of life? PROTEOME ~60,000 in human ~60,000 in rice ~4500 in bacteria Several thousand distinct sequence families STRUCTURE A few thousand distinct structural folds FUNCTION Tens of thousands of functions EXPRESSION Different expression patterns based on time and location INTERACTION Interaction and expression are interdependent with structure and function PROTEIN FOLDING Gene …-CTA-AAA-GAA-GGT-GTT-AGC-AAG-GTT-… Protein sequence …-L-K-E-G-V-S-K-D-… One amino acid Unfolded protein Spontaneous self-organisation (~1 second) Native biologically relevant state • Not unique • Mobile • Inactive • Expanded • Irregular PROTEIN FOLDING Gene …-CTA-AAA-GAA-GGT-GTT-AGC-AAG-GTT-… Protein sequence …-L-K-E-G-V-S-K-D-… One amino acid Unfolded protein Spontaneous self-organisation (~1 second) Native biologically relevant state • Not unique • Mobile • Inactive • Expanded • Irregular • Unique shape • Precisely ordered • Stable/functional • Globular/compact • Helices and sheets STRUCTURE One distance constraint for every six residues 0 2 Experiment (X-ray, NMR) One distance constraint for every ten residues Cα RMSD 4 ACCURACY Computation (de novo) Computation (template-based) Hybrid (Iterative Bayesian interpretation of noisy NMR data with structure simulations) 6 STRUCTURE T0290 – peptidyl-prolyl isomerase from H. sapiens T0288 – PRKCA-binding from H. sapiens 0.5 Å Cα RMSD for 173 residues (60% identity) 2.2 Å Cα RMSD for 93 residues (25% identity) T0332 – methyltransferase from H. sapiens T0364 – hypothetical from P. putida 2.0 Å Cα RMSD for 159 residues (23% identity) 5.3 Å Cα RMSD for 153 residues (11% identity) Liu/Hong-Hung/Ngan FUNCTION Ion binding energy prediction with a correlation of 0.7 Calcium ions predicted to < 0.05 Å RMSD in 130 cases Meta-functional signature accuracy Meta-functional signature for DXS model from M. tuberculosis Wang/Cheng INTERACTION Transcription factor bound to DNA promoter regulog model from S. cerevisiae Prediction of binding energies of HIV protease mutants and inhibitors using docking with dynamics BtubA/BtubB interolog model from P. dejongeii (35% identity to eukaryotic tubulins) McDermott/Wichadakul/Staley/Horst/Manocheewa/Jenwitheesuk/Bernard SYSTEMS Example predicted protein interaction network from M. tuberculosis (107 proteins with 762 unique interactions) Proteins PPIs TRIs H. sapiens 26,741 17,652 828,807 1,045,622 S. cerevisiae 5,801 5,175 192,505 2,456 O.sativa (6) 125,568 19,810 338,783 439,990 E. coli 4,208 885 1,980 54,619 In sum, we can predict functions for more than 50% of a proteome, approximately ten million protein-protein and protein-DNA interactions with an expected accuracy of 50%. Utility in identifying function, essential proteins, and host pathogen interactions McDermott/Wichadakul SYSTEMS Combining protein-protein and protein-DNA interaction networks to determine regulatory circuits McDermott/Rashid/Wichadakul INFRASTRUCTURE ~500,000 molecules over 50+proteomes served using a 1.2 TB PostgreSQL database and a sophisticated AJAX webapplication and XML-RPC API http://bioverse.compbio.washington.edu http://protinfo.compbio.washington.edu Guerquin/Frazier INFRASTRUCTURE Guerquin/Frazier INFRASTRUCTURE http://bioverse.compbio.washington.edu/integrator Chang/Rashid APPLICATION: RICE INTERACTOMICS Proteome Number of proteins O. sativa japonica KOME cDNAs O. sativa indica BGI 9311 O. sativa japonica Syngenta O. sativa indica IRGSP O. sativa japonica nrKOME cDNAs O. sativa indica BGI pa64 Total Total (unique) 25,875 40,925 38,071 36,658 19,057 37,712 198,298 125,568 Number annotated (%) 11,841 22,278 20,874 20,481 7478 15,286 Number in protein network Number of protein interactions (44%) (55%) (55%) (56%) (39%) (41%) 4705 5849 5911 5835 3047 5780 88,102 95,149 104,640 110,118 38,793 98,779 98,238 (50%) 60,272 (48%) 31,127 19,810 535,581 338,783 http://bioverse.compbio.washington.edu http://protinfo.compbio.washington.edu McDermott/Wichadakul APPLICATION: RICE INTERACTOMICS BGI/McDermott APPLICATION: DRUG DISCOVERY HSV CMV KHSV Jenwitheesuk APPLICATION: DRUG DISCOVERY Computionally predicted broad spectrum human herpesvirus protease inhibitors is effective in vitro against members from all three classes and is comparable or better than anti-herpes drugs HSV KHSV CMV Our protease inhibitor acts synergistically with acylovir (a nucleoside analogue that inhibits replication) and it is less likely to lead to resistant strains compared to acylovir HSV HSV Lagunoff APPLICATION: NANOTECHNOLOGY Oren/Sarikaya/Tamerler FUTURE + Structural genomics + Functional genomics Computational biology MODELLING PROTEIN AND PROTEOME STRUCTURE FUNCTION AT THE ATOMIC LEVEL IS NECESSARY TO UNDERSTAND THE RELATIONSHIPS BETWEEN SINGLE MOLECULES, SYSTEMS, PATHWAYS, CELLS, AND ORGANISMS ACKNOWLEDGEMENTS Current group members: •Baishali Chanda •Brady Bernard •Chuck Mader •Ersin Emre Oren •Ekachai Jenwitheesuk •Gong Cheng •Imran Rashid •Jeremy Horst •Ling-Hong Hung •Michal Guerquin •Shu Feng •Siriphan Manocheewa •Somsak Phattarasukol •Stewart Moughon •Tianyun Liu •Vania Wang •Weerayuth Kittichotirat •Zach Frazier •Reene Ireton, Program Manager Past group members: •Aaron Chang •David Nickle •Duangdao Wichadukul •Duncan Milburn •Jason McDermott •Marissa LaMadrid •Kai Wang •Kristina Montgomery •Shing-Chung Ngan •Vanessa Steinhilb •Yi-Ling Cheng ACKNOWLEDGEMENTS Collaborators: Funding agencies: •BGI -Gane Wong -Jun Yu - Jun Wang -et al. •BIOTEC/KMUTT •MSE -Mehmet Sarikaya -Candan Tamerler -et al. •UW Microbiology -James Staley -John Mittler -Michael Lagunoff -Roger Bumgarner -Wesley Van Voorhis -et al. •National Institutes of Health •National Science Foundation -DBI -IIS •Searle Scholars Program •Puget Sound Partners in Global Health •Washington Research Foundation •UW -Advanced Technology Initiative -TGIF E. coli INTERACTIONS McDermott M. tuberculosis INTERACTIONS McDermott C. elegans INTERACTIONS McDermott H. sapiens INTERACTIONS McDermott Network-based annotation for C. elegans McDermott KEY PROTEINS IN ANTHRAX Articulation points McDermott HOST PATHOGEN INTERACTIONS McDermott