Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Introduction to Chemoinformatics and Drug Discovery Irene Kouskoumvekaki Associate Professor February 15th, 2013 The Chemical Space There are atoms and space. Everything else is opinion. —Democritus (ca. 460 BC – ca. 370 BC) 2 CBS, Department of Systems Biology Systems Chemical Biology 3 CBS, Department of Systems Biology Today’s Learning Objectives • To introduce you to the field of chemoinformatics and the most commonly used terms and methods • To show examples of the use of chemoinformatics in modern drug research • To give you practical experience through hands-on exercises 4 CBS, Department of Systems Biology Drug Discovery Process Disease 5 Drug target CBS, Department of Systems Biology Drug candidate Animal studies Clinical studies Marketed drug A drug candidate… ... is a (ligand) compound that binds to a biological target (protein, enzyme, receptor, ...) and in this way either initiates a process (agonist) or inhibits it (antagonist/inhibitor) The structure/conformation of the ligand is complementary to the space defined by the protein’s active site The binding is caused by favorable interactions between the ligand and the side chains of the amino acids in the active site. (electrostatic interactions, hydrogen bonds, hydrophobic contacts...) 6 CBS, Department of Systems Biology 7 CBS, Department of Systems Biology Wet-lab drug discovery process HTS Screening collection 106 cmp. 8 Actives 103 actives CBS, Department of Systems Biology Wet-lab drug discovery process HTS Screening collection 106 cmp. Actives 103 actives High rate of false actives!!! High throughput is not enough to get high output….. 9 CBS, Department of Systems Biology Wet-lab drug discovery process HTS Screening collection 106 cmp. 10 Follow-up Actives 103 actives CBS, Department of Systems Biology Chemical structure Purity Mechanism Activity value Wet-lab drug discovery process HTS Screening collection 106 cmp. Follow-up Actives Hits 103 actives 1-10 hits Analogues synthesis and testing ADMET properties 11 CBS, Department of Systems Biology Wet-lab drug discovery process HTS Screening collection 106 cmp. Follow-up Actives Hits 103 actives 1-10 hits Hit-to-lead Lead series 0-3 lead series Analogues synthesis and testing ADMET properties 12 CBS, Department of Systems Biology Wet-lab drug discovery process HTS Screening collection 106 cmp. Follow-up Actives Hits 103 actives 1-10 hits Hit-to-lead Lead-to-drug Lead series 0-3 lead series Drug candidate 0-1 Analogues synthesis and testing ADMET properties 13 CBS, Department of Systems Biology 14 CBS, Department of Systems Biology Failures 15 CBS, Department of Systems Biology We need more.. to find less.. 16 CBS, Department of Systems Biology Drug Discovery Process Chemoinformatics Disease 17 Drug target CBS, Department of Systems Biology Drug candidate Animal studies Clinical studies Marketed drug Wet-lab + Dry-lab drug discovery in vitro in silico + in vitro Diverse set of molecules tested Computational methods to select in the lab subsets (to be tested in the lab) based on prediction of drug-likeness, solubility, binding, pharmacokinetics, toxicity, side effects, ... 18 CBS, Department of Systems Biology The Lipinski ‘rule of five’ for druglikeness prediction Octanol-water partition coefficient (logP) ≤ 5 Molecular weight ≤ 500 # hydrogen bond acceptors (HBA) ≤ 10 # hydrogen bond donors (HBD) ≤ 5 If two or more of these rules are violated, the compound might have problems with oral bioavailability. (Lipinski et al., Adv. Drug Delivery Rev., 23, 1997, 3.) 19 CBS, Department of Systems Biology Exercise : Drug-likeness by ’rule of five’ •Go to the following webpage www.molsoft.com/mprop •Draw proguanil, calculate properties and decide if this compound is a drug according to ’rule of five’ 20 CBS, Department of Systems Biology 21 CBS, Department of Systems Biology Proguanil antimalarian tablets 22 CBS, Department of Systems Biology Chemoinformatics Gathering and systematic use of chemical information, and application of this information to predict the behavior of unknown compounds in silico. data 23 CBS, Department of Systems Biology prediction Major Aspects of Chemoinformatics •Databases: Development of databases for storage and retrieval of small molecule structures and their properties. •Machine learning: Training of Decision Trees, Neural Networks, Self Organizing Maps, etc. on molecular data. •Predictions: Molecular properties relevant to drugs, virtual screening of chemical libraries, system chemical biology networks… 24 CBS, Department of Systems Biology Major Aspects of Chemoinformatics •Databases: Development of databases for storage and retrieval of small molecule structures and their properties. •Machine learning: Training of Decision Trees, Neural Networks, Self Organizing Maps, etc. on molecular data. •Predictions: Molecular properties relevant to drugs, virtual screening of chemical libraries, system chemical biology networks… 25 CBS, Department of Systems Biology 26 CBS, Department of Systems Biology Representing a chemical structure • How much information do you want to include? – atoms present – connections between atoms • bond types – stereochemical configuration – charges – isotopes – 3D-coordinates for atoms C8H9NO3 27 CBS, Department of Systems Biology Representing a chemical structure • How much information do you want to include? – atoms present – connections between atoms • bond types – stereochemical configuration – charges – isotopes – 3D-coordinates for atoms OH CH2 H2N O CH OH 28 CBS, Department of Systems Biology Representing a chemical structure • How much information do you want to include? – atoms present – connections between atoms • bond types (aromatic ring identification) – stereochemical configuration – charges – isotopes – 3D-coordinates for atoms OH CH2 H2N O CH OH 29 CBS, Department of Systems Biology Representing a chemical structure • How much information do you want to include? – atoms present – connections between atoms • bond types – stereochemical configuration – charges – isotopes – 3D-coordinates for atoms OH CH2 H2N O CH OH 30 CBS, Department of Systems Biology Representing a chemical structure • How much information do you want to include? – atoms present – connections between atoms • bond types – stereochemical configuration – charges – isotopes – 3D-coordinates for atoms OH CH2 + H3N O CH O 31 CBS, Department of Systems Biology Representing a chemical structure • How much information do you want to include? – atoms present – connections between atoms • bond types – stereochemical configuration – charges – isotopes – 3D-coordinates for atoms OH CH2 H2N O 14 CH OH 32 CBS, Department of Systems Biology Representing a chemical structure • How much information do you want to include? – atoms present – connections between atoms • bond types – stereochemical configuration – charges – isotopes – 3D-coordinates for atoms 33 CBS, Department of Systems Biology From chemists to representations 34 CBS, Department of Systems Biology Structural representation of molecules Line notations Structural representation of molecules Connection tables 35 CBS, Department of Systems Biology SMILES (Simplified Molecular Input Line Entry System) Canonical SMILES: unique for each structure Isomeric SMILES: describe isotopism, configuration around double bonds and tetrahedral centers, chirality 36 CBS, Department of Systems Biology InChI (IUPAC International Chemical Identifier) 37 CBS, Department of Systems Biology MOLfile format (.sdf) 38 CBS, Department of Systems Biology Small molecule databases 39 CBS, Department of Systems Biology Try it yourself! • Go to PubChem: pubchem.ncbi.nlm.nih.gov/ • Type proguanil and press Go • Click on the first result on the list 40 CBS, Department of Systems Biology Try it yourself! • Scroll down and find the SMILES and InChI 41 CBS, Department of Systems Biology Try it yourself! • Click on SDF (top right icon) • Select: 2D SDF: Display 42 CBS, Department of Systems Biology Try it yourself! • Go back and click again on SDF • Select: 3D SDF: Display 43 CBS, Department of Systems Biology Questions? 44 CBS, Department of Systems Biology 45 CBS, Department of Systems Biology