Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Virtual Screening C371 Fall 2004 INTRODUCTION • Virtual screening – Computational or in silico analog of biological screening – Score, rank, and/or filter a set of structures using one or more computational procedures – Helps decide: • Which compounds to screen • Which libraries to synthesize • Which compounds to purchase from an external source – Also used to analyze the results of HTS screening runs Ways to Assess Structures from a Virtual Screening Experiment • Use a previously derived mathematical model that predicts the biological activity of each structure • Run substructure queries to eliminate molecules with undesirable functionality • Use a docking program to ID structures predicted to bind strongly to the active site of a protein (if target structure is known) • Filters remove structures not wanted in a succession of screening methods Main Classes of Virtual Screening Methods • Depend on the amount of structural and bioactivity data available – One active molecule known: perform similarity search (ligand-based virtual screening) – Several active molecules known: try to ID a common 3D pharmacophore, then do a 3D database search – Reasonable number of active and inactive structures known: train a machine learning technique – 3D structure of the protein known: use protein-ligand docking Virtual Screening Methods for NonSpecific Targets • Prediction of the likelihood that a molecule has “drug-like” characteristics and possesses desired physicochemical properties “DRUG-LIKENESS” AND COMPOUND FILTERS • Which features of drug molecules confer biological activity? • Substructure filters to eliminate molecules known to have problems – For a specific target, may have to modify or extend the filters • Analyze the values of simple properties (MW, logP, No. of rotatable bonds) Lipinski Rule of Five • Poor absorption or permeation is more likely when: – MW > 500 – LogP >5 – More than 5 H-bond donors (sum of OH and NH groups) – More than 10 H-bond acceptors (sum of N and O atoms) Other Findings • 70% of drug-like molecules have: – Between 0 and 2 H-bond donors – Between 2 and 9 H-bond acceptors – Between 2 and 8 rotatable bonds – Between 1 and 4 rings • Other techniques (neural networks, genetic algorithms, decision trees) consider more complex possibilities “Lead-Likeness” • Increase in molecular complexity occurs during optimization phase of a lead molecule STRUCTURE-BASED VIRTUAL SCREENING • Protein-Ligand Docking – Aims to predict 3D structures when a molecule “docks” to a protein • Need a way to explore the space of possible protein-ligand geometries (poses) • Need to score or rank the poses to ID most likely binding mode and assign a priority to the molecules – Problem: involves many degrees of freedom (rotation, conformation) and solvent effects • Conformations of ligands in complexes often have very similar geometries to minimum-energy conformations of the isolated ligand Protein-Ligand Docking Methods • Modern methods explore orientational and conformational degrees of freedom at the same time – Monte Carlo algorithms (change conformation of the ligand or subject the molecule to a translation or rotation within the binding site – Genetic algorithms – Incremental construction approaches • Problem: Lack of a comprehensive knowledge base Distinguish “Docking” and “Scoring” • Docking involves the prediction of the binding mode of individual molecules – Goal: ID orientation closest in geometry to the observed X-ray structure • Scoring ranks the ligands using some function related to the free energy of association of the two units – DOCK function looks at atom pairs of between 2.3-3.5 Angstroms – Pair-wise linear potential looks at attractive and repulsive regions, taking into account steric and hydrogen bonding interactions Structure-Based Virtual Screening: Other Aspects • Computationally intensive and complex • Multitude of possible parameters figure into docking programs • Docking programs require 3D conformation as the starting point or require partial atomic charges for protein and ligand • X-Ray Crystallographic studies don’t include hydrogens, but most docking programs require them PREDICTION OF ADMET PROPERTIES • Requirements for a drug: – Must bind tightly to the biological target in vivo – Must pass through one or more physiological bariers (cell membrane or blood-brain barrier) – Must remain long enough to take effect – Must be removed from the body by metabolism, excretion, or other means • ADMET: Absorption, Distribution, metabolism, Excretion (Elimination), Toxicity ADMET (cont’d) • Permeability through the intestinal cell membrane or blood-brain barrier – Paucity of experimental data in vivo studies, especially for humans Hydrogen Bonding Descriptors • Count of the numbers of donors and acceptors • Calculation of the overall propensity to be an acceptor or donor • Modeling solubility, octanol/water partition coefficient, and blood-brain barrier permeability Polar Surface Area • Amount of molecular surface due to polar atoms (N and O plus attached hydrogens) • Especially good for prediction of oral absorption and brain penetration • Polar surface are greater than 140 square Angstroms has been associated with poor absorption Descriptors Based on 3D Fields • Molecular descriptors quantify the molecule’s overall size and shape and the balance between hydrophilicity, hydrophobicity, and hydrogen bonding Toxicity Prediction • Very difficult problem • Most limit studies to single toxicological phenomenon or a single class of compounds (e.g., Polycyclic aromatic hydrocarbons) • Some based on known toxic effects SUMMARY • Virtual screening methods are central to many cheminformatics problems in: – Design – Selection – Analysis • Increasing numbers of molecules can be evaluated using these techniques • Reliability and accuracy remain as problems in docking and predicting ADMET properties • Need much more reliable and consistent experimental data