Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
SYNSPROUT A tool for generation synthetically accessible ligands by De Novo design I CA MS Krisztina Boda ([email protected]) Supervisor: Prof. Peter Johnson Topics of Discussion Sprout Components De Novo structure generation in Classic SPROUT SynSPROUT approach Automatic Fragment Library Generation Retro-synthetic knowledge base Fragmenting MDDR Synthetic knowledge base Structure generation in SynSPROUT Example Future work SPROUT Components Detecting potential binding pockets of the protein structures Identification of interaction sites favourable hydrogen bonding Docking structures at the target interaction sites Generating 3D molecular structures of novel ligands by linking the docked starting fragments together Tools for scoring, sorting and clustering the answer set Structure Generation in Classic Sprout 1st Phase Primary molecular structure generation 2nd Phase Conversion of structure graph into molecule CAESA Generalised Fuse fragments Sequential method to build structure graph Heuristics to avoid combinatorial explosion New bond Atom substitution N N O O N Estimating of synthetic accessibility Spiro O O Synthetic Sprout Approach Build synthetic constraints into structure generation Retro Synthetic Knowledge Base Synthetic Knowledge Base Reliable high yielding reactions Ease of synthesis is a key factor in drug development Generating structures and postpruning them is inefficient BREAKING STRUCTURES INTO SMALLER COMPONENTS VIRTUAL SYNTHESIS IN RECEPTOR CAVITY Readily synthetisable putative ligand structures Drug-like structures Fragment Library Pool of readily available starting materials Automatic Fragment Library Generation 2D Drug-like Structures Atom & Ring Perception Retro-Synthetic Knowledge Base Detecting RetroSynthetic Patterns Retro-synthetic patterns Fragmentation Retro-synthetic rules Clustering Identical Fragments Synthetic Knowledge Base Functional Groups Synthetic rules Synthetic Fragment Library Perception Knowledge Base Aromatic Normalisation Hybridisation H-bonding properties Detecting Functional Groups Filter 3D Fragment Generation Corina Conformation Generation Omega Retro-Synthetic Knowledge Base Retro-Synthetic Patterns Describing atom SPCENTER HETS HS EPS ARYL CONNECTS RING CHEMICAL-LABEL <Amide> C[HETS=2](=O)–N–C CHEMICAL-LABEL <Ether> C[HETS=1]–O-C[SPCENTER=3][HETS=1] H N O S O Describing bond –, =, #, % CHAIN RING Describing ring Branching atoms N O O OH CHEMICAL-LABEL <Sulphonamide> O=S(=O)-N O N H O S Retro-Synthetic Knowledge Base Retro-Synthetic Rules EXPLANATION Amide Formation IF Amide THEN delete-bond between 1 and 3 add-atom – O to 1 END-THEN H2N S N O O 1 2 O 3 1 3 H N O2 4 S OH OH HO O N O O OH OH Cl EXPLANATION Ether Formation IF Ether THEN delete-bond between 2 and 3 add-atom – O, Cl, Br to 3 END-THEN OH O OH Br O Fragmenting MDDR Enamine Formation Michael Addition Disulphide Thioester Acetal fom Carbonyl Seconadry Enamine Acetal from Hemiacetal Urethane Aminal from Hemiaminal Aldol Formation Urea Conjugated Enone Isocyanate from amine/acid Imine Formation Friedel Crafts Acylation Sulphide Sulphonamide Ester Formation Wittig Reaction Ullman reaction Alkylation Alpha to Carbonyl Ether Formation Amide Formation Grignard Reaction Reductive Amination 78515 Drug-like structures 403780 Detected retro-synthetic patterns 404187 Generated 2D fragments 73141 Unique fragments Ranking by frequency of occurrences 1 + 1 0 10000 20000 Filtering out undesirable fragments 30000 40000 50000 60000 Distribution of applied retro-synthetic rules Fragment Library 70000 80000 Synthetic Fragment Number of occurrences 407 H H A AD A H Carboxylic Acid (Acceptor) H Phenol (Acceptor-Donor) Synthetic Knowledge Base CHEMICAL-LABEL <Carboxylic Acid> C[SPCENTRE=2];[ARYL=NO](=O)-O[HS=1] CHEMICAL-LABEL <Primary Amine> C-N[HS=2];[CONNECTION=1] Available Func.Groups Carboxylic Acid Primary / Secondary Amine Phenol Aryl Halide Carbonyl Alcohol Joining Rules EXPLANATION Amide Formation IF Carboxylic Acid INTER Primary Amine THEN destroy-atom 3 form-bond - between 1 and 5 change-hybridization 5 to SP2 DIHEDRAL 0 0 DIHEDRAL 0 180 BOND-LENGTH 1.35 END-THEN Steps of formation Hybridization change Bond type Bond length Dihedral penalty/angle Available Synthetic Rules 2 O 1 OH 3 H + H O N5 4 H N Amide Formation Ether Formation Ester Formations Ullman Reaction Aldol Wittig Reductive Amination Docking starting fragment to sites De Novo Drug Design in SynSPROUT Selecting two sites to be connected Donor site H2N Bi-directional consecutive search Overlapping common fragments NH2 O OH Acceptor Site Combined structures are re-docked OH Docking starting fragment to sites De Novo Drug Design in SynSPROUT 1.Amide Formation ( Carboxylic Acid –Primary Amine ) Selecting two sites to be connected Donor site O O 1 Bi-directional consecutive search Overlapping common fragments O NH O OH Carboxylic Acid Acceptor Site Combined structures are re-docked BF search towards acceptor site O OH NH2 Primary Amine O OH Docking starting fragment to sites De Novo Drug Design in SynSPROUT 1.Amide Formation ( Carboxylic Acid –Primary Amine ) 2.Reductive Amination ( Carbonyl – Primary Amine ) 2 Selecting two sites to be connected Bi-directional consecutive search Donor site N H O OH OH Primary Amine Overlapping common fragments H2N O Acceptor Site Combined structures are re-docked DF search towards acceptor site O Carbonyl OH OH Docking starting fragment to sites De Novo Drug Design in SynSPROUT 1.Amide Formation ( Carboxylic Acid –Primary Amine ) 2.Reductive Amination ( Carbonyl – Primary Amine ) Selecting two sites to be connected 2 N H O O Overlap O OH O O NH O Acceptor Site O Combined structures are re-docked OH 1 NH Bi-directional consecutive search Overlapping common fragments Donor site N H OH OH O OH OH CDK2 Library : 300 fragments/1055 conformations Docked:935 971 57 6800 Docked:359 Docked:1791 Docked:898 CDK2 Act Score : -7.91 2 3 1 1 Ether formation ( Primary Alkyl Halide – Alcohol ) 2 C–S–C ( SH – Primary Alkyl Halide ) 3 Amine alkylation ( Primary Amine – Primary Alkyl Halide ) Conclusion Fragment Library Generation De Novo Structure Generation Diverse libraries with fragments rich in biological motifs Promising structures with estimated high binding affinity Future Work Enhance the program to perform more complex synthetic reactions Acknowledgements Peter Johnson Attilla Ting for atom perception Colin Martin for building retro-synthetic knowledge base all members of ICAMS