Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Protein Production for Structure-Based Drug Design Stephen Chambers ~ Head of Gene Expression Vertex Pharmaceuticals Incorporated NIGMS 2004 PSI Protein Production & Crystallization Workshop March 29-31, 2004 Vertex: Building a Major Drug Company Focus: Small molecule drugs for major diseases • Established: 1989; Public: 1991 • Common stock: NASDAQ: VRTX • >700 employees • 3 sites – Cambridge, US (headquarters) – Oxford, UK – San Diego, CA Traditional vs. Vertex Approach to Drug Discovery Therapeutic Area Perspective • Single target approach Target Family Perspective • Chemogenomics multi-target approach Expanding Parallel Drug Discovery into Gene Families Content • Structural Genomics vs Structure Based Drug Design – Much in common • Expression strategy for higher output – – – – Parallel expression of E.coli & insect cells Quantitative analysis Illustrated using examples from Vertex Kinase program Application to other protein families • Integration into a broader process with other disciplines Structure Based Drug Design • Drug Discovery (& Development) • Human proteins – Complex post-translationally modified proteins – Heterogeneous proteins – ‘Difficult’ proteins • Highest value structures contain inhibitor • Premium given to high-output (cf high-throughput) • Failure not an option Protein Expression Bottleneck DNA Cloning Expression Purification Enzymology Crystallography Consumers: • Protein Biochemistry – soluble, purifiable protein • Enzymology – soluble, active protein – 0.1-10 mg of protein • Crystallography – soluble, crystallizable protein – 5-100 mg of protein Expression Process Triage Expression Prior to Production: • • • • • Expression systems Growth conditions Cell lines Constructs Mutants pBEV: Dual-System Expression Vector (Chambers et al 2004) Expression Systems E.COLI YEAST INSECT CELLS MAMMALIAN CELLS Proteolytic Cleavage Glycosylation +/- +/- + + - + + + Secretion +/- + + + Folding +/- +/- + + Phosphorylation - + + + Yield (%) based on dry wt 1-5 1 30 <1 (Valk & Keus 1990) Automated baculo-viral production qPCR used to determine viral titer Highly Engineered Process Using Standard Equipment HiGro Shaker (Genomic Solutions Inc) Genesis (Tecan) Ni-NTA Magnetic Agarose Beads Wave Reactor (Wave Biotech) Cup-horn sonicator (Misonix Inc) HT-Expression & Automated Purification Full-Length Kinases in E.coli & Insect cells Increasing solubility in E.coli: TK>AGC>STE>>CMGC>CAMK Model Behavior in E.coli (http://www.hgmp.mrc.ac.uk/Software/EMBOSS/Apps/cai.html) (http://www.biotech.ou.edu/) Effect of Protein Size on Expression & Solubility in E.coli Quantitative Analysis of Expression Strategies: Decision Tree Analysis of Kinase Expression Actual Payoff in Kinase Structures 14% Structures 86% Structures Novel Kinase Structures in PDB (Yon & Jhoti 2003) HT-Expression Allows the Exploration of Diversity: Rapid Identification of Well Expressing Proteins GSK3 2.7 Å (ter Haar et al 2001) Pim-1 2.4 Å Crystal Structure of MAP Kappa 2 Kinase MAPK2K ∆46 2.8 Å MAPK2K MAPK2K∆46 (Meng et al 2002) Crystal Structure of Aurora-2 Kinase Limited Proteolysis Aurora2K ∆107 2.9 Å (Cheetham et al 2003) Crystal Structure of FLT-3 Kinase FLT-3(H564-S993) 2.1 Å DOMAIN H564-S993 H564-S993 H564-S993 H564-S993 H564-V958 H564-V958 H564-V958 H564-V958 H564-V958 H564-V958 H564-V958 H564-F936 H564-F936 H564-F936 N587-S993 N587-S993 N587-S993 N587-S993 N587-V958 N587-V958 N587-V958 N587-V958 N587-V958 N587-V958 N587-F936 N587-F936 N587-F936 N587-F936 W603- S9 93 W603- S9 93 W603- S9 93 W603- V958 W603- V958 W603- V958 W603- F936 DELETION DUPLICATION MUTANT ITD H711-V782 F723-H761 ITD D835Y H711-V782 H711-H761 F723-V782 F723-H761 H711-V782 F723-H761 ITD H711-V782 F723-H761 ITD H711-V782 H711-H761 F723-V782 F723-H761 ITD H711-V782 F723-H761 H711-V782 F723-H761 H711-V782 F723-H761 (Griffith et al 2004) HT-Expression Allows the Exploration of Diversity: Mutations HT-Expression Allows the Exploration of Diversity: Utilizing Different Cell Lines Insect Cell Expressed Structures in PDB E.coli Expressed Structures in PDB From the Vertex Portfolio Source of Proteins Source of Structures Expressing Difficult Proteins in Insect Cells: Proteases Cathepsins Serine proteases Metallo proteases HT-Expression Allows the Exploration of Diversity: Protein Families (Phosphatases) HT-Cloning & Expression Process HT-Purification & Crystallography Process No Protein Left Behind: Rescuing Insoluble Proteins CDC25a 1.7 Å ICE 2.6 Å No Protein Left Behind: Rescuing Soluble Proteins Integrated Platform Serving Structural Biology Miniaturized Automated Crystallization Multiple Inhibitor Structures for Drug Design Novel Kinase Structures: Not as Interesting as the Active Site An Active Site with Various Inhibitors Conclusions • Demonstrated efficient protein production integrated into a platform for structure-based drug-design • Insect cells expression negates many of the deficiencies observed in E.coli expression • High-throughput expression used to identify soluble expressed protein – proteins that are difficult to express and insoluble are usually difficult to purify and crystallize • Parallel expression in E.coli and insect cells, providing greater number of soluble expressed protein, increases operational efficiency • Process applicable to a range of gene families Acknowledgements Protein Expression Protein Biochemistry Molecular Cloning Enzymology X-Ray Crystallography