Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Center for Comparative Genomics and Bioinformatics, PSU, UP, 2005 Computational Modeling of Genome-wide Transcriptional Regulation Frank Pugh Department of Biochemistry and Molecular Biology Yousry Azmy Department of Mechanical & Nuclear Engineering The Pennsylvania State University 1. Motivation Ultimate goal of systems biology: Virtual cell Model cell as series of coupled chemical reactions Computationally predict its behavior in response to environmental perturbations Enable in silico drug interaction testing Guide experimental inquiry This project is an early step to achieve this goal: Establish smaller definable systems Construct computational models for these systems Experimentally test & validate (hopefully!) the models CCBG Presentation PSU, University Park, July 13, 2005 2 of 12 Department of Mechanical and Nuclear Engineering 1. Model Foundation Define cell in terms of massive series of coupled reactions: Genetic networks: describe circuitry of how genes influence expression of other genes, … Protein networks: describe physical interactions among all proteins in a cell Transcriptional regulation: thousands of genes, each potentially regulated by the combinatorial actions of hundreds of transcription regulatory proteins Starting point for network model: View network as series of reversible events that dynamically move: Forward: transcription machinery assembly Backward: disassembly or inhibition Transcriptional output: net flux of these forward and reverse events CCBG Presentation PSU, University Park, July 13, 2005 3 of 12 Department of Mechanical and Nuclear Engineering 1. Project Objectives Phenomenological model of yeast biochemical processes: Construct model that replicates changes in gene expression in response to experimental perturbations of transcription machinery Implies strong coupling between construction (computation) & validation (experiment) Large number of potential experiments to fully test all possible response permutations precludes exhaustive investigation Simplifying compromise: Construction/validation mode: Employ existing experimental results of the data a construct model, i.e. computing its parameters Remaining data a validate and refine the constructed model Portion Predictive mode: Execute model for new experimental settings & verify measured values new cases break model a compute new model parameters If new set of parameters cannot be found a deficiency of model Seek & verify new connection scheme: Repeat validation sequence If Prospective mode: Guide future experiments Identify new experiments deemed interesting to biochemistry/biology CCBG Presentation PSU, University Park, July 13, 2005 4 of 12 Department of Mechanical and Nuclear Engineering 2. TBP Model Model TATA binding protein (TBP) regulatory interactions Crystallographic structures of TBP and its regulators arranged according to their expected assembly/disassembly pathway. TAF1 is not shown This is way more biochemistry than I know! CCBG Presentation PSU, University Park, July 13, 2005 5 of 12 Department of Mechanical and Nuclear Engineering 2. Model Assumptions Initial model is phenomenological not quantitative: Determine sense of change not magnitude Ignore indirect effects due to one output affecting another output: Supported by experimental observation Only two-states on/off mechanisms are included in initial model Model distinguishes between state of: Switches: Binary on/off experimental control Flow: Three state in/out/no-flow depending on potential drop CCBG Presentation PSU, University Park, July 13, 2005 6 of 12 Department of Mechanical and Nuclear Engineering 2. Analogy to Electric Circuit Computational model based on analogy to electric circuit v1 v6 s1 v2 s2 i r6 r1 1 r8 q10 q8 i6 k8 k10 r12 k12 q14 q12 r10 v3 s3 k16 r4 i r13 r3 3 r14 q15 k13 k14 k15 i4 k9 v4 q17 s4 q16 r18 k18 CCBG Presentation PSU, University Park, July 13, 2005 7 of 12 q9 r7 v7 i7 r19 k19 r20 k20 r17 k17 i2 r2 q11 r9 r11 r16 q13 r15 k11 v5 i5 r21 q18 k21 s5 r5 r 0 q19 v=0 i0 Department of Mechanical and Nuclear Engineering 2. Construction of Model An electric circuit is fully determined by: Connection scheme: Consequence of biochemistry Model parameters: Voltage at each external node: vn Resistors: rn Setting of switches: sn Applying Kirchoff’s laws to each switch setting combination a internal voltages qn & currents kn 5800 Replicas of electric circuit: Each represents one gene: Yields circuit output i0 All circuits in initial model possess the same ~10 switches Each circuit will possess a unique set of model parameters: vn & rn Voltage at output point arbitrarily set to zero (ground) Same switch setting for all circuits (genes) in given experiment CCBG Presentation PSU, University Park, July 13, 2005 8 of 12 Department of Mechanical and Nuclear Engineering 3. Illustration of Model Construction Given the 5-switch TBP circuit depicted on slide 7: (/gene) Total number of currents: 14 internal + 8 external = 22 Total number of internal node voltages: 12 Kirchoff’s laws a 34 linear equations in 34 unknowns For given switch setting s = {s1, s2, s3, s4, s5}, sn = 0,1 Solve for circuit output i0(s,v,r) in terms of 29 unknown model parameters: • • v = {vn, n=1,…,7} r = {rn, n=0,…,21} Total number of switch states (experimental i0) = 25 = 32 Overdetermined system of nonlinear relations in model parameters: Least-squares fit? Expect imbalance between number of relations & unknowns to grow with circuit complexity CCBG Presentation PSU, University Park, July 13, 2005 9 of 12 Department of Mechanical and Nuclear Engineering 3. Computational Challenges Yeast transcription machinery possesses: At least 100 switches that can be controlled one at a time About 5,800 circuits each with a single measurable output a 2100 possible experiments: combinations of on/off switch states This is 1030 possibilities, each producing ~ 5,800 measured values! Discount ~99% as biochemically irrelevant a 1028 experiments to fully validate or refine the model Computationally prohibitive proposition! Initial proposal: Examine ~ 10 interactions centered around TBP Large symbolic problem: Numerical solution algorithm? Inverse problem syndrome: Solution sensitivity Accounting for experimental errors in model parameters Anything else? CCBG Presentation PSU, University Park, July 13, 2005 10 of 12 Department of Mechanical and Nuclear Engineering 4. Current Status Unguided data acquisition in Pugh’s lab Proof of principle study of computational model: Employ 5-switch circuit model of TBP interactions Obtain symbolic expression for i0(s,v,r): Mathematica NoteBook composed Runs out of memory due to large expression! CCBG Presentation PSU, University Park, July 13, 2005 11 of 12 Department of Mechanical and Nuclear Engineering 4. Remaining Research Implement computational model in modular code: User access via GUI: Access & modify data, visualize circuit,… Parallelization via MPI Experiment with preliminary circuit in code Develop solution algorithm for given set of experimental data Develop algorithm to accommodate amended set of experimental data Code verification & model validation: Design & conduct new experiments likely to test validity of model Success: Sufficient number of experimental results not involved in computing model parameters are predicted by computer code Automate model refinement process to achieve validation: Develop algorithm to isolate pipe connections causing model failure Design interface to permit user to view possible modifications and select one or more for testing Design and conduct guided experiments CCBG Presentation PSU, University Park, July 13, 2005 12 of 12 Department of Mechanical and Nuclear Engineering Reduced Model i1 s1 i6 s2 k10 r8 q10 q8 r10 k8 r12 k12 q14 i3 s3 r13 q12 k16 r16 q13 q15 r15 k15 i7 q11 r9 r11 k9 q9 r19 k19 i4 r14 k13 k11 i2 s4 q16 r20 k20 s5 k14 r17 k17 q17 i5 r18 k18 CCBG Presentation PSU, University Park, July 13, 2005 13 of 12 r21 q18 k21 q19 i 0 Department of Mechanical and Nuclear Engineering