* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Verification of Real Time Systems
Survey
Document related concepts
Transcript
Modeling and Analysis Techniques in Systems Biology. CS 6221 Lecture 1 P.S. Thiagarajan Basic Info • • • • • P.S. Thiagarajan COM2 #03 – 55 ; Tel Ext. 67998 [email protected] www.comp.nus.edu.sg/~thiagu Course web page: – www.comp.nus.edu.sg/~cs6221 – We will be using the IVLE system extensively. 2 Office Hours • Send mail first and fix an appointment. 3 Course Material • Selected Parts of the text book : – Systems Biology in Practice: E. Klipp, R. Herwig, A. Kowald, C. Wierling, H. Lehrach (Wiley) • Selected Survey papers, book chapters. • Lecture slides. • Research Articles. 4 Assignments • Lab Assignments –3 – tool based (Cell Illustrator, COPASI, SimBio) – Individual 5 Term Papers • Read a paper or –more likely- a bunch of papers on a topic. • Summarize in the form of a term paper. • First assignment: Common • Second assignment: – More substantial – Can be aligned to your interests 6 Talk • Give talk based on the second term paper. – 25 + 5 minutes. 7 Grading (Tentative) • Lab assignments 45% (15 + 15 + 15) • Term papers 40% (15 + 25) • Talk: 15% 8 What is the Course About? • Computational systems biology – Computational aspects of systems biology. • Systems biology: – Not just focus on individual components. • genes, mRNAs, proteins, membranes, ligands …. – But study a system of such components and their interactions. • Many different views of systems biology. 9 Why Systems Biology? • Biology has traditionally –and extremely successfully!- focused on what individual parts of a cell do . • Bio-chemistry of large and small molecules – The structure of DNA and RNA – Proteins, ligands,… 10 Why Systems Biology? • But functionality of a system is determined crucially by the interactions of the parts. • Many fundamental biological processes are dynamic. – cell growith/division/differentiation – Metabolism,…. • Many diseases are marked by malfunctioning of these processes. 11 Why Systems Biology? • Advances in experimental technology are producing vast amounts of data concerning biological processes. – Which genes get expressed “when” in controlled conditions. • One would like to understand this data in a systemic way. • Enter: computational systems biology! 12 The CSB Approach • View selected biological processes as dynamical systems. – – – – Model Simulate Analyze Predict • Many research communities study dynamical systems … 13 What do we need ? • Biology for computer scientists. – basic biological sub-systems/processes – experimental techniques. • Modeling, analysis and simulation techniques. • Biologists as collaborators! 14 Current Status • Modeling techniques. – Mathematical • differential equations, Linear algebra, probability theory, statistics, Boolean networks, Markov chains, Bayesian networks,…. – CS-specific: • Automata, Petri nets, Hybrid functional Petri nets, hybrid automata, Bayesian networks/inferencing/learning, Markov chains, Model checking…. 15 Current Status • Metabolism – Kinetics “laws” (models). – Enzyme kinetics, law of mass action, Michelis-Menten kinetics – Metabolic network models and flux analysis. 16 Current Status • Signal Transduction • Receptor-ligand interactions • Protein actors • signaling dynamics 17 Current Status • Other biological processes • • • • biological oscillations protein folding kinetics cell cycle Gene expression, regulation 18 Current Status • Modeling tools • Cell Illustrator, COPASI, SimBio, ….. 19 What shall we do? • Selected basic topics. – To illustrate the current state of the field. – To critically examine what is missing. – To discuss promising lines of research. 20 What can CS offer? • We “know” how to deal with complex systems. – Hierarchy • silicon realization of circuits, digital design, microarchitectures, assemble language, programming languages, GUIs, … – separation of concerns. – concepts (models), techniques, tools at each layer and for connecting the layers. 21 What can CS offer? • Deal with other disciplines. – Multi-media – Control – Manufacturing – Communications – Business! • Using computing power via algorithms and data structures! • Computational thinking?! 22 What can CS offer? • Find the right level abstractions. – approximations • Handle distributed dynamics • Deal with hybrid behaviors • Build tools. 23 What the Course is NOT about. • We will not deal with: – Traditional “Bio-Informatics” topics • data mining, sequence analysis, … – Computational aspects of structural biology • Proteins structure, folding… 24 Contents • Bio-chemical networks – The basics of chemical kinetics • Three types of bio-chemical networks – Gene networks – Metabolic networks – Signaling pathways 25 Bio-pathways • Many studies of biological sub-systems boil down to studying: – bio-pathways • A network of bio-chemical reactions. 26 The hierarchy of bio-chemical networks Bio-Chemical reactions Metabolic pathways A network of Bio-Chemical reactions Signaling pathways Gene regulatory networks Interacting networks of Bio-Chemical reactions Cell functions 27 Biopathways 28 Gene Regulatory networks • Boolean models • Differential equations • Bayesian networks. 29 Metabolic pathways • Petri nets • Linear algebra • Flux analysis 30 Signaling Pathways • • • • Differential equations. Hybrid functional Petri nets Hybrid automata Stochastic models. 31 Our Research • ODEs based modeling. – Parameter estimation techniques • Stochastic approximations of ODEs dynamics. – Parameter estimation, sensitivity analysis • GPU implementations • Probabilistic (statistical ) model checking 32 Our Research • Collaboration with biologists: – Signaling pathways: • • • • AKT/MAPK pathway Complement pathway TLR3-TLR7 signaling pathways DNA damage/repair pathways • www.comp.nus.edu.sg/~rpsysbio 33 Expected Outcomes • Have a sound grasp of: – current modeling and simulation techniques (Signaling pathways) – Reaction kinetics – stochastic models and simulations – Analysis techniques: • Parameter estimation, sensitivity analysis 34 Expected Outcomes • Be aware of the limitations of current techniques and state of knowledge • Be ready to undertake modeling and simulation work. 35 Let us get started. 36 Basic Biology: Sources • Chapter 2 (Biology in a Nutshell) of the book “Systems Biology in Practice” by E. Klipp et.al. • Chapter 1 (Molecular Biology for Computer Scientists) of the book “Artificial Intelligence and Molecular Biology” by Lawrence Hunter. • The internet! 37 A major goal of biology • Understand the molecular biology of eukaryotic cells. • Cell: the basic building block. – Two major families: Prokaryotes and Eukaryotes. – Eukaryotes • More complex; genetic material is contained in the nucleus; • Most multi-cellular organisms are made up of eukroyotes.; WE are made up of these types of cells. 38 Cells • In multi-cellular organisms; – Cells are differentiated. – Different types of cells have different functions (and composition). – Groups of cells for specific functionalities • tissues. • we have 14 different types of tissues. 39 Source ? 40 Major Classes of Bio-Molecules • • • • Carbohydrates Lipids Proteins Nucleic acids 41 Proteins • Many functions! • Build up the cytoskeletal structure of the cell (the scaffolding) • Responsible for cell movements (motility) • Serve as catalytic enzymes for bio-chemical reactions. • Induce signal transductions. • Control transcriptions and translation of genes • Control degradation of proteins. 42 Proteins • Proteins consist of polypeptides. – Polypeptide - a LONG chain of amino acids bonded together by peptide bonds between adjacent amino acid residues. • The order of amino acids constituting a peptide is fundamental. – Primary structure – coded by genetic information 43 Proteins • 20 (23?) different amino acids • A protein can have 50 – 4000 amino acids sequence. (50 – 1000 is the typical range) • 201000 possible proteins! • Actually, only a tiny fraction is found in nature. 44 Nucleic Acids • DNA (Deoxyribonucleic acid) molecules store genetic information. – Present in all living organisms • RNA (Ribonucleic acid) takes part in a large number of processes. – Transferring hereditary information in the DNA to synthesize proteins. 45 The Central Dogma • First enunciated by Francis Crick in 1958[1] – re-stated in a Nature paper published in 1970:[2] • Three major classes of information-carrying biopolymers: – DNA, RNA, proteins – Information encoded as sequences of molecules. 46 The Central Dogma • In principle there can be 9 types of transfers: DNA RNA Proteins DNA RNA Proteins 47 The Central Dogma • The “simple” form of central dogma states: DNA RNA Proteins 48 The Central Dogma • Information cannot be transferred back from protein to either protein or nucleic acid. • 'once information gets into protein, it can't flow back to nucleic acid.' 49 Current Known Information Flows Special flows occur in retro viruses ! 50 Information Flows (Replication) DNA DNA DNA mRNA (Transcription) Proteins (Translation) mRNA 51 Mechanism of Cellular Functions – Replication (of DNA) – Transcription of RNA and Processing –by splicingto yield mRNA which migrates to the cytoplasm. – Translation (by ribosomes) of the code carried by mRNA into proteins. 52 Legend: 53 Post-translational Modifications • Proteins undergo many modifications to implement cellular functions – Phosphorylation (Activation of proteins) – Dephosphorylation (Deactivation of proteins) – Methylation and acetylation (Gene silencing. Plays a role in cell differentiation) – Cleavage (Cutting of genes and proteins. For degradation and apoptosis) – Ubiquitination (Marking of proteins for further degradation) 54 Interaction roles of proteins • Proteins have specific roles in the form of chemical interactions. – Kinase (Catalyzes phosphorylation, thereby activating other proteins) – Phosphatase (Catalyzes dephosphorylation) – Transcriptional Co-factors 55 Role of Bio-pathways • Apoptosis – programmed cell death • Differentiation – Cells getting specialized for specific functions • Cell-cycle – Growth and replication of cells • Many others! 56 Example: Wnt Signaling Pathway • Most studies on each of the two types of pathways (Signaling and GRN) done in isolation • Wnt canonical pathway, starts with the binding of the Wnt ligand to Frz receptor • Chain of chemical reactions occur, which results in the transcription factor β-Catenin being translocated to the nucleus • Cofactor with TCF/LEF to up-regulate the transcription of several genes 57 Wnt Signaling Pathway (Canonical) Wnt Dsh Degradation Cytoplasmic When It will inhibit Wnt ligand the Complex B-catenin formation bindscan will to form be Frz, of then the phosphorylated Dsh translocate when degradation is GSK-3B recruited to by binds tothe the and phosphorylates plasma complex nucleus membrane where and gets it binds ‘marked’ and APC to anddegradation for gets co-factors Axin activated Tcf and Lef p Dsh GSK-3B p B-Catenin Lef B-Catenin p APC p Axin Tcf 58 Reaction Kinetics Sources • Chapter 5 (Metabolism) of the book “Systems Biology in Practice” by E. Klipp et.al. • Other related material to be uploaded. 60 Bio-Chemical Reactions • Bio-Chemical reaction: – A basic unit of biological processes. – Convert molecules of one type into another • Can be modeled at different levels of abstraction (time scales). – Microscopic: single molecules and their interactions – Macroscopic: Concentrations and rates (changes of concentration per time unit). 61 62 Reactions • Bio-chemical reaction: – Involves bio-molecules. • Proteins, carbohydrates, lipids,… – Creation and transformation of bio-molecules. – Control the flow of energy , materials and information through the cell. 63 Kinetic Models of Reactions • Reaction: – A chemical process resulting in inter-conversion of the reactants. • motion of electrons cause chemical bonds to break and form. • Reaction types – Isomerization • structural rearrangement (transform one isomer to another) • no change in net atomic composition 64 Reaction Types • Direct combination or synthesis: – two or more chemical elements or compounds unite to form a more complex product. • 2H2 + O2 → 2H2O • Chemical decomposition – a compound is decomposed into smaller compounds: • 2H2O → 2H2 + O2 65 Reaction Types • Single displacement or substitution – an element being displaced out of a compound by a more reactive element: • 2Na + 2HCl → 2NaCl + H2 • Double displacement – two compounds in aqueous solution exchange elements or ions to form different compounds. • NaCl + AgNO3 → NaNO3 + AgCl 66 Reaction Kinetics • Kinetics: – Determine reaction rates • Fix reaction law and • determine reaction rate constant • Solve the equation capturing the dynamics. • The reaction rate for a product or reactant in a particular reaction: – the amount (in moles or mass units) per unit time per unit volume that is formed or removed. 67 Reaction Rates • Influenced by: – Temperature – Concentration – Pressure – Light – Order (zero, first, second) – catalyst 68 Rate Laws • Rate law: – An equation that relates the concentrations of the reactants to the rate. • Differential equations are often used to describe these laws. • Assumption: The reactants participating in the reactions are abundant. 69 Rate Laws • Mass action law: – The reaction rate is proportional to the probability of collision of the reactants – Proportional to the concentration of the reactants to the power of their molecularities. 70 Mass action law S1 + S2 V P V = k. [S1] [S2] [S1] is the concentration (Moles/ litre) of S1 [S2] is the concentration (Moles/ litre) of S k is the rate constant V, the rate of the reaction 71 Mass action law S1 + S2 V1 V2 2P V = (V1) - (V2) = k1. [S1] [S2] – k2 [P]2 [S1] ([S2]) is the concentration (Moles/litre) of S1 (S2) k1 and k2 are the rate constants V1, the rate of the forward reaction V2, the rate of the backward reaction V, the net rate Molecularity is 1 for each substrate (reactant) of the forward reaction and 2 for the backward reaction 72 Mass-action Kinetics k1 E+S ES k3 E+P k2 73 To be continued…….. 74 75 • Assuming mass law kinetics we can write down a system of ordinary differential equations for the 6 species. • But we don’t know how to solve systems of ordinary (non-linear) differential equations even for dimension 4! • We must resort to numerical integration. 76 Given: 77 Initial values chosen “randomly” 78 Michaelis-Menton Kinetics • Describes the rate of enzyme-mediated reactions in an amalgamated fashion: – Based on mass action law. – Subject to some assumptions • Enzymes – Protein (bio-)catalysts • Catalyst: – A substance that accelerates the rate of a reaction without being used up. – The speed up can be enormous! 79 Enzymes • Substrate binds temporarily to the enzyme. – Lowers the activation energy needed for the reaction. • The rate at which an enzyme works is influenced by: – concentration of the substrate – Temperature • beyond a certain point, the protein can get denatured – Its 3 dimensional structure gets disrupted 80 Enzymes • The rate at which an enzyme works is influenced by: – The presence of inhibitors • molecules that bind to the same site as the substrate (competitive) – prevents the substrate from binding • molecules that bind to some other site of the enzyme but reduces its catalytic power (non-competitive) – pH (the concentration of hydrogen ions in a solution) • affects the 3 dimensional shape 81 Michaelis-Menton Kinetics k1 E+S ES k3 E+P k2 i) A reversible formation of the Enzyme-Substrate complex ES ii) Irreversible release of the product P from the enzyme. This is for a single substrate; no backward reaction; at least negligible if we focus on the initial phase of the reaction. 82 Michaelis-Menten Kinetics 83 Michaelis-Menton Kinetics k1 E+S ES k3 E+P k2 Use mass action law to model each reaction. 84 (1) This is the rate at which P is being produced. Assumption1: [ES] concentration changes much more slowly than those of [S] and [P] (quasisteady-state) We can then write: 85 This simplifies to: (2) 86 Michaelis-Menton Kinetics (1) (2) Define (Michaelis constant) (3) 87 Assumption1: [ES] concentration changes much more slowly than those of [S] and [P] (quasisteady-state) Assumption2: The total enzyme concentration does not change with time. [E0] = [E] + [ES] [E0] - initial concentration 88 Michaelis-Menton Kinetics [ E ] [ E0 ] [ ES ] [S ]([ E0 ] [ ES ]) K M [ ES ] [ S ][ E0 ] K M [ ES ] [ ES ][ S ] [ S ][ E0 ] [ ES ] [S ] K M 89 Michaelis-Menton Kinetics [ S ][ E0 ] [ ES ] [S ] K M v k3[ ES ] (1) [ S ][ E0 ] v k3 [S ] K M 90 Michaelis-Menton Kinetics Vmax is achieved when all of the enzyme (E0) is substrate-bound. v k3[ ES ] (assumption: [S] >> [E0]) at maximum rate, [ ES ] [ E0 ] Thus, vmax k3[ ES ] k3[ E0 ] 91 Michaelis-Menton Kinetics [ S ][ E0 ] v k3 [S ] K M vmax k3[ E0 ] vmax [ S ] v [S ] K M This is the Michaelis-Menten equation! 92 Michaelis-Menton Kinetics [ S ][ E0 ] v k3 [S ] K M vmax k3[ E0 ] vmax [ S ] v [S ] K M This is the Michaelis-Menten equation! So what? 93 Michaelis-Menton Kinetics vmax [ S ] v [S ] K M Consider the case: vmax v 2 vmax vmax [ S ] 2 [S ] K M [S ] KM 2[S ] K M [S ] The KM of an enzyme is therefore the substrate concentration at which the reaction occurs at half of the maximum rate. 94 Michaelis-Menton Kinetics 95 Michaelis-Menton Kinetics 96 Michaelis-Menton Kinetics • KM is an indicator of the affinity that an enzyme has for a given substrate, and hence the stability of the enzyme-substrate complex. • At low [S], it is the availability of substrate that is the limiting factor. • As more substrate is added there is a rapid increase in the initial rate of the reaction. 97 Modeling Bio-Chemical networks • Enzyme catalyzed reaction (a) E S S+E k1 k2 S.E k3 S.E t1 : k1[S][E] E+P P t3 : k3[S.E] t2 : k2[S.E] (b) E S P t4 : Vmax[S] / (KM + [S]) 98