Download On the deduction of chemical reaction rate constants from

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
On the deduction of chemical reaction rate constants
from measurements of time series of concentration
Paola Lecca
The Microsoft Research – University of Trento
Centre for Computational and Systems Biology
http://www.cosbi.eu
E-mail: [email protected]
Modelling biochemical networks
The reaction system. Components (atoms, ions, molecules,
proteins, etc): A
,B ,C
,D
,E
, and F
.
A
B
D
C
E
F
To build a mathematical model of a biochemical systems we have
1. to decide on the model structure (connectivity)
2. to estimate the involved parameters values : rate constants
Knowledge inference
 Novel high-throughput techniques of molecular biology are
capable of producing in vivo and in vitro time series data of
variables that can be used to deduce the kinetics governing
the time evolution of the system.
 These data implicitly contain information about the
biological systems, such its functional connectivity and
regulation.
 The hidden information is to be extracted with methods of
network inference and parameter estimation.
Parameter estimation
 Parameter estimation (also known as model
calibration) aims to find the parameters of the model
which give the best fit to experimental data.
 Parameter estimation continues to be the bottleneck
of the computational analysis of biological systems.
Parameters are needed to run quantitative
simulations of the models.
Types of experimental data
We need time-series of concentrations.
Different techniques allow to collect such
types of data.
GC-MS: Gas Chromatography-Mass Spectrometry
LC-MS: Liquid Chromatography-Mass Spectrometry
CE-LIF: Capillary Electrophoresis-Laser Induced Fluorescence
NMR: Nuclear Magnetic Resonance Spectroscopy
NMR has rapid analysis times but has relatively
low sensitivity- the tip of the iceberg.
GC-MS and LC-MS provide good selectivity and
sensitivity.
CE-LIF provides very high sensitivity but lower
selectivity.
Adapted from: Sumner LW et al. Phytochemistry 2003, 62: 817-836
Sources of data
 Literature (papers, on-line databases): NF-kB (radiolabbeling), cell cycle (micro-arrays).
 Experiments at Department of Bio-organic Chemistry of
University of Trento, Italy: enzymatic kinetics of
Glutathione S-transferase (Mass Spectrometrym NMR).
 Dipartimento di Biotecnologie, Fondazione San Raffaele,
Milano, Italy: metabolic pathways (NMR).
ML approach
The values of the model’s parameters have to be the most likely values
giving the observed time course of the concentration.
Maximum likelihood (ML) approach can be used to achieve this
goal. The main steps are:
 to build a suitable expression of the joint transitional density for
expressing the probability density function of the observed
outcomes in terms of measured system variables and
parameters;
 and to optimize this function to determine unknown parameters.
The model of kinetics
The kinetics is described by a set of rate equations. If there are
and
species
is the concentration of the i-th species, the i-th rate equation is
Is the vector of concentrations
of chemicals that are present
in the expression of the
function for the i-th species .
is the entire parameter vector
of the system.
For each equation there is a
parameter vector as follows:
The model of kinetics
The general form of the i-th rate equation is as follows
where
.
The rate equation can be discretized in the following way:
where k = 1, 2, …, M. The discretized rate equation is viewed as a model
of increments/decrements of reactant concentrations.
The noise
We assume we have noisy observations
at times
, where
with mean zero and variance
is a Gaussian noise term
.
We track the noise in the measurements explicitly to form a
likelihood function in the optimization, i. e. s is a parameter of the
model and undergoes the same procedure of estimation as the rate
constants.
The likelihood
The Markov nature of the discretized
model of rate equation implies the
conditional independence of the
AND
Gaussian model for the noise
variations of the concentration between
different time points.
The probability to observe the value
given the
model at time
, and the set of parameter
vector
(and its symmetric expression), as
The likelihood
The likelihood for the observed increments/decrements
therefore will be
where
,
is the covariance matrix of the increment/decrement vector and
Non integer values of the coefficients a make estimating
the expectation E analytically difficult.
Approximation
At the first order, for small s, we can approximate the Gaussian noise with
zero mean and variance s with an uniform distribution defined on the interval
so that
Where
and thence
Variance of parameters
We introduce the sensitivity matrix S(t) evaluated at the sampling instants t1, …,
tM, and the matrix G defined as follows.
Variance of parameters
The square root of the p-th diagonal element of the
inverse of the Fisher information matrix
gives an estimate of the standard deviation of the p-th
component of parameter vector.
KInfer (Knowledge Inference)
KInfer is a free software downloadable
from our web page
http://www.cosbi.eu
KInfer
KInfer
Automatic conversion into O.D.Es system.
Uploading or entering
the set of reactions.
Possibility of specifying the system
directly through O.D.E., if desired.
KInfer
To upload the set of reactions.
To upload the experimental time courses.
To set the optimization algorithm parameters.
To estimate the initial guesses
for the parameters.
KInfer
Infer the parameters!
Show the results!
KInfer
KInfer
Automatic calculation of initial
guesses on the parameters.
KInfer
A didactical example of biochemical network
Golding’s model of gene transcription
Goutsias model of gene transcription regulation
Goutsias model of gene transcription regulation
Simulation of behavior of monomer M
Goutsias model of gene transcription regulation
Simulation of behavior of dimer D
Goutsias model of gene transcription regulation
Simulation of behavior of mRNA
Electrical properties of neuron membrane
(Hodgkin – Huxley circuit representation)
Electrical properties of neuron membrane
(Hodgkin – Huxley circuit representation)
Simulations
NF-kB
 Nuclear transcription factor found in most animal cell types.
 Involved in cellular response to stress, cytokines, free radicals,
ultraviolet irradiation, bacterial/viral antigens.
 Plays a key role in regulating immune response to infection.
 Activation of the NF-kB transcription factor can be triggered by
exposing cells to a multitude of external stimuli such as tumour
necrosis factor (TNF-a and interleukin 1 (IL-1a). These cytokines
initiate numerous and diverse intracellular signalling cascades, most
of which activate the IKK complex.
IKKγ
IKKα
IKKβ
The three catalytic
subunits of IKK complex.
The cytoplasmic inhibitors of NF-kB (the IkBs) are
phosphorylated by activated IKK at specific Nterminal residues, tagging them for polyubiquitination and rapid proteasomal degradation.
NF-kB is released upon activation where it then
translocates to the nucleus to induce the transcription
of genes encoding regulators of immune and
inflammatory responses and also genes involved in
apoptosis and cell proliferation.
An NF-kB subnetwork
Since recombinant human IKK2 (rhIKK2) phosphorylates GSTIkB in vitro, we examined the activity of the synthesised GST-IkB
followed by the association reaction and dissociation, as in the
following chemical reaction system
NF-kB subnetwork parameters
Conclusions
Some strength points of our approach.
 Its probabilistic formulation is key to a principled handling of
the noise inherent in biological data.
 It allows a number of further extensions, such as a fully
Bayesian treatment of the parameter inference and automated
model selection strategies based on the comparison between
marginal likelihoods of different models.
 Automatic estimation of the initial guesses of the parameters.
 Estimation of the experimental errors on the parameters.
Acknowledgements
 Alida Palmisano, CoSBi, Italy
 Corrado Priami, CoSBi and University of Trento, Italy
 Guido Sanguinetti, University of Sheffield, UK
 Adaoha Ihekwaba, CoSBi, Italy