Download the proposal

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Cytokinesis wikipedia , lookup

Magnesium transporter wikipedia , lookup

Protein (nutrient) wikipedia , lookup

Endomembrane system wikipedia , lookup

Protein wikipedia , lookup

Protein moonlighting wikipedia , lookup

Protein phosphorylation wikipedia , lookup

Multi-state modeling of biomolecules wikipedia , lookup

Homology modeling wikipedia , lookup

Intrinsically disordered proteins wikipedia , lookup

Nuclear magnetic resonance spectroscopy of proteins wikipedia , lookup

JADE1 wikipedia , lookup

Western blot wikipedia , lookup

Proteolysis wikipedia , lookup

List of types of proteins wikipedia , lookup

Protein–protein interaction wikipedia , lookup

G protein–coupled receptor wikipedia , lookup

Signal transduction wikipedia , lookup

Transcript
COMPUTER SCIENCES MASTER PROGRAM
PROCESS COMPUTATIONAL MODELING COURSE
PROFESSOR: FRANK VALENCIA, PhD.
TITLE PROPOSAL:
MODELING BIOMOLECULAR INTERACTIONS WITH NTCC PROCESS CALCULI
Diana Hermith, BSc. Molecular Biology
Current Graduate Student in Computer Sciences
November 11th de 2008
______________________________________________________________________
Modeling Biomolecular Interactions with NTCC Process Calculi
(Draft Proposal)
Hermith, Diana
[email protected]
INTRODUCTION
physically and logically.
Most biological functions are mediated
which proteins interact, how they do so,
by
These
and what function is performed by their
interactions can be physical, such as
complex is at least as important as
when two proteins form a complex, or
predicting
“logical,” such as when one or more
structure
proteins control the behavior of one or
Historically, the problem of detecting
more other proteins without physical
the site where two proteins (or a protein
interaction.
Examples of physical
and a nucleic acid molecule) interact
interactions are stable complexes, in
has been treated separately from the
which the functional unit is formed by
problem of finding where a small
more than one protein chain, as in the
molecule such as an inhibitor or a
case of the glycogen phosphorylase
substrate binds to a macromolecule.
enzyme, and transient associations, in
The two cases are similar only at a first
which the protein chains are stable by
glance; they differ quite substantially in
themselves but can also interact to
the details of the techniques.1
protein
interactions.
the
of
Detecting
threedimensional
individual
proteins.
transmit a signal or as a response to
external
conditions.
logical
Metabolic pathways provide us with
interactions, one protein affects another
many examples of logical interactions.
protein by, for example, regulating its
The concentration of a product is often
expression
the
“sensed” by other proteins in its
concentration of a factor that, in turn, is
synthetic cascade and modulates their
sensed by the target protein. The two
activity. The presence of hormones is
modes of interaction are not exclusive.
1
or
In
changing
The same proteins can interact both
Anna Tramontano. The Ten Most Wanted
Solutions in Protein Bioinformatics. Anna
Tramontano Chapman & Hall/CRC; ISBN:
1584884916; 216pp.; 2005.
1
COMPUTER SCIENCES MASTER PROGRAM
PROCESS COMPUTATIONAL MODELING COURSE
PROFESSOR: FRANK VALENCIA, PhD.
TITLE PROPOSAL:
MODELING BIOMOLECULAR INTERACTIONS WITH NTCC PROCESS CALCULI
Diana Hermith, BSc. Molecular Biology
Current Graduate Student in Computer Sciences
November 11th de 2008
______________________________________________________________________
detected by cell surface receptors and
transmitted to other proteins in the cell
that can interact with the genetic
material to activate or repress genes.
External stimuli, such as the presence of
food or poison, is sensed by a bacterium
and transmitted to its flagella to direct
the cell towards or away from the
region of highest concentration of the
sensed
substance.
These
logical
interactions can coexist with physical
interactions. For example, hemoglobin
senses the binding of oxygen and
transmits the information from one of
its subunits to the others via physical
interaction.
Other examples can be
found in cell surface receptors. These
molecules have an extracellular domain,
a
membrane
domain,
and
an
intracellular domain.
Fig. 1. The binding of a ligand to the
extracellular
domain
of
a
transmembrane receptor might cause its
binding to a coreceptor (which can be
the same or a different protein). The
subsequent interaction between the
intracellular domains can trigger
signaling, for example, by activating a
transcription factor that, in turn,
activates the required genes.
Principles of Signal Transduction
In biology, signal transduction refers to
Binding of a ligand to the extracellular
domain can cause these molecules to
form dimers (i.e., to associate with
another
receptor
association
intracellular
of
chain).
the
domains
The
corresponding
allows
the
molecule to transmit the signal inside
the cell (Figure 1).
any process by which a cell converts
one kind of signal or stimulus into
another.
Most processes of signal
transduction involve ordered sequences
of biochemical reactions inside the cell,
which are carried out by enzymes,
activated
by
a
second
messengers,
signal
transduction
resulting
in
pathway.
Such processes are usually
2
COMPUTER SCIENCES MASTER PROGRAM
PROCESS COMPUTATIONAL MODELING COURSE
PROFESSOR: FRANK VALENCIA, PhD.
TITLE PROPOSAL:
MODELING BIOMOLECULAR INTERACTIONS WITH NTCC PROCESS CALCULI
Diana Hermith, BSc. Molecular Biology
Current Graduate Student in Computer Sciences
November 11th de 2008
______________________________________________________________________
rapid, lasting on the order of
milliseconds in the case of ion flux, or
minutes for the activation of proteinand lipid-mediated kinase cascades, but
some can take hours, and even days (as
is the case with gene expression), to
complete. The number of proteins and
other molecules participating in the
Fig 2. Signal Transduction
events involving signal transduction
increases as the process emanates from
Introduction to the Problem : G
the initial stimulus, resulting in a "signal
Protein Signal Cascade3
cascade," beginning with a relatively
small stimulus that elicits a large
Most signal molecules targeted to a cell
response.
bind at the cell surface to receptors
This is referred to as
amplification of the signal.2
embedded in the plasma membrane.
Only signal molecules able to cross the
An environmental signal, such as a
plasma
membrane
hormone, is first received by interaction
hormones) interact with intracellular
with a cellular component, most often a
receptors. A large family of cell surface
cell-surface receptor. The information
receptors have a common structural
that the signal has arrived is then
motif,
converted into other chemical forms,
Rhodopsin was the first of these to have
or transduced. The signal is often
its 7-helix structure confirmed by X-ray
amplified before evoking a response.
crystallography.
7
(e.g.,
transmembrane
steroid
-helices.
Feedback pathways regulate the entire
signaling process.
Rhodopsin is unique. It senses light, via
a bound chromophore, retinal. Most 7helix receptors have domains facing the
3
2
http://en.wikipedia.org/wiki/Signal_transduction
http://www.rpi.edu/dept/bcbp/molbiochem/MB
Web/mb1/part2/signals.htm#animat1
3
COMPUTER SCIENCES MASTER PROGRAM
PROCESS COMPUTATIONAL MODELING COURSE
PROFESSOR: FRANK VALENCIA, PhD.
TITLE PROPOSAL:
MODELING BIOMOLECULAR INTERACTIONS WITH NTCC PROCESS CALCULI
Diana Hermith, BSc. Molecular Biology
Current Graduate Student in Computer Sciences
November 11th de 2008
______________________________________________________________________
extracellular side of the plasma
G-proteins are heterotrimeric, with 3
membrane that recognize and bind
subunits , , .
signal
For
activates cyclic-AMP formation within
example., the -adrenergic receptor is
a cell is called a stimulatory G-protein,
activated
designated Gs with alpha subunit Gs.
molecules
by
(ligands).
epinephrine
and
norepinephrine.
A G-protein that
Gs is activated, e.g., by receptors for the
hormones epinephrine and glucagon.
The signal is usually passed from a 7-
The -adrenergic receptor is the GPCR
helix receptor to an intracellular G-
for epinephrine.
protein. Seven-helix receptors are thus
called GPCR, or G-Protein-Coupled
Receptors.
hormone
signal
Approx. 800 different
GPCRs are encoded in the human
outside
GPCR
plasma
membrane
genome.
G-protein-Coupled
Receptors

AC
GDP GTP
may
dimerize or form oligomeric complexes
within the membrane. Ligand binding
GTP
GDP
cytosol
ATP cAMP + PPi
Fig. 3 The G Protein Signal Cascade
may promote oligomerization, which
The  subunit of a G-protein (G) binds
may in turn affect activity of the
GTP, and can hydrolyze it to GDP + Pi..
receptor.
 and  subunits have covalently
proteins
Various GPCR-interacting
(GIPs)
modulate
receptor
attached lipid anchors that bind a G-
function. Effects of GIPs may include:
protein
altered
cytosolic surface.
ligand
affinity,
receptor
to
the
plasma
membrane
Adenylate Cyclase
dimerization or oligomerization, control
(AC) is a transmembrane protein, with
of
cytosolic domains forming the catalytic
receptor
localization,
including
transfer to or removal from the plasma
site.
membrane, promoting close association
with other signal proteins.
4
COMPUTER SCIENCES MASTER PROGRAM
PROCESS COMPUTATIONAL MODELING COURSE
PROFESSOR: FRANK VALENCIA, PhD.
TITLE PROPOSAL:
MODELING BIOMOLECULAR INTERACTIONS WITH NTCC PROCESS CALCULI
Diana Hermith, BSc. Molecular Biology
Current Graduate Student in Computer Sciences
November 11th de 2008
______________________________________________________________________
The sequence of events by which a
Phase 5. Protein Kinase A (cAMP
hormone activates cAMP signaling
Dependent Protein Kinase) catalyzes
include the following phases:
transfer of phosphate from ATP to
serine or threonine residues of various
Phase 1. Initially G has bound GDP,
cellular proteins, altering their activity.
and   and  subunits are complexed
together. G, the complex of  and 
The turn off of the signal involves these
subunits, inhibits G.
kind of possibilities:
Phase 2. Hormone binding, usually to
P1. G hydrolyzes GTP to GDP + Pi.
an extracellular domain of a 7-helix
(GTPase). The presence of GDP on G
receptor
causes it to rebind to the inhibitory bg
(GPCR),
causes
a
conformational change in the receptor
complex.
Adenylate Cyclase is no
that is transmitted to a G-protein on the
longer activated.
cytosolic side of the membrane. The
nucleotide-binding site on G. becomes
P2.
Phosphodiesterases
more accessible to the cytosol, where
hydrolysis of cAMP  AMP.
catalyze
[GTP] > [GDP]. G. releases GDP and
binds GTP (GDP-GTP exchange).
P3. Receptor desensitization varies with
the hormone.
In some cases the
Phase 3. Substitution of GTP for GDP
activated receptor is phosphorylated via
causes another conformational change
a G-protein Receptor Kinase.
in G.. G.-GTP dissociates from the
phosphorylated receptor then may bind
inhibitory  complex and can now bind
to a protein -arrestin.
to and activate Adenylate Cyclase.
promotes removal of the receptor from
The
-Arrestin
the membrane by clathrin-mediated
Phase 4. Adenylate Cyclase, activated
endocytosis. -Arrestin may also bind a
by the stimulatory G.-GTP, catalyzes
cytosolic Phosphodiesterase, bringing
synthesis of cAMP.
this enzyme close to where cAMP is
5
COMPUTER SCIENCES MASTER PROGRAM
PROCESS COMPUTATIONAL MODELING COURSE
PROFESSOR: FRANK VALENCIA, PhD.
TITLE PROPOSAL:
MODELING BIOMOLECULAR INTERACTIONS WITH NTCC PROCESS CALCULI
Diana Hermith, BSc. Molecular Biology
Current Graduate Student in Computer Sciences
November 11th de 2008
______________________________________________________________________
being produced, contributing to signal
proteins. For example, G inhibits
turnoff.
one of several isoforms of Adenylate
Cyclase, contributing to rapid signal
P4.
Protein
Phosphatase
catalyzes
removal by hydrolysis of phosphates
turnoff in cells that
express that
enzyme.
that were attached to proteins via
THE PROPOSAL:
Protein Kinase A.
Gain a comprehensive and predictive
The signal amplification is an important
understanding
of
the
dynamic,
feature of signal cascades.
interconnected
processes
underlying
hormone
molecule
can
lead
One
to
living systems, in these case, the G
formation of many cAMP molecules.
Protein Signal Cascade.
For these
Each catalytic subunit of Protein Kinase
purpose, is necessary to have some
A catalyzes phosphorylation of many
information about dynamics, molecular
proteins during the life-time of the
structure and biochemical detail of
cAMP.
interaction, to explore and apply formal
semantics to simulate, analyze and
Different isoforms of G have different
compare the biomolecular G Protein
signal
Signal Cascade System.
roles.
stimulatory Gs
activates
For
example,
the
when it binds GTP,
Adenylate
cyclase.
An
inhibitory Gi, when it binds GTP,
inhibits Adenylate cyclase.
Different
So in these
first draft, we not yet include, the
syntax, the structural congruece (bound
variables, laws) and reaction rules, but
we now, that the molecules involves in
the signal pathways have interaction
effectors and their receptors induce Gi,
capability (we can use the channel
to exchange GDP for GTP than those
abstracción),
that activate Gs.
(communication)
G
The complex of
that is released when G binds
GTP is itself an effector that binds to
and activates or inhibits several other
(state
and/or
reactions/interactions
and
modification
channel/conformational
change), with the same principles
specify chemistry, organic chemistry,
6
COMPUTER SCIENCES MASTER PROGRAM
PROCESS COMPUTATIONAL MODELING COURSE
PROFESSOR: FRANK VALENCIA, PhD.
TITLE PROPOSAL:
MODELING BIOMOLECULAR INTERACTIONS WITH NTCC PROCESS CALCULI
Diana Hermith, BSc. Molecular Biology
Current Graduate Student in Computer Sciences
November 11th de 2008
______________________________________________________________________
enzymatic
reactions,
metabolic
appropiate reasoning about this kind of
pathways, signal-transduction pathways
systems.
and ultimately the entire cell. Biology
is driven by quantities (e.g., energy,
The principles of modeling in NTCC
time, affinity, distance, amount of
calculus the G Protein Signal Cascade
components)
need
will be include the molecule as a
stochastic variant of NTCC process
computational process, the biochemical
algebras must be considered simulation
interaction as communication, the use of
techniques come into play.
process algebra to model the G Protein
so
we
would
Signal Cascade applying the NTCC
The goal 1 would be, identify and
process calculi.
characterize
biomolecular
approach should be an unified view of
machinary of G Protein Signal Cascade
the system, the simulation and analysis
as
using
and a comparative power and scalability
Process algebra: the NTCC calculus.
to enrich the model with experimental
The
(quantitative) data.
the
concurrent
second
computation,
one,
develop
the
The benefits of the
The idea in this
computational capabilities to advance
sense, is to explore and combine the
understanding of complex biological
metodology with implementations than
systems and predict their behavior.
permit an interplay between collecting
data
in
experiments
(experimental
Previous abstractions shows, that in
biology) and the NTCC model, with the
concurrent
aim
computational
processes,
to
capture
some
mechanistic
each biological entity is a process that
understanding of how the systems under
may carry some state and interacts with
study works, by validating the model
other processes. Prior proposals4 based
under
on process algebras, such as the pi
correspond to the experiments and by
calculus can be applied for make an
comparing
various
the
conditions
outcomes
that
to
the
experimental data, one can identify
4
Regev et al. 2001.
http://www.wisdom.weizmann.ac.il/~udi/papers
/pi_calculus.pdf
discrepancies
mechanisms
between
and
the
hypothetical
experimental
7
COMPUTER SCIENCES MASTER PROGRAM
PROCESS COMPUTATIONAL MODELING COURSE
PROFESSOR: FRANK VALENCIA, PhD.
TITLE PROPOSAL:
MODELING BIOMOLECULAR INTERACTIONS WITH NTCC PROCESS CALCULI
Diana Hermith, BSc. Molecular Biology
Current Graduate Student in Computer Sciences
November 11th de 2008
______________________________________________________________________
observations. These differences can be
•Message: Protein A phosphorylates a
used to suggest new hypotheses, which
residue on B
serve to adjust the model and need to be
•Meaning of message: This enables
validated
Protein B to bind to C
experimentally,
or
new
experiments, which can confirm or
xpected results we are thinking to
falsify the modeling hypotheses.
obtain: a unified view of structure and
The goal will be find an appropriate
dynamics of G Protein Signal Cascade,
model for G Protein Signal Cascade that
a
include molecular structure, behavior
(complexes,
and biological formal semantics.
residues) in visible form, a complex
detailed
molecular
information
molecules,
domains,
dynamic behavior (feedback, cross-talk,
Therefore it is necessary consider the
split and merge), a modular system.
molecule as a computational process,
represent a structure by its potential
We want construct a NTCC model that
behavior (by the process in which it can
could resolve some of these biological
participate). For example, an enzyme
queries: describe the sequence of events
(protein molecule) as the enzymatic
by
reaction process in which it may
epinephrine
participate.
It is also necessary, to
production of cyclic AMP within a cell;
consider
the
as
include the roles of the receptor
communication (molecular interaction
(GPCR), the different subunits of the
and modification as communication and
stimulatory G protein, and Adenylate
change
each
Cyclase. How is the signal turned off at
interaction enables or disables other
each step?. How we can be simulate the
interactions. For example, consider the
role of -arrestin?.
simple following model:
describe the activation of cAMP-
•A system: Proteins A, B, and C
Dependent
•Communication: Protein A and B can
Kinase A)?. What causes the enzyme to
interact
be inhibited in the absence of cyclic
of
interaction
channel
names),
which
AMP?.
a
or
hormone
glucagon
Protein
such
as
activates
How we can
Kinase
(Protein
How is activation by cyclic
8
COMPUTER SCIENCES MASTER PROGRAM
PROCESS COMPUTATIONAL MODELING COURSE
PROFESSOR: FRANK VALENCIA, PhD.
TITLE PROPOSAL:
MODELING BIOMOLECULAR INTERACTIONS WITH NTCC PROCESS CALCULI
Diana Hermith, BSc. Molecular Biology
Current Graduate Student in Computer Sciences
November 11th de 2008
______________________________________________________________________
AMP turned off?. How we can explain
overlap between the two proteins atoms;
the reaction catalyzed by the activated
that is, to treat each atom as a soft rather
cAMP-Dependent
Kinase?.
than a hard sphere so that limited
What reaction is catalyzed by the
interpenetration of the atoms of the two
enzyme Protein Phosphatase?.
proteins is allowed in the final docked
Protein
conformation.
For molecular structure considerations,
it is important think about the process of
BIBLIOGRAPHY
successful “docking”; it depends on
[1] Executable Cell Biology. Jasmin
Fisher and Thomas A Henzinger.
Nature Biotechnology, volume 25
number 11 november 2007.
three things: how we represent the
protein structures, taking into account
their flexibility; how we search the
conformational space of the possible
solutions; and how we evaluate and
rank the solutions. A protein structure,
and especially the side chains involved
in
binding,
substantial
can
undergo
rearrangements
formation of the complex.
[2]
Representing
Biomolecular
Processes with Computer Process
Algebra: Pi Calculus programs of
Signal Transduction pathways. Aviv
Regev, William Silverman and Ehud
Shapiro.
American Association for
Artificial Intelligence (www.aaai.org).
quite
upon
In some
cases, the observation of the protein
[3] Anna Tramontano. The Ten Most
Wanted
Solutions
in
Protein
Bioinformatics. Anna Tramontano
Chapman
&
Hall/CRC;
ISBN:
1584884916; 216pp.; 2005.
structure might suggest specific regions
that
are
likely
conformation
upon
to
modify
binding.
their
For
example, regions between domains can
be specifically allowed to assume a
discrete set of conformations. Another
possibility is to use a single-protein
structure conformation and introduce
the treatment of flexibility later in the
procedure, by not penalizing limited
[4] Representation and simulation of
biochemical processes using the picalculus process algebra. A Regev, W
Silverman,
E
Shapiro.
2001.
http://www.wisdom.weizmann.ac.il/~ud
i/papers/pi_calculus.pdf
[5] Timed Concurrent Constraint
Programming in Systems Biology. A.
Arbelaez, J. Gutierrez, J. Perez.
http://avispa.puj.edu.co
[6] Timed Concurrent Constraint
Programming for Analysing Biological
Systems. J. Gutierrez, J. Perez, C.
9
COMPUTER SCIENCES MASTER PROGRAM
PROCESS COMPUTATIONAL MODELING COURSE
PROFESSOR: FRANK VALENCIA, PhD.
TITLE PROPOSAL:
MODELING BIOMOLECULAR INTERACTIONS WITH NTCC PROCESS CALCULI
Diana Hermith, BSc. Molecular Biology
Current Graduate Student in Computer Sciences
November 11th de 2008
______________________________________________________________________
Rueda
and
F.
Valencia.
http://www.elsevier.nl/locate/entcs
[7]
Modelamiento
de
Sistemas
Biológicos usando Cálculos de Procesos
Concurrentes. J. Gutierrez, J. Perez, C.
Rueda. Epiciclos. Cali (Colombia),
diciembre de 2005.
[8] A Calculus for Modelling,
Simulating
and
Analysing
Compartmentalized Biological Systems.
R. Mardare and A. Ihekwaba.
Computation in Modern Science and
Engineering: Proceedings of the
International
Conference
on
Computational Methods in Science and
Engineering 2007 (iccmse 2007): Vol.
2, parts A and B. AIP conference
proceedings, Vol 963, pp. 642-646
(2007).
[9] Systems Biology: Looking at
opportunities and challenges in applying
systems theory to molecular and cell
biology. O. Wolkenhauer, H. Kitano
and K-H Cho. IEEE Control Systems
Magazine, August 2003.
[10] Computing in Molecular Biology:
Mapping and Interpreting Biological
Information. E. Lander, R. Langridge
and D. Saccocio.
IEEE Control
Systems Magazine, November 1991.
[11] Computational Approaches to
Discovering Semantics in Molecular
Biology. R. Lipton, T. Marr and D.
Welsh. Proceedings ot the IEE, Vol.
77, No. 7, July 1989.
10