Download The construction of LacplantCyc and the use of physiological and

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Homologous recombination wikipedia , lookup

Helicase wikipedia , lookup

Helitron (biology) wikipedia , lookup

Transcript
Reconstructing the metabolic network of a bacterium from
its genome: the construction of LacplantCyc
In silico reconstruction of the metabolic pathways of Lactobacillus plantarum: comparing predictions
of nutrient requirements with growth experiments
Teusink, van Enckevort, Francke, Wiersma, Wegkamp, Smid and Siezen
2005 Appl. Environ. Microbiol. 71: 7253-7262
Christof Francke
What do we mean by reconstruction?
...... the collection and visualization of all potential
physiologically relevant cellular processes. ..........
Reconstructing the metabolic network of a bacterium from its genome
Francke, Siezen and Teusink
Trends in Microbiol. 2005 13: 550-558
Why do we want to do it?
...... it serves to sort the individual proteins and thus
the potential molecular functions, into a context (like pathways or
protein complexes) and as such:
- allows for improved functional annotation
- provides a platform to visualize and analyze 'omics' data
- yields a network the topology of which can be studied
- can be converted to a model (metabolic engineering)
How do we annotate?
The attribute function is ambiguous
context independent (molecular function or properties)
- catalyze certain reactions
- interact with certain proteins
- bind to a specific DNA sequence
context dependent (role)
- act in a certain pathway
- be a member of a certain protein complex(es)
- act as a transcription factor
We are interested in lactic acid bacteria
#
(2003) Proc Natl Acad Sci USA 100,1990
#
annotation database
PlantDB
the construction of LacplantCyc
recovering gene - protein - reaction -pathway relations
Pathway Tools (SRI) uses gene-annotation (EC-numbers) and
reference database (MetaCyc) to arrive automatically at an
• encyclopedia of
genes connected to proteins
proteins connected to reactions
reactions connected in pathways
initial automatic reconstruction: some remarks
Are the assignments correct and which functions are there
that have not been retrieved?
- presence of pathways
- gaps in pathways
- are these numbers correct?
- same reactions
and pathways
- manual changes are not
correctly incorporated
the actual labour: curation
What we have done:
consult reference databases
What we have done:
add information that is not recovered from MetaCyc.
Transporters are not
recovered by pathway tools
What we have done:
add information that is not recovered from MetaCyc.
- include newly discovered and or
organism specific reactions and pathways
- add information on complex formation
pyruvate
EI
PEP
EI
~P
HPr
~P
HPr
170
170
~P
dak1
170
dak2 ~P
dihydroxy acetone
dak1
dak2
dihydroxy acetone ~P
What we have done:
evaluation of the attributed molecular function for each individual case
Do we trust the gene - protein - reaction association when we
consider the similarity between the sequence of the gene-product
and the sequence of a protein with the specified molecular function
(evidence based on experiment)?
- determine orthology
(use phylogeny and gene-context to determine evolutionary relationship)
- check experimental evidence
The evaluation of the attributed molecular function:
Improved annotation of homologous proteins
the use of phylogeny and orthologous relations
experimental: kojibiose phosphorylase
map1*
experimental: trehalose phosphorylase
map4
* there are slight but significant
differences in alignment of cluster 1
which might point to slightly altered
specificity
L. plantarum has four homologs annotated
as maltose phosphorylase
The evaluation of the attributed molecular function:
Improved annotation of homologous proteins
the use of gene context and metabolic context
map2
map3
experimental: maltose phosphorylase
active
experimental: maltose phosphorylase
passive
the evaluation of gaps in pathways:
A
B
are genes
really missing
?

an example: Tetrahydrofolate synthesis by Lactobacillus plantarum
the evaluation of gaps in pathways:
track missing genes
C
D
Gene identifier
product name
gene name
EC-code
lp_0014
lp_0119
lp_0897
lp_0962
lp_1144
lp_1627
lp_1737
lp_1885
replicative DNA helicase DnaC
NTP pyrophosphohydrolase
NTP pyrophosphohydrolase (putative)
ATP-dependent DNA helicase RecQ
ATP-dependent DNA helicase PcrA
ATP-dependent DNA helicase RecG
ATP-dependent helicase DinG
ATP-dependent DNA helicase RecQ
dnaC
lp_0119
lp_0897
recQ1
pcrA
recG
dinG
recQ2
3.6.1.3.6.1.3.6.1.3.6.1.3.6.1.3.6.1.3.6.1.3.6.1.-
Lp_3295
xanthosine triphosphate pyrophosphatase xtp2 no EC code
the evaluation of gaps in pathways:
the use of knowledge on physiology
absent
absent
predicted
growth dependence
Validation:
no tetrahydrofolate detectable without addition of p-aminobenzoate
to the medium
the evaluation of reactions and pathways:
the use knowledge on physiology
Lactobacilli do not have a TCA cycle and
therefore do not produce succinyl-CoA
==> In all reactions succinyl-CoA is used as a
substrate it has to be replaced by acetyl-CoA
TCA cycle
cleaning up the database
the removal of redundant pathways
a comparison of the automatic and curated LacplantCyc
- inconsistencies between observed nutrient requirements
and pathway predictions may lead to new insights about
regulation
automatic




further research needed
using LacplantCyc
using LacplantCyc
- to visualize -omics data
- to compare the metabolic network between different species
we need improved visualization
remarks
About the use of Cyc to visualize 'omics' data
- it preferably requires
* an interactive overview with more information
* the possibility of having multiple selectable overviews
* colouring of the genes instead of the reactions
using LacplantCyc
- to help the reconstruction of the metabolic
network of a related species through orthologous
relationships between proteins
Accelerating the reconstruction of genome-scale metabolic networks
Notebaart, van Enckevort, Francke, Siezen and Teusink
- to serve as the starting point for making a
metabolic model (constraint based modeling)
in preparation
remarks
About the use of:
Cyc as a source of gene-reaction-pathway association information to
be used in other applications
- requires easy export of these associations
Cyc as a starting point for modeling
- requires balanced reactions, detailed and correct molecular
information on compounds and balance checks
final remarks
Pathway Tools is very nice to quickly connect reaction and pathway
information to a gene which has been annotated with an EC-code.
However:
- Generation of a reliable reconstruction requires a lot of work and the
implementation of changes is not always straightforward (problems
with certain frames) and requires a lot of steps.
- Better control over the editor of individual pathways and the pathway
overview would be an important asset.
- Application of automatic procedures after curation unfortunately
destroys the changes that were carefully implemented.
- Multiple editors with straightforward import and export functions
would enhance the usefulness.
acknowledgements
Frank van Enckevort
Christof Francke
Richard Notebaart
Roland Siezen
Eddy Smid
Bas Teusink
Arno Wegkamp
Anne Wiersma
LacplantCyc can be found at www.lacplantcyc.nl