Download NIDA-svisit-20071219-PARE - Yale Bioinformatics -

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

MicroRNA wikipedia , lookup

Immunoprecipitation wikipedia , lookup

List of types of proteins wikipedia , lookup

RNA-Seq wikipedia , lookup

Histone acetylation and deacetylation wikipedia , lookup

Non-coding RNA wikipedia , lookup

Gene expression profiling wikipedia , lookup

Secreted frizzled-related protein 1 wikipedia , lookup

Silencer (genetics) wikipedia , lookup

G protein–coupled receptor wikipedia , lookup

Protein wikipedia , lookup

Magnesium transporter wikipedia , lookup

Ancestral sequence reconstruction wikipedia , lookup

Protein design wikipedia , lookup

Protein folding wikipedia , lookup

Protein moonlighting wikipedia , lookup

Protein structure prediction wikipedia , lookup

QPNC-PAGE wikipedia , lookup

Interactome wikipedia , lookup

Protein (nutrient) wikipedia , lookup

Messenger RNA wikipedia , lookup

Western blot wikipedia , lookup

Protein adsorption wikipedia , lookup

Proteolysis wikipedia , lookup

Gene expression wikipedia , lookup

Epitranscriptome wikipedia , lookup

De novo protein synthesis theory of memory formation wikipedia , lookup

Protein–protein interaction wikipedia , lookup

Nuclear magnetic resonance spectroscopy of proteins wikipedia , lookup

Transcript
Relating Protein Abundance &
mRNA Expression
Mark B Gerstein
Yale (Comp. Bio. & Bioinformatics)
NIDA site visit at Yale
2007.12.19
Why relate amounts of protein & mRNA
Gene expression major place for regulation
(easy to measure)
vs.
Concentration of protein major determinant of activity
Expectations from simple kinetic
models:
dPi
= ks,i [mRNAi ] – kd,i Pi
dt
ks;i [mRNAi ]
At steady state: Pi =
kd,i
where ks,i and kd,i are the protein synthesis
and degradation rate constants
[Greenbaum et al. Bioinformatics 2002, 18, 587.]
Outliers from trend interesting
Relationship of Protein Abundance to
Complexes and Pathways
In protein complexes, one expects stoichiometric abundance of component proteins and
that mRNA expression levels should be correlated with protein abundance
…Among pathways, this is expected to a lesser degree between interacting proteins
Protein complexes
[Graphic: http://proton.chem.yale.edu]
Protein interaction networks
[Graphic: Jeong et al, Nature, 41:411]
Sources of experimental data
mRNA expression levels
Microarrays
Affymetrix
PCR
SAGE
Protein abundance
2D Gel Electrophoresis
Multiple staining options
Small dynamic range
DIGE
Cy3 vs. Cy5 labeling
Large dynamic range
ICAT, iTRAQ
MS-based
Relative abundance (ratio of isotopically
labeled species)
Large dynamic range
MudPIT
LC-MS/MS
SILAC
Stable isotope labeling with amino acids in
cell culture - for MS analysis
TAP-Tag
Weissman and O’Shea (Oct. 2003)
[http://www.biology.ucsc.edu/mcd/images/microarray.gif]
PARE: proteomics.gersteinlab.org
Upload or use pre-loaded
mRNA, protein datasets
Open-source code
Downloadable
Analyze all or
analyze MIPS or GO subset
[Yu et al., BMC Bioinfo. '07]
PARE: a web-based tool for correlating
mRNA expression and protein abundance
PARE
main page
Select MIPS, GO
subsets (opt.)
Display results
(1) Select mRNA, protein datasets:
-use pre-loaded datasets
-upload datasets
(2) Choose categorization method:
-correlate all
-MIPS complexes
-GO biological processes
-GO molecular function
-GO cellular component
(3) Display
-Linear or log-log correlation for selected subset(s)
-Tabulate data, correlation values for selected subset(s
-Label (on plot) and tabulate outlying datapoints
[Yu et al., BMC Bioinfo. '07]
PARE output
Correlated data
Calculation of
mutual information
[Yu et al., BMC Bioinfo. '07]
Log-log plot of correlation
-linear fit
-outliers labeled
Correlation of subsets (GO, MIPS)
[Yu et al., BMC Bioinfo. '07]
Yeast ref. datasets:
“Correlate all”
vs.
GO cellular component subsets
for particular cellular locations
PARE: pre-loaded datasets
[Yu et al., BMC Bioinfo. '07]
Connecting PARE with datasets
from NIDA investigators
Protein abundance (iTRAQ datasets)
Mouse (Nairn lab) - samples from 3 brain regions:
cortex, striatum, hippocampus
Green monkey (Taylor lab) - several brain regions caudate, dlPFC, mPFC, NacC, NacS, PFC11,
PFC13, PrCO, putamen
each treated with saline, PCP, and cocaine
mRNA datasets obtained from expression database
Mouse - Sandberg et al. PNAS 2000, 97, 11038.
Mouse brain: correlation of mRNA &
protein expression
Mouse brain, p < 0.15
For 96 genes with
differential expression
in hippocampus vs.
cortex
3
protein (hippocampus/cortex)
protein abundance ratio
(hippocampus/cortex)
2.5
2
Plan to correlate
abundance for
individual pathways
and complexes with C.
Bruce ("John", talking
later)
1.5
1
0.5
0
0
0.5
1
1.5
2
m RNA (hippocam pus/cortex)
mRNA expression ratio
(hippocampus/cortex)
2.5
3
Acknowledgements
iTRAQ datasets:
Angus Nairn, Erika Andrade, Dilja Krueger
Jane Taylor
Chris Colangelo, Mark Shifman (YPED)
PARE:
Anne Burba, Eric Yu