* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download NIDA-svisit-20071219-PARE - Yale Bioinformatics -
Immunoprecipitation wikipedia , lookup
List of types of proteins wikipedia , lookup
Histone acetylation and deacetylation wikipedia , lookup
Non-coding RNA wikipedia , lookup
Gene expression profiling wikipedia , lookup
Secreted frizzled-related protein 1 wikipedia , lookup
Silencer (genetics) wikipedia , lookup
G protein–coupled receptor wikipedia , lookup
Magnesium transporter wikipedia , lookup
Ancestral sequence reconstruction wikipedia , lookup
Protein design wikipedia , lookup
Protein folding wikipedia , lookup
Protein moonlighting wikipedia , lookup
Protein structure prediction wikipedia , lookup
Interactome wikipedia , lookup
Protein (nutrient) wikipedia , lookup
Messenger RNA wikipedia , lookup
Western blot wikipedia , lookup
Protein adsorption wikipedia , lookup
Proteolysis wikipedia , lookup
Gene expression wikipedia , lookup
Epitranscriptome wikipedia , lookup
De novo protein synthesis theory of memory formation wikipedia , lookup
Protein–protein interaction wikipedia , lookup
Nuclear magnetic resonance spectroscopy of proteins wikipedia , lookup
Relating Protein Abundance & mRNA Expression Mark B Gerstein Yale (Comp. Bio. & Bioinformatics) NIDA site visit at Yale 2007.12.19 Why relate amounts of protein & mRNA Gene expression major place for regulation (easy to measure) vs. Concentration of protein major determinant of activity Expectations from simple kinetic models: dPi = ks,i [mRNAi ] – kd,i Pi dt ks;i [mRNAi ] At steady state: Pi = kd,i where ks,i and kd,i are the protein synthesis and degradation rate constants [Greenbaum et al. Bioinformatics 2002, 18, 587.] Outliers from trend interesting Relationship of Protein Abundance to Complexes and Pathways In protein complexes, one expects stoichiometric abundance of component proteins and that mRNA expression levels should be correlated with protein abundance …Among pathways, this is expected to a lesser degree between interacting proteins Protein complexes [Graphic: http://proton.chem.yale.edu] Protein interaction networks [Graphic: Jeong et al, Nature, 41:411] Sources of experimental data mRNA expression levels Microarrays Affymetrix PCR SAGE Protein abundance 2D Gel Electrophoresis Multiple staining options Small dynamic range DIGE Cy3 vs. Cy5 labeling Large dynamic range ICAT, iTRAQ MS-based Relative abundance (ratio of isotopically labeled species) Large dynamic range MudPIT LC-MS/MS SILAC Stable isotope labeling with amino acids in cell culture - for MS analysis TAP-Tag Weissman and O’Shea (Oct. 2003) [http://www.biology.ucsc.edu/mcd/images/microarray.gif] PARE: proteomics.gersteinlab.org Upload or use pre-loaded mRNA, protein datasets Open-source code Downloadable Analyze all or analyze MIPS or GO subset [Yu et al., BMC Bioinfo. '07] PARE: a web-based tool for correlating mRNA expression and protein abundance PARE main page Select MIPS, GO subsets (opt.) Display results (1) Select mRNA, protein datasets: -use pre-loaded datasets -upload datasets (2) Choose categorization method: -correlate all -MIPS complexes -GO biological processes -GO molecular function -GO cellular component (3) Display -Linear or log-log correlation for selected subset(s) -Tabulate data, correlation values for selected subset(s -Label (on plot) and tabulate outlying datapoints [Yu et al., BMC Bioinfo. '07] PARE output Correlated data Calculation of mutual information [Yu et al., BMC Bioinfo. '07] Log-log plot of correlation -linear fit -outliers labeled Correlation of subsets (GO, MIPS) [Yu et al., BMC Bioinfo. '07] Yeast ref. datasets: “Correlate all” vs. GO cellular component subsets for particular cellular locations PARE: pre-loaded datasets [Yu et al., BMC Bioinfo. '07] Connecting PARE with datasets from NIDA investigators Protein abundance (iTRAQ datasets) Mouse (Nairn lab) - samples from 3 brain regions: cortex, striatum, hippocampus Green monkey (Taylor lab) - several brain regions caudate, dlPFC, mPFC, NacC, NacS, PFC11, PFC13, PrCO, putamen each treated with saline, PCP, and cocaine mRNA datasets obtained from expression database Mouse - Sandberg et al. PNAS 2000, 97, 11038. Mouse brain: correlation of mRNA & protein expression Mouse brain, p < 0.15 For 96 genes with differential expression in hippocampus vs. cortex 3 protein (hippocampus/cortex) protein abundance ratio (hippocampus/cortex) 2.5 2 Plan to correlate abundance for individual pathways and complexes with C. Bruce ("John", talking later) 1.5 1 0.5 0 0 0.5 1 1.5 2 m RNA (hippocam pus/cortex) mRNA expression ratio (hippocampus/cortex) 2.5 3 Acknowledgements iTRAQ datasets: Angus Nairn, Erika Andrade, Dilja Krueger Jane Taylor Chris Colangelo, Mark Shifman (YPED) PARE: Anne Burba, Eric Yu