* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Proteomic studies of the environmentally important methanotroph
Rosetta@home wikipedia , lookup
Structural alignment wikipedia , lookup
Degradomics wikipedia , lookup
Circular dichroism wikipedia , lookup
Protein design wikipedia , lookup
List of types of proteins wikipedia , lookup
Protein domain wikipedia , lookup
Protein folding wikipedia , lookup
Homology modeling wikipedia , lookup
Protein structure prediction wikipedia , lookup
Bimolecular fluorescence complementation wikipedia , lookup
Intrinsically disordered proteins wikipedia , lookup
Protein moonlighting wikipedia , lookup
Protein purification wikipedia , lookup
Western blot wikipedia , lookup
Nuclear magnetic resonance spectroscopy of proteins wikipedia , lookup
Proteomic studies of the environmentally important methanotroph Methylocella silvestris Konstantinos Thalassinos, Vibhuti Patel, Susan Slade, Nisha Patel, Joanne Connoly, Andrew Crombie, Colin Murrell, James Scrivens The Central Dogma of Molecular Biology Proteome Complexity Human Genome 20,000 – 25,000 genes1 A single gene can give rise to many different proteins by the process of alternative splicing In complex genes, alternative splicing can generate dozens or even hundreds of different mRNA isoforms2 It is estimated that almost 50% of all proteins contain one or more post-translational modifications3 1International Human Genome Sequencing Consortium (2004). "Finishing the euchromatic sequence of the human genome.". Nature 431 (7011): 931-945 2Missler, M. and Sudhof, T. C. (1998). Neurexins: three genes and 1001 products. Trends Genet. 14, 20–26. 3Apweiler, R., Hermjakob, H. and Sharon, N. (1999). On the frequency of protein glycosylation, as deduced from analysis of the SWISS-PROT database. Biochim.Biophys. Acta 1473, 4–8. What is Proteomics? Proteomics is the global characterisation of protein products (sequence, post-translational modifications, protein-protein interactions) expressed by a given genome at a specific point in time Unlike the genome, the proteome is a dynamic entity Challenges and limitations Biological Dynamic range 4 orders of magnitude in cells and 10 orders of magnitude in plasma Post-translational modifications Alternative splicing Technical Sample requirements Complex, time-consuming sample preparation Days of experimental time Pedrioli, P. G., et al., (2004). A common open representation of mass spectrometry data and its application to proteomics research. Nat. Biotechnol. 22, 1459–1466. Proteomics Approaches Profiling Differential Identify which proteins are present in a sample Probe for changes in protein expression levels between a number of environmental states Detailed characterisation Identify the presence and map the position of all posttranslational modifications present on each protein Essentials of a Mass Spectrometer A mass spectrometer separates ions according to their mass-to-charge (m/z) ratio Direct Sample Introduction ESI Liquid Chromatography MALDI Quadrupole TOF Ion Trap FT MS and Tandem MS data MS/MS fragment ion nomenclature Limitations of Current Database Search Programs Finding a peptide match after a database search is easy, but knowing whether it is correct is not It is almost always possible to match a MS/MS spectrum to a peptide in the database Incorrect matches often (but not always) result from use of low quality peptide MS/MS data to search the database Even high quality data can produce invalid identifications Actual peptide sequence is not in the database searched (under the search conditions used) Probability of a false positive assignment is much higher for proteins identified with only one peptide (known as one-hitwonders) According to publishing guidelines more than 1 peptide per protein is required Methylocella silvestris Acidophillic aerobic methanotroph Methanotrophs use methane as their sole carbon and energy source Key position in the global methane cycle Effects of multi-carbon substrates on activity Soluble methane monooxygenase (sMMO) The recently discovered genus of Methylocella is capable of utilising certain multi-carbon compounds as well as methane Aims To measure changes in protein expression of Methylocella silvestris under varying growth conditions. Relate changes in proteome to important biological pathways. Compare existing methods and new approaches using the same instrumentation and software without bias. Proteomes analysed H3C CH3 H H O HO OH H OH H N O H Propane Succinate H H H Methanol O H HO H Methane Acetate CH3 H H Methylamine Creation of protein database M. silvestris genome recently published Predict all open reading frames Use custom Perl scripts to create appropriate FASTA formatted database Experimental approaches used SDS-PAGE gels iTRAQ Uses 4 isobaric tags for quantitation MSE (IdentityE) No quantitation Uses an internal standard for quantitation Database search results saved in MySQL database Proteins identified by each methodology Gels iTRAQ IdentityE Overlap of protein identifications Including single-peptide identifications Two peptides or more iTRAQ reporter ion ratios Replicate analyses of iTRAQ-labelled samples IdentityE estimated quantitation Log(e) ratio plot of common proteins expressed under methane and acetate growth. Summary Gels iTRAQ IdentityE Protein loading 14 µg 800 – 1000 µg 0.5 – 0.75 µg Total experimental time 4 days 6 days Less than 3 days 30-40 hours 30-40 hours 6 hours per sample Number of proteins identified 95 171 399 Average number of peptides per protein 3.7 2 10 17.2 % 11.5 % 50.4 % -- 3 4 Total instrument time Average sequence coverage Dynamic range covered by relative quantitation Conclusions All the methodologies employed provided good profiling coverage of the respective proteome. iTRAQ and IdentityE both provide information on protein identity and changes in expression. IdentityE More confident protein identifications Lower protein requirements Significantly less instrument demands Significantly reduced sample preparation time Provides a stand-alone quantitative estimate of the proteins present in any given sample Publication A comparison of labelling and label-free mass spectrometry-based proteomics approaches. Konstantinos Thalassinos, Vibhuti Patel, Susan Slade, Nisha Patel, Joanne Connoly, Andrew Crombie, Colin Murrell, James Scrivens. Journal of Proteome Research Ongoing studies Continue the studies on the effect of growth substrate on the proteome of M. silvestris. Relate results back to M. silvestris cell biochemistry in particular the pathways of multi-carbon substrate assimilation Biological significance of results obtained Distinct protein profiles for each substrate: Certain proteins only expressed under methane. Certain proteins only expressed under acetate. Significantly lower levels of key enzyme soluble methane monooxygenase (sMMO) when grown under acetate. Pathway mapping Kyoto Encyclopedia of Genes and Genomes (KEGG) KEGG Automatic Annotation Server (KAAS) Manually curated database of biological pathways. Functional annotation of genes by BLAST comparisons against KEGG database. Develop a program to map the KAAS results back onto KEGG pathways http://www.genome.jp/kegg/ Biological pathways Acknowledgements Acknowledgements Multi-dimensional protein identification techniques (MudPIT) Profiling Quantitation Proteome Tryptic digest Label with isobaric tags Strong cation exchange chromatography LC-ESI-MS/MS Analyse ratios of tags 115 iTRAQ 117 Database searches 114 Protein identification Relative levels of identified proteins 116 Alternative approach to profiling and quantitative proteomics Waters IdentityE and ExpressionE 500 ng sample loading plus an internal standard Approximately 2 hours analysis/sample New approach to profiling and quantitative proteomics Validation of MSE data IdentityE and ExpressionE Quantitation is based on relationship between ESI signal response and protein concentration The average ESI signal response of the 3 most intense tryptic peptides per mole of protein is constant (CV +/- 10%)$ Quantify at the protein level (gross changes) or peptide level (minor fluctuations in protein expression) $Absolute Quantification of Proteins by LCMSE Silva et al., MCP vol. 5 issue 1 (2006) 144-156 Protein identifications common to iTRAQ and IdentityE Proteins identified in the iTRAQ and IdentityE experiments, including onehit-wonders.