Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Institute of Biotechnology and Department of Biosciences Faculty of Biological and Environmental Sciences and Helsinki Graduate Program in Biotechnology and Molecular Biology University of Helsinki Helsinki, Finland Proteomic characterization of host response to viral infection Niina Lietzén ACADEMIC DISSERTATION To be presented for public examination with the permission of the Faculty of Biological and Environmental Sciences of the University of Helsinki in the Auditorium 2402 (Telkänpönttö) at Viikki Biocenter 3, Viikinkaari 1, Helsinki, on May 16th at 12 o´clock noon. Helsinki 2012 Supervisors Docent Tuula Nyman Institute of Biotechnology University of Helsinki Helsinki, Finland Docent Sampsa Matikainen Unit of Excellence in Immunotoxicology Finnish Institute of Occupational Health Helsinki, Finland Thesis committee Docent Jaana Vesterinen Institute of Biomedicine Biochemistry and Developmental Biology University of Helsinki Helsinki, Finland Professor Juho Rousu Department of Information and Computer Science Aalto University Espoo, Finland Reviewers Docent Jaana Vesterinen Institute of Biomedicine Biochemistry and Developmental Biology University of Helsinki Helsinki, Finland Docent Sampsa Hautaniemi Institute of Biomedicine and Genome-Scale Biology Research Program Faculty of Medicine University of Helsinki Helsinki, Finland Opponent Professor Kris Gevaert VIB Department of Medical Protein Research and Department of Biochemistry University of Ghent Ghent, Belgium Custos Professor Kari Keinänen Department of Biosciences Faculty of Biological and Environmental Sciences University of Helsinki Helsinki, Finland Cover figure: Cytoscape protein interaction network (Niina Lietzén) ISBN 978-952-10-7955-9 (paperback) ISBN 978-952-10-7956-6 (PDF) ISSN 1799-7372 http://ethesis.helsinki.fi Unigrafia Helsinki 2012 TABLE OF CONTENTS LIST OF ORIGINAL PAPERS ABBREVIATIONS ABSTRACT 1. INTRODUCTION..................................................................................................................... 1 1.1 Proteins and proteomics ................................................................................................. 1 1.2 Methods in proteomics ................................................................................................... 2 1.2.1 Sample prefractionation ........................................................................................... 3 1.2.2 Protein and peptide separation ................................................................................ 4 1.2.2.1 Gel-based methods ............................................................................................ 5 1.2.1.2 Chromatographic methods ................................................................................ 6 1.2.2 Mass spectrometry in proteomics ............................................................................. 8 1.2.2.1 Protein identification by mass spectrometry ...................................................... 9 1.2.3 Database search engines in protein identification ................................................... 11 1.2.4 Quantitative proteomics ......................................................................................... 12 1.2.5 Data analysis .......................................................................................................... 16 1.3 Innate immune system .................................................................................................. 18 1.3.1 Innate immune recognition of pathogens ............................................................... 19 1.3.2 Innate immune responses against viral infection .................................................... 21 1.3.3 Influenza A virus ..................................................................................................... 23 2. AIMS OF THE STUDY ............................................................................................................ 26 3. MATERIALS AND METHODS ................................................................................................. 27 3.1 Cells and stimulations.................................................................................................... 27 3.2 Subcellular fractionation and secretome analysis .......................................................... 27 3.3 Gel-based methods used in proteomic experiments ...................................................... 28 3.4 Quantitative analysis using iTRAQ.................................................................................. 29 3.5 Mass spectrometry ........................................................................................................ 30 3.6 Database searches......................................................................................................... 30 3.7 Protein classification, interaction networks and clustering analysis................................ 31 3.8 Immunological analyses................................................................................................. 32 3.9 Reagents ....................................................................................................................... 32 4. RESULTS .............................................................................................................................. 34 4.1 Proteomics is an efficient method to study innate immune responses in human keratinocytes and macrophages .......................................................................................... 34 4.2 Mascot and Paragon give comparable protein identification results .............................. 36 4.3 Virus-induced responses in HaCaT keratinocytes and human primary macrophages ...... 37 4.3.1 Several inflammatory pathways are activated in human primary macrophages ...... 37 4.3.2 Viral infection triggers caspase-dependent apoptosis in human macrophages and keratinocytes .................................................................................................................. 39 4.3.2.1 Prediction and identification of potential caspase cleavage targets from proteomic data ............................................................................................................ 39 4.3.3 Influenza A virus infection and polyI:C transfection trigger significant protein secretion from human primary macrophages .................................................................. 41 5. DISCUSSION ........................................................................................................................ 43 6. CONCLUSIONS AND FUTURE PERSPECTIVES ........................................................................ 49 ACKNOWLEDGEMENTS ........................................................................................................... 50 REFERENCES ........................................................................................................................... 51 LIST OF ORIGINAL PAPERS This thesis is based on the following original articles that are referred to in the text by their Roman numerals I-V. I Öhman T*, Lietzén N*, Välimäki E, Melchjorsen J, Matikainen S, Nyman TA (2010) Cytosolic RNA recognition pathway activates 14-3-3 protein mediated signaling and caspase-dependent disruption of cytokeratin network in human keratinocytes. Journal of Proteome Research 9: 1549-1564. - NL did the image analysis and comparison of 2D gels and participated in protein identification, computational data analysis and writing the manuscript. II Lietzén N, Öhman T*, Rintahaka J*, Julkunen I, Aittokallio T, Matikainen S#, Nyman TA# (2011) Quantitative subcellular proteome and secretome profiling of influenza A virusinfected human primary macrophages. PLoS Pathogens 7: e1001340. - NL was responsible of proteomic work, data analysis and writing the manuscript. III Rintahaka J, Lietzén N, Öhman T, Nyman TA, Matikainen S (2011) Recognition of cytoplasmic RNA results in cathepsin-dependent inflammasome activation and apoptosis in human macrophages. The Journal of Immunology 186: 3085-3092. - NL did the protein identifications and database searches for proteomic part of the work and participated in making the manuscript. IV Lietzén N*, Natri L*, Nevalainen OS, Salmi J, Nyman TA (2010) Compid: A new software tool to integrate and compare MS/MS based protein identification results from Mascot and Paragon. Journal of Proteome Research 9: 6795-6800. - NL was responsible of testing the program and writing the manuscript. NL also participated in developing the concept of the tool. V Piippo M, Lietzén N, Nevalainen OS, Salmi J, Nyman TA (2010) Pripper: prediction of caspase cleavage sites from whole proteomes. BMC Bioinformatics 11: 320. - NL was responsible of testing the tool and analyzing proteomic data with the tool. NL participated in writing the manuscript. *,# Authors with equal contribution The original articles were reprinted with the permission of the original copyright holders. ABBREVIATIONS 2-DE two-dimensional gel electrophoresis 2D DIGE two-dimensional differential gel electrophoresis ESI electrospray ionization FDR false discovery rate FT-ICR fourier transform-ion cyclotron resonance GO Gene Ontology HILIC hydrophilic interaction liquid chromatography IFN interferon ICAT isotope coded affinity tags iTRAQ isobaric tag for relative and absolute quantitation LC liquid chromatography LIT linear ion trap MALDI matrix assisted laser desorption ionization MS mass spectrometry MS/MS tandem mass spectrometry m/z mass-to-charge ratio NLR NOD-like receptor PAMP pathogen-associated molecular pattern pI isoelectric point PMF peptide mass fingerprint polyI:C polyinosic-polycytidylic acid PRR pattern recognition receptor Q quadrupole mass analyzer RLR RIG-I-like receptor ROS reactive oxygen species RPLC reversed-phase liquid chromatography SCX strong cation exchange chromatography SDS-PAGE sodium dodecyl sulfate polyacrylamide gel electrophoresis SEC size exclusion chromatography SILAC stable isotope labeling of amino acids in cell culture TLR toll-like receptor TMT tandem mass tag TOF time-of-flight mass analyzer ABSTRACT Proteomics is defined as large-scale study of proteins, and with current proteomic methods thousands of proteins can be identified and quantified from a single experiment. Efficient methods have also been developed for protein localization, posttranslational modification and interaction studies. Mass spectrometry has an important role in proteomics and it is currently used in almost all proteomic experiments to detect and characterize the proteins or peptides in a sample. In addition, various bioinformatics tools have become increasingly important for proteomics by improving data analysis and by helping in the biological interpretation of complex proteomic data. The combination of proteomics and bioinformatics is nowadays an important tool to study cellular signaling mechanisms under different conditions, for example viral infection. Viruses entering a host cell are first recognized by host´s innate immune receptors. This recognition activates multiple signaling cascades resulting in antiviral immune responses, inflammation and finally programmed cell death, apoptosis, of the infected cell. The detailed mechanisms of host cell defense responses activated after viral infection are still partially unknown. The aim of this project was to develop and utilize proteomic and bioinformatic methods to characterize host responses to viral infection. Three different proteomic approaches were used in this project to study virus-induced changes in the proteomes of human epithelial cells and macrophages. First, cytosolic viral RNA-induced responses in HaCaT keratinocytes were studied using cell fractionation, two-dimensional gel electrophoresis and mass spectrometry (MS). Second, influenza A virus-induced changes in the mitochondrial, cytoplasmic and nuclear cell fractions as well as in the secretomes of human primary macrophages were characterized using iTRAQ labeling-based quantitative proteomics. Third, cytosolic viral RNA-triggered protein secretion from human primary macrophages was studied using qualitative high-throughput proteomics utilizing SDS-PAGE and liquid chromatographic separations and MS. Various bioinformatics tools were also used to analyze the protein identification and quantitation data. In addition, two computational tools, Compid and Pripper, were developed to simplify the analysis of our proteomic data. Compid simplifies the comparison of protein identification results from different database search engines and Pripper enables the large-scale prediction of caspase cleavage products and their identification from the collected MS data. Our studies showed that both influenza A virus and cytosolic viral RNA trigger significant changes in the proteomes of human primary macrophages and HaCaT keratinocytes. Virusinduced changes in the expression of 14-3-3 signaling proteins as well as rearrangement of host cell cytoskeleton were detected in HaCaT keratinocytes. Caspase-3-dependent apoptosis was detected in polyI:C transfected HaCaT keratinocytes as well as in influenza A virus infected and polyI:C transfected human primary macrophages. Our studies with human primary macrophages also showed that several inflammatory pathways, and especially the NLRP3 inflammasome, are activated as a result of viral cytosolic RNA and influenza A virus infection. Additionally, we showed that cathepsins, src tyrosine kinase and P2X7 receptor were involved in the inflammasome activation. Finally, we showed that influenza A virus infection and polyI:C transfection triggered extensive secretion of various different proteins. In conclusion, our proteomic experiments have given an extensive view of cellular events activated in human macrophages and keratinocytes after viral infection. 1. INTRODUCTION Host cell defense responses against viruses are initiated immediately after viruses’ invasion to the cell. Innate immune system is a complex network of interconnected biological pathways that are responsible of organism’s first defence responses against viruses. Therefore, the study of individual molecules or pathways is usually not sufficient to describe the effects of virus on host cells (Gardy et al. 2009). Proteomics can be used to study protein expression levels, localizations, posttranslational modifications and interactions in a cell at certain conditions (Fields 2001). Various different methods have been developed for these purposes, and especially with mass spectrometry (MS)-based proteomics, thousands of proteins can be characterized in a single experiment (Geiger et al. 2012, Boisvert et al. 2012, Phanstiel et al. 2011). Proteomics has been used, for example, to study the expression and functions of viral proteins (Shaw et al. 2008). Additionally, proteomics has been used to study interactions between host and virus proteins (Naji et al. 2012) as well as virus-induced changes in host cell proteomes (Vogels et al. 2011, Emmott et al. 2010b). Proteomic studies can give important information about the cellular mechanisms activated by viral infection and could be utilized for example to evaluate the pathogeneity of different viruses (Rasheed et al. 2009) or to develop drugs and vaccines against viruses. In this Ph.D. project, different proteomic approaches were utilized to study virus-induced events in human primary macrophages and epithelial cells. 1.1 PROTEINS AND PROTEOMICS Proteins are the workhorses of a cell. They are the molecular instruments expressing genetic information stored in DNA or RNA. Proteins are involved in almost all biological processes of a cell. Gene expression and thus the level of proteins in a cell is constantly regulated by several different processes. Rate of transcription, posttranscriptional processing and degradation of mRNA as well as rate of translation, posttranslational modification, degradation and transport of proteins all affect on the protein contents of a cell in certain conditions (Figure 1). The most important factor affecting protein levels in a cell is the rate of translation (Schwanhäusser et al. 2011). Therefore, the study of protein contents of a cell gives the most information about the biological processes active in a cell at certain conditions. 1 Figure 1. Gene expression in eukaryotic cells is regulated at several different stages. Proteome is the entire set of proteins expressed by a cell, tissue or organism at given time under certain conditions (Wilkins et al. 1996). The concept of proteomics evolved in 1990s to describe large-scale studies of proteins. In addition to protein levels, protein-protein interactions, protein localization and posttranslational modifications influence the physiological state of a cell. Therefore, the aim of proteomics is to identify all the proteins present in a sample, to quantify them and to study their localizations, posttranslational modifications and interactions (Fields 2001). At present, thousands of proteins can be identified and quantified in a single proteomic experiment (Geiger et al. 2012, Boisvert et al. 2012, Luber et al. 2010). Efficient methods have also been developed for extensive studies of protein posttranslational modifications and interactions (Phanstiel et al. 2011, Kim et al. 2011, Li et al. 2011, Rees et al. 2011). Thus, proteomics can be used to study molecular mechanisms active in a variety of biological systems. 1.2 METHODS IN PROTEOMICS There is a large variety of methods available for proteomic experiments nowadays. The method of choice is often determined by the biological question, sample material, costs and instrumentation available for the experiments. However, most of the modern proteomic experiments utilize MS in the analysis of complex protein samples. MS-based proteomic analyses can be roughly divided into two classes: bottom-up and top-down proteomics (Kelleher et al. 1999). The most common approach at the moment is bottom-up proteomics which relies on MS analysis of proteolytic peptides followed by protein inference using computational methods. Despite the wide range of methods available, there is a common workflow for almost all bottom-up proteomic experiments (Figure 2). In top-down proteomics, intact proteins are 2 injected into a mass spectrometer and fragmented there to characterize them. Top-down proteomic experiments are still quite rare because of both technical challenges and the difficulties in data analysis (Zhou et al. 2012). Figure 2. General workflow for bottom-up proteomic experiments. 1.2.1 Sample prefractionation Proteomic samples originate from various sources: for example from cell cultures, tissues or biological fluids. There are often thousands or tens of thousands of distinct proteins in one sample and concentration range between low- and high-abundant proteins can be several orders of magnitude. Therefore, various prefractionation and enrichment methods are often needed to improve the analysis of such complex samples. Subcellular fractionation is often used in proteomics to simplify complex samples. Mitochondrial, cytoplasmic, and nuclear fractions as well as other cell compartments can be extracted from intact cells and studied separately (Andreyev et al. 2010, Qattan et al. 2010, Du et al. 2010). Enrichment of proteins into different cell fractions may facilitate the detection of low-abundant proteins (Du et al. 2010). Additionally, subcellular fractionation can give important insights into cellular events since protein localization is often important for its function (Qattan et al. 2010). Sucrose gradient density centrifugation, immunoaffinity purification and free-flow electrophoresis are often used for cell fractionation in proteomic experiments (Hartwig et al. 2009, Lee et al. 2010). In addition, different commercial kits have been developed for the enrichment of specific subcellular organelles (Hartwig et al. 2009). Regardless of the method used for cell fractionation, the enriched fractions usually contain impurities from other cell compartments. On the other hand, proteins may also exist in multiple subcellular compartments and those localizations may vary between different conditions (Qattan et al. 2010, Boisvert et al. 2010, Lee et al. 2010). Also, database annotations of proteins´ subcellular locations are still incomplete and often show only one location per protein. Therefore, it can be difficult to evaluate the quality and results of cell fractionation experiments and care must be taken when reporting these results. 3 In addition to cell fractionation, several different affinity-based methods can be used to extract a certain group of proteins from complex biological samples. In phosphoproteomics, several different affinity enrichment methods like immunoaffinity enrichments, titanium dioxide chromatography and immobilized metal ion affinity chromatography are used to extract phosphoproteins or –peptides from complex protein mixtures (Thingholm et al. 2009). Affinity enrichments are also used to study other types of posttranslational modifications like protein glycosylation (Vandenborre et al. 2010) and ubiquitination (Hjerpe et al. 2009) as well as protein-protein interactions (Rees et al. 2011). 1.2.2 Protein and peptide separation After the initial prefractionation or enrichment steps, proteomic samples might still contain hundreds or thousands of distinct proteins making direct MS analysis of these complex samples challenging. When multiple peptides are introduced to the mass spectrometer simultaneously, the instrument may not have enough time to fragment and analyze all of them. Additionally, simultaneous ionization of multiple different peptides may result in signal suppression based on different ionization properties of peptides causing signal losses for some of the peptides (Horvatovich et al. 2010). Therefore, efficient separation of peptides prior to MS is important to decrease the number of different peptides entering the mass spectrometer simultaneously and to increase the dynamic range of analysis. Increased separation efficiency can also result in more peptide identifications and thus more and better quality protein identifications per sample. The most common separation methods used in proteomics are two-dimensional gel electrophoresis (2-DE), sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDSPAGE) and liquid chromatography (LC). Complexity of samples introduced to the mass spectrometer varies clearly between these methods (Figure 3). Simple peptide mixtures originating from one or few proteins can be extracted from 2D gels and introduced to the mass spectrometer. SDS-PAGE and LC separations, on the other hand, result in highly complex peptide mixtures where links between original proteins and resulting peptides are mostly or completely lost prior MS analyses. Therefore, each of these separation methods has its own requirements for MS analysis and the following protein identification. 4 Figure 3. Protein/peptide separation methods commonly used in proteomics. A) 2-DE separation of proteins results in protein spots containing only one or a few proteins. The spots are excised from the gel individually followed by MS analysis. B) SDS-PAGE gives rough separation of proteins and is often followed by LC separation prior MS analysis. C) In LC separation-based approaches, the whole sample is digested into peptides prior separation. Thus, all LC fractions analyzed by MS contain peptides from multiple proteins. 1.2.2.1 Gel-based methods In the early days of proteomics, two-dimensional gel electrophoresis was the most important method for protein separation (Wilkins et al. 1996, Görg et al. 2004). In 2-DE, proteins are first separated based on their isoelectric points (pI) using isoelectric focusing followed by size-based separation using SDS-PAGE. As a result, a two-dimensional map of protein spots is created in the gel. The protein spots are then visualized using different staining methods, for example fluorescent dyes (Ünlü et al. 1997, Berggren et al. 2000) or silver staining (O´Connel et al. 1997). Quantitation of protein spots is done based on the intensity of staining in each spot. Finally, the protein spots of interest can be picked from the gel for identification, in-gel digested, and the resulting peptides can be analyzed by MS. 2-DE aims at complete separation of proteins resulting in gels where each spot represents only one protein. In a routine experiment, even 2000 protein spots can be separated in a single gel providing adequate resolution for many proteomic experiments (Görg et al. 2004). 2-DE is a valuable method to study for example protein degradation since protein fragments can be easily 5 detected based on their vertical positions in a gel (Bredemeyer et al. 2004). 2-DE can also be used to visualize changes in protein posttranslational modifications since, for example, different phosphorylation and glycosylation states of a protein can often be seen as a horizontal series of spots in a 2D gel (Koponen et al. 2011, Di Michele et al. 2010). However, 2-DE has limitations in the analysis of certain protein classes: the limited solubility of hydrophobic proteins hinders their analysis with 2-DE (Rabilloud et al. 2010). Also, proteins with extreme pIs can often not be detected in a 2D gel because of the limited pI range in isoelectric focusing. Another method for protein separation in a gel is one-dimensional SDS-PAGE. It is a robust method for protein separation and is more universal than 2-DE because it does not suffer from limited solubility of hydrophobic proteins or limited protein pI range in isoelectric focusing. In SDS-PAGE, proteins are separated in a gel based on their size, and only partial separation of proteins in a complex mixture is achieved. SDS-PAGE is often used as a first separation step for complex protein samples and is usually followed by slicing of the gel into pieces, in-gel digestion of proteins in each gel piece and finally identification of the extracted peptides by LCMS/MS (Fang et al. 2010, Boisvert et al. 2012, Savijoki et al. 2011). For quantitative analysis, SDS-PAGE separations can also be easily combined with protein labeling using stable isotopes (Ong et al. 2003, Boisvert et al. 2012). 1.2.1.2 Chromatographic methods In addition to gel-based separation, liquid chromatography (LC) is often used in proteomics to simplify complex samples prior MS analysis. Since proteins are very diverse in their chemical properties, it is usually easier to optimize an LC separation for a set of peptides with more uniform characteristics (Motoyama and Yates 2008). Several different LC methods can be used for peptide separation. Reversed-phase liquid chromatography (RPLC) separates peptides based on their hydrophobicity, strong cation exchange chromatography (SCX) based on their ionic character, hydrophilic interaction liquid chromatography (HILIC) based on peptide hydrophilicity and size exclusion chromatography (SEC) based on peptide size. Unlike many other LC methods, SEC separations can also be easily optimized for a heterogeneous mixure of intact proteins and it is therefore used in proteomics to separate both peptides and proteins (Lee et al. 2011, Wisniewski et al. 2010). Finally, affinity chromatography is used for peptide or protein separation based on certain specific characteristics like posttranslational modifications (Thingholm et al. 2009). 6 Digestion of protein mixtures into peptides increases sample complexity significantly resulting in tens or hundreds of thousands of different peptides with a concentration range over several orders of magnitude. With such complex samples, a single separation step does usually not give sufficient separation. The power of multidimensional liquid chromatography in proteomics was recognized in the beginning of 21st century when Washburn et al. used a combination of RPLC and SCX to identify almost 1500 yeast proteins (Washburn et al. 2001). Since then, multidimensional LC has been used extensively for various proteomic purposes (Table 1). For maximal separation efficiency, retention mechanisms of each LC separation step should be as independent from each other as possible. Orthogonality studies of Gilar et al. showed that fairly good orthogonality for peptide separation can be achieved by various combinations of LC methods (Gilar et al. 2005). However, RPLC is usually chosen as the last separation method due to the compatibility of RPLC eluents with MS analysis. An alternative approach for multidimensional chromatographic analyses is COFRADIC (combined fractional diagonal chromatography), where peptides fractionated in the first dimension are enzymatically modified based on a specific characteristic (e.g. N-terminal peptides) followed by second separation step and peptide sorting (Gevaert et al. 2005). Here, the properties of peptides are modified instead of chromatographic conditions to yield a good separation for a specific peptide class. During the last 10 years, liquid chromatography has become an increasingly popular separation technique in proteomics. Due to the very limited amount of sample available for most experiments, development of efficient nanoscale LC separation methods has been important for proteomics (Shen et al. 2002). Additionally, multidimensional LC separations are easily automated, minimizing the amount of manual work in each analysis. Ultra high pressure LC systems have shown an increase in separation performance as well as a decrease in separation time, both features being important for high-throughput proteomic experiments (Nagaraj et al. 2012). Flexibility, variability and separation power of LC and its compatibility with many other primary separation methods such as SDS-PAGE and peptide isoelectric focusing (Boisvert et al. 2012, Martins-de-Souza et al. 2009) have made LC a central method in proteomics. 7 Table 1. Examples of different multidimensional LC separations used in proteomic experiments. pepSEC = size exclusion chromatography for peptides. 1st dimension HILIC 2nd dimension RPLC pepSEC study reference phosphoproteomics for estrogen-induced transcriptional regulation Wu et al. 2011 RPLC biomarkers for hepatocellular carcinoma Lee et al. 2011 RPLC (pH 10) RPLC (pH 2) human NK cell proteome Dwivedi et al. 2008 SCX RPLC iTRAQ quantitation of Saccharomyces cerevisiae proteome Ross et al. 2004 SCX RPLC yeast proteome Washburn et al. 2001 1.2.2 Mass spectrometry in proteomics Mass spectrometry is a technique used to measure mass-to-charge ratios (m/z) and abundancies of gas phase ions that are introduced into a mass spectrometer. The technique can be used to analyze any molecule that can be converted into a sufficiently stable gas phase ion. In 1980s, development of electrospray ionization (ESI) (Yamashita and Fenn 1984) and matrix assisted laser desorption ionization (MALDI) (Karas et al. 1985) enabled the efficient ionization and subsequent MS analysis of large biomolecules like proteins and peptides (Fenn et al. 1989, Hillenkamp et al. 1990). A few years later, nanoESI was developed for the efficient analysis of small sample amounts (Wilm & Mann 1996). These developments in biological mass spectrometry started a new era of protein analysis and were also crucial for the development of modern MS-based proteomics. Sensitivity, variability and resolution of MS are nowadays utilized in many fields of proteomics, for example in protein identification and quantitation (Boisvert et al. 2012, Bluemlein et al. 2011, Luber et al. 2010), characterization of posttranslational modifications (Kim et al. 2011, Rajimakers et al. 2010) and study of protein complexes and protein-protein interactions (Li et al. 2011, Rees et al. 2011). Different types of instruments have been built to meet the requirements for different types of proteomic experiments (Table 2). Quadrupole mass analyzers (Q), time-of8 flight (TOF) analyzers, fourier transform ion cyclotron resonance instruments (FT-ICR), ion traps and various hybrid instruments have been utilized in proteomics, each of them having their own advantages and disadvantages (Domon and Aebersold 2006). One of the newest inventions, Orbitrap mass analyzer with high mass accuracy and high resolution (Makarov 2000, Hu et al. 2005), has pushed the boundaries of proteomics by increasing the amount of information that can be collected from the complex samples. Table 2. Recent examples of proteomic experiments utilizing different mass analyzers. large-scale proteome characterization quantitative proteomics LTQ-Orbitrap Geiger et al. 2012, Nagaraj et al. 2011, Trost et al. 2009 Boisvert et al. 2012, Monetti et al. 2011, Luber et al. 2010 Q-TOF Savijoki et al. 2011, Dwivedi et al. 2008 Bewley et al. 2011 mass analyzer TOF-TOF FT-ICR Pounds et al. 2008 QQQ posttranslational modification analysis Kim et al. 2011, Monetti et al. 2011, Lemeer et al. 2012 Rajimakers et al. 2010, Lemeer et al. 2012 Holland et al. 2011 Holland et al. 2011, Lemeer et al. 2012 Collier et al. 2010 Wang et al. 2011 Bluemlein et al. 2011 1.2.2.1 Protein identification by mass spectrometry In bottom-up proteomics, mass spectrometer is used to detect and identify peptides rather than proteins (Kelleher et al. 1999). Protein identifications are then retrieved based on peptide identification data using different computational tools. MS-based peptide identification and protein inference can be done by peptide mass fingerprinting (PMF) or by utilizing tandem mass spectrometry (MS/MS). 9 In PMF analyses, all peptides from one or a few proteins are ionized and introduced to the mass spectrometer simultaneously. Mass spectrometer measures all the masses of ionized peptides, and this combination of peptide masses is considered to be a characteristic “fingerprint” of a protein that can be searched for (Figure 4A) (Gevaert and Vandekerckhove 2000). In PMF, protein identifications are based solely on unique sets of peptide masses and no information about peptide sequences is collected. Therefore, several peptide masses have to be detected for each protein to be able to uniquely assign these masses to a certain protein. PMF analyses are typically used for samples that contain peptides from only one or a few proteins. Therefore, 2DE separation of proteins followed by MALDI-TOF analyses of protein spots is the most common workflow for PMF analyses. Figure 4. Principles of protein identification based on A) PMF and B) MS/MS analyses. In MS/MS-based analyses, m/z-values of the ionized peptides are measured first, followed by fragmentation of selected m/z-values and detection of the resulting fragment ions (Figure 4B) (Domon and Aebersold 2006). Fragment ion masses contain information about peptide sequence and thus, both intact peptide masses and sequence information can be retrieved resulting in more reliable protein identifications. MS/MS analyses are used especially in shotgun proteomic experiments where complex proteomic samples are first digested followed by LC separation and ESI-MS/MS analysis of the resulting peptides. 10 1.2.3 Database search engines in protein identification High-throughput LC-MS/MS experiments can produce even one million mass spectra per experiment making manual data interpretation impossible. Therefore, various database search engines have been developed to process raw MS data. Sequence searching is the most common method for MS-based protein identification in proteomics. Mascot (Perkins et al. 1999), Sequest (Eng et al. 1994), X!Tandem (Craig et al. 2004) and Paragon (Shilov et al. 2007) are commonly used sequence database search engines for proteomic purposes. In sequence searching, data analysis consists of two consecutive steps: peptide identification and protein inference (Deutsch et al. 2008, Nesvizhskii 2007). In the first step, in silico digestion of all proteins in the protein sequence database is performed. The peptides are created and studied based on user-defined criteria like enzyme specificity, mass tolerance and potential posttranslational modifications. Most database search engines try first to find matches between in silico peptides and experimental data based on intact peptide masses. In silico fragment ion spectra are then created for candidate peptides. After this, a list of potential peptide-spectrum matches is created and qualities of each match are evaluated based on different scoring schemes. In the second step of analysis, peptide identifications are grouped to yield protein identifications. Most database search engines have their own grouping algorithms that handle peptide identification data in slightly different ways resulting in partially different protein identification results for the same set of data. However, a common principle in most search engines is to try to find a minimum set of proteins that can explain all the identified peptides. Sequence searching is a suitable method to study already sequenced organisms since only peptides and proteins whose sequence is present in a database can be detected (Nesvizhskii 2007). Another type of search engines identifies peptides based on spectral matching of previously observed and identified MS/MS spectra with the collected MS/MS spectra (Lam 2011, Craig et al. 2006, Lam et al. 2007). At present, vast numbers of MS/MS spectra are stored in different data repositories, and it is possible to build extensive spectral libraries from these data (Lam et al. 2011). Comparison between spectral matching search engine SpectraST and sequence database search engine Sequest showed that spectral matching can be a faster and more accurate method for peptide identification than traditional sequence database search engines (Lam et al. 2007). However, quality of MS/MS spectra included in libraries and limitations in library coverage have to be considered when identifying peptides based on spectral matching. 11 A third type of search engines are de novo sequencing algorithms which try to read the peptide sequence directly from MS/MS spectra (Deutsch et al. 2008, Nesvizhskii 2007). This method requires no prior knowledge of peptide sequences that are identified. However, de novo sequencing is a computationally heavy process and requires good-quality MS/MS spectra for peptide identification. Additionally, problems with protein inference may appear with complex samples. Since protein identifications can be performed in numerous different ways using various tools, it is important to be able to evaluate the quality of protein identifications retrieved from a search engine. Most database search engines like Mascot, Sequest and Paragon use a statistical scoring mechanism to assess the reliability of protein identifications. In addition, false discovery rates (FDRs) based on for example target-decoy searches are often used to evaluate the reliability of identifications (Elias and Gygi 2007). In target-decoy strategies, database searches are performed against a composite database of target protein sequences and decoy sequences, the reversed or randomized counterparts of the target protein sequences. Based on the assumption that an incorrect peptide assignment is equally likely to originate from a target or a decoy database, the number of decoy identifications can be used to estimate the total number of incorrect assignments. However, the identities of these false positive assignments can not be determined. 1.2.4 Quantitative proteomics In proteomics, it is often necessary to study changes in protein levels in different conditions. Although proteomic methods for both relative and absolute quantitation have been developed, absolute quantitation is rarely performed in proteomic experiments (Elliott et al. 2009). Peptide´s physicochemical properties affect its ionization efficiency and thus the detected signal intensity in a mass spectrometer. Therefore, a reference ion with known concentration is always required to determine the absolute amount of interesting peptides in a sample. This requirement of reference compounds limits the use of absolute quantitation in proteomics. Instead, most proteomic experiments utilize relative quantitation to study changes in protein levels between different samples. This relative quantitation of proteins can be achieved using either gel-based or MS-based quantitation methods. 12 In gel-based quantitative proteomics, proteins are usually separated using 2-DE and quantitation is performed based on protein spots detected from the gel. Gels from different samples can be matched and quantitation done based on the intensities of corresponding protein spots in distinct gels. Here, fluorescent dyes (Berggren et al. 2000, Ünlü et al. 1997) or silver staining (Chevallet et al. 2006) are often used for spot detection. Two samples can also be labeled with different fluorophores and separated in the same gel using two-dimensional differential gel electrophoresis (2D DIGE) (Ünlü et al. 1997). Then, relative quantitation is done based on the intensities of the different fluorophores in the same spot. The development of 2D DIGE has improved the quantitation accuracy as well as sensitivity of gel-based quantitative proteomics (Marouga et al. 2005). However, dynamic range of different staining methods used in 2-DE is limited compared to the huge differences in protein abundances in real biological samples (Rabilloud et al. 2010). Sensitivity of staining methods is also sometimes limited hindering the detection of low abundance proteins from the gel and thus their identification and quantitation. In addition, complete separation of proteins is required for quantitative analysis because if more than one protein is present in a single spot, quantitation data cannot be assigned to either of them. In MS-based quantitative proteomics, different labeling methods or label-free approaches can be used for protein quantitation. Some of the most common labeling strategies in proteomics are stable isotope labeling of amino acids in cell culture (SILAC) (Ong et al. 2002), isotope-coded affinity tags (ICAT) (Gygi et al. 1999), isobaric tag for relative and absolute quantitation (iTRAQ) (Ross et al. 2004) and tandem mass tags (TMT) (Dayon et al. 2008). In SILAC-based quantitation, protein labeling occurs in cell culture when heavy or light isotopes of common amino acids like arginine and lysine are incorporated metabolically into proteins (Figure 5) (Ong et al. 2002). The labeling is done prior any treatment of the samples minimizing technical variations in sample preparation and analysis. If both lysine and arginine are used in labeling, at least one amino acid in each tryptic peptide should be labeled resulting in the detection of multiple labeled peptides per protein. A mass shift of a few Daltons is detected between the differentially labeled forms of a peptide and relative quantitation of peptides is then performed by comparing MS peak areas of these differentially labeled forms of each peptide. Although SILAC is often used to compare only two or three parallel samples, 5plex SILAC experiments have also been published (Molina et al. 2009). However, multiplexing increases the SILAC sample complexity significantly making the MS analysis more difficult. Finally, for adequate incorporation of labels, viable cell lines that can be cultured long enough are required. 13 Figure 5. SILAC-, ICAT- and iTRAQ-based quantitation in proteomics. In each of these methods, labeling of proteins/peptides is done at a different stage (first coloured boxes). In SILAC and ICAT experiments, quantitation is based on MS data whereas in iTRAQ, quantitation is based on MS/MS data. ICAT is a protein labeling method where “heavy” or “light” biotinylated tags are attached to cysteine residues of proteins (Figure 5) (Gygi et al. 1999, Hansen et al. 2003). The proteins are then digested and labeled peptides are enriched using affinity chromatography. Since cysteine is a rare amino acid, enrichment of labeled peptides simplifies the sample mixture significantly. In traditional ICAT method, differentially labeled forms of each peptide show an 8 Da mass difference in MS spectra (Gygi et al. 1999) whereas in the newer, cleavable ICAT method, the corresponding mass difference is 9 Da (Hansen et al. 2003). Quantitation in ICAT can be performed by comparing the peak areas of differentially labeled peptides. Due to the low number of cysteine-containing peptides, ICAT quantitation of a protein is often based on only one or two peptides making the results prone to errors. iTRAQ and TMT are chemical labeling methods where isobaric tags are attached to peptides after protein digestion (Figure 5)(Ross et al. 2004, Dayon et al. 2008). Both tags are structurally very similar and react with free amino groups of peptides, i.e. N-termini and lysine residues. Each isobaric tag contains a cleavable reporter ion group with a specific mass. Differentially labelled forms of each peptide can be distinguished from each other only after peptide 14 fragmentation in mass spectrometer when the reporter ion groups are cleaved from the peptide. The quantitation is then performed based on the reporter ion peak areas in MS/MS spectra. Both iTRAQ and TMT are multiplexed methods allowing the analysis of four (4plex iTRAQ), six (6plex TMT) or even eight (8plex iTRAQ) samples in parallel (Ross et al. 2004, Dayon et al. 2008, Pierce et al. 2008). The structure of an iTRAQ label and the principle of iTRAQ-based quantitation are shown in Figure 6. Figure 6. 4plex iTRAQ labeling. A) Structure of an iTRAQ label and attachment of the label into peptide. B) Differentially labelled forms of one peptide elute simultaneously from RPLC and have the same total mass but can be separated based on MS/MS spectra. Label-free quantitation can be performed based on peptide peak areas in MS spectra (Bondarenko et al. 2002) or based on spectral counts at MS/MS level (Liu et al. 2004). In both approaches, all the samples are analyzed individually and data analysis and comparison is done computationally after MS analyses. Thus, an unlimited number of samples can be compared with each other. In signal intensity-based measurements, peptide peaks from different runs are matched based on retention times and peptide masses (Bondarenko et al. 2002). Relative quantitation of peptides and subsequently proteins is then performed based on differences in peptide peak areas between runs. Technical reproducibility of LC-MS/MS analyses and minimal overlap of peptides are extremely important because quantitation relies completely on matching the MS data between runs. Spectral counting, on the other hand, is based on the idea that in MS/MS experiments performed using data-dependent acquisition more abundant peptides will 15 be selected for fragmentation more often (Liu et al. 2004).The method can be refined by taking into consideration for example the number of detectable tryptic peptides for a protein (Rappsilber et al. 2002) and the properties of these peptides (Lu et al. 2007). Both spectral counting- and peptide peak intensity-based label-free quantitation methods have been successfully applied in large-scale proteomic studies (Luber et al. 2010, Mosley et al. 2009, Old et al. 2005). 1.2.5 Data analysis Database searches of large proteomic datasets result in lists containing thousands of protein identifications. In addition, quantitative data and information about protein posttranslational modifications are often included in these lists. It is extremely difficult to deduce potentially relevant biological processes by the manual inspection of collected data. Thus, numerous bioinformatics tools have been developed to help data interpretation. Functional classification of the identified proteins is often one of the first data analysis steps after database searches. Gene Ontology (GO) database comprises of a well-standardized set of biological processes, molecular functions and cellular compartments associated to different gene products (Ashburner et al. 2000). Currenly, more than 500 000 gene products from several different organisms are annotated in the database. These annotations are often utilized in the initial characterizations of proteomic datasets. AmiGO is the official GO database browsing tool that can be used for example to retrieve GO annotations for a single protein or for simple visualizations of the database´s hierarchical structure (Carbon et al. 2009). GO analyses of large proteomic datasets can be performed using several different bioinformatic tools, such as GeneTrail (Backes et al. 2007) and GOMiner (Zeeberg et al. 2003). These tools utilize different statistical methods to find GO categories that are over- or underrepresented in the dataset of interest compared to a reference dataset such as the genome of the selected organism (Backes et al. 2007, Zeeberg et al. 2003). GO database is manually annotated, and each annotation has an evidence code describing the type of evidence supporting the annotation (Dimmer et al. 2008). This allows the user to evaluate the reliability of the GO analysis results. Although GO classification is a valuable tool in proteomics, it does not provide detailed mechanistic information about the cellular events. Therefore, pathway analyses and protein-protein interaction analyses are used to retrieve more detailed information about the cellular events. 16 Pathway analyses can be used to study the role of the identified proteins in well-defined biomolecular reactions. The Kyoto Encyclopedia of Genes and Genomes (KEGG) PATHWAY database contains several maps of biochemical pathways, especially metabolic pathways (Ogata et al. 1999). Reactome is another large biological pathway database, which is focused especially on human proteins (Joshi-Tope et al. 2005). Pathway databases contain information about physical and functional interactions between proteins and the annotations in these databases are usually manually curated (Malik et al. 2010). KEGG and REACTOME pathway databases can be mined directly using protein identification data from proteomic experiments. Additionally, some of the GO classification tools, such as GeneTrail and PANTHER, can be used for overand underrepresentation analysis of biological pathways (Backes et al. 2007, Thomas et al. 2003). Also some additional bioinformatics tools, such as ExPlain, have been developed for biological pathway analyses (Zubarev et al. 2008). Protein-protein interactions can provide important information about the functional complexes and intermolecular associations in a cell. Protein-protein interaction databases contain both experimentally determined and computationally predicted information about physical and functional interactions between proteins (Malik et al. 2010). Various tools such as String (Jensen et al. 2009), PINA (Wu et al. 2009) and Cytoscape (Shannon et al. 2003) mine the existing protein interaction data stored in these databases and can be used to create proteinprotein interaction networks based on these data. PINA and Cytoscape are also capable of integrating functional data into protein-protein interaction networks making them extremely useful and efficient tools in proteomic data analyses (Wu et al. 2009, Shannon et al. 2003). Cytoscape can also be used to incorporate quantitative data into networks and the numerous visualization and analysis possibilities available with Cytoscape make it the most comprehensive visualization tool available for complex proteomic data (Shannon et al. 2003). Since the reliability of annotations in protein-protein interaction databases varies widely, it is important to be aware of the quality of interaction data used in the networks. Filtering out lowquality interactions prior analyses can also simplify the interpretation of the data. Combinations of Gene Ontology classification, pathway analyses and protein-protein interaction networks are often used when analyzing complex proteomic data. These computational analyses can help building biological hypotheses based on proteomic data. However, different functional experiments are required to verify these hypotheses. 17 1.3 INNATE IMMUNE SYSTEM Immune system comprises of various cells and molecules that work to protect an organism against pathogens. It can be divided into two categories: innate immune system and adaptive immune system. Innate immune system is organism´s first line of defence against pathogens. It is responsible for the early detection of invading pathogens and launching of the first immune reactions to eliminate the pathogen (Murphy et al. 2008). The conserved defence mechanisms of innate immune system are triggered immediately after the pathogen has been detected and they protect the organism until the adaptive immune responses against the pathogen have been developed. In addition, innate immune system is required for the development of adaptive immune responses that are fully activated only several days after infection. Adaptive immune responses are targeted specifically against the detected pathogen and are thus more efficient than the unspecific innate immune responses. In addition, adaptive immune system is capable of generating immunological memory that ensures faster and more efficient adaptive immune responses if the same pathogen is re-encountered later. When organisms encounter with a pathogen, epithelial cells of skin and mucous form the first physical barrier between the pathogen and internal parts of an organism (Medzhitov 2007, Kupper and Fuhlbrigge 2004). Infections can occur only when the pathogen passes this physical barrier. Epithelial cells recognize invading pathogens with their pattern-recognition receptors (PRRs) resulting in the secretion of antimicrobial peptides, chemokines and cytokines. These molecules function as signals of infection to immune cells. Different types of immune cells are activated as a result of infection (Janeway and Medzhitov 2002, Murphy et al. 2008). Macrophages and dendritic cells are important phagocytes present in most tissues. They are activated at early stages of infection when pathogens cross the epithelial barrier. One of their important tasks is to engulf and digest pathogens. In addition, they orchestrate immune responses by inducing inflammation and secreting cytokines and chemokines that activate other immune cells and recruit them to the site of infection. Finally, macrophages and dendritic cells can also function as antigen presenting cells helping the development of adaptive immune responses. Other types of phagocytes working in the innate immune system are neutrophils, eosinophils and basophils that can be activated by inflammatory cytokines and chemokines (Janeway and Medzhitov 2002). 18 1.3.1 Innate immune recognition of pathogens Host defence against invading pathogens is initiated when host cells recognize specific microbial components called pathogen-associated molecular patterns (PAMPs) (Janeway and Medzhitov 2002, Akira et al. 2006). PAMPs contain various structures that are essential for microbes, for example viral RNA, DNA and bacterial lipopolysaccharide. Host PRRs are germline-encoded receptors designed to differentiate between self and non-self structures and thus to detect foreign microbial structures invading the cell. Several distinct PRRs are found in mammals, each with their own specificities and roles in the innate immune recognition of pathogens. Toll-like receptors (TLRs) are the best characterized group of PRRs. They are integral membrane glycoproteins with extracellular domains for the recognition of PAMPs and intracellular domains for signaling (Kumar et al. 2011). Human TLRs 1, 2, 4, 5 and 6 are expressed on cell surface whereas TLRs 3, 7, 8 and 9 are found from endolysosomal membranes. Endolysosomal TLRs are specialized for the recognition of viral nucleic acids (TLR3 for viral dsRNA, TLR7/8 for viral ssRNA and TLR9 for viral DNA) whereas cell surface TLRs recognize mostly bacterial and fungal structures. Recognition of PAMPs by TLRs results in the recruitment of different adaptor molecules such as MyD88 and TRIF. This initiates signaling events resulting in the activation of transcription factors, for example NF-B, IRF3/7 and MAP kinases, and the production of proinflammatory cytokines and type I interferons (Figure 7). Another group of PRRs is RIG-I-like receptors (RLRs) found in the cytoplasm of host cells (Kumar et al. 2011). These receptors recognize viral RNA in the cytoplasm of infected cells. There are three receptors belonging to this family: RIG-I and MDA-5, both recognizing different types of viral RNA, and LGP2 which is a positive regulator of RIG-I- and MDA-5mediated signaling. When RIG-I or MDA-5 are activated with viral RNA, they interact with mitochondrial-antiviral signaling protein (MAVS) located on mitochondrial outer membrane and peroxisomes (Figure 7) (Dixit et al. 2010). Interactions with peroxisomal MAVS result in rapid interferon-independent expression of interferon-stimulated genes whereas mitochondrial MAVS activates type I interferon production with slower kinetics. In addition, activation of RLRs results in the production of pro-inflammatory cytokines. 19 NOD-like receptors (NLRs) are the third main group of PRRs sensing a wide range of ligands in the cytoplasm (Kumar et al. 2011). NOD1 and 2, for example, are cytoplasmic receptors for bacterial cell wall structures. NOD1- and NOD2-mediated recognition of PAMPS results in the activation of NF-B or MAP kinases inducing the production of proinflammatory cytokines. Another group of NLRs are the inflammasome components, whose activation results in the assembly of a protein complex called the inflammasome (Martinon et al. 2002). Inflammasome activation results in caspase-1 cleavage followed by the activation of proinflammatory cytokines IL-1 and IL-18 (Figure 7). One of the inflammasome components, NLRP3, is activated by bacterial and viral RNA as well as some endogenous danger signals (e.g. danger signal proteins) and environmental pollutants like asbestos. In addition, NLRP1 and NLRC4 are known inflammasome components recognizing various structures. Figure 7. Innate immune recognition of viral nucleic acids results in the production of interferons and pro-inflammatory cytokines. MAPK = MAP kinases. 20 1.3.2 Innate immune responses against viral infection Innate immune responses against viral infection are initiated when host PRRs recognize the invading virus. The most important innate immune responses against viral infection are antiviral responses, inflammation and apoptosis of the infected cells. Interferons (IFNs) are proteins produced and secreted by virus infected cells. They are antiviral agents helping infected cells to fight against invading viruses. From the three different classes of IFNs (type I, II and III interferons) type I IFNs (interferon and ) are the most central in innate immune responses against viral infection (Randall and Goodbourn 2008). Production of type I IFNs is initiated after host cell´s TLRs and RLRs have recognized the invading virus. TLRs 3, 7 and 9 as well as RIG-I and MDA-5 each activate type I IFN production via different signaling pathways resulting in the activation of interferon regulatory factors IRF3 and/or IRF7 (Akira et al. 2006). IFN produced by the infected cells are secreted and can be detected by interferon receptors on the surface of infected and neighboring cells (Randall and Goodbourn 2008). Activation of these IFN receptors initiates the production of interferon-stimulated genes. There are hundreds of interferon-stimulated genes involved in for example host cell transcription and translation, immune modulation, signaling and apoptosis (de Veer et al. 2001). The proteins encoded by these genes are the primary effectors of antiviral immune responses. Inflammatory responses triggered by viral infection aim at recruitment of leukocytes to the sites of infection and to the elimination of infectious agents. Macrophages residing in infected tissues are important triggerers of inflammatory responses (Medzhitov 2008). Inflammatory responses are initially triggered by PRRs when they recognize PAMPs or specific virulence factors. Additionally, at later phases of infection, inflammation can be induced by endogenous danger signal proteins like HMGB1 and S100A9 secreted by infected cells (Bianchi 2007). Activation of inflammatory pathways results in the production of proinflammatory cytokines (e.g. TNF- and IL-1) and chemokines that activate other immune cells and attract them to the site of infection as well as production of proteolytic enzymes like caspases and matrix metalloproteinases for host defence (Medzhitov et al. 2008). 21 Transcriptional and MAP kinase-mediated activation of pro-inflammtory cytokines can occur via various different pathways depending on the virus (Takeuchi and Akira 2010). TLRs 7 and 9 use MyD88-dependent pathways and TLR3 a TRIF-dependent pathway to activate NF-B transcription factor and thus the production of proinflammatory cytokines. RIG-I/MAVS interaction activates pathways resulting in, for example, NF-B activation. To complete the work of TLRs and RLRs in activating transcription of pro-inflammatory cytokines, a multiprotein complex called the inflammasome functions to activate proinflammatory cytokines IL-1 and IL-18 (Martinon et al. 2002). Inflammasomes consist of a cytoplasmic receptor, an adaptor protein ASC (apoptosis-associated speck-like protein containing a CARD) and caspase-1. There are different types of inflammasomes recognizing different viruses, for example NLRP3 inflammasome recognizing RNA viruses like influenza A virus and AIM2 inflammasome recognizing DNA viruses (Martinon et al. 2009, Hornung et al. 2009). These inflammasomes work as caspase-1 activating platforms (Martinon et al. 2002, 2009). The active caspase-1 can then cleave inactive pro-IL-1 and pro-IL-18 into their active forms which are secreted from the cell to induce inflammation (Pirhonen et al. 1999, 2001). If viral infection cannot be resolved through antiviral and inflammatory immune responses, programmed cell death, apoptosis, is activated to eliminate the infected cell (Lamkanfi and Dixit 2010). Apoptosis is a caspase-dependent, non-inflammatory form of programmed cell death (Zimmermann et al. 2001, Ting et al. 2008). It can be initiated intracellularily by the release of cytochrome c or other apoptogenic proteins from mitochondrial intermembrane space into the cytosol or via cell-death receptors on cell surface. Initiation of apoptotic events results in the activation of several apoptotic caspases. Caspases are the most central effector proteins activated during apoptosis. They are a group of cysteine proteases that are synthetized as inactive zymogens and can be activated by proteolytic cleavage of the protein (Crawford and Wells 2011). Caspases mediate their effects via aspartatespecific cleavage of their target proteins. Caspase-3 is one of the central molecules in apoptosis having several hundreds of known target proteins, and its activation is often held as a hallmark for apoptosis. In addition to caspase-3, several other caspases (caspase-2, -6, -7, -8, -9 and -10) are involved in apoptotic signalling. In addition to the apoptotic caspases, the human caspase family also includes the inflammatory caspases-1, -4 and -5 as well as caspase-14 involved in cellular remodelling. 22 Programmed cell death is an innate immune response that host cells can use to inhibit viral replication and thus to prevent the spread of virus in the infected organism (Best 2008, Lamkanfi and Dixit 2010). However, several viruses have evolved mechanisms to interfere with host cell death pathways (Lamkanfi and Dixit 2010, Kaminskyy and Zhivotovsky 2010). Some viruses have found ways to modulate the activity of caspases, central molecules in apoptosis. Other viruses, such as some herpes viruses, can inhibit apoptosis by encoding proteins that are homologous to cellular anti-apoptotic proteins of Bcl-2 family. Finally, apoptosis is not always beneficial for host. HIV-1 virus, for example, triggers apoptosis in infected host immune cells like dendritic cells and macrophages hindering development of proper immune responses (Kaminskyy and Zhivotovsky 2010). In addition to apoptosis, necrosis and pyroptosis can also be considered as forms of programmed cell death (Lamkanfi and Dixit 2010). However, the mechanisms related with these pro-inflammatory modes of cell death are still rather unclear. 1.3.3 Influenza A virus Influenza A viruses are negative-stranded RNA viruses belonging to the Orthomyxovirus family. They are highly pathogenic respiratory viruses capable of infecting avian and mammalian species. Annual epidemics of influenza A virus cause severe illnesses in millions of people worldwide. The severity of the infections varies from mild symptoms to severe illness and even death. During the last century, influenza A viruses have triggered four pandemics causing morbidity and mortality around the world. Influenza A virus genome consists of 8 RNA segments that encode 11 distinct proteins (Ludwig et al. 2003) (Figure 8). Each of these proteins has been studied extensively to elucidate their roles in influenza A virus pathogeneity. NS1 protein of influenza A virus has been often associated with viruses pathogenity. This protein interferes with RIG-I-mediated type I IFN production in many different ways. NS1 protein, for example, blocks RIG-I ubiquitination which is important for RIG-I/MAVS interaction (Gack et al. 2009). Another example of influenza A virus protein interfering with host immune system is matrix protein 2 (M2) which blocks autophagosome fusion in the infected cells (Gannagé et al. 2009). Finally, extensive interactions between influenza A virus and host cell proteins are likely to affect the 23 consequences of infection. Influenza-induced innate immune responses in host cells can be triggered by endosomal TLR3 and TLR7 recognizing viral dsRNA and ssRNA as well as by RIG-I recognizing viruses cytosolic RNA (Wu et al. 2011). The virus replicates in host cell nucleus and for replication, it has to hijack several host nuclear factors (Josset et al. 2008, König et al. 2010, Karlas et al. 2010). Therefore, several influenza A virus proteins interact with host cell nuclear machinery affecting for example nuclear structure and host cell splicing machinery. Several other interactions between influenza A virus and host cells have also been reported and they might have an impact on the state of host cells after viral infection (Shapira et al. 2009). Figure 8. Sturcture of influenza A virus particle. PB1 = polymerase basic protein 1, PB2 = polymerase basic protein 2, PA = polymerase acidic protein. Viruses NS1 protein is important for replication but is not included in virus particles. (Adjusted from Ludwig et al. 2003) Only few studies have utilized proteomic methods to study influenza A virus-induced changes in host cell proteomes. In 2006, Baas et al. utilized MS-based proteomics to study influenza A virus-induced changes in macaque lung tissues resulting in 3548 protein identifications (Baas et al. 2006). Although the proteomic study was not quantitative, comparisons between proteomic data and parallel mRNA-level studies indicated inconsistencies between protein and mRNA levels of some proteins in the lung tissues. This shows the importance of protein level data in studies of cells physiological state. Later on, few 2-DE-based (Liu et al. 2008, Vester et al. 2009, van Diepen et al. 2010) and MS-based (Coombs et al. 2010, Emmott et al. 2010a) quantitative proteomic studies of host responses against influenza virus infection have been performed. However, most of these studies have resulted in the identification of only few differentially expressed proteins and the large-scale analyses about influenza-host-interplay have been performed using genome-wide screening techniques (e.g. Hao et al. 2008, Shapira et 24 al. 2009, Karlas et al. 2010, König et al. 2010). Therefore, more in-depth quantitative proteomic analyses and data interpretation would be needed to find cellular signalling pathways that are potentially activated by influenza A virus infection. Successful proteomic studies of host-virus interplay have already been published for other viruses showing the potential of this technique (Emmott et al. 2010b, Naji et al. 2012). 25 2. AIMS OF THE STUDY Macrophages and keratinocytes have an important role in the activation innate immune responses after viral infection. The aim of this study was to develop and utilize proteomic and bioinformatic methods to characterize host responses to viral infection. The more detailed aims were: - To characterize cytosolic viral RNA-induced innate immune responses in HaCaT keratinocytes (I) - To set up a quantitative subcellular proteomics workflow and to utilize it for global characterization of influenza A virus-induced changes in human primary macrophages (II) - To study virus-induced protein secretion from human primary macrophages (II, III) - To develop a computational tool to help the comparison and analysis of protein identification results from different database search engines (IV) - To develop a computational tool for large-scale caspase cleavage site predictions and to utilize it for high-throughput mapping of potential caspase targets based on mass spectral data (V) 26 3. MATERIALS AND METHODS 3.1 CELLS AND STIMULATIONS In this project, two different types of human cells, HaCaT keratinocytes (I, IV) and human primary macrophages (II, III), were used. HaCaT keratinocytes (American Type Culture Collection) were cultured in DMEM (Dulbecco´s Modified Eagle Medium) supplemented with 10% Fetal Calf Serum, L-glutamate and antibiotics (I, IV). Human primary macrophages were differentiated from blood monocytes obtained from leukocyte-rich buffy coats of healthy blood donors (Pirhonen et al. 1999). Differentiation was done by maintaining the monocytes in Macrophage serum-free medium supplemented with 10 ng/ml of Granulocyte-macrophage colony-stimulating factor (GM-CSF) and antibiotics. After five days of culturing the macrophages were used in the experiments. Each macrophage sample was a pool of separately cultured and stimulated cells from three different blood donors. To study viral RNA triggered immune responses in macrophages and HaCaT keratinocytes, cells were transfected with polyinosic-polycytidylic acid (polyI:C) (I, III) or infected with influenza A virus (II). PolyI:C is a mimetic of dsRNA and it has been used to mimic RNA virus infections. Transfections were done using 10 μg/ml of polyI:C (Sigma-Aldrich). Lipofectamin 2000 (Invitrogen) was used as a transfection reagent. PolyI:C transfected cells were studied at different timepoints between 1h and 18h. For viral infections of human macrophages, human pathogenic influenza A virus strain Udorn/72/H3N2 was used with viral dose of 2,56 hemagglutination U/ml. The cells were studied at 6h, 9h, 12h and 18h post-infection. In addition, other RNA viruses, influenza A virus strain Beijing/353/89/H3N2 (II), vesicular stomatitis virus (I, III) and encephalomyocarditis virus (I, III) were used for infections to confirm parts of the results. 3.2 SUBCELLULAR FRACTIONATION AND SECRETOME ANALYSIS For subcellular fractionations, approximately 10 million cells were used. Mitochondrial and cytoplasmic fractions of HaCaT keratinocytes (I, IV) and macrophages (II) were isolated by QProteome Mitochondria Isolation Kit (Qiagen). After isolation, cytoplasmic fractions were further purified using 2-D Clean-Up Kit (GE Healthcare). Nuclear fractions of macrophages 27 were isolated using QProteome Nuclear Protein Isolation Kit (Qiagen) (II). The resulting soluble and insoluble nuclear protein fractions were combined before analysis. The enrichment of mitochondrial, cytoplasmic and nuclear proteins in corresponding fractions was confirmed using Western blots (II). To study influenza A virus or polyI:C induced changes in protein secretion, macrophage growth media were collected and analyzed (II, III). The cells grown in complete Macrophage-SFM medium were washed three times with PBS after which the cells were stimulated in RPMI growth media supplemented with 1 mM HEPES, L-glutamine and antibiotics (GIBCO). The growth media were collected and concentrated with Amicon Ultra centrifugal filter devices (Millipore) with 10 000 nominal molecular weight cutoff. The concentrated media were either used directly for western blot analyses or purified with 2-D Clean-Up Kit (GE Healthcare) for proteomic analyses. 3.3 GEL-BASED METHODS USED IN PROTEOMIC EXPERIMENTS Two-dimensional gel electrophoresis (2-DE) was used for the separation of mitochondrial and cytoplasmic protein fractions of HaCaT keratinocytes (I). 11 cm pI 4-7 IPG-strips (Bio-Rad) were used as the first dimension and Criterion Tris-HCl 8–16% precast gels (Bio-Rad) as the second dimension. The gels were stained using SYPRO Ruby protein gel stain (Bio-Rad or Sigma-Aldrich) according to manufacturer´s instructions. Spot detection, matching and intensity-based quantitation were done using Image Master 2D Platinum version 6.0 (GE Healthcare). Spots with at least 2-fold difference in expression between control and polyI:C transfected samples were considered differentially expressed and were picked for mass spectral analysis. Finally, protein spots in the gels were visualized using silver staining (O´Connel et al. 1997). Sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE) was used for protein separation in the analysis of polyI:C transfected macrophage growth media (III) and mitochondrial protein fraction of HaCaT keratinocytes (IV). In addition, SDS-PAGE was used as a protein separation method in western blot analyses (I, II, III). With intracellular fractions, equal amounts of protein were loaded on the gel. For secretomes, equal amounts of media were taken for the analyses. 28 3.4 QUANTITATIVE ANALYSIS USING ITRAQ 4plex iTRAQ (AB Sciex) labeling was used for relative quantitation of proteins in influenza A virus infected macrophages (II). Changes in mitochondrial, cytoplasmic and nuclear proteomes and secretomes of influenza A virus infected macrophages were studied as a function of time. The infected cells were studied at three different timepoints and protein amounts in these samples were compared with uninfected control cells (Figure 9). With intracellular fractions, equal amounts of protein from each sample were taken for the analyses based on silver stained gels. For secretome analyses, equal amounts of cells were taken for the analyses and the whole samples were labeled. Cysteine reduction, alkylation and protein in-solution digestion was done for each sample followed by iTRAQ labeling of the resulting peptides. Digestion and labeling were done according to manufacturer´s instructions. Figure 9. Labeling of the samples for iTRAQ analyses. Two biological and two technical replicates of each sample were analyzed. After labeling, the peptide mixtures were prefractionated by SCX. The SCX separations were performed with Ettan HPLC system (Amersham Biosciences) using a PolySULFOETHYL A column (200 x 2,1 mm, PolyLC). The LC was operated at 0,2 ml/min and 20 mM KH2PO4 buffer (pH 3) was used with a gradient of 0-0,4 M KCl in 35 min. The eluting sample was collected in 1 min fractions and the fractions containing peptides were analyzed using nanoLCESI-MS/MS. 29 3.5 MASS SPECTROMETRY NanoLC-ESI-MS/MS analyses of tryptic peptides were performed with Ultimate 3000 nanoLC system (Dionex) combined with QStar Elite hybrid quadrupole time-of-flight mass spectrometer (AB Sciex) (I, II, III, IV). In the analyses, the sample was first injected into ProteCol C18 trapping column (0,15x10mm; 3 μm; 120Å)(SGE) and the column was washed with 0,1% trifluoroacetic acid (TFA) for 10 min. After that, the peptides were separated in a PepMap100 C18 analytical column (0,075x150mm; 5μm; 100Å) (LC Packings/Dionex) using a linear gradient of 0-40% acetonitrile in 0,1% formic acid. The length of the gradient varied between 20 min and 120 min depending on the sample. Mass spectral analyses were performed on positive ion mode. MS data were acquired using Analyst 2.0 software (AB Sciex) and an information-dependent acquisition method. The method consisted of a 0,5 s TOF-MS survey scan of m/z 400-1400 followed by MS/MS scans of two most abundant ions with charge states +2 to +4. Once an ion was selected for fragmentation, it was put on an exclusion list for 60 s. MALDI-MS was used for the identification of the most intense protein spots from 2D gels (I). Prior MALDI analyses proteins from each spot were in-gel digested with trypsin and the resulting peptide samples were desalted using μC18 Zip Tips (Millipore). -cyano-4hydroxycinnamic acid was used as the crystallization matrix. Mass spectra for MALDI analyses were acquired using an Ultraflex TOF/TOF instrument (Bruker Daltonik) in positive ion reflector mode. The collected spectra were processed using FlexAnalysis version 3.0 (Bruker Daltonik). 3.6 DATABASE SEARCHES Two different database search engines, Mascot (Matrix Science) (PMF: publicly available Mascot; MS/MS: in-house Mascot version 2.2) (I, III, IV, V) and Paragon from ProteinPilot version 2.0.1 (AB Sciex) (II, IV), were used for protein identifications based on MS data. Searches were done against NCBI (I, IV) or UniProt/SwissProt (II, III, IV, V) databases. Similar search parameters were used in all the searches. False discovery rates for the identifications were calculated using a target-decoy strategy based on searches against concatenated normal and reversed protein sequence databases (Elias & Gygi 2007). 30 For iTRAQ-based quantitative analyses, only Paragon was used (II). Raw MS data from both technical replicates of an iTRAQ sample set were processed together to improve the quality of quantitation data. Quality of quantitations was also improved by manually excluding all the MS/MS spectra with all reporter ion peak hights below 10 counts from quantitation. Finally, proteins with more than 1.5-fold difference (intracellular fractions) or more than 3-fold difference (secretomes) between control and infected sample were considered differentially expressed. 3.7 PROTEIN CLASSIFICATION, INTERACTION NETWORKS AND CLUSTERING ANALYSIS After retrieving a list of protein identifications (and quantitation data) from a database search engine, additional data analysis steps were performed to help interpreting the results. Three different types of data analyses were done: protein classification based on their functions and localizations, protein-protein interaction network studies and expression profile-based clustering analyses. Functional classification and subcellular localization studies of proteins were done mainly based on their Gene Ontology annotations using GeneTrail (Backes et al. 2007) (II). Protein expression profiles were studied and clustering of similarily behaving proteins was performed using Chipster (Kallio et al. 2011). Protein-protein interaction networks for interesting groups of proteins were created using String (Jensen et al. 2009) (II). In more detailed analyses of the published secretome data (II, III), different forms of protein secretion were studied using ExoCarta (Mathivanan et al. 2012) and SignalP (Petersen et al. 2011). ExoCarta is a manually curated database of exosomal proteins, lipids and RNA (Mathaiavan et al. 2009, 2012). Protein entries in ExoCarta database were compared with protein identifications from the secretomes of human primary macrophages (II, III) to detect potentially exosomal proteins from our datasets. SignalP is a prediction tool for signal peptides based on protein sequences (Nielsen et al. 1997, Petersen et al. 2011). It was used to find potential signal peptide containing proteins from our secretome data (II, III) and thus to estimate the contribution of classical, signal peptide-based protein secretion in our secretome datasets. 31 3.8 IMMUNOLOGICAL ANALYSES Immunological methods used in this study are listed in Table 3 and introduced in more detail in the original publications. Table 3. Immunological methods used in the publications. Method Publication APOPercentage apoptosis assay cathepsin B inhibition cathepsin D siRNA confocal microscopy ELISA P2X7 receptor inhibition P2X7 receptor siRNA quantitative RT-PCR src tyrosine kinase inhibition western blotting II, III II, III III I I, II, III II II I, III II I, II, III 3.9 REAGENTS Several different antibodies were used to verify the proteomic data and to further characterize the innate immune responses in the experiments. All the antibodies used in the publications are listed in Table 4. 32 Table 4. Antibodies used in the publications. Antibody Manufacturer 14-3-3 -tubulin -amyloid Actin Annexin A1 APOE ASC Bid Caspase-1 p10 Caspase-1 p20 Caspase-3 (H-277) Caspase-3, cleaved (#9661) Cathepsin B Cathepsin D Cathepsin Z Cytochrome c Cytokeratin 18 eIF4A Galectin-3 GAPDH Histone H1 HMGB1 HSP27 HSP90 IFIT3 IL-18 Influenza A virus (H3N2) LAMP-1 P2X7 receptor pan14-3-3 phospho-Ser 14-3-3 binding motif S100-A9 VDAC1 Santa Cruz Biotechnology Cell Signaling Santa Cruz Biotechnology Santa Cruz Biotechnology Santa Cruz Biotechnology Santa Cruz Biotechnology Millipore Cell Signaling Santa Cruz Biotechnology Sigma-Aldrich Santa Cruz Biotechnology Cell Signaling Calbiochem Santa Cruz Biotechnology Santa Cruz Biotechnology Santa Cruz Biotechnology Santa Cruz Biotechnology Santa Cruz Biotechnology Santa Cruz Biotechnology Santa Cruz Biotechnology Santa Cruz Biotechnology Santa Cruz Biotechnology Santa Cruz Biotechnology Cell Signaling BD Transduction Laboratories Pirhonen et al. 1999 provided by prof. Ilkka Julkunen Santa Cruz Biotechnology Santa Cruz Biotechnology Santa Cruz Biotechnology Cell Signaling Santa Cruz Biotechnology Santa Cruz Biotechnology 33 Publication I I II II II II III III II, III III I II, III II, III II, III II II I I II II II II I I, II II II, III II II II I I II II 4. RESULTS 4.1 PROTEOMICS IS AN EFFICIENT METHOD TO STUDY INNATE IMMUNE RESPONSES IN HUMAN KERATINOCYTES AND MACROPHAGES Combination of proteomics and bioinformatics has emerged as a valuable approach to study cellular signaling mechanisms (Choudhary et al. 2010). In this project, three different proteomic methods were utilized to study virus-induced changes in human primary macrophages and HaCaT keratinocytes (Figure 10). Figure 10. Proteomic methods used in publications I-III. A) 2-DE and MS were used in publication (I). B) iTRAQ-based quantitative subcellular proteomics and secretome analysis were performed in publication (II). C) In publication (III), secretomes were analyzed using SDS-PAGE separation of proteins followed by in-gel digestion and LC-MS/MS analysis of the resulting peptides. In our simplest proteomic approach, we used SDS-PAGE combined with LC-MS/MS analyses to study secretomes of polyI:C transfected human primary macrophages (III). PolyI:C is a mimic of viral dsRNA and can be used to model viral infections. This study resulted in the 34 identification of 595 distinct proteins secreted from human primary macrophages. Although the method was not quantitative, clear differences between protein identifications from the secretomes of Lipofectamine-treated control cells and polyI:C transfected cells were detected. For example, several Ras-related proteins and vacuolar ATPases were identified only from the secretomes of polyI:C transfected cells indicating that the detection of cytoplasmic dsRNA affects vesicular trafficking in host cells. In addition, the secretion of cysteine protease inhibitors cystatin-A and B were detected only from the secretomes of polyI:C transfected macrophages. The effects of cytoplasmic dsRNA recognition on intracellular host proteomes was studied using quantitative 2-DE combined with mass spectrometry (I). Mitochondrial and cytoplasmic protein fractions of untreated and polyI:C transfected human HaCaT keratinocytes were separated using 2-DE and differential expression of proteins was studied based on these gels. This study resulted in protein identifications from 176 differentially expressed protein spots. The value of subcellular fractionation was clearly seen since only 15% of protein identifications were common for mitochondrial and cytoplasmic fractions. Comparisons of 2D gels from control and transfected cells showed clear polyI:C-induced changes in 14-3-3 protein expression. In addition, 2D gel analysis showed that polyI:C transfection triggered significant fragmentation of cytoskeletal proteins such as cytokeratins. Although similar cytoskeleton fragmentation has probably been present in influenza A virus infected human primary macrophages (Öhman et al. 2009) it could not be detect in our iTRAQ analyses using non-gelbased separation methods. In publication (II), we used non-gel-based quantitative subcellular proteomics to study influenza A virus-induced changes in mitochondrial, cytoplasmic and nuclear fractions and secretomes of human primary macrophages. Our quantitative subcellular proteomic experiments resulted in altogether 3477 protein identifications including 1321 differentially expressed proteins in the intracellular fractions and 544 upregulated proteins in the secretomes. Non-gel-based proteomics is considered to be the most universal method for proteomic analyses. Accordingly, comparisons between protein identifications from mitochondrial and cytoplasmic fractions in publications (I) and (II) show that 22% of proteins identified in the non-gel-based experiment fall outside the pI and molecular weight range of our 2D gels and would therefore not have been identified using 2-DE. On the other hand, comparisons of proteomic data from SDS-PAGE (III) and iTRAQ (II) secretome experiments showed that these two approaches give comparable results and could be used to support and complement each other. Bioinformatics analyses of our 35 iTRAQ data indicated that influenza A virus infection results in extensive changes in host cell nuclear proteomes and distracts the mitochondrial structure. Finally, our data analyses revealed several signs about activation of antiviral responses, apoptosis and various inflammatory mechanisms in influenza A virus infected cells. In conclusion, our experiments show that quantitative subcellular proteomics and secretome analysis is an efficient approach to study virus-induced protein-level changes in host cells. Spatial (I, II, III) and temporal (II, III) information about multiple proteins can be collected in a single experiment. Finally, analysis of proteomic data using various bioinformatics tools helps creating hypotheses about biological mechanisms activated as a result of viral infection (I, II). 4.2 MASCOT AND PARAGON GIVE COMPARABLE PROTEIN IDENTIFICATION RESULTS In MS-based proteomic experiments, protein identification and quantitation relies on database search engines. Several different database search engines have been developed for proteomic purposes, providing both confirmatory and complementary information (Searle et al. 2008, Yu et al. 2010). Two different database search engines, Mascot (I, III, V) and Paragon (II), were used in our proteomic experiments. A computational tool, Compid, was developed to simplify the comparison of Mascot and Paragon protein identification results and it was used to confirm that the results from both database search engines were comparable (IV). Compid can be used for peptide and protein level comparisons between two sets of raw data from database search engines and also to regroup these protein identification results. Our comparisons of Mascot and Paragon protein identification results with Compid showed that there are some differences in the results from these two database search engines. The biggest differences were detected on peptide level where only 20% of the peptide identifications were common for both search algorithms. Here, the most important reasons for low overlap were probably the low-quality MS/MS spectra resulting in low-confidence peptide spectrum matches. The overlap between Mascot and Paragon protein identifications was significantly better, 4165%. The differences in protein identifications were partially caused by the more efficient protein grouping algorithm in Paragon. In addition, the size and redundancy of protein sequence database had a clear impact on the overlap between Mascot and Paragon database search results. 36 However, most of the good quality protein identifications were common for Mascot and Paragon showing that these database search algorithms gave comparable protein identification results. 4.3 VIRUS-INDUCED RESPONSES IN HACAT KERATINOCYTES AND HUMAN PRIMARY MACROPHAGES Our studies showed that both influenza A virus infection and polyI:C transfection trigger significant changes in the proteomes of human primary macrophages (II, III) and HaCaT keratinocytes (I). Activation of antiviral immune responses was seen as the upregulation of several interferon-inducible proteins in influenza A virus infected human primary macrophages (II). Different mechanisms of inflammation, such as inflammasome activation and secretion of inflammatory danger signal molecules, were also detected (II, III). Viral stimulation clearly activated caspase-mediated apoptosis in HaCaT keratinocytes (I) and human primary macrophages (II, III). Finally, influenza A virus and polyI:C triggered significant protein secretion from human primary macrophages (II, III). 4.3.1 Several inflammatory pathways are activated in human primary macrophages NLRP3 inflammasome is a caspase-1-activating platform that has an important role in inflammatory responses triggered by influenza A virus infection (Allen et al. 2009, Thomas et al. 2009). Although inflammasome activation is currently under extensive research, the exact mechanisms leading to its activation have remained unresolved. Two out of three NLRP3 inflammasome components, caspase-1 and ASC, were identified from the cytoplasmic fraction of our iTRAQ data (II). In addition, caspase-1 activation and secretion of mature IL-18 were detected in influenza A virus infected and polyI:C transfected human primary macrophages indicating virus-induced activation of the inflammasome. Lysosome destabilization and especially leakage of lysosomal protease cathepsin B has often been associated with NLRP3 inflammasome activation (Hornung et al. 2008, Allen et al. 2009). At 6h post-infection, our proteomic data showed the upregulation of several lysosomal proteins in the cytoplasmic fraction (II). Since this could be a sign of lysosomal rupture in the infected cells (II), we studied whether lysosomal cathepsins are involved in inflammasome activation. 37 Both ELISA (II, III) and western blot (II) analyses of IL-18 showed that cathepsin B inhibitor Ca-074 Me abolished IL-18 secretion from the stimulated cells almost completely. Additionally, cathepsin D siRNA decreased the secretion of IL-18 from polyI:C transfected cells (III). Finally, western blot analyses showed that secretion of caspase-1 p10 fragment was abolished by cathepsin B inhibition (II) indicating a role for cathepsins in inflammasome activation upstream of caspase-1 activation. In addition to cytosolic cathepsin B, several other intracellular signals have been suggested to trigger inflammasome activation. To study other possible routes for inflammasome activation, a protein-protein interaction network was created from all inflammation-related proteins identified from the intracellular fractions of influenza A virus infected macrophages (II). This protein-protein interaction network showed that plasma membrane ATP receptor P2X7 is directly associated with all NLRP3 inflammasome components. Therefore, we studied the role of P2X7 receptor in influenza A virus infection triggered inflammasome activation. Inhibition of P2X7 receptor with a specific inhibitor, AZ11645373, as well as P2X7 receptor siRNA experiments resulted in a clear decrease in IL-18 secretion from human primary macrophages implying that P2X7 receptor has a regulatory role in inflammasome activation after influenza A virus infection. NADPH oxidase complex has also been previously related with NLRP3 inflammasome activity (Dostert et al. 2008). Two subunits of NADPH oxidase complex, CYBA and CYBB, were overexpressed in cytoplasmic fraction of influenza A virus infected human primary macrophages at 6 h post-infection (II). In addition, src tyrosine kinase, an upstream regulator of NADPH complex (Giannoni et al. 2010), was identified from influenza A virus infected macrophages (II). Our functional studies showed that IL-18 secretion from influenza A virus infected macrophages was completely abolished by the inhibition of src tyrosine kinase with PP2 (II). Thus, in addition to cytosolic cathepsin B and P2X7 receptor activity, also src tyrosin kinase activity has a role in the regulation of inflammasome activity in influenza A virus induced human primary macrophages. In addition to inflammasome activation, also other signs of proinflammatory responses were detected in influenza A virus infected and polyI:C transfected human primary macrophages (II, III). Several inflammatory proteins like macrophage migration inhibitory factor (MIF), S100-A9 and HMGB1 were secreted from influenza A virus infected or polyI:C transfected macrophages (II, III). Finally, multiple inflammation-related proteins such as allograft inflammatory factor, 38 S100-A8 and S100-A9 were differentially expressed in the intracellular fractions of influenza A virus infected macrophages (II). 4.3.2 Viral infection triggers caspase-dependent apoptosis in human macrophages and keratinocytes Caspase-3 activation is a classical sign of apoptosis. Our studies showed several different signs of caspase-dependent apoptosis in the influenza A virus and polyI:C stimulated macrophages and keratinocytes (I, II, III). Western blot analyses of influenza A virus infected (II) and polyI:C transfected (III) macrophages showed the activation of caspase-3 in stimulated cells. In addition, classical signs of mitochondrial apoptosis: translocation of Bax from cytosol onto mitochondria as well as leakage of cytochrome c from mitochondria into cytosol, were clearly seen in our iTRAQ data (II). Cathepsin B inhibition experiments showed that caspase-3 activation in influenza A virus infected and polyI:C transfected macrophages was largely dependent on cathepsin B activity (II, III). Finally, based on our kinetics experiments, caspase-3 activation was shown to precede inflammasome activation and it occurred already few hours after infection (III). Initiation of apoptosis causes clear changes in the morphology of dying cells (Crawford and Wells 2011). A clear rearrangement of cytoskeletal proteins was seen in polyI:C transfected HaCaT keratinocytes (I). Several fragments of cytokeratins and other structural proteins were identified from the stimulated cells, especially from the mitochondrial fraction. In addition, polyI:C transfection-induced changes in the subcellular distribution of cytokeratin 18 were seen in the confocal microscopy images of HaCaT keratinocytes. Caspase-3 inhibition experiments showed that the cleavage of cytokeratin-18 was caspase-dependent and therefore related with apoptosis of the transfected cells. 4.3.2.1 Prediction and identification of potential caspase cleavage targets from proteomic data Determination of caspase target proteins can give important insights into the execution mechanisms of programmed cell death. Therefore, a computational tool, Pripper, was developed to predict potential caspase cleavage motifs based on protein sequences and known caspase 39 cleavage sites (V). Pripper was the first prediction tool capable of processing multiple protein sequences or even entire proteomes simultaneously making it extremely useful for proteomic applications. In addition, Pripper can create caspase cleavage product databases based on the predictions. These databases can be downloaded directly into a database search engine and utilized to identify potential caspase targets from MS-based proteomic experiments in a highthroughput way. Workflow for caspase target prediction based on Pripper and the following database searches is described in Figure 11. Figure 11. Workflow for caspase cleavage site and caspase target prediction in proteomics by using Pripper. We utilized Pripper to search our proteomic data from publications (I) and (II) for potential caspase cleavage targets. The analysis of MS data from polyI:C transfected HaCaT keratinocytes resulted in the identification of one cytokeratin-18 fragment as a potential caspase cleavage product (V). This was an additional confirmation for caspase-dependent cytokeratin fragmentation in polyI:C transfected HaCaT cells. In addition, Pripper analysis of proteomic data from publication (II) resulted in the identification of several cytoskeleton-associated proteins (e.g. actin) as potential caspase targets (V). This indicates that influenza A virus infection of human primary macrophages also resulted in caspase-dependent cytoskeletal rearrangement, although it was not detected from the proteomic data. Finally, the analysis of our iTRAQ data using a caspase cleavage product database resulted in the identification of a potential novel caspase cleavage motif in leukosialin. However, additional biological experiments are required to confirm these caspase cleavage sites. 40 4.3.3 Influenza A virus infection and polyI:C transfection trigger significant protein secretion from human primary macrophages Proteomic methods were used to study changes in protein secretion followed by influenza A virus infection and polyI:C transfection of human primary macrophages (II, III). Our analyses showed that both stimulations trigger significant protein secretion. GeneTrail analysis of the identified proteins showed that the secreted proteins are associated with several different cellular processes including gene expression, signaling and cell death (data not shown). Comparison of protein identifications in publication (III) and secretome data in publication (II) showed that only 30% of the protein identifications were common for both experiments. However, the functional roles of proteins identified in both experiments were very similar (Figure 12A). DAMP (damage-associated molecular pattern) proteins are endogenous proteins with various intracellular functions, acting as danger signals when secreted out of cells after stress or injury (Bianchi 2007). Several DAMPs (e.g. S100-A8, S100-A9, Galectin-3, HMGB1, HMGB2) were secreted from polyI:C transfected (III) and especially from infeluenza A virus infected (II) human primary macrophages. Secretion of several Ras-related proteins as well as other proteins involved in vesicle trafficking was also triggered by both stimulations. Finally, several lysosomal proteins, for example cathepsins and lysosomal hydrolases were identified from the secretomes of influenza A virus infected (II) and polyI:C transfected (III) macrophages. Classically, proteins are directed for secretion via ER/Golgi-pathway by signal peptides included in protein sequences (Nickel et al. 2010). However, based on SignalP (Petersen et al. 2011) analyses, only 16% of proteins identified from our secretomes contain a signal sequence (Figure 12B). This indicates that virus-induced human macrophages use other secretory routes to direct proteins out of cells. In addition to signal peptide-mediated protein secretion, proteins can be directed out of the cells via unconventional secretion mechanisms including plasma membrane-located channels, budding of plasma membrane microvesicles, exocytosis of secretory lysosomes and release of exosomes from multivesicular bodies (Théry et al. 2009, Nickel et al. 2010). Exosomes have been associated with cell-to-cell signaling related with, for example, immune responses. We used ExoCarta to determine the contribution of potentially exosomal proteins for our secretome data. More than 50% of the identified proteins were included in ExoCarta database (Figure 12B) suggesting that virus infection induces significant protein secretion via exosomes. Additionally, comparison of our secretome data with commonly identified exosomal proteins showed that most of the proteins that are generally identified from exosomes were also present in our data (Figure 13). 41 Figure 12. Functionally similar proteins are secreted from influenza A virus infected and polyI:C transfected human primary macrophages. A) Protein abundances in selected Gene Ontology classes are shown for secretomes of Influenza A virus- (II) and polyI:C-stimulated (III) macrophages. B) ExoCarta and SignalP classifications of proteins identified from secretomes of influenza A virus infected (II) and polyI:C transfected (III) macrophages. Figure 13. Proteins that are typically identified from purified exosomes (Adjusted from Théry et al. 2009 and Mathivanan et al. 2010). All the proteins that were identified from secretomes of influenza A virus infected or polyI:C transfected human primary macrophages are marked with bold letters. 42 5. DISCUSSION Proteomics is a powerful approach to characterize system-wide changes in protein expression. Several different methods have been developed for protein localization studies (Hartwig et al. 2009, Lee et al. 2010), quantitative analyses (Coombs 2011, Neilson et al. 2011), characterization of posttranslational modifications (Thingholm et al. 2009, Kim et al. 2011, Hjerpe et al. 2009) and mapping of protein-protein interactions (Rees et al. 2011, Li et al. 2011). Methods used in proteomic experiments have developed enormously during the last decades. Mass spectrometry is one of the fastest developing areas of proteomics. The new orbitrap instruments (Makarov 2000, Hu et al. 2005) have increased the amounts of data that can be collected in proteomic experiments significantly and introduction of new fragmentation techniques such as electron transfer dissociation has improved for example the analysis of protein posttranslational modifications (Zubarev et al. 1998, Kim and Pandey 2012). These new developments enable wider applicability of proteomic methods in solving biological questions thereby opening new possibilities for proteomic studies. The expectations for proteomic experiments have therefore constantly increased making it essential for scientists to keep up with the latest inventions. In this project, two different quantitative subcellular proteomic approaches and quantitative and qualitative secretome analyses were utilized to study virusinduced changes in human primary macrophages and HaCaT keratinocytes. With modern mass spectral techniques, huge amounts of raw data can be collected from a single proteomic experiment making manual interpretation and validation of these data impossible. Therefore, different database search engines are used to identify and quantify proteins based the collected MS data (Nesvizhskii 2007). However, differences in database search algorithms often result in slightly different protein identification results from the same dataset (Elias et al. 2005, Searle et al. 2008, Savijoki et al. 2011). Especially low quality MS/MS spectra with low signalto-noise ratio, minor peptide fragmentation or overlapping fragments from multiple distinct peptides are often problematic for database search engines (Salmi et al. 2009). The resulting low-confidence peptide-spectrum matches often differ between database search engines and have a higher probability for being false positive identifications. Large differences in low confidence peptide identifications were also seen in our comparisons between Mascot and Paragon. To overcome this problem, several strategies for filtering out low quality MS/MS spectra and low confidence peptide identifications have been developed (e.g. Keller et al. 2002, Helsens et al. 2008, Salmi et al. 2006). Additionally, parallel use of different database search engines can be used to increase the confidence of protein identifications as well as to increase 43 the number of proteins identified (Yu et al. 2010, Searle et al. 2008). For example, we have utilized Compid to combine Mascot and Paragon protein identifications resulting in a slight increase in the number of protein identifications and therefore higher coverage of Lactobacillus rhamnosus proteomes (Savijoki et al. 2011). After retrieving a list of protein identification, quantitation and modification data, further bioinformatics tools are required to perceive the potentially underlying molecular mechanisms. The repertoire of data analysis tools has expanded enormously and several tools for protein classification, pathway analysis and interaction studies as well as for various specific applications are currently available. In this project, we have used Gene Ontology-based protein classification, protein expression profiling and protein-protein interaction studies to characterize the collected data. Computationally built networks of proteins and their physical and functional relationships are useful when associating proteomic data with cell-wide processes (Goh et al. 2012, Albert 2005). However, one of the problems with proteomics is the incompleteness of the collected data. For example low abundance regulatory proteins present in the samples often remain undetected creating challenges for data analysis. Zubarev et al. have tried to solve this problem with a method called key-node analysis where the presence of undetected regulatory proteins in a pathway can be hypothesized based on the identification of their target proteins (Zubarev et al. 2008). Another challenge in proteomic data analysis is the incompleteness of several biological databases (Goh et al. 2012). Here, the best results can probably be achieved by combinig data from several parallel databases. Finally, the analysis of proteomic data usually results in extremely complex networks that are difficult to interpret. Therefore, the users often focus only on a certain part of the network and filter out the rest of the data limiting the results. Filtering out low quality data and incorporation of different types of data into one network can result in more easily interpretable networks and help, for example, in finding novel associations between protein clusters. In addition to using general classification and network tools, we created a new tool, Pripper, to study caspase-mediated protein fragmentation. Caspases are a family of proteins involved in the regulation and execution of various immune processes like apoptosis and inflammation (Crawford and Wells 2011). Caspase cleavage of a target protein may result in direct activation or deactivation of the target protein or may alter protein functions by for example affecting their localization. Therefore, identification of caspase targets may give important information about executionary mechanisms in for example apoptosis. Although there are several tools available for the prediction of caspase cleavage sites (Backes et al. 2005, Wee et al. 2009, Ayyash et al. 44 2012), Pripper was the first tool enabling the direct utilization of the collected MS data for large-scale caspase target predictions. These analyses resulted in the identification of several potential caspase targets, especially structural proteins. Together with our proteomic and immunological data, this indicates that caspases have an important role in virus-induced modulation of cell cytoskeleton. Combination of proteomic and bioinformatic analyses has been used to study host defense responses against different viruses (Zheng et al. 2011). Influenza A virus is a respiratory virus infecting annually millions of people worldwide. Although influenza A viruses have been studied widely for several decades, only few large-scale proteomic experiments of influenzainduced host responses have been published (Baas et al. 2006, Coombs et al. 2010, Emmott et al. 2010a). Virus infection usually results in the activation of various antiviral and inflammatory pathways and finally death of the infected host cell (Akira et al. 2006, Lamkanfi and Dixit 2010). We detected clear upregulation of several interferon-inducible proteins in the cytoplasmic fraction of influenza A virus infected human primary macrophages showing the activation of antiviral immune responses. Activation of inflammatory responses and caspasemediated apoptosis were also clearly seen in our data. One of the inflammatory responses triggered by influenza A virus, as well as some other RNA viruses and cytosolic RNA, is the activation of the NLRP3 inflammasome in the infected cells (Kanneganti et al. 2006). This inflammasome activation is crucial for the development of proper innate immune responses after influenza A virus infection (Allen et al. 2009, Thomas et al. 2009). However, the exact mechanisms leading to NLRP3 inflammasome activation are still unknown. Lysosomal rupture (Hornung et al. 2008, Allen et al. 2009), cytosolic cation imbalance (Qu et al. 2007, Ichinohe et al. 2010) and ROS production (Dostert et al. 2008, Allen et al. 2009) have often been related with inflammasome activation. Our data indicated that influenza A virus infection causes lysosomal damage in influenza A virus infected and polyI:C transfected human primary macrophages. Inhibition of lysosomal cathepsin B has been shown to block inflammasome activation and IL-1 secretion to some extent (Hornung et al. 2008, Allen et al. 2009). This was also seen in our data, since both caspase-1 activation as well as IL-18 cleavage and secretion were clearly blocked in influenza A virus and polyI:C stimulated macrophages treated with cathepsin B inhibitor. Additionally, cathepsin D siRNA decreased the secretion of IL-18 significantly. Cathepsins B and D can also cause significant increase in mitochondrial ROS production (Zhao et al. 2003), providing another possible link between lysosome rupture and inflammasome activation. Additionally, caspase-3 activation in human primary 45 macrophages was shown to be partially cathepsin dependent indicating that lysosomal rupture might also result in activation of apoptotic pathways at early stages of infection. Phagocytosis of microbial components and particles results in ROS generation and has been associated with, for example, monosodium urate- and asbestos-triggered inflammasome activation (Dostert et al. 2008). NADPH oxidase mediates ROS production in phagocytosis, and knockdown of NADPH oxidase subunit p22phox (CYBA) has been shown to decrease IL-1 secretion. Src tyrosine kinases regulate NADPH oxidase-mediated ROS generation (Giannoni et al. 2010) and SRC has been shown to localize into dsRNA containing endosomes and associate with TLR3 (Johnsen et al. 2006). In addition, lysosomal protein LAMP-1 was localized in these endosomes suggesting a potential link between src tyrosine kinase activity and lysosomes. We showed that inhibition of src tyrosine kinase blocked IL-18 secretion from influenza A virus infected human primary macrophages indicating that src tyrosine kinase- and NADPH oxidasemediated ROS has an important role in NLRP3 inflammasome activation. A recent publication showed that src tyrosine kinase might regulate extracellular vesicle formation (Choi et al. 2012) indicating that src tyrosine kinase-mediated inflammasome activation might also be involved in exosomal protein secretion. Finally, ROS-dependent interaction between thioredoxin interacting protein TXNIP and NLRP3 has been shown to result in NLRP3 inflammasome activation creating a potential link between ROS generation and inflammasome activation (Zhou et al. 2010). Additionally, activation of P2X7 receptor by ATP has been associated with ROS production, and it also increases K + efflux out of the cells (Pfeiffer et al. 2007). In our studies, inhibition of P2X7 receptor as well as silencing of P2X7 receptor expression decreased the amount of IL-18 secretion from influenza A virus infected human primary macrophages. Similar signs of of P2X7 receptor-related inflammasome activation after influenza A virus infection have also been shown previously (Ichinohe et al. 2010). Influenza A viruses M2 ion channel has an important role in influenza-induced inflammasome activation. M2 ion channel is capable of transporting H+ out of the vesicles in trans-Golgi network affecting the intracellular cation balance. These data indicate that also intracellular cation imbalance might have an important role in inflammasome activation. Extensive inflammatory responses can be destructive for host organism. A recent publication showed that NLRP3 inflammasome activity can be regulated via autophagosomes (Shi et al. 46 2012). Autophagy is a specialized system for degradation of intracellular substances. Inflammasome activation triggers autophagosome formation via RalB protein activation. The formed autophagosomes limit inflammasome activity by engulfing inflammasomes. Proteins and organelles that are sequestered into autophagosomes can be delivered to lysosomes for degradation (Behrends et al. 2010) or autophagosomes can fuse with the plasma membrane and release their contents into extracellular space in membrane-enclosed exosomes (Manjithaya and Subramani 2011). In a recent study, proteomics was used to study autophagosome-associated proteins in human breast cancer cells after various stimuli (Dengjel et al. 2012). 80 proteins identified in this study were also present in our secretome datasets implying that autophagosome-related protein secretion is one of the secretory mechanisms activated in human primary macrophages after viral infection. Based on current data, inflammasome activating and regulating processes are speculated in Figure 14. NLRP3 inflammasome activity has been related with unconventional protein secretion (Qu et al. 2007, Keller et al. 2008). Keller et al. have shown that caspase-1 activation affects the secretion of several proteins, for example galectin-3 and peroxiredoxin-1, that are secreted via nonclassical route (Keller et al. 2008). In addition, P2X7 receptor can stimulate exosome release and IL-1 secretion in an inflammasome-depentent manner (Qu et al. 2007). Finally, inflammasome components as well as active forms of IL-1 and IL-18 are secreted from activated cells via nonclassical mechanisms (Martinon et al. 2002, Qu et al. 2007). Microvesicles have been shown to transfer functional, signaling competent receptors from one cell to another (Mack et al. 2000). Exosomes are probably the most studied group of microvesicles that can be used to transmit signals between cells. Our secretome data showed that several exosome-related proteins, lysosomal proteins and vesicle-related proteins as well as inflammasome components Caspase-1 and ASC are secreted from influenza A virus infected and polyI:C transfected human primary macrophages. The unconventionally secreted proteins also included several DAMPs, for example galectin-3, S100-A8 and S100-A9. This indicates that inflammasome activation-related unconventional protein secretion and especially secretion of inflammasome components might have an important role in cell-to-cell signaling after viral infection. 47 Figure 14. Cellular processes potentially related with influenza A virus-induced inflammasome activity. 1) Src tyrosine kinase- and NADPH oxidase-mediated ROS is involved in inflammasome activation during phagocytosis. 2) Lysosomal rupture and cytosolic cathepsins B and D are associated with ROS and inflammasome activation. 3) P2X 7 receptor activation results in K+ efflux and participates in inflammasome activation 4) Influenza A virus protein M2 facilitates H+ efflux from Golgi to cytosol activating the inflammasome 5) Inflammasome activation triggers autophagocytosis of intracellular proteins (e.g. inflammasome components) therefore inhibiting inflammasome activation. Autophagocytosis might be followed by either degradation of infammasome components or by their secretion from autophagosomes. 48 6. CONCLUSIONS AND FUTURE PERSPECTIVES In this project we have combined proteomics, bioinformatics and immunological studies to characterize virus-induced changes in human primary macrophages and HaCaT keratinocytes. We show that influenza A virus infection triggers significant changes in intracellular proteomes of human primary macrophages. The production of several antiviral proteins is initiated as a result of infection. In addition, several inflammatory pathways, especially NLRP3 inflammasome activation, are activated in influenza A virus infected and polyI:C transfected human primary macrophages. Inflammasome activation in human macrophages was regulated by cathepsins, src tyrosine kinase and P2X7 receptor activities and was followed by the initiation of apoptotic events only few hours post stimulation. Caspase-3-dependent apoptosis was detected in polyI:C transfected HaCaT keratinocytes as well as in influenza A virus infected and polyI:C transfected human primary macrophages. Caspase activation resulted in, for example, cytokeratin-18 cleavage and changes in 14-3-3 protein expression in HaCaT keratinocytes. Finally, we show that influenza A virus infection and polyI:C transfection trigger extensive secretion of various different proteins such as inflammatory, vesicle-related and danger signal proteins from human primary macrophages which might have an important role in cell-to-cell signaling after viral infection. In conclusion, our studies show that quantitative subcellular proteomics and secretome analysis is a powerful tool for system-wide studies of cellular antiviral defense responses. Cellular signaling events are often regulated by protein posttranslational modifications such as phosphorylation and ubiquitination and especially protein ubiquitination has been recently associated with various innate immune signaling events. Therefore, large-scale studies of protein phosphorylation and ubiquitination could give deeper insights into regulation of host cell innate immune responses. Finally, to be able to utilize the full potential of large-scale proteomic studies, close collaboration between proteomics, bioinformatics and biology experts is required. 49 ACKNOWLEDGEMENTS This project was carried out at the Institute of Biotechnology, University of Helsinki in close collaboration with the Unit of Excellence in Immunotoxicology, Finnish Institute of Occupational Health and the Department of Information Technology, University of Turku. The Institute of Biotechnology is acknowledged for providing excellent facilities for the research. The work was funded by the Academy of Finland and the Helsinki Graduate Program in Biotechnology and Molecular Biology (GPBM). GPBM is also thanked for providing a variety of interesting courses for PhD studies. I am extremely grateful to my supervisors Docent Tuula Nyman and Docent Sampsa Matikainen for all the guidance and help during these years. It has been a pleasure to work under your supervision. Your research enthusiasm and expertise have been important for the progress of this project. I would like to thank the reviewers of this thesis, Docent Jaana Vesterinen and Docent Sampsa Hautaniemi, for their valuable comments. Docent Jaana Vesterinen and Professor Juho Rousu are thanked for the advises and discussions in the thesis committee meetings. I am grateful to Docent Nisse Kalkkinen, the former head of the Protein Chemisry Research Group, for the opportunity to work in his well-equipped laboratory. I express my gratitude to all the coauthors of the publications. I would especially like to thank Tiina Öhman: I couldn´n have hoped for a better teacher and lab partner. I am also thankful to Johanna Kerminen for all the hard work in immunology as well as all the encouraging words. Olli Nevalainen and Jussi Salmi are thanked for the fruitful collaboration and for sharing their expertise in bioinformatics with us. Tero Aittokallio is thanked for the valuable help in the analysis of complex proteomic data. I would also like to thank all the former and current colleagues in the Protein Chemistry Group and at FIOH for creating such a pleasant working environment. I would especially like to thank the other occupants of “Lastenhuone”, Pia, Juho and Wojciech, for all the humour and refreshing discussions. Finally, I would like to thank my family, friends and Kalle for their love, patience and support. Helsinki, April 2012 50 REFERENCES Akira S, Uematsu S, Takeuchi O (2006) Pathogen recognition and innate immunity. Cell 124, 783-801. Albert R (2005) Scale-free networks in cell biology. J Cell Sci. 118, 4947-4957. Allen IC, Scull MA, Moore CB, Holl EK, McElvania-TeKippe E, Taxman DJ, Guthrie EH, Pickles RJ, Ting JP (2009) The NLRP3 inflammasome mediates in vivo innate immunity to influenza A virus through recognition of viral RNA. Immunity 30, 556-565. Andreyev AY, Shen Z, Guan Z, Ryan A, Fahy E, Subramaniam S, Raetz CRH, Briggs S, Dennis EA (2010) Application of proteomic marker ensembles to subcellular organelle identification. Mol Cell Proteomics 9, 388-402. Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, Harris MA, Hill DP, Issel-Tarver L, Kasarskis A, Lewis S, Matese JC, Richardson JE, Ringwald M, Rubin GM, Sherlock G (2000) Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet. 25, 25-29. Ayyash M, Tamimi H, Ashhab Y (2012) Developing a powerful tool for the discovery of novel caspase-3 substrates: a preliminary screening of the human proteome. BMC Bioinformatics 13, 14. Baas T, Baskin CR, Diamond DL, García-Sastre A, Bielefeldt-Ohmann H, Tumpey TM, Thomas MJ, Carter VS, Teal TH, Van Hoeven N, Proll S, Jacobs JM, Caldwell ZR, Gritsenko MA, Hukkanen RR, Camp DG II, Smith RD, Katze MG (2006) Integrated molecular signature of disease: analysis of influenza virus-infected macaques through functional genomics and proteomics. J. Virol. 80, 10813-10828. Backes C, Kuentzer J, Lenhof HP, Comtesse N, Meese E (2005) GraBCas: a bioinformatics tool for score-based prediction of caspase- and granzyme B-cleavage sites in protein sequences. Nucleic Acids Res. 33, W208-W213. Backes C, Keller A, Kuentzer J, Kneissl B, Comtesse N, Elnakady YA, Müller R, Meese E, Lenhof HP (2007) GeneTrail – advanced gene set enrichment analysis. Nucleic Acids Res. 35, W186-192. Behrends C, Sowa ME, Gygi SP, Harper JW (2010) Network organization of the human autophagy system. Nature 466, 68-76. Berggren K, Chernokalskaya E, Steinberg TH, Kemper C, Lopez MF, Diwu Z, Haugland RP, Patton WF (2000) Background-free, high sensitivity staining of proteins in one- and twodimensional sodium dodecyl sulfate-polyacrylamide gels using a luminescent ruthenium complex. Electrophoresis 21, 2509-2521. Best SM (2008) Viral subversion of apoptotic enzymes: escape from death row. Annu Rev Microbiol. 62, 171-192. 51 Bianchi ME (2007) DAMPs, PAMPs and alarmins: all we need to know about danger. J. Leukoc. Biol. 81, 1-5. Bluemlein K, Ralser M (2011) Monitoring protein expression in whole-cell extracts by targeted label- and standard-free LC-MS/MS. Nat Protoc. 6, 859-869. Boisvert F-M, Lam YW, Lamont D, Lamond AI (2010) A quantitative proteomics analysis of subcellular proteome localization and changes induced by DNA damage. Mol Cell Proteomics 9, 457-470. Boisvert F-M, Ahmad Y, Gierlinski M, Charrière F, Lamont D, Scott M, Barton G, Lamond AI (2012) A quantitative spatial proteomics analysis of proteome turnover in human cells. Mol Cell Proteomics 11, M111.011429. Bondarenko PV, Chelius D, Shaler T (2002) Identification and relative quantitation of protein mixtures by enzymatic digestion followed by capillary reversed-phase liquid chromatographytandem mass spectrometry. Anal. Chem. 74, 4741-4749. Bredemeyer AJ, Lewis RM, Malone JP, Davis AE, Gross J, Townsend RR, Ley TJ (2004) A proteomic approach for the discovery of protease substrates. Proc Natl Acad Sci 101, 1178511790. Carbon S, Ireland A, Mungall CJ, Shu S, Marshall B, Lewis S, AmiGO Hub, Web Presence Working Group (2009) AmiGO: online access to ontology and annotation data. Bioinformatics 25, 288-289. Chevallet M, Luche S, Rabilloud T (2006) Silver staining of proteins in polyacrylamide gels. Nat Protoc. 1, 1852-1858. Choi D-S, Yang J-S, Choi E-J, Jang SC, Park S, Kim OY, Hwang D, Kim KP, Kim Y-K, Kim S, Gho YS (2012) The protein interaction network of extracellular vesicles derived from human colorectal cancer cells. J Proteome Res 11, 1144-1151. Choudhary C, Mann M (2010) Decoding signalling networks by mass spectrometry-based proteomics. Nat Rev Mol Cell Biol. 11, 427-439. Collier TS, Sarkar P, Franck WL, Rao BM, Dean RA, Muddiman DC (2010) Direct comparison of stable isotope labeling by amino acids in cell culture and spectral counting for quantitative proteomics. Anal. Chem. 82, 8696-8702. Coombs KM, Berard A, Xu W, Krokhin O, Meng X, Cortens JP, Kobasa D, Wilkins J, Brown EG (2010) Quantitative proteomic analyses of influenza virus-infected cultured human lung cells. J. Virol. 84, 10888-10906. Coombs KM (2011) Quantitative proteomics of complex mixtures. Expert Rev Proteomics 8, 659-677. Craig R, Beavis RC (2004) TANDEM: matching proteins with tandem mass spectra. Bioinformatics 20, 1466-1467. 52 Craig R, Cortens JC, Fenyo D, Beavis RC (2006) Using annotated peptide mass spectrum libraries for protein identification. J Proteome Res. 5, 1843-1849. Crawford ED, Wells JA (2011) Caspase substrates and cellular remodeling. Annu Rev Biochem. 80, 1055-1087. Dayon L, Hainard A, Licker V, Turck N, Kuhn K, Hochstrasser DF, Burkhard PR, Sanches J-C (2008) Relative quantification of proteins in human cerebrospinal fluids by MS/MS using 6plex isobaric tags. Anal. Chem. 80, 2921-2931. Dengjel J, HØyer-Hansen M, Nielsen MO, Eisenberg T, Harder LM, Schandorff LM, Farkas T, Kirkegaard T, Becker AC, Schroeder S, Vanselow K, Lundberg E, Nielsen MM, Kristensen AR, Akimov V, Bunkenborg J, Madeo F, Jäättelä M, Andersen JS (2012) Identification of autophagosome-associated proteins and regulators by quantitative proteomic analysis and genetic screens. Mol Cell Proteomics 11, M111.014035. Deutsch EW, Lam H, Aebersold R (2008) Data analysis and bioinformatics tools for tandem mass spectrometry in proteomics. Physiol Genomics 33, 18-25. Di Michele M, Macrone S, Cicchillitti L, Della Corte A, Ferlini C, Scambia G, Donati MB, Rotilio D (2010) Glycoproteomics of paclitaxel resistance in human epithelial ovarian cancer cell lines : towards the identification of putative biomarkers. J Proteomics 73, 879-898. van Diepen A, Brand HK, Sama I, Lambooy LHJ, van den Heuvel LP, van der Well L, Huynen M, Osterhaus ADME, Andeweg AC, Hermans PWM (2010) Quantitative proteome profiling of respiratory virus-infected lung epithelial cells. J Proteomics 73, 1680-1693. Dimmer EC, Huntley RP, Barrell DG, Binns D, Draghici S, Camon EB, Hubank M, Talmud PJ, Apweiler R, Lovering RC (2008) The Gene Ontology - providing a functional role in proteomic studies. Practical Proteomics 1. Dixit E, Boulant S, Zhang Y, Lee AS, Odendall C, Shum B, Hacohen N, Chen ZJ, Whelan SP, Fransen M, Nibert ML, Superti-Furga G (2010) Peroxisomes are signaling platforms for antiviral innate immunity. Cell 141, 668-681. Domon B, Aebersold R (2006) Mass spectrometry and protein analysis. Science 312, 212-217. Dostert C, Pétrilli V, Van Bruggen R, Steele C, Mossman BT, Tschopp J (2008) Innate immune activation through NLRP3 inflammasome sensing of asbestos and silica. Science 320, 674-677. Du R, Long J, Yao J, Dong Y, Yang X, Tang S, Zuo S, He Y, Chen X (2010) Subcellular quantitative proteomics reveals multiple pathway cross-talk that coordinates specific signaling and transcriptional regulation for the early host response to LPS. J Proteome Res 9, 1805-1821. Dwivedi RC, Spicer V, Harder M, Antonovici M, Ens W, Standing KG, Wilkins JA, Krokhin OV (2008) Practical implementation of 2D HPLC scheme with accurate peptide retention prediction on both dimensions for high-throughput bottom-up proteomics. Anal Chem. 80, 7036-7042. Elias JE, Haas W, Faherty BK, Gygi SP (2005) Comparative evaluation of mass spectrometry platforms used in large-scale proteomics investigations. Nat Methods 2, 667-675. 53 Elias JE, Gygi SP (2007) Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry. Nat Methods 4, 207-214. Elliott MH, Smith DS, Parker CE, Borchers C (2009) Current trends in quantitative proteomics. J Mass Spectrom. 44, 1637-1660. Emmott E, Wise H, Loucaides EM, Matthews DA, Digard P, Hiscox JA (2010a) Quantitative proteomics using SILAC coupled to LC-MS/MS reveals changes in the nuclear proteome in influenza A virus-infected cells. J Proteome Res. 9, 5335-5345. Emmott E, Rodgers MA, Macdonald A, McCrory S, Ajuh P, Hiscox JA (2010b) Quantitative proteomics using stable isotope labeling with amino acid in cell culture reveals changes in the cytoplasmic, nuclear, and nucleolar proteomes in vero cells infected with the coronavirus infectious bronchitis virus. Mol Cell Proteomics 9, 1920-1936. Eng JK, McCormack AL, Yates JR III (1994) An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database. J Am Soc Mass Spectrom 5, 976-989. Fang Y, Robinson DP, Foster LJ (2010) Quantitative analysis of proteome coverage and recovery rates for upstream fractionation methods in proteomics. J Proteome Res. 9, 1902-1912. Fenn JB, Mann M, Meng CK, Wong SF, Whitehouse CM (1989) Electrospray ionization for mass spectrometry of large biomolecules. Science 246, 64-71. Fields S (2001) Proteomics in genomeland. Science 291, 1221-1224. Gack MU, Albrecht RA, Urano T, Inn KS, Huang IC, Carnero E, Farzan M, Inoue S, Jung JU, García-Sastre A (2009) Influenza A virus NS1 targets the ubiquitin ligase TRIM25 to evade recognition by the host viral RNA sensor RIG-I. Cell Host Microbe 5, 439-449. Gannagé M, Dormann D, Albrecht R, Dengjel J, Torossi T, Rämer PC, Lee M, Strowig T, Arrey F, Conenello G, Pypaert M, Andersen J, García-Sastre A, Münz C (2009) Matrix protein 2 of influenza A virus blocks autophagosome fusion with lysosomes. Cell Host Microbe 6, 367380. Gardy JL, Lynn DJ, Brinkman FSL, Hancock REW (2009) Enabling a systems biology approach to immunology: focus on innate immunity. Trends Immunol. 30, 249-262. Geiger T, Wehner A, Schaab C, Cox J, Mann M (2012) Comparative proteomic analysis of eleven common cell lines reveals ubiquitous but varying expression of most proteins. Mol Cell Proteomics 11, M111.014050. Gevaert K, Vandekerckhove J (2000) Protein identification methods in proteomics. Electrophoresis 21, 1145-1154. Gevaert K, Van Damme P, Martens L, Vandekerckhove J (2005) Diagonal reverse-phase chromatography applications in peptide-centric proteomics: Ahead of catalogue-omics? Anal Biochem. 345, 18-29. 54 Giannoni E, Taddei ML, Chiarugi P (2010) Src redox regulation: Again in the front line. Free Radical Biology & Medicine 49, 516-527. Gilar M, Olivova P, Daly AE, Gebler JC (2005) Orthogonality of separation in two-dimensional liquid chromatography. Anal. Chem. 77, 6426-6434. Goh WWB, Lee YH, Chung M, Wong L (2012) How advancement in biological network analysis methods enpowers proteomics. Proteomics 12, 550-563. Gygi SP, Rist B, Gerber SA, Turecek F, Gelb MH, Aebersold R (1999) Quantitative analysis of complex protein mixtures using isotope-coded affinity tags. Nat Biotechnology 17, 994-999. Görg A, Weiss W, Dunn MJ (2004) Current two-dimensional electrophoresis technology for proteomics. Proteomics 4, 3665-3685. Hansen KC, Schmitt-Ulms G, Chalkley RJ, Hirsch J, Baldwin MA, Burlingame AL (2003) Mass spectrometric analysis of protein mixtures at low levels using cleavable 13C-isotopecoded affinity tag and multidimensional chromatography. Mol Cell Proteomics 2, 299-314. Hao L, Sakurai A, Watanabe T, Sorensen E, Nidholm CA, Newton MA, Ahlquist P, Kawaoka Y (2008) Drosophila RNAi screen identifies host genes important for influenza virus replication. Nature 454, 890-893. Hartwig S, Feckler C, Lehr S, Wallbrecht K, Wolgast H, Müller-Wieland D, Kotzka J (2009) A critical comparison between two classical and a kit-based method for mitochondria isolation. Proteomics 9, 3209-3214. Helsens K, Timmerman E, Vandekerckhove J, Gevaert K, Martens L (2008) Peptizer, a tool for assessing false positive peptide identifications and manually validating selected results. Mol Cell Proteomics 7, 2364-2372. Hillenkamp F, Karas M (1990) Mass spectrometry of peptides and proteins by matrix-assisted ultraviolet laser desorption/ionization. Methods Enzymol. 193, 280-295. Hjerpe R, Aillet F, Lopitz-Otsoa F, Lang V, England P, Rodriguez MS (2009) Efficient protection and isolation of ubiquitylated proteins using tandem ubiquitin-binding entities. EMBO Rep. 10, 1250-1258. Holland C, Schmid M, Zimny-Arndt U, Rohloff J, Stein R, Jungblut PR, Meyer TF (2011) Quantitative phosphoproteomics reveals link between Helicobacter pylori infection and RNA splicing modulation in host cells. Proteomics 11, 2798-2811. Hornung V, Bauernfeind F, Halle A, Samstad EO, Kono H, Rock KL, Fitzgerald KA, Latz E (2008) Silica crystals and aluminum salts activate NALP3 inflammasome through phagosomal destabilization. Nat Immunol. 9, 847-856. Hornung V, Ablasser A, Charrel-Dennis M, Bauernfeind F, Horvath G, Caffrey DR, Latz E, Fitzgerald KA (2009) AIM2 recognizes cytosolic dsDNA and forms a caspase-1-activating inflammasome with ASC. Nature 458, 514-518. 55 Horvatovich P, Hoekman B, Govorukhina N, Bischoff R (2010) Multidimensional chromatography coupled to mass spectrometry in analysing complex proteomics samples. J. Sep. Sci. 33, 1421-1437. Hu Q, Noll RJ, Li H, Makarov A, Hardman M, Graham Cooks R (2005) The Orbitrap : a new mass spectrometer. J Mass Spectrom. 40, 430-443. Ichinohe T, Pang IK, Iwasaki A (2010) Influenza virus activates inflammasomes via its intracellular M2 ion channel. Nat Immunol. 11, 404-410. Janeway CA, Medzhitov R (2002) Innate immune recognition. Annu. Rev. Immunol. 20, 197216. Jensen LJ, Kuhn M, Stark M, Chaffron S, Creevey C, Muller J, Doerks T, Julien P, Roth A, Simonovic M, Bork P, von Mering C (2009) STRING 8 – a global view on proteins and their functional interactions in 630 organisms. Nucleic Acids Res. 37, D412-416. Johnsen IB, Nguyen TT, Ringdal M, Tryggestad AM, Bakke O, Lien E, Espevik T, Anthonsen MW (2006) Toll-like receptor 3 associates with c-Src tyrosine kinase on endosomes to initiate antiviral signaling. EMBO J. 25, 3335-3346. Joshi-Tope G, Gillespie M, Vastrik I, D´Eustachio P, Schmidt E, de Bono B, Jassal B, Gopinath GR, Wu GR, Matthews L, Lewis S, Birney E, Stein L (2005) Reactome: a knowledgebase of biological pathways. Nucleic Acids Res. 33, D428-432. Josset L, Frobert E, Rosa-Calatrava M (2008) Influenza A replication and host nuclear compartments : Many changes and many questions. J Clin Virol. 43, 381-390. Kallio MA, Tuimala JT, Hupponen T, Klemelä P, Gentile M, Scheinin I, Koski M, Käki J, Korpelainen EI (2011) Chipster: user-friendly analysis software for microarray and other highthroughput data. BMC Genomics 12, 507. Kaminskyy V, Zhivotovsky B (2010) To kill or be killed : how viruses interact with cell death machinery. J Intern Med. 267, 473-482. Kanneganti TD, Body-Malapel M, Amer A, Park JH, Whitfield J, Franchi L, Taraporewala ZF, Miller D, Patton JT, Inohara N, Nuñes G (2006) Critical role for Cryopyrin/Nalp3 in activation of caspase-1 in response to viral infection and double-stranded RNA. J Biol Chem. 281, 3656036568. Karas M, Bachmann D, Hillenkamp F (1985) Influence of the wavelength in high- irradiance ultraviolet laser desorption mass spectrometry of organic molecules. Anal. Chem. 57, 29352939. Karlas A, Machuy N, Shin Y, Pleissner K-P, Artarini A, Heuer D, Becker D, Khalil H, Ogilvie LA, Hess S, Mäurer AP, Müller E, Wolff T, Rudel T, Meyer TF (2010) Genome-wide RNAi screen identifies human host factors crucial for influenza virus replication. Nature 463, 818-822. Kelleher NL, Lin HY, Valaskovic GA, Aaresund DJ, Fridriksson EK, McLafferty FW (1999) Top down versus bottom up protein characterization by tandem high-resolution mass spectrometry. J. Am. Chem. Soc. 121, 806-812. 56 Keller A, Nesvizhskii A, Kolker E, Aebersold R (2002) Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search. Anal. Chem. 74, 5383-5392. Keller M, Rüegg A, Werner S, Beer HD (2008) Active caspase-1 is a regulator of unconventional protein secretion. Cell 132, 818-831. Kim M-S, Pandey A (2012) Electron transfer dissociation mass spectrometry in proteomics. Proteomics 12, 1-13. Kim W, Bennett EJ, Huttlin EL, Guo A, Li J, Possemato A, Sowa ME, Rad R, Rush J, Comb M, Harper JW, Gygi SP (2011) Systemic and quantitative assessment of the ubiquitin-modified proteome. Mol Cell 44, 325-340. Koponen J, Laakso K, Koskenniemi K, Kankainen M, Savijoki K, Nyman TA, de Vos WM, Tynkkynen S, Kalkkinen N, Varmanen P (2011) Effect of acid stress on protein expression and phosphorylation in Lactobacillus rhamnosus GG. J Proteomics 75, 1357-1374. Kumar H, Kawai T, Akira S (2011) Pathogen recognition by the innate immune system. Int Rev Immunol. 30, 16-34. Kupper TS, Fuhlbrigge RC (2004) Immune surveillance in the skin: mechanisms and clinical consequences. Nat Rev Immunol. 4, 211-222. König R, Stertz S, Zhou Y, Inoue A, Hoffmann H-H, Bhattacharyya S, Alamares JG, Tscherne DM, Ortigoza MB, Liang Y, Gao Q, Anrews SE, Bandyopadhyay S, De Jesus P, Tu BP, Pache L, Shih C, Orth A, Bonamy G, Miraglia L, Ideker T, García-Sastre A, Young JAT, Palese P, Shaw ML, Chanda SK (2010) Human host factors required for influenza virus replication. Nature 463, 813-817. Lam H, Deutsch EW, Eddes JS, Eng JK, King N, Stein SE, Aebersold R (2007) Development and validation of a spectral library method for peptide identification from MS/MS. Proteomics 7, 655-667. Lam H (2011) Building and searching tandem mass spectral libraries for peptide identification. Mol Cell Proteomics 10, R111.008565. Lamkanfi M, Dixit VM (2010) Manipulation of host cell death pathways during microbial infections. Cell Host Microbe 8, 44-54. Lee HJ, Na K, Kwon MS, Park T, Kim KS, Kim H, Paik YK (2011) A new versatile peptidebased size exclusion chromatography platform for global profiling and quantitation of candidate biomarkers in hepatocellular carcinoma specimens. Proteomics 11, 1976-1984. Lee YH, Tan HT, Chung MCM (2010) Subcellular fractionation methods and strategies for proteomics. Proteomics 10, 3935-3956. Lemeer S, Kunold E, Klaeger S, Raabe M, Towers MW, Claudes E, Arrey TN, Strupat K, Urlaub H, Kuster B (2012) Phosphorylation site localization in peptides by MALDI MS/MS and the Mascot Delta Score. Anal Bioanal Chem. 402, 249-260. 57 Li S, Wang L, Berman M, Kong YY, Dorf ME (2011) Mapping a dynamic innate immunity protein interaction network regulating type I interferon production. Immunity 35, 426-440. Liu H, Sadykov RG, Yates JR 3rd (2004) A model for random sampling and estimation of relative protein abundance in shotgun proteomics. Anal. Chem. 76, 4193-4201. Liu N, Song W, Wang P, Lee K, Chan W, Chen H, Cai Z (2008) Proteomics analysis of differential expression of cellular proteins in response to avian H9N2 virus infection in human cells. Proteomics 8, 1851-1858. Lu P, Vogel C, Wang R, Yao X, Marcotte EM. Absolute protein expression profiling estimates the relative contributions of transcriptional and translational regulation. Nat. Biotechnol. 25, 117-124. Luber CA, Cox J, Lauterbach H, Fancke B, Selbach M, Tschopp J, Akira S, Wiegand M, Hochrein H, O´Keeffe M, Mann M (2010) Quantitative proteomics reveals subset-specific viral recognition in dendritic cells. Immunity 32, 279-289. Ludwig S, Planz O, Pleschka S, Wolff T (2003) Influenza-virus-induced signalling cascades: targets for antiviral therapy? Trends Mol. Medicine 9, 46-52. Mack M, Kleinschmidt A, Brühl H, Klier C, Nelson PJ, Cihak J, Plachý J, Stangassinger M, Erfle V, Schlöndorff D (2000) Transfer of the chemokine receptor CCR5 between cells by membrane-derived microparticles: a mechanism for cellular human immunodeficiency virus 1 infection. Nat Med. 6, 769-775. Makarov A (2000) Electrostatic axially harmonic orbital trapping : a high-performance technique of mass analysis. Anal. Chem. 72, 1156-1162. Malik R, Dulla K, Nigg EA, Körner R (2010) From proteome lists to biological impact – tools and strategies for the analysis of large MS data sets. Proteomics 10, 1270-1283. Manjithaya R, Subramani S (2011) Autophagy : a broad role in unconventional protein secretion ? Trends Cell Biol. 21, 67-73. Marouga R, David S, Hawkins E (2005) The development of the DIGE system: 2D fluorescence difference gel analysis technology. Anal Bioanal Chem 382, 669-678. Martinon F, Burns K, Tschopp J (2002) The inflammasome : a molecular platform triggering activation of inflammatory caspases and processing of proIL-beta. Mol Cell 10, 417-426. Martinon F, Mayor A, Tschopp J (2009) The inflammasomes: guardians of the body. Annu Rev Immunol. 27, 229-265. Martins-de-Souza D, Maccarrone G, Reckow S, Falkai P, Schmitt A, Turck CW (2009) Shotgun mass spectrometry analysis of the human thalamus proteome. J. Sep. Sci. 32, 1231-1236. Mathaiavan S, Simpson RJ (2009) ExoCarta: a compendium of exosomal proteins and RNA. Proteomics 21, 4997-5000. 58 Mathivanan S, Fahner CJ, Reid GE, Simpson RJ (2012) ExoCarta 2012: database of exosomal proteins, RNA and lipids. Nucleic Acids Res. 40, 1241-1244. Medzhitov R (2007) Recognition of microorganisms and activation of the immune response. Nature 449, 819-826. Medzhitov R (2008) Origin and physiological roles of inflammation. Nature 454, 428-435. Molina H, Yang Y, Ruch T, Kim JW, Mortensen P, Otto T, Nalli A Tang QQ, Lane MD, Chaerkady R, Pandey A (2009) Temporal profiling of the adipocyte proteome during differentiation using a five-plex SILAC based strategy. J Proteome Res 8, 48-58. Monetti M, Nagaraj N, Sharma K, Mann M (2011) Large-scale phosphosite quantification in tissues by a spike-in SILAC method. Nat Methods 8, 655-658. Mosley AL, Florens L, Wen Z, Washburn MP (2009) A label free quantitative proteomics analysis of the Saccharomyces cerevisiae nucleus. J Proteomics 72, 110-120. Motoyama A, Yates JR III (2008) Multidimensional LC separations in shotgun proteomics. Anal. Chem. 80, 7187-7193. Murphy K, Travers P and Walport M (2008) Janeway´s Immunobiology, Seventh edition (Garland Science, Taylor & Francis Group, LLC), 5-50. Nagaraj N, Wisniewski JR, Geiger T, Cox J, Kircher M, Kelso J, Pääbo S, Mann M (2011) Deep proteome and transcriptome mapping of human cancer cell line. Mol Syst Biol. 7, 548. Nagaraj N, Kulak NA, Cox J, Neuhaus N, Mayr K, Hoerning O, Vorm O, Mann M (2012) Systems-wide perturbation analysis with near complete coverage of yeast proteome by singleshot UHPLC runs on a bench-top Orbitrap. Mol Cell Proteomics 11, M111.013722. Naji S, Ambrus G, Cimermancic P, Reyes JR, Johnson JR, Filbrandt R, Huber MD, Veseley P, Krogan NJ, Yates JR III, Saphire AC, Gerace L (2012) Host cell interactome of HIV-1 Rev includes RNA helicases involved in multiple facets of virus production. Mol Cell Proteomics 11, M111.015313. Neilson KA, Ali NA, Muralidharan S, Mirzaei M, Mariani M, Assadourian G, Lee A, van Sluyter SC, Haynes PA (2011) Less label, more free: approaches in label-free quantitative mass spectrometry. Proteomics 11, 535-553. Nesvizhskii AI (2007) Protein identification by tandem mass spectrometry and sequence database searching. Methods Mol Biol. 367, 87-119. Nickel W (2005) Unconventional secretory routes: direct protein export across the plasma membrane of mammalian cells. Traffic. 6, 607-614. Nickel W (2010) Pathways of unconventional protein secretion. Curr Opin Biotechnol. 21, 621626. Nielsen H, Engelbrecht J, Brunak S, von Heijne G (1997) Identification of prokaryotic and eukaryotic signal peptides and prediction of their cleavage sites. Protein eng. 10, 1-6. 59 O´Connell KL, Stults JT (1997) Identification of mouse liver proteins on two-dimensional electrophoresis gels by martrix-assisted laser desorption/ionization mass spectrometry of in situ enzymatic digests. Electrophoresis 18, 349-359. Ogata H, Goto S, Sato K, Fujibuchi W, Bono H, Kanehisa M (1999) KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 27, 29-34. Old WM, Meyer-Arendt K, Aveline-Wolf L, Pierce KG, Mendoza A, Sevinsky JR, Resing KA, Ahn NG (2005) Comparison of label-free methods for quantifying human proteins by shotgun proteomics. Mol Cell Proteomics 4, 1487-1502. Ong S-E, Blagoev B, Kratchmarova I, Kristensen DB, Steen H, Pandey A, Mann M (2002) Stable isotope labeling by amino acids in cell culture, SILAC, as a simple and accurate approach to expression proteomics. Mol Cell Proteomics 1, 376-386. Ong S-E, Kratchmarova I, Mann M (2003) Properties of 13C-substituted arginine in stable isotope labeling by amino acids in cell culture (SILAC). J Proteome Res. 2, 173-181. Perkins DN, Pappin DJ, Creasy DM, Cottrell JS (1999) Probability-based protein identification by searching sequence databases using mass spectrometry data. Electrophoresis 20, 3551-3567. Petersen TN, Brunak S, von Heijne G, Nielsen H (2011) SignalP 4.0 : discriminating signal peptides from transmemebrane regions. Nat Methods 8, 785-786. Pfeiffer ZA, Guerra AN, Hill LM, Gavala ML, Prabhu U, Aga M, Hall DJ, Bertics PJ (2007) Nucleotide receptor signaling in murine macrophages is linked to reactive oxygen species generation. Free Radic Biol Med. 42, 1506-1516. Phanstiel DH, Brumbaugh J, Wenger CD, Tian S, Probasco MD, Bailey DJ, Swaney DL, Tervo MA, Bolin JM, Ruotti V, Stewart R, Thomson JA, Coon JJ (2011) Proteomic and phosphoproteomic comparison of human ES and iPS cells. Nat Methods 8, 821-827. Pierce A, Unwin RD, Evans CA, Griffiths S, Carney L, Zhang L, Jaworska E, Lee C-F, Blinco D, Okoniewski MJ, Miller CJ, Bitton DA, Spooncer E, Whetton AD (2008) Eight-channel iTRAQ enables comparison of the activity of six leukemogenic tyrosine kinases. Mol Cell Proteomics 7, 853-863. Pirhonen J, Sareneva T, Kurimoto M, Julkunen I, Matikainen S (1999) Virus infection activates IL-1 beta and IL-18 production in human macrophages by caspase-1-dependent pathway. J Immunol. 162, 7322-7329. Pirhonen J, Sareneva T, Julkunen I, Matikainen S (2001) Virus infection induces proteolytic processing of IL-18 in human macrophages via caspase-1 and caspase-3 activation. Eur J Immunol. 31, 726-733. Pounds JG, Flora JW, Adkins JN, Lee KM, Rana GSJB, Sengupta T, Smith RD, McKinney WJ (2008) Characterization of the mouse broncoalveolar lavage proteome by micro-capillary LCFTICR mass spectrometry. J Chromatogr B 864, 95-101. 60 Qattan AT, Mulvey C, Crawford M, Natale DA, Godovac-Zimmermann J (2010) Quantitative organelle proteomics of MCF-7 breast cancer cells reveals multiple subcellular locations for proteins in cellular functional processes. J Proteome Res 9, 495-508. Qu Y, Franchi L, Nunez G, Dubyak GR (2007) Nonclassical IL-1 beta secretion stimulated by P2X7 receptors is dependent on inflammasome activation and correlated with exosome release in murine macrophages. J Immunol. 179, 1913-1925. Rabilloud T, Chevallet M, Luche S, Lelong C (2010) Two-dimensional gel electrophoresis in proteomics : Past, present and future. J Proteomics 73, 2064-2077. Rajimakers R, Kraiczek K, de Jong AP, Mohammed S, Heck AJ (2010) Exploring the human leukocyte phosphoproteome using a microfluidic reversed-phase-TiO2-reversed-phase highperformance liquid chromatography phophochip coupled to a quadrupole time-of-flight mass spectrometer. Anal Chem. 82, 824-832. Randall RE, Goodbourn S (2008) Interferons and viruses: an interplay between induction, signalling, antiviral responses and virus countermeasures. J Gen Virol 89, 1-47. Rappsilber J, Ryder U, Lamond AI, Mann M (2002) Large-scale proteomic analysis of the human spliceosome. Genome. Res. 12, 1231-1245. Rasheed S, Yan JS, Hussaiin A, Lai B (2009) Proteomic characterization of HIV-modulated membrane receptors, kinases and signaling proteins involved in novel angiogenic pathways. J. Transl. Med. 7, 75. Rees JS, Lowe N, Armean IM, Roote J, Johnson G, Drummond E, Spriggs H, Ryder E, Russell S, ST Johnston D, Lilley KS (2011) In vivo analysis of proteomes and interactomes using Parallel Affinity Capture (iPAC) coupled to mass spectrometry. Mol Cell Proteomics 10, M110.002386. Rhee SY, Wood V, Dolinski K, Draghici S (2008) Use and misuse of the gene ontology annotations. Nat Rev Genet. 9, 509-515. Ross PR, Huang YN, Marchese JN, Williamson B, Parker K, Hattan S, Khainovski N, Pillai S, Dey S, Daniels S, Purkayastha S, Juhasz P, Martin S, Bartlet-Jones M, He F, Jacobson A, Pappin DJ (2004) Multiplexed protein quantitation in Saccharomyces cerevisiae using aminereactive isobaric tagging reagents. Mol Cell Proteomics 3, 1154-1169. Salmi J, Moulder R, Filén JJ, Nevalainen OS, Nyman TA, Lahesmaa R, Aittokallio T (2006) Quality classification of tandem mass spectrometry data. Bioinformatics 22, 400-406. Salmi J, Nyman TA, Nevalainen OS, Aittokallio T (2009) Filtering strategies for improving protein identification in high-throughput MS/MS studies. Proteomics 9, 848-860. Savijoki K, Lietzén N, Kankainen M, Alatossava T, Koskenniemi K, Varmanen P, Nyman TA (2011) Comparative proteome cataloging of Lactobacillus rhamnosus strains GG and Lc705. J Proteome Res 10, 3460-3473. Schwanhäusser B, Busse D, Li N, Dittmar G, Schuchhardt J, Wolf J, Chen W, Selbach M (2011) Global quantification of mammalian gene expression control. Nature 473, 337-342. 61 Searle BC, Turner M, Nesvizhskii AI (2008) Improving sensitivity by probabilistically combining results from multiple MS/MS search methodologies. J Proteome Res 7, 245-253. Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T (2003) Cytoscape : a software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498-2504. Shapira SD, Gat-Viks I, Shum BOV, Dricot A, de Grace MM, Wu L, Gupta PB, Hao T, Silver SJ, Root DE, Hill DE, Regev D, Hacohen N (2009) A physical and reculatory map of hostinfluenza interactions reveals pathways in H1N1 infection. Cell 139, 1255-1267. Shaw ML, Stone KL, Colangelo CM, Gulcicek EE, Palese P (2008) Cellular proteins in influenza virus particles. PLoS Pathogens 4, e1000085. Shen Y, Zhao R, Berger SJ, Anderson GA, Rodriguez N, Smith DR (2002) High-efficiency nanoscale liquid chromatography coupled on-line with mass spectrometry using nanoelectrospray ionization for proteomics. Anal. Chem. 74, 4235-4249. Shi C-S, Shenderov K, Huang N-N, Kabat J, Abu-Asab M, Fitzgerald KA, Sher A, Kehrl JH (2012) Activation of autophagy by inflammatory signals limits IL-1 production by targeting ubiquitinated inflammasomes for destruction. Nat Immunol. 13, 255-263. Shilov IV, Seymour SL, Patel AA, Loboda A, Tang WH, Keating SP, Hunter CL, Nuwaysir LM, Schaeffer DA (2007) The Paragon algorithm: a next generation search engine that uses sequence temperature values and feature probabilities to identify peptides from tandem mass spectra. Mol Cell Proteomics 6, 1638-1655. Takeuchi O, Akira S (2010) Pattern recognition receptors and inflammation. Cell 140, 805-820. Théry C, Ostrowski M, Segura E (2009) Membrane vesicles as conveyors of immune responses. Nat Immunology 9, 581-593. Thingholm TE, Jensen ON, Larsen MR (2009) Analytical strategies for phosphoproteomics. Proteomics 9, 1451-1468. Thomas PD, Kejariwal A, Campbell MJ, Mi H, Diemer K, Guo N, Ladunga I, Ulitsky-Lazareva B, Muruganujan A, Rabkin S, Vandergriff JA, Doremieux O (2003) PANTHER: a browsable database of gene products organized by biological function, using curated protein family and subfamily classification. Nucleic Acids Res. 31, 334-341. Thomas PG, Dash P, Aldridge JR Jr, Ellebedy AH, Reynolds C, Funk AJ, Martin WJ, Lamkanfi M, Webby RJ, Boyd KL, Doherty PC, Kanneganti TD (2009) The intracellular sensor NLRP3 mediates key innate and healing responses to influenza A virus via the regulation of caspase-1. Immunity 30, 566-575. Ting JP, Willingham SB, Bergstrahl DT (2008) NLRs at the intersection of cell death and immunity. Nat Rev Immunol. 8, 372-379. Trost M, English L, Lemieux S, Courcelles M, Desjardins M, Thibault P (2009) The phagosomal proteome in interferon--activated macrophages. Immunity 30, 143-154. 62 Ünlü M, Morgan ME, Minden JS (1997) Difference gel electrophoresis : a single method for detecting changes in protein extracts. Electrophoresis 18, 2071-2077. Vandenborre G, Van Damme EJ, Ghesquiére B, Manschaert G, Hamshou M, Rao RN, Geavert K, Smagghe G (2010) Glycosylation signatures in Drosophila : fishing with lectins. J Proteome Res. 9, 3235-3242. de Veer MJ, Holko M, Frevel M, Walker E, Der S, Paranjape JM, Silverman RH, Williams BRG (2001) Functional classification of interferon-stimulated genes identified using microarrays. J Leukoc Biol 69, 912-920. Vester D, Rapp E, Gade D, Genzel Y, Reichl U (2009) Quantitative analysis of cellular proteome alterations in human influenza A virus-infected mammalian cell lines. Proteomics 9, 1-12. Vogels MW, van Balkom BWM, Kaloyanova DV, Batenburg JJ, Heck AJ, Helms JB, Rottier PJM, de Han CAM (2011) Identification of host factors involved in coronavirus replication by quantitative proteomics analysis. Proteomics 11, 64-80. Wang X, Stewart PA, Cao Q, Sang QX, Chung LW, Emmett MR, Marshall AG (2011) Characterization of the phosphoproteome in androgen-repressed human prostate cancer cells by Fourier transform ion cyclotron resonance mass spectrometry. J Proteome Res. 10, 3920-3928. Washburn MP, Wolters D, Yates JR 3rd (2001) Large-scale analysis of the yeast proteome by multidimensional protein identification technology. Nat Biotechnology 19, 242-247. Wee LJ, Tong JC, Tan TW, Ranganathan S (2009) A multi-factor model for caspase degradome prediction. BMC Genomics 10, S3-S6. Wilkins MR, Sanches J-C, Gooley AA, Appel RD, Humphrey-Smith I, Hochstrasser DF, Williams KL (1996) Progress with proteome projects : why all proteins expressed by a genome should be identified and how to do it. Biotechnol Genet Eng Rev. 13, 19-50. Wilm M, Mann M (1996) Analytical properties of the nanoelectrospray ion source. Anal. Chem. 68, 1-8. Wisniewski JR, Nagaraj N, Zougman A, Gnad F, Mann M (2010) Brain phosphoproteome obtained by a FASP-based method reveals plasma membrane protein topology. J Proteome Res. 9, 3280-3289. Wu CJ, Chen YW, Tai JH, Chen SH (2011) Quantitative phosphoproteomics studies using stable isotope dimethyl labeling coupled with IMAC-HILIC-nanoLC-MS/MS for estrogeninduced transcriptional regulation. J Proteome Res. 10, 1088-1097. Wu J, Vallenius T, Ovaska K, Westermarck J, Mäkelä TP, Hautaniemi S (2009) Integrated network analysis platform for protein-protein interactions. Nat Methods 6, 75-77. Wu S, Metcalf JP, Wu W (2011) Innate immune response to influenza virus. Curr Opin Infect Dis. 24, 235-240. 63 Yamashita M, Fenn JB (1984) Electrospray ion source. Another variation on the free-jet theme. J. Phys. Chem. 88, 4451-4459. Yu W, Taylor JA, Davis MT, Bonilla LE, Lee KA, Auger PL, Farnsworth CC, Welcher AA, Patterson SD (2010) Maximizing the sensitivity and reliability of peptide identification in largescale proteomic experiments by harnessing multiple search engines. Proteomics 10, 1172-1189. Zeeberg BR, Feng W, Wang G, Wang MD, Fojo AT, Sunshine M, Narasimhan S, Kane DW, Reinhold WC, Lababidi S, Bussey KJ, Riss J, Barrett JC, Weinstein JN (2003) GoMiner : a resource for biological interpretation of genomic and proteomic data. Genome Biol. 4, R28. Zhao M, Antunes F, Eaton JW, Brunk UT (2003) Lysosomal enzymes promote mitochondrial oxidant production, cytochrome c release and apoptosis. Eur. J. Biochem. 270, 3778-3786. Zheng J, Sugrue RJ, Tang K (2011) Mass spectrometry based proteomic studies on viruses and hosts – A review. Anal Chim Acta 702, 149-159. Zhou H, Ning Z, Starr EA, Abu-Farha M, Figeys D (2012) Advancements in top-down proteomics. Anal. Chem. 84, 720-734. Zhou R, Tardivel A, Thorens B, Choi I, Tschopp J (2010) Thioredoxin-interacting protein links oxidative stress to inflammasome activation. Nat Immunol. 11, 136-140. Zimmermann KC, Bonzon C, Green DR (2001) The machinery of programmed cell death. Pharmacol Ther 92, 57-70. Zubarev RA, Kelleher NL, McLafferty FW (1998) Electron capture dissociation of multiply charged protein cations. A nonergodic process. J Am Chem. Soc. 120, 3265-3266. Zubarev RA, Nielsen ML, Fung EM, Savitski MM, Kel-Margoulis O, Wingender E, Kel A (2008) Identification of dominant signaling pathways from proteomics expression data. J Proteomics 71, 89-96. Öhman T, Rintahaka J, Kalkkinen N, Matikainen S, Nyman TA (2009) Actin and RIG-I/MAVS signaling components translocate to mitochondria upon influenza A virus infection of human primary macrophages. J Immunol.182,5682-5692. 64