* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download and Rhizobiales-Like PPP-Family Protein Phosphatases from
Secreted frizzled-related protein 1 wikipedia , lookup
Ribosomally synthesized and post-translationally modified peptides wikipedia , lookup
Clinical neurochemistry wikipedia , lookup
Gene nomenclature wikipedia , lookup
Ultrasensitivity wikipedia , lookup
Lipid signaling wikipedia , lookup
Artificial gene synthesis wikipedia , lookup
Silencer (genetics) wikipedia , lookup
Signal transduction wikipedia , lookup
Gene regulatory network wikipedia , lookup
Metalloprotein wikipedia , lookup
G protein–coupled receptor wikipedia , lookup
Mitogen-activated protein kinase wikipedia , lookup
Point mutation wikipedia , lookup
Homology modeling wikipedia , lookup
Phosphorylation wikipedia , lookup
Ancestral sequence reconstruction wikipedia , lookup
Gene expression wikipedia , lookup
Paracrine signalling wikipedia , lookup
Magnesium transporter wikipedia , lookup
Bimolecular fluorescence complementation wikipedia , lookup
Interactome wikipedia , lookup
Protein structure prediction wikipedia , lookup
Expression vector wikipedia , lookup
Western blot wikipedia , lookup
Nuclear magnetic resonance spectroscopy of proteins wikipedia , lookup
Protein–protein interaction wikipedia , lookup
UNIVERSITY OF CALGARY Characterization of Novel Shewanella- and Rhizobiales-Like PPP-Family Protein Phosphatases from Arabidopsis thaliana by Richard Glen Uhrig A THESIS SUBMITTED TO THE FACULTY OF GRADUATE STUDIES IN PARTIAL FULFILMENT OF THE REQUIREMENTS FOR THE DEGREE OF DOCTOR OF PHILOSOPHY DEPARTMENT OF BIOLOGICAL SCIENCES CALGARY, ALBERTA July, 2013 © Richard Glen Uhrig 2013 Abstract Reversible protein phosphorylation catalyzed by protein kinases and phosphatases represents the most prolific and currently best-characterized post-translational modification. Over the last decade advancements in genome sequencing technologies has massively increased genomic databases, resulting in the identification of previously unannotated protein kinases and phosphatases from multiple organisms. The primary goal of the research presented here was to elucidate the evolutionary, biochemical, cellular and biological characteristics of two recently identified PPP-family protein phosphatase subclasses from the model photosynthetic Eukaryote Arabidopsis thaliana. These two subclasses included the Shewanella-like (SLP1 and 2) and Rhizobiales-like (RLPH2) phosphatases, which were named after their relatedness to phosphatase orthologs from Shewanella and Rhizobiales bacteria. Heterologous protein phosphatase expression in, and purification from, Escherichia coli revealed unique biochemical characteristics including a complete insensitivity to PPP-family protein phosphatase inhibitors okadaic acid and microcystin-LR, as well as diversity in their phosphorylated substrate specificities. Bioinformatics complemented cell biology also uncovered unique subcellular localizations for each enzyme, with AtSLP1, 2 and AtRLPH2 being chloroplastic, mitochondrial and nuclear / cytosolic, respectively. Further identification of the AtSLP1 and AtSLP2 protein interactome from both plant tissues and cell culture was accomplished by employing a tandem affinity purification protein isolation strategy, with specific protein interactor complements indicating independent regulatory roles within plant cells involving chloroplast energy biosynthesis and mitochondrial intermembrane space processes, respectively. Lastly, phenotypic analysis of atslp2 insertional mutant and 35S::AtSLP2 over-expression plants revealed biological involvement in regulating gibberellic acid-related processes during seed germination as well as ii influencing fatty acid and amino acid contents during seed maturation. Research presented here has made significant progress in our understanding of the previously uncharacterized SLP and RLPH PPP-family phosphatase subclasses from Arabidopsis thaliana, by resolving their molecular evolution, in addition to their subcellular, cellular and biochemical properties. A functional understanding of AtSLP1 and AtSLP2 was also achieved through the identification of their protein interaction partners, which, in conjunction with insertional knockout and protein over-expression plant lines, lays a solid foundation for future research endeavors looking to examine bacterial-like phosphatases from photosynthetic and non-photosynthetic Eukaryotes alike. iii Acknowledgements I would like to thank past and present members of the Moorhead lab for fruitful conversations throughout the duration of my degree. I would also like to thank undergraduate students Rolaine Bu, Keaton Colville and Arshpreet Deol for their contributions. Thanks also to Dr. Howard Ceri for use of his confocal microscope and Dr. Gordon Chua for use of his microplate reader. I would also like to extend my sincerest thanks to Dr. Greg Moorhead for giving me the opportunity to work in his lab and for being a supportive and encouraging mentor over these past years. Research was supported by National Science and Engineering Research Council of Canada (NSERC) Alexander Graham Bell CGS-D, Alberta Ingenuity Technology Futures (AITF) Graduate Student Scholarship, The Izaak Walton Killam Doctoral Scholarship and The Dean's Doctoral Scholarship. Collaborations with: Dr. Anne-Claude Gingras, Dr. Marcus Samuel, Dr. Edward Yeung and Dr. Alisdair Fernie iv Dedication To my Family for a lifetime of encouragement and support. v Table of Contents Abstract ............................................................................................................................... ii Acknowledgements ............................................................................................................ iv Dedication ............................................................................................................................v Table of Contents ............................................................................................................... vi List of Tables .......................................................................................................................x List of Figures and Illustrations ......................................................................................... xi List of Manuscripts and Contributions ..............................................................................xv List of Symbols, Abbreviations and Nomenclature ........................................................ xvii Epigraph .............................................................................................................................xx CHAPTER ONE: INTRODUCTION ..................................................................................1 1.1 Protein phosphorylation .............................................................................................1 1.2 Protein phosphatase families .....................................................................................3 1.3 Plant PPP protein phosphatases .................................................................................7 1.3.1 Type 1 (PP1) protein phosphatases ...................................................................9 1.3.2 Type 2A (PP2A) protein phosphatases............................................................14 1.3.3 Type 4 & 6 (PP4 & PP6) protein phosphatases ...............................................18 1.3.4 Type 5 (PP5) protein phosphatases .................................................................20 1.3.5 Type 7 (PP7) protein phosphatases .................................................................22 1.3.6 Novel plant bacterial-like PPP protein phosphatases ......................................23 1.3.7 Prokaryotic protein phosphorylation and PPP protein phosphatases ..............24 1.4 Research Objectives .................................................................................................26 CHAPTER TWO: EVOLUTION OF BACTERIAL-LIKE PPP PROTEIN PHOSPHATASES IN PHOTOSYNTHETIC EUKARYOTES FEATURES ANCESTRAL MITOCHONDRIAL ORIGIN OVERLAID BY HORIZONTAL GENE TRANSFER ..............................................................................................................28 2.1 Introduction ..............................................................................................................28 2.2 Materials and Methods.............................................................................................30 2.2.1 Multiple sequence alignments .........................................................................30 2.2.2 Candidate sequence search, retrieval and validation .......................................30 2.2.3 Phylogenetic tree inference .............................................................................32 2.2.4 Subcellular localization prediction ..................................................................33 2.2.5 Analysis of sequence motifs and gene architecture .........................................34 2.3 Results ......................................................................................................................34 2.3.1 Eukaryotic bacterial-like SLP and RLPH protein phosphatases are PPP protein phosphatases ....................................................................................................34 2.3.2 Distribution and interrelationships of bacterial-like PPP phosphatases ..........39 2.3.2.1 SLP phosphatases ..................................................................................39 2.3.2.2 RLPH phosphatases ...............................................................................44 2.3.3 Sequence motif identification ..........................................................................46 2.3.4 In silico gene structure analysis of bacterial-like PPP phosphatases from photosynthetic Eukaryotes ...............................................................................49 2.4 Discussion ................................................................................................................49 2.5 Conclusion ...............................................................................................................59 vi CHAPTER THREE: BACTERIAL-LIKE SLP PROTEIN PHOSPHATASES FROM ARABIDOPSIS THALIANA ARE HIGHLY CONSERVED PLANT PROTEINS THAT POSSESS UNIQUE PROPERTIES .........................................................................61 3.1 Introduction ..............................................................................................................61 3.2 Materials and Methods.............................................................................................62 3.2.1 Bioinformatics .................................................................................................62 3.2.2 Molecular cloning ............................................................................................62 3.2.3 Cell culture transfection and protoplast creation .............................................63 3.2.4 Transient AtSLP expression in Vicia faba epidermal leaf cells ......................64 3.2.5 Microscopy ......................................................................................................64 3.2.6 Heterologous protein expression and antibody production .............................65 3.2.7 Enzymatic analysis ..........................................................................................67 3.3 Results ......................................................................................................................68 3.3.1 AtSLP1 and AtSLP2 localize to different cellular compartments...................68 3.3.2 Temporal and spatial expression of AtSLP1 and AtSLP2 ..............................73 3.3.3 AtSLP phosphatase primary sequence and enzymatic properties ...................80 3.3.4 Characterization of purified untagged AtSLP1 ...............................................85 3.4 Discussion ................................................................................................................88 3.4.1 Subcellular targeting of the AtSLP phosphatases ...........................................88 3.4.2 Temporal and spatial differences in AtSLP phosphatase expression ..............92 3.4.3 Conservation of essential PPP protein phosphatase motifs in AtSLP1 and AtSLP2 ..........................................................................................................................94 3.4.4 Metal cation preferences of AtSLP phosphatases ...........................................95 3.4.5 Inhibition by classic PPP protein phosphatase inhibitors ................................96 3.5 Conclusion ...............................................................................................................98 CHAPTER FOUR: PROTEIN INTERACTOME ANALYSIS OF SLP PHOSPHATASES 1 AND 2 FROM A. THALIANA ..................................................................................99 4.1 Introduction ..............................................................................................................99 4.2 Materials and Methods...........................................................................................102 4.2.1 Plant growth conditions .................................................................................102 4.2.2 Molecular cloning and expression .................................................................102 4.2.3 A. thaliana cell culture transfection and TAP pull-downs ............................103 4.2.4 Mass spectrometry .........................................................................................104 4.2.5 Transient tobacco BY2 cell expression and imaging ....................................106 4.2.6 Isolation of A. thaliana cell culture mitochondria .........................................107 4.2.7 Enzymatic analysis ........................................................................................108 4.2.8 Antibody production ......................................................................................109 4.2.9 Western immunoblotting ...............................................................................109 4.2.10 Native polyacrylamide gel electrophoresis (PAGE) ...................................110 4.3 Results ....................................................................................................................110 4.3.1 AtSLP1 specifically interacts with chloroplast F1 ATP synthase subunits β and γ ........................................................................................................................110 4.3.2 AtSLP2 interacts with mitochondrial redox relay protein AtMIA40 ............113 4.3.3 Reciprocal interaction analysis supports a specific AtSLP2-AtMIA40 interaction ........................................................................................................................116 4.3.4 AtSLP2 and AtMIA40 co-localize to the mitochondria ................................117 vii 4.3.5 AtMIA40 activates the Ser/Thr phosphatase activity of AtSLP2 .................120 4.4 Discussion ..............................................................................................................127 4.4.1 AtSLP1 phosphatase interacts with chloroplast F1 ATP synthase subunits β and γ. ........................................................................................................................127 4.4.2 AtSLP2 phosphatase is a novel client of redox relay protein AtMIA40 .......130 4.4.3 Co-localization of both AtSLP2 and AtMIA40 to the mitochondria ............136 4.5 Conclusion .............................................................................................................137 CHAPTER FIVE: EXPLORING THE BIOLOGICAL SIGNIFICANCE OF ATSLP1 AND ATSLP2: A REVERSE GENETIC APPROACH ..................................................139 5.1 Introduction ............................................................................................................139 5.2 Methods .................................................................................................................142 5.2.1 Plant growth conditions and seed weight ......................................................142 5.2.2 Molecular cloning and expression .................................................................143 5.2.3 Seed extraction ..............................................................................................143 5.2.4 Plant DNA isolation and PCR genotyping ....................................................143 5.2.5 Immunoblot analysis .....................................................................................144 5.2.6 Quantitative PCR analysis .............................................................................144 5.2.7 Metabolite analysis ........................................................................................147 5.3 Results ....................................................................................................................147 5.3.1 Generation of homozygous insertional mutant and protein over-expression plant lines ................................................................................................................147 5.3.2 AtSLP2 and AtMIA40 function to negatively regulate seed germination ....148 5.3.3 An AtSLP2-AtMIA40 protein complex negatively regulates GA biosynthesis150 5.3.4 AtSLP2 influences seed fatty acid and amino acid content ..........................159 5.4 Discussion ..............................................................................................................162 5.4.1 Negative regulation of seed germination through GA metabolism ...............162 5.4.2 The AtSLP2-AtMIA40 protein complex functions to negatively regulate GA biosynthesis ....................................................................................................164 5.4.3 Influence of AtSLP on seed FA and AA composition ..................................166 5.5 Conclusion .............................................................................................................168 CHAPTER SIX: RHIZOBIALES-LIKE PHOSPHATASE 2 FROM ARABIDOPSIS THALIANA IS A NOVEL PHOSPHOTYROSINE-SPECIFIC PPP PROTEIN PHOSPHATASE ....................................................................................................169 6.1 Introduction ............................................................................................................169 6.2 Materials and Methods...........................................................................................171 6.2.1 Bioinformatics ...............................................................................................171 6.2.2 Molecular cloning of AtRLPH2 ....................................................................171 6.2.3 Transient expression of fluorescent AtRLPH2 constructs and microscopy ..172 6.2.4 Heterologous protein expression and purification .........................................173 6.2.5 Tandem affinity purification (TAP) isolation of AtSLP1 and AtRLPH2 .....174 6.2.6 Enzymatic analysis ........................................................................................174 6.3 Results ....................................................................................................................176 6.3.1 AtRLPH2 is a dual-localized nuclear and cytosolic protein .........................176 6.3.2 Purification of recombinant HIS6-tagged AtRLPH2 ....................................176 6.3.3 Enzymatic characterization of AtRLPH2 ......................................................180 viii 6.3.4 AtRLPH2 is a phosphotyrosine-specific PPP protein phosphatase ...............186 6.4 Discussion ..............................................................................................................192 6.4.1 Bridging the gap: AtRLPH2 is a phosphotyrosine-specific PPP protein phosphatase ....................................................................................................192 6.4.2 Function of AtRLPH2 protein phosphatase ..................................................195 6.4.3 AtRLPH2 maintains an ancient insensitivity to naturally occurring protein phosphatase inhibitors....................................................................................197 6.4.4 AtRLPH2 catalysis: Cysteine-mediated or metal-dependent catalytic mechanism ........................................................................................................................198 6.5 Conclusion .............................................................................................................199 CHAPTER SEVEN: PERSPECTIVES AND FUTURE DIRECTIONS ........................200 7.1 Summary ................................................................................................................200 7.2 Novel PPP phosphatase of the chloroplast: SLP1 phosphatases ...........................202 7.3 Connecting AtSLP2 to GA biosynthesis ...............................................................203 7.4 The SLP phosphatase of Schizosaccharomyces pombe .........................................204 7.5 Bridging the gap: the AtRLPH2 is a phosphotyrosine-specific PPP protein .........205 LITERATURE CITED: ...................................................................................................206 APPENDIX A.1. SLP PHYLOGENETIC TREE AND ALIGNMENT SEQUENCE INFORMATION.....................................................................................................225 APPENDIX A.2. RLPH PHYLOGENETIC TREE AND ALIGNMENT SEQUENCE INFORMATION.....................................................................................................230 APPENDIX B. PHYLOGENETIC TREE NODE SCORES ..........................................233 APPENDIX C.1. CLONING AND GENOTYPING PRIMERS ....................................234 APPENDIX C.2. PLANT GENOTYPING .....................................................................235 APPENDIX C.3. PRIMERS FOR QPCR OF GA RELATED GENES ..........................236 ix List of Tables Table 1.1: PPP protein phosphatase complement of A. thaliana and humans. ............................... 9 Table 2.1: In silico subcellular prediction data for eukaryotic SLP phosphatases used in phylogenetic tree and alignment construction. ..................................................................... 42 Table 2.2: In silico subcellular prediction data for eukaryotic RLPH phosphatases used in phylogenetic tree and alignment construction. ..................................................................... 46 Table 2.3: Intron quantity for photosynthetic Eukaryote SLP phosphatases. ............................... 51 Table 3.1: MALDI-TOF mass spectrometry identification of lower molecular weight polypeptides in the Superdex 200 pool post-digest. ............................................................. 88 Table 4.1: AtSLP1- and AtSLP2-specific protein interactors. ................................................... 114 Table 4.2: Phosphorylated peptides used in AtSLP2 enzyme assays. ........................................ 129 Table 6.1: Malachite green dephosphorylation assay examining AtRLPH2 phosphatase activity towards phosphorylated peptide substrates. ........................................................... 190 x List of Figures and Illustrations Figure 1.1: Reversible protein phosphorylation of serine, threonine and tyrosine. ........................ 2 Figure 1.2: PPP protein phosphatase inhibitors. ............................................................................. 6 Figure 1.3: Phylogeny and domain architecture of plant PPPs. ...................................................... 8 Figure 1.4: Tissue-specific transcript expression of the A. thaliana PPP protein phosphatases. ........................................................................................................................ 11 Figure 1.5: A. thaliana PP1: from non-specific catalytic subunits to specialized cellular functions. ............................................................................................................................... 13 Figure 1.6: Cellular events regulated by PP2A in plants. ............................................................. 16 Figure 2.1: Alignment of SLP phosphatases from both Prokaryotes and Eukaryotes. ................. 37 Figure 2.2: Alignment of RLPH phosphatases from both Prokaryotes and Eukaryotes. ............. 38 Figure 2.3: Orthogonal phylogenetic tree depicting SLP phosphatase distribution and interrelationships across both Eukaryotes and Prokaryotes. ................................................. 41 Figure 2.4: Orthogonal phylogenetic tree depicting RLPH phosphatase distribution and interrelationships across both Eukaryotes and Prokaryotes. ................................................. 45 Figure 2.5: Compiled bacterial-like phosphatase Motif 2 from SLP and RLPH phosphatases.... 47 Figure 2.6: Compiled bacterial-like phosphatase Motif 1 from SLP and RLPH phosphatases.... 48 Figure 2.7: Gene structure of eukaryotic SLP and RLPH phosphatases. ..................................... 50 Figure 2.8: Proposed model of RLPH phosphatase molecular evolution. .................................... 53 Figure 2.9: Proposed model of SLP phosphatase molecular evolution. ....................................... 54 Figure 3.1: Development of AtSLP1 and 2 stably-transfected A. thaliana cell culture. .............. 69 Figure 3.2: In vivo subcellular localization of AtSLP1 and 2 using stably-transfected A. thaliana cell culture. ............................................................................................................. 70 Figure 3.3: Co-localization of AtSLP1 with fluorescent marker constructs in Vicia faba epidermal leaf cells. .............................................................................................................. 71 Figure 3.4: Co-localization of AtSLP2 with fluorescent marker constructs in Vicia faba epidermal leaf cells. .............................................................................................................. 72 Figure 3.5: AtSLP transcript expression data. ............................................................................... 74 xi Figure 3.6: NCBI Gene Expression Omnibus (GEO) profile of diurnal AtSLP1 and AtSLP2 transcript cycling (www.ncbi.nlm.nih.gov/geoprofiles). ...................................................... 76 Figure 3.7: Colloidal blue stained 12 % SDS-PAGE of the HIS6-AtSLP1 and HIS6-AtSLP2 NiNTA eluates employed in producing anti-AtSLP1 and anti-AtSLP2 polyclonal antibodies. ............................................................................................................................. 77 Figure 3.8: Western blot assessment of affinity-purified (AP) anti-AtSLP1 and anti-AtSLP2 IgG detection limits and specificity. ..................................................................................... 78 Figure 3.9: Spatial and temporal western blot analysis of AtSLP1 and AtSLP2 protein expression. ............................................................................................................................ 79 Figure 3.10: Alignment of AtSLP1 and AtSLP2 full-length protein sequences with representative PP1 sequences from Arabidopsis thaliana (TOPP2; At5g59160), AtPP2A-1 (At1g59830) and Homo sapiens (HsPP1γ; NP_002701). ................................... 81 Figure 3.11: 12 % SDS-PAGE of NiNTA-purified HIS6-AtSLP1, HIS6-AtSLP2 and uninduced bacteria cell line control eluates stained with Colloidal blue. ............................. 83 Figure 3.12: Enzymatic analysis of HIS6-AtSLP1 and HIS6-AtSLP2......................................... 84 Figure 3.13: Size exclusion and anion exchange chromatography steps of AtSLP1 purification. ........................................................................................................................... 86 Figure 3.14: Analysis of the major end-point purification steps employed to completely purify AtSLP1. ...................................................................................................................... 87 Figure 3.15: Colloidal blue stained 12 % SDS-PAGE of purified HsPP1γ and HIS6-TOPP2. ... 89 Figure 3.16: Assessment of purified, untagged AtSLP1 sensitivity to small molecule inhibitors. .............................................................................................................................. 90 Figure 4.1: Schematic depiction of tandem affinity purification (TAP) protein isolation methodology. ...................................................................................................................... 105 Figure 4.2: Western blot analysis of dark-grown, wild-type A. thaliana cell culture................. 111 Figure 4.3: AtSLP-TAP pull-downs isolating AtSLP1- and AtSLP2-specific protein interactors. ........................................................................................................................... 112 Figure 4.4: Transcript and protein expression of AtSLP1 and AtCF1 ATP synthase subunits. . 115 Figure 4.5: Reciprocal TAP pull-downs verify specific interaction between AtSLP2 and AtMIA40. ............................................................................................................................ 118 Figure 4.6: Reciprocal non-denaturing PAGE verifies specific interaction between AtSLP2 and AtMIA40. ..................................................................................................................... 119 xii Figure 4.7: AtSLP2 and AtMIA40 are mitochondrial-targeted proteins. ................................... 121 Figure 4.8: Anti-AtMIA40 IgG production and testing. ............................................................. 122 Figure 4.9: AtMIA40 activates AtSLP2 under reducing conditions........................................... 124 Figure 4.10: Effect of reductant only on AtSLP1 and AtSLP2 activity. .................................... 125 Figure 4.11: Amino acid sequence alignment of photosynthetic Eukaryote SLP2 phosphatases against select SLP1 phosphatases. ................................................................ 126 Figure 4.12: Substrate specificity of AtSLP2-AtMIA40 complex. ............................................ 128 Figure 4.13: Model depicting how AtSLP1 may operate in vivo. .............................................. 131 Figure 4.14: Model depicting how AtSLP2 and AtMIA40 may operate in vivo. ....................... 133 Figure 5.1: AtSLP and AtMIA40 gene models depicting insertional mutant plant lines. ............ 145 Figure 5.2: PCR and immunoblot analysis of AtSLP1 and AtSLP2 knockout and overexpression plant lines. ......................................................................................................... 146 Figure 5.3: Silique size and seed weight..................................................................................... 149 Figure 5.4: Depiction of atslp2-2, wild-type NÖ and 35S::AtSLP2 seeds germinated on 0.5 x MS-Agar plates containing ABA or Uniconazole. ............................................................. 151 Figure 5.5: Depiction of atmia40 and wild-type Col-O seeds germinated on 0.5 x MS-Agar plates containing ABA or Uniconazole. ............................................................................. 152 Figure 5.6: Quantitative analysis of seed testa cracking on 0.5 x MS-Agar plates containing ABA or Uniconazole........................................................................................................... 153 Figure 5.7: Quantitative analysis of seed radicle emergence on 0.5 x MS-Agar plates containing ABA or Uniconazole. ........................................................................................ 154 Figure 5.8: Immunoblot analysis of AtSLP2 and AtMIA40 protein expression in imbibed and germinating seeds................................................................................................................ 155 Figure 5.9: Relative transcript expression of GA-related biosynthetic and signaling proteins from Biological Arabidopsis Resource (BAR). .................................................................. 157 Figure 5.10: Quantitative PCR analysis of GA biosynthetic and signaling genes from 0 h, 6 h and 12 h imbibed atslp2-2, wild-type and 35S::AtSLP2 seeds. .......................................... 158 Figure 5.11: Comparative analysis of atslp2-2 and 35S::AtSLP2 seed FA content and composition. ........................................................................................................................ 160 Figure 5.12: Relative levels of AAs in atslp2-2, WT NÖ and 35S::AtSLP2 dry seeds. ............. 161 xiii Figure 5.13: Model of AtSLP2-AtMIA40 protein complex function during seed germination. 163 Figure 6.1: AtRLPH1 and 2 transcriptional tissue expression profile. ....................................... 177 Figure 6.2: Subcellular localization of AtRLPH2....................................................................... 178 Figure 6.3: Purification of AtRLPH2-HIS6 from E. coli............................................................ 179 Figure 6.4: Calibrated Superdex 200 size exclusion chromatography of AtRLPH2-HIS6. ....... 181 Figure 6.5: AtRLPH2 pH activity profile, metal cation dependency, and sensitivity to metal chelators EDTA and EGTA. ............................................................................................... 182 Figure 6.6: ClustalX alignment of representative RLPH phosphatases from across photosynthetic Eukaryotes. ................................................................................................. 183 Figure 6.7: Cartoon depiction of PPP, PPM and PTP protein phosphatase motifs relative to AtRLPH2. ........................................................................................................................... 184 Figure 6.8: Effect of hydrogen peroxide (H2O2) and N-ethylmaleimide (NEM) on AtRLPH2 phosphatase activity. ........................................................................................................... 185 Figure 6.9: AtRLPH2 sensitivity to inhibition by phosphate-containing small molecules. ....... 187 Figure 6.10: AtRLPH2 sensitivity to classic PPP and PTP protein phosphatase small molecule inhibitors.............................................................................................................. 188 Figure 6.11: Phosphorylated substrate preferences of purified A. thaliana bacterial-like PPP protein phosphatases. .......................................................................................................... 191 Figure 6.12: Phosphorylated substrate preferences of TAP-purified A. thaliana bacterial-like PPP-family protein phosphatases. ....................................................................................... 193 xiv List of Manuscripts and Contributions Chapter 1 Uhrig, R.G., Labandera, A.M., and Moorhead, G.B. Arabidopsis PPP serine / threonine protein phosphatases: many targets but few engines. Trends in Plant Science. Accepted Manuscript PLANTS-D-13-00047R1. I was primary author on the manuscript Chapter 2 Uhrig, R.G*., Kerk, D*., and Moorhead, G.B. Evolution of bacterial-like PPP protein phosphatases in photosynthetic eukaryotes features ancestral mitochondrial or archaeal origin overlaid by lateral gene transfer. Manuscript in progress. I will be co-primary author on the manuscript I was responsible for all in silico subcellular localization, motif identification and gene structure work. HMM work was split between myself and collaborator Dr. David Kerk (University of Calgary). I initially formulated alignments and phylogenetic trees to trace the heritage of each protein subclass, which was further resolved by collaborator Dr. David Kerk. Chapter 3 Uhrig, R.G., and Moorhead, G.B. Two ancient bacterial-like PPP family phosphatases from Arabidopsis thaliana are highly conserved plant proteins that possess unique properties. Plant Physiology. 2011. 157(4): 1778-92. I was primary author on the manuscript Chapters 4 & 5 Uhrig, R.G., Liang, S., Goudreault, M., Yeung, E., Colville, K., Bu, R., Gingras, A.C., Samuel, M.A., and Moorhead, G.B. Activation of Arabidopsis mitochondrial serine/threonine protein phosphatase SLP2 by the oxidoreductase MIA40. Manuscript in progress. I will be primary author on the manuscript I was responsible for all work except the following. Mass spectrometry was performed by collaborators Dr. Anne-Claude Gingras and Marilyn Goudreault (University of Toronto). Quantitative PCR and atmia40 genotyping was performed by Siyu Liang (University of Calgary). Pictures of germinating siliques were taken by Dr. Edward Yeung (University of Calgary). Seed fatty acid and amino acid analysis was performed by the collaborating laboratory of Dr. Alisdair Fernie (Max Planck Institute of Molecular Plant Physiology). xv Chapter 6 Uhrig, R.G., Labandera, A.M., Colville, K., and Moorhead, G.B. Rhizobiales-like phosphatase 2 (RLPH2) from Arabidopsis thaliana is a novel phosphotyrosine-specific PPP-family protein phosphatase. Manuscript in progress. I will be primary author on the manuscript I was responsible for all work except the following. AtRLPH2- and AtSLP1-TAP phosphorylated peptide assays were performed in conjunction with A.M. Labandera. Other notable contributions during Ph.D. thesis Uhrig, R.G., and Moorhead, G.B. Plant Proteomics: current status and future prospects. Journal of Proteomics. doi: 10.1016/j.jprot.2013.01.018. I was primary author on the manuscript Skene-Arnold, T.D., Luu, A.H., Uhrig, R.G., De Wever, V., Nimick, M., Maynes, J., Fong, A., James, M., Trinkle-Mulcahy, L., Moorhead, G.B., and Holmes, C.F.B. Molecular mechanisms underlying the interaction of Protein Phosphatase-1c with ASPP proteins. Biochemical Journal. 2013. 449: 649-59. I was a contributing author on the manuscript Tran, H.T., Uhrig, R.G., Nimick, M., and Moorhead, G.B. Interfacing protein lysine acetylation and protein phosphorylation: Ancient modifications meet ancient proteins. Plant Signaling and Behavior. 2012. 7(8). I was a contributing author on the manuscript Tran, H.T*., Nimick, M*., Uhrig, R.G*., Templeton, G., Morrice, N., Gourlay, R., Delong, A., and Moorhead, G.B. Arabidopsis thaliana Histone Deacetylase 14 (Hda14) is an α-Tubulin Deacetylase that associates with Pp2a and enriches in the microtubule fraction with the putative Histone Acetyltransferase Elp3. The Plant Journal. 2012. 71(2): 263-72. I was co-primary author on the manuscript Uhrig, R.G., and Moorhead, G.B. Okadaic acid and microcystin insensitive PPP-family phosphatases may represent novel biotechnology targets. Plant Signaling and Behavior. 2011. 6(12). I was primary author on the manuscript Uhrig, R.G., Ng, K.K.S., and Moorhead, G.B. PII in higher plants: a modern role for an ancient protein. Trends in Plant Science. 14(9): 505-11. I was primary author on the manuscript xvi List of Symbols, Abbreviations and Nomenclature Symbol ABA ABI5 AGO1 ATP ADP AMP Asp BLASTP TBLASTN BIN2 BR BRI1 BSL1-3 BSU1 Ch CDG1 cTP cv Cyto DSP DTT EDTA EGTA ELP3 ER GA GEO GEM GFP Gly-3P Glu-6P GSH GST H2O2 HAD HDA14 HEAT Definition abscisic acid abscisic acid insensitive 5 argonaute protein 1 adenosine 5'-triphosphate adenosine 5'-diphosphate adenosine 5'-monophosphate Aspartate basic local alignment search tool - search protein database with a protein query basic local alignment search tool - search translated nucleotide database with a protein query Bri1 insensitive kinase 2 brassinosteroid brassinosteroid insensitive 1 Bri1 supressor-like 1-3 Bri1 suppressor 1 Chloroplast constitutive differential growth 1 chloroplast transit peptide column volume Cytosol dual specificity phosphatase Dithiothreitol ethylenedinitrilotetraacetic acid ethylene glycol tetraacetic acid elongator protein 3 endoplasmic reticulum gibberellic acid gene expression omnibus GL2 expression modulator green fluorescent protein glycerol-3-phosphate glucose-6-phosphate reduced glutathione glutathione S-transferase hydrogen peroxide haloacid dehalogenase histone deacetylase 14 Huntingtin, elongation factor 3, protein phosphatase 2A, and the yeast kinase target of rapamycin 1 xvii HEPES HGT HIS6 HMCR HMM Hs Hsp90 HTGS IC50 I-2 MCLR MIA40 mM mRNA MS Mt mTP MWCO NAA NaF NaOV NCBI NEM NIPP1 NiNTA nM nm NSCP Nuc OA PAGE PCR PDC (E1α mitochondrial) pAsp pHis pSer pThr pTyr Perox PHOT2 Pi PP1-7 PPi 4-(2-hydroxyethyl)-1-piperazineethanesulfonic acid horizontal gene transfer hexa-histidine 3-hydroxy-3-methylglutaryl coenzyme A reductase hidden Markov model homo sapiens (human) heat shock protein 90 high throughput genomic sequences half-maximal inhibitory concentration inhibitor-2 protein microcystin-LR mitochondrial import and assembly protein 40 Millimolar messenger ribonucleic acid Murashige Skoog Mitochondria mitochondrial targeting peptide molecular weight cutoff 1-naphthaleneacetic acid sodium fluoride sodium orthovanadate National Center for Biotechnology Information N-ethylmaleimide nuclear inhibitor of protein phosphatase-1 nickel nitrilotriacetic acid nanomolar nanometers non-specific co-purifying proteins nucleus okadaic acid polyacrylamide gel electrophoresis polymerase chain reaction pyruvate dehydrogenase complex (mitochondrial E1α) phosphorylated aspartate phosphorylated histidine phosphorylated serine phosphorylated threonine phosphorylated tyrosine peroxisome phototropin 2 inorganic phosphate protein phosphatase type 1 - 7 pyrophosphate xviii PMSF pNPP PP2C PPKL PPM PPP PRSL PTEN PTP PTPC PTS1 RCN1 RISC RFP RLPH RSS1 SBI1 SDS22 SDS SEX4 SHLP1 SLP Sp SPS SuSy TCEP TOPP2 TOR TPR µM WGS phenylmethylsulfonyl fluoride para-nitrophenyl phosphate phosphoprotein phosphatase 2C protein phosphatase kelch-like Mg2+-dependent protein phosphatase phosphoprotein phosphatase PP1 regulatory subunit 2-like protein phosphatase and tensin homolog protein tyrosine phosphatase plant-type phosphoenolpyruvate carboxylase peroxisomal targeting signal 1 roots curl in naphthylphthalamic acid 1 RNA induced silencing complex red fluorescent protein Rhizobiales-like phosphatase rice salt sensitive 1 suppressor of Bri1 suppressor of DIS2 sodium dodecyl sulfate starch excess 4 Shewanella-like phosphatase-1 Shewanella-like phosphatase signal peptide sucrose phosphate synthase sucrose synthase tris(2-carboxyethyl)phosphine Arabidopsis thaliana PP1 isoform 2 target of rapamycin tetratricopeptide micromolar whole-genome shotgun contigs xix Epigraph "Think off-center." - George Carlin xx Chapter One: Introduction 1.1 Protein phosphorylation Reversible protein phosphorylation is an ancient regulatory mechanism which has evolved to become one of the dominant means to control protein function, regulating essentially every cellular process across the domains of life (Figure 1.1A; (Hunter and Pawson, 2012)). Mass spectrometry-based phosphoproteomics studies have revealed that upwards of 70% of all human proteins are phosphorylated, and most on multiple sites within a protein (Olsen et al., 2006), while ongoing genome sequencing efforts have consistently found protein kinases and phosphatases to constitute approximately 2-4% of the protein-encoding genes of most Eukaryotes (Kerk et al., 2008; Hunter and Pawson, 2012). Given the expansive nature of protein kinase and phosphatase families across Eukaryotes, there is no reason to believe protein phosphorylation would be any less common in the Kingdom Plantae. Phosphorylated proteins have been found in abundance throughout mitochondria (Ito et al., 2009), chloroplasts (Reiland et al., 2009), nuclei (Olsen et al., 2006), cytosol (Olsen et al., 2006) and even extracellularly (Tagliabracci et al., 2012), with phosphoproteomics documenting plant proteomes to have a much higher abundance of phosphotyrosine than originally thought despite the absence of classic protein tyrosine kinases and phosphatases. This led to the discovery of phosphoserine (pSer), threonine (pThr) and tyrosine (pTyr) ratios much like other higher Eukaryotes (~84-86%, 10-12% and 2-4%, respectively) (Figure 1.1B; (Sugiyama et al., 2008; Nakagami et al., 2010; Nguyen et al., 2012)). 1 Figure 1.1: Reversible protein phosphorylation of serine, threonine and tyrosine. (A) Reversible protein phosphorylation is catalyzed by the opposing actions of protein kinases and protein phosphatases, which add a phosphoryl group and remove phosphate from a target protein, respectively. Protein kinases convert adenosine 5'-triphosphate (ATP) to adenosine 5'diphosphate (ADP), transferring the phosphoryl group to a protein, while phosphatases use water (H2O) to catalyze the removal of phosphate (Pi). (B) Structures of phosphorylated and dephosphorylated serine, threonine and tyrosine. These amino acids comprise the bulk of the phosphorylation events occurring in Eukaryotes. The estimated abundance of each phosphorylated amino acid species in humans and plants is listed above. Highlighted in pink is the attached phosphate molecule. 2 For historical and sometimes technical reasons, protein kinases are generally more well characterized than protein phosphatases (Brautigan, 2013). In vitro, protein kinases display substrate specificity based on protein primary sequence, while protein phosphatase catalytic subunits are typically non-discriminate in the absence of additional protein binding partners (Heroes et al., 2013). This has led to the notion that protein phosphatases lack specific regulation and simply maintain a ‘housekeeping’ function. However, studies across a range of model Eukaryotes has confirmed that in fact the phosphatases are not passive players in the (de)phosphorylation balance, but rather dynamic and as highly regulated as their partner kinases. Since the mid 1990s, biochemical and genetic studies have continued to uncover key roles for the plant protein phosphatases in a wide range of biological contexts. 1.2 Protein phosphatase families Protein kinases consist of one superfamily (Manning et al., 2002; Lehti-Shiu and Shiu, 2012), while the protein phosphatases are divided into four distinct families (Moorhead et al., 2007; Genoud et al., 2008; Kerk et al., 2008; Shi, 2009; Bollen et al., 2010; De Munter et al., 2013; Heroes et al., 2013) represented by the phosphoprotein phosphatases (PPP), Mg2+/Mn2+dependent phosphoprotein metallophosphatases (PPM/PP2C), phosphotyrosine phosphatases (PTP) and aspartic acid (Asp)-dependent family enzymes. These four families are categorized based on a combination of their catalytic mechanism, metal cation requirements, inhibitor sensitivities and phosphorylated target substrates (Shi 2009). All four protein phosphatase families possess at least one member which is capable of dephosphorylating pSer and pThr residues, while the PTP phosphatases, which largely dephosphorylate pTyr residues, have members which also dephosphorylate non-proteinaceous phosphorylated substrates (Tonks, 3 2006; Kerk et al., 2008; Silver et al., 2013). It is thought that the PPP protein phosphatases catalyze upwards of 90% of the overall protein dephosphorylation reactions in eukaryotic cells (Heroes et al., 2013); however, this number may be slightly lower in plants due to the immense proliferation of Ser/Thr-specific PP2C enzymes (Xue et al., 2008; Fuchs et al., 2013). Both the PPP and PPM protein phosphatases have been characterized to dephosphorylate pSer and pThr residues (Kerk et al., 2008; Shi, 2009; Fuchs et al., 2013), and both have been found to maintain a metal cation-based catalytic mechanism involving either Mn2+/Fe3+ (PPPs) or Mg2+/Mn2+ (PPM/PP2Cs) (Shi, 2009; Fuchs et al., 2013). These similarities, however, are a product of convergent evolution, as PPP and PPM/PP2C phosphatases possess distinct catalytic pockets comprised of GDxHG, GDxVDRG, GNHE, HGG and RxxxD, DGxxG, DG, GxxDN motifs, respectively (Shi, 2009; Fuchs et al., 2013). Furthermore, most PPM/PP2C protein phosphatases possess additional motifs that assist in defining their subcellular localization and substrate specificity in a similar manner to protein kinases (Ubersax and Ferrell, 2007; Xue et al., 2008; Shi, 2009), while PPP protein phosphatases are expressed as bare catalytic subunits that require interaction(s) with targeting subunits to define their intracellular specificity (Moorhead et al., 2009). Unlike PPM protein phosphatases, PPP protein phosphatases are sensitive to a number of small molecule inhibitors, such as microcystin and okadaic acid, which bind a surface loop near the active site (Figure 1.2; (Goldberg et al., 1995; Maynes et al., 2001)). These inhibitors are pharmacological tools for experimentation and were instrumental in the initial categorization of aforementioned PPP protein phosphatase subclasses based on differences in inhibitory sensitivity (MacKintosh et al., 1990; Sheppeck et al., 1997). Microcystin-LR (MCLR) and okadaic acid (OA) are naturally occurring compounds that were purified from the cyanobacterium Microcystis 4 aeruginosa (MacKintosh et al., 1990) and marine sponge Halichondria okadai, respectively (Bialojan and Takai, 1988). Microcystis aeruginosa can have severe ecological ramifications through its exogenous production of microcystins during large cyanobacterial blooms, which result in fish kills and water toxification (Cohen et al., 1990; MacKintosh and MacKintosh, 1994; Dawson, 1998). These two compounds are responsible for inhibiting both PP1 and PP2Alike (PP2A, PP4 and PP6) protein phosphatases, but at different concentrations, aiding in the differentiation between isolated phosphatases prior to the availability of sequenced genomes for phylogenetic comparisons. PTP phosphatases on the other hand, possess a completely different, but highly conserved, metal cation-independent dephosphorylation mechanism (Jia et al., 1995; Kerk et al., 2008; Tonks, 2013). Unlike either the PPP or PPM protein phosphatases, which maintain a tertiary folded catalytic cleft capable of coordinating metal co-factors, PTP phosphatases possess a structurally deeper, more compact HCx5R catalytic motif, which employs a deprotonated cysteine to facilitate dephosphorylation of target pTyr residues (Jia et al., 1995; Pannifer et al., 1998; Tonks, 2013). PTP phosphatases are divided into 2 main subclasses: tyrosine phosphatases and dual specificity phosphatases (DSPs) (Kerk et al., 2008). The DSP phosphatases were initially named based upon the ability of the first mitogen-activated protein kinase (MAPK) phosphatase to dephosphorylate both residues of the pTxpY motif in the MAPK activation loop (Caunt and Keyse, 2013); however, a number of DSP phosphatases possess the capability of dephosphorylating non-proteinaceous species such as phosphatidylinositol phosphates (PTEN), glucans (Laforin, SEX4) and mRNA (RNA-capping enzymes) (Kerk et al., 2008; Shi, 2009; Kotting et al., 2010; Gentry et al., 2013; Silver et al., 2013). 5 Figure 1.2: PPP protein phosphatase inhibitors. Depicted are the molecular structures of (A) Microcystin-LR (MCLR) and (B) Okadaic acid (OA). (C) Atomic structure of PP1-MCLR complex (1FJM; Goldberg et al., 1995). PP1 (green) has four key amino acid stretches (blue) comprising the active site. The C-terminal SAPNYC motif (red) contains the cysteine responsible for the covalent linkage of MCLR (yellow). Purple spheres depict two Mn2+ cations that position in the active site to assist in PP1 enzymatic activity. The upper panel depicts the PP1 atomic structure without microcystin, while the lower panels depict the positioning of MCLR in the active site from two different angles. 6 The fourth family of protein phosphatases are the Asp-based catalytic phosphatases (Kerk et al., 2008). These phosphatases are defined by an unique catalytic motif: the DxDxTV motif (Kerk et al., 2008; Seifried et al., 2013). The upstream Asp (D) residues function to coordinate the metal cation co-factor Mg2+ and are central to Asp-based phosphatase catalytic activity (Seifried et al., 2013). This family is so far defined by a small group of RNA polymerase II associating (FCP) and haloacid dehalogenase (HAD) protein phosphatases (Seifried et al., 2013). 1.3 Plant PPP protein phosphatases The PPP protein phosphatase family of plants is comprised of type one (PP1) through type 7 (PP7) phosphatases, which largely maintain both sequence and structural relatedness (Figure 1.3). This is highlighted by their conserved catalytic mechanism involving the same four canonical catalytic motifs found in other eukaryotic PPP protein phosphatases (Figure 1.3; (Moorhead et al., 2007; Shi, 2009; Heroes et al., 2013)). Plants, however, lack the PP3 (or PP2B) enzymes and, along with several other Eukaryotes, have additional unique PPP protein phosphatases; the Kelch-repeat domain-containing (PPKL) protein phosphatases, the Shewanella-like phosphatases (or SLPs) and the Rhizobiales-like phosphatases (or RLPHs), which are all absent in mammals (Table 1.1; (MacKintosh and MacKintosh, 1994; Andreeva and Kutuzov, 2004; Kerk et al., 2008)). When examining the tissue expression profile of A. thaliana PPP protein phosphatases, it becomes evident that PPP protein phosphatases are likely involved in regulating both cellular processes common across diverse tissue types as well as processes central to highly specialized tissues (Figure 1.4). This is potentially highlighted by the abundant AtPP1 phosphatases, which show wide-ranging transcriptional expression profiles outside of 7 Figure 1.3: Phylogeny and domain architecture of plant PPPs. The highly conserved core catalytic domain of each PPP subfamily is depicted in gray with signature aspects of each motif highlighted. Green and blue amino acids represent those involved in metal ion coordination and phosphate binding, respectively. Also described is the microcystin inhibition docking motif SAPNYC (purple), which includes the reactive cysteine (C) to which microcystin covalently attaches. PP7 maintains this motif, but lacks the reactive C, whereas SLP and RLPH phosphatases completely lack this motif. Within these motifs ‘x’ represents any amino acid. Unique features of each subfamily are also depicted: TPR (tetratricopeptide repeat), NLS (nuclear localization signal) and cTP (chloroplast transit peptide). The Arabidopsis sequences used to compile the phylogenetic tree are: PP1 (TOPP1; At2g29400), PP2A-1 (At1g59830), PP4-1 (At4g26720), PP5 (At2g42810), PP6-1 (At1g50370), PP7 (At5g58500), SLP1 (At1g07010), SLP2 (At1g18480), RLPH1 (At3g09960), RLPH2 (At3g09970) and PPKL (BSU1; At1g03445). No canonical PP2B (calcineurin-A) is encoded in plants. Neighbor-joining (NJ) phylogenetic tree was obtained using ClustalX 2.0.12 and was visualized using FigTree v1.3.1. Tree topology is consistent with other studies examining PPP protein phosphatases. The total number of amino acids for each enzyme is shown on the right and, for presentation purposes only, two (×3) of the six Kelch-repeat motifs of the Kelch-like domain are shown for BSU1. The number of genes encoding each subfamily of the PPP family in Arabidopsis are shown in square brackets. Although the architecture of a specific gene product is depicted (e.g. TOPP1), each additional protein maintains the same motifs and domains. 8 silique and seed tissues, contrary to AtPP4 phosphatases, which are almost entirely expressed in silique and seed tissues (Figure 1.4). Table 1.1: PPP protein phosphatase complement of A. thaliana and humans. PPP protein phosphatase subclasses found in only A. thaliana (light grey) and the total PPP protein phosphatases encoded by each respective organism (dark grey) are highlighted. Table was modified from Kerk et al., 2008., with labeled phosphatases (*) absent from this previous study. 1.3.1 Type 1 (PP1) protein phosphatases PP1 protein phosphatases are ~37 kDa monomeric proteins that are remarkably conserved and ubiquitously expressed across Eukaryotes (Kerk et al., 2008). In A. thaliana there are nine PP1 phosphatase isoforms which maintain 80 - 85 % identity amongst themselves and 76-90% identity to PP1 phosphatases from humans and fungi (Moorhead et al., 2009; Templeton et al., 2011). With PP1 substrate specificity determined through interactions with a variety of regulatory subunits, identification and characterization of these regulatory subunits has been of paramount importance. Of the 200 known PP1 binding partners in humans, the vast majority dock PP1 through a conserved binding site defined as 'RVxF', which maintains the consensus 9 (R/K)(R/K)(V/I)x(F/W) (Moorhead et al., 2007, 2007; Heroes et al., 2013). Numerous human PP1 interactors have orthologs in plants that also possess conserved RVxF motifs and thus likely associate with PP1 (Takemiya et al., 2009; Ogawa et al., 2011; Templeton et al., 2011; Takemiya et al., 2013). The amino acids of PP1 responsible for coordinating the RVxF motif are also conserved across Eukaryotes (Egloff et al., 1997), supporting this motif as an ancient proteindocking site. The prevalence of this motif contributes to PP1 involvement in a diversity of cellular processes ranging from mitosis to metabolism in non-photosynthetic Eukaryotes (Cohen, 2002). Recent biochemical evidence has demonstrated that plant PP1 regulatory proteins interact with PP1 catalytic subunits through the RVxF motif as predicted, similarly implicating plant PP1 in the regulation of a number of cellular processes (Ogawa et al., 2011; Takemiya et al., 2013). These binding partners function to either abolish PP1 activity by blocking access to the active site (e.g. Inhibitor-2 protein), or by recruiting substrates and/or controlling active site access (Peti et al., 2013). From a cellular localization perspective, the nine A. thaliana PP1 (AtPP1) phosphatases, like the three human PP1 isoforms, localize to the nucleus and cytosol (Templeton et al., 2011; Takemiya et al., 2013), and in plants are excluded from the plastid (MacKintosh et al., 1991). Transcriptional expression analysis shows that they are present in a wide range of tissues, but are largely excluded from siliques and seeds (Figure 1.4). As well, all AtPP1 phosphatases were found to be active phosphatases and to be differentially regulated by A. thaliana inhibitor-2 (AtI2) protein (Templeton et al., 2011). Through a combination of MCLRor PP1-Sepharose affinity chromatography, several PP1 interactors have been uncovered in A. thaliana, including NIPP1 (NUCLEAR INHIBITOR OF PROTEIN PHOSPHATASE 1), SDS22 (SUPPRESSOR OF DIS2), GEM (GL2-EXPRESSION MODULATOR) and I2 (Figure 1.5; 10 Figure 1.4: Tissue-specific transcript expression of the A. thaliana PPP protein phosphatases. Using the online database Genevestigator, the transcriptional expression profile of each PPP protein phosphatase from A. thaliana was obtained (www.genevestigator.com) across a range of tissues. These data were derived from compiled microarray expression analysis using the Affymetrix 22k A. thaliana chip. At2g29400 (AtPP1-1; TOPP1) At5g59160 (AtPP1-2; TOPP2), At1g64040 (AtPP1-3; TOPP3); At2g39840 (AtPP1-4; TOPP4), At3g46820 (AtPP1-5; TOPP5), At5g43380 (AtPP1-6; TOPP6), At4g11240 (AtPP1-7; TOPP7), At5g27840 (AtPP1-8; TOPP8), At3g05580 (AtPP1-9; TOPP9), At1g10430 (AtPP2A-1), At3g58500 (AtPP2A-2), At2g42500 (AtPP2A-4), At1g69960 (AtPP2A-5), At4g26720 (AtPP4-1), At5g55260 (AtPP4-2), At2g42810 (AtPP5), At1g50370 (AtPP6-1), At3g19980 (AtPP6-2), At5g63870 (AtPP7), At1g03445 (AtBSU1), At4g03080 (AtBSL1), At2g27210 (AtBSL2), At1g07010 (AtSLP1), At1g18480 (AtSLP2), At3g09960 (AtRLPH1), At3g09970 (AtRLPH2). Missing are At3g58500 (AtPP2A-3), and the AtPPKL At1g08420 (BSL3) which was not found in the Genevestigator database. Expression is indicated by a gradient of white (low expression) to red (high expression) as a percentage of the top 1% genes expressed in each tissue. The numbers describe the number of total arrays used in determining the displayed level of relative gene expression. 11 (Templeton et al., 2011)). Other studies employing genetic approaches have additionally identified RVxF-containing plant PP1 partners I3 (Inhibitor-3 protein) (Takemiya et al., 2009), RSS1 (RICE SALT SENSITIVE 1) (Ogawa et al., 2011) and PRSL (PP1 regulatory subunit 2like protein) (Figure 1.5; (Takemiya et al., 2013)). Of the plant PP1 protein regulators identified to date, I2 is thought to be one of the most ancient (Stubbs et al., 2001; Ceulemans et al., 2002). Mammalian I2 has been studied for over 3 decades and in 2011, the A. thaliana version (AtI2) was characterized, after being captured on a PP1-Sepharose matrix (Templeton et al., 2011). AtI2 was found to contain a conserved RVxF motif and to inhibit all nine A. thaliana PP1 isoforms. Utilizing the inhibitory properties of I2, Takemiya et al., (2006) demonstrated that PP1 regulates stomatal opening downstream of the blue light sensing kinase phototrophin (Takemiya et al., 2006). Yeast two-hybrid screening for PP1 interactors also identified several RVxF-containing proteins including PRSL1, which like GEM, binds PP1 in an RVxF-dependent manner (Takemiya et al., 2013). Although not completely resolved, these results suggest that PRSL1 targets PP1 to regulate blue light sensing. Additionally, through a combined approach of genetic screening for salt tolerance in rice and yeast two-hybrid, Ogawa et al., (2011) identified the RVxF-containing PP1 regulatory protein RSS1, loss of which, resulted in short-root and dwarf phenotypes under high-salt conditions. RSS1 accumulates through the progression of S-phase in the cell cycle where it is required for the maintenance of proliferative cells in meristematic tissues. 12 Figure 1.5: A. thaliana PP1: from non-specific catalytic subunits to specialized cellular functions. PP1 phosphatases are encoded in the genomes of all Eukaryotes as bare catalytic subunits that require the interaction of regulatory protein interactors to drive their specificity within the cell. Interaction between PP1 catalytic subunits and regulatory proteins is facilitated by the RVxF protein-binding motif. Depicted are documented PP1 protein interactors which possess an RVxF motif, and the corresponding cellular process they are implemented in regulating in complex with PP1. With human PP1 phosphatases maintaining upwards of 200 regulatory proteins, this list of documented plant PP1 regulatory proteins will likely expand significantly. 13 1.3.2 Type 2A (PP2A) protein phosphatases The PP2A holoenzyme is trimeric, consisting of a ~35 kDa catalytic (C), ~65 kDa scaffolding (A) and ~54 - 130 kDa regulatory (B) subunit (DeLong, 2006). A. thaliana encodes 5 catalytic, 3 scaffolding and 17 regulatory B subunits, which form a variety of combinations to exert different regulatory outcomes (Jonassen et al., 2011; Trotta et al., 2011). The 17 identified B-subunits group into B (55 kDa), B´ (54-72 kDa) and B” (72-130 kDa) subunit families, and coupled with recent structural analysis, confirm B-subunits control substrate access to the catalytic subunit active site (Shi, 2009). Several studies have identified the A. thaliana PP2A-A1 subunit RCN1 (ROOTS CURL IN NAPHTHYLPHTHALAMIC ACID 1) and several Bsubunits as direct players in stress signaling (B´γ) (Trotta et al., 2011), abscisic acid (ABA) insensitivity, guard cell responses, as well as hypocotyl and root elongation (B’’α) (Blakeslee et al., 2008). Metabolic links have also established B” subunits (α and β) as negative regulators of 3-hydroxy-3-methylglutaryl Coenzyme A reductase (HMGR), a key enzyme that regulates the isoprenoid biosynthesis pathway (Leivar et al., 2011) and B55 subunits (α and β) as necessary for nitrate reductase activation (MacKintosh, 1992; Heidari et al., 2011) as well as TAP46, a key regulator of cell growth and survival, autophagy, and protein synthesis (Figure 1.6A; (Ahn et al., 2011)). In vitro, PP2A has been suggested to regulate other metabolic proteins such as phosphoenolpyruvate carboxylase (PEPC) (Dong et al., 2001) and sucrose phosphate synthase (Siegl et al., 1990). The B subunits responsible for directing PP2A specificity towards these targets remain to be uncovered. Aspects of PP2A function also indicate a regulatory link between different posttranslational modifications, with phosphorylation-acetylation (Tran et al., 2012) and phosphorylation-methylation (Wu et al., 2011) connections being noted to date. It was found by 14 Tran et al., 2012 that PP2A co-purified with the histone deacetylase HDA14 and the histone acetyltransferase ELP3. HDA14 deacetylates α-tubulin and is also the ortholog of human HDAC6, which targets mammalian α-tubulin. The specific function of this interaction remains unknown; however, PP2A was previously implicated in the control of microtubule function (Farkas et al., 2007) and may regulate the binding and trafficking of kinesins (Tran et al., 2012). Current evidence suggests that HDA14 may represent a regulatory 'B' subunit due to its direct interaction with scaffolding A subunits of PP2A. Further experimentation is required to specifically resolve this hypothesis (Figure 1.6B). The phosphorylation-methylation link was made during the examination of brassinosteroid (BR) signaling, where the leucine carboxymethyltransferase SBI1 (SUPPRESSOR OF BRI1) was found to be responsible for methylating PP2A (Wu et al., 2011). Like PP1, PP2A has been implicated in regulating light signaling in plants. In particular, yeast two-hybrid and in vitro pull-down assays uncovered a direct interaction between RCN1 (PP2A-A1) and PHOT2 (Tseng and Briggs, 2010). This work, along with Tran et al. (2012) and others (Yu et al., 2008; Herzog et al., 2012; Tran et al., 2012), have highlighted that scaffolding A subunits do not just bind the B and C subunits of PP2A, but undoubtedly other protein partners as well. The association of RCN1 (PP2A-A1) and PHOT2 was found to down-regulate phototropism and stomatal opening through the dephosphorylation of PHOT2 under blue light conditions (Figure 1.6C; (Tseng and Briggs, 2010)). Furthermore, PP2A-C (isoform C2) has been implicated as a positive regulator in chloroplast movements mediated by PHOT2. Under blue light conditions PP2A was shown to dephosphorylate and activate the actin binding protein 15 Figure 1.6: Cellular events regulated by PP2A in plants. (A) Many key cellular signals converge on target of rapamycin (TOR) in plants as in humans and yeast. PP2A-C interacts with regulatory subunit Tap46 (α4/Tap42), a target of TOR, to modulate downstream cellular events: protein translation, autophagy, nutrient cycling and senescence in plants. Pointed and blunt arrowheads denote activation and inhibition of enzyme activity or cellular processes, respectively. (B) The plant PP2A-A1 (RCN1) scaffolding subunit forms a specific complex with HDA14 to deacetylate α-tubulin. Whether HDA14 represents a novel B targeting subunit or simply a protein interactor has yet to be determined. The role of histone acetyltransferase ELP3 (elongator complex protein 3) in conjunction with PP2A in modulating α-tubulin acetylation is also unknown. The identification of an HDA14–PP2A complex represents an interesting point of cross-talk between protein phosphorylation and acetylation. Broken arrows denote the possible influence of nearby post-translational modifications on the ability of a PP2A– HDA14 complex to bind tubulin. (C) PP2A specifically dephosphorylates light sensor PHOT2 to regulate phototropism and stomatal opening. The 16 regulatory B subunit directing the specificity of these functions remains unknown. (D) PINOID kinase and PP2A regulate polar auxin transport through auxin efflux carrier PIN via phosphorylation and dephosphorylation, respectively. (E) Central to BR signaling is methylated PP2A-mediated dephosphorylation of the brassinosteroid receptor BRI1, which controls either BRI1 cellular internalization at the plasma membrane or recycling from endosomes leading to degradation. PP2A methylation results from BR-induced SBI1 expression. Kelch-like phosphatase BSU1 is also involved in the BR signaling pathway by facilitating nuclear enrichment of BZR transcription factors via BIN2 kinase dephosphorylation. Evidence for the potential direct induction of SBI1 transcription by BZR remains to be determined. Solid and broken arrows represent the activation and subsequent inactivation of BR signaling, respectively. Abbreviations: Ac, acetylation; BAK1, BRI1-associated kinase 1; BKI1, BRI1 kinase inhibitor 1; BSK1, BR signaling kinase; CDG1, constitutive differential growth 1; BSU1, BRI1 suppressor 1; BIN2, BR-insensitive kinase 2; LST8, Lethal with Sec Thirteen 8; P, phosphorylation; RPS6, ribosomal protein S6; S6K, S6 protein kinase. Double black line represents the plasma membrane. Parts (a), (d) and (e) were adapted from (Ahn et al., 2011; Di Rubbo et al., 2011; Li et al., 2011). ADF / cofilin, leading to a rearrangement of the actin cytoskeleton and resulting in light stimulated chloroplast movements (Wen et al., 2012). PP2A is additionally involved in regulating aspects of plant hormone signaling, with recent evidence demonstrating PP2A influence over the auxin transport system and cell polarity (Li et al., 2011; Ballesteros et al., 2012). PP2A-C, more specifically sub-family 2, and the corresponding protein kinase (PINOID) were implicated in regulating the phosphorylation state of the auxin efflux PIN-FORMED (PIN) proteins in roots (Figure 1.6D; (Michniewicz et al., 2007)). Furthermore, it is hypothesized that PP2A-C (isoform 4) is co-expressed with, and dephosphorylates, PIN1 to fine tune normal auxin transport to the root tip (Ballesteros et al., 2012). Interestingly, this regulatory mechanism also seems to be conserved in the leaf epidermis controlling the differentiation of pavement cells (Li et al., 2011). Perhaps the most significant contribution to PPP protein phosphatase research in the past few years is the finding that PP2A is intimately involved in regulating intracellular responses to 17 brassinosteroids (BR). Several key works revealed the action of PP2A at two points in this pathway. First, dephosphorylation and inactivation of BR-coupled receptor BRI1 (BR-insensitive 1) (Figure 1.6E; (Wu et al., 2011)), and second, upon initiation of the intracellular BR signal cascade, activation of BR-responsive gene transcription through the dephosphorylation and subsequent nuclear accumulation of the transcription factors BZR1 and 2 (Figure 1.6E; (Tang et al., 2011)). The dephosphorylation of BZR1 and 2 specifically employs a PP2AB’ complex and is critical for the BR signaling cascade; however, it is not known yet if PP2A needs additional post-translational modifications to execute this function (Tang et al., 2011). In the initial process outlined above, a genetic screen for suppressors of bri1-5 identified a suppressor (sbi1) that accumulated BRI1 protein in the mutant plant. SBI1, a leucine carboxymethyltransferase, specifically methylates the PP2A catalytic subunit C-terminal YFL motif leucine. It is not clear if PP2A (and SBI1) acts at the plasma membrane or during receptor recycling through the endomembrane system to mark BRI1 for degradation, and thus switch off BR signaling (Wu et al., 2011). Although PP2A and SBI1 operate at this level in the pathway, it has yet to be established if BRI1 is a direct substrate of PP2A. 1.3.3 Type 4 & 6 (PP4 & PP6) protein phosphatases The PP4 and PP6 catalytic subunits are also conserved across Eukaryotes, including plants (Dai et al., 2012; Dai et al., 2013). These proteins form a phylogenetically distinct cluster of PPP protein phosphatases along with PP2A, which is suggestive of a common ancestral phosphatase (Figure 1.2; (Moorhead et al., 2009)). PP4 and PP6 phosphatases possess 60-65 % identity to each other in addition to PP2A, while maintaining a much higher ~90-95 % identity amongst respective subclass orthologs found in other Eukaryotes (Kloeker et al., 2003; Cohen et 18 al., 2005). Close evolutionary relatedness between PP4 and PP2A phosphatases also translates into a similar sensitivity to MCLR and OA inhibition (Kloeker et al., 2003; Hastie et al., 2005). As well, both PP4 and PP6 possess protein regulatory (R) proteins named PP4R1-R4 and PP6R1-R3, respectively, which exclusively bind each respective catalytic subunit and contribute to their functional specificity (Cohen et al., 2005; Chen and Gingras, 2007; Stefansson et al., 2008). Similar to the PP2A-A scaffolding subunits, which possess HEAT (Huntingtin, elongation factor 3 (EF3), protein phosphatase 2A (PP2A), and the yeast kinase target of rapamycin 1 (TOR1)) repeats, PP4R- and PP6R-associating subunits are comprised of "HEAT-like" and ankyrin repeats, respectively (Cohen et al., 2005; Chen and Gingras, 2007; Stefansson et al., 2008). Overall, the annotated differences in sequence identity and regulatory binding partners indicate that PP4 and PP6 largely maintain regulatory roles independent of each other and PP2A (Cohen et al., 2005; Chen and Gingras, 2007; Stefansson et al., 2008). Structurally, both PP4 and PP6 catalytic subunits (c) have been suggested to exist in either trimeric or dimeric protein complexes (Zhang et al., 2005; Douglas et al., 2010; Ahn et al., 2011). Although no atomic level protein structures have yet been resolved, this is best exemplified in humans by the identification of a PP4c-R2-HDAC3 (histone deacetylase 3) complex as well as a PP6c-R1/2/3-DNA-PK (DNA-dependent protein kinase) complex, accompanied independently by the ability of both PP4c and PP6c to both bind α4 (TAP42 in yeast; TAP46 in A. thaliana) (Zhang et al., 2005; Douglas et al., 2010). Facilitating protein interactor commonalities between PP4, PP6 and PP2A may be their maintenance of a C-terminal YFL motif that allows for C-terminal leucine methylation in PP2A (Sents et al., 2013). Consistent with this idea was the finding that A. thaliana PP2A, PP4 and PP6 each bind TAP46 with varying affinities independent of their other regulatory subunits (Ahn et al., 2011). As with 19 α4 and TAP42, TAP46 was found to be a substrate of the target of rapamycin (TOR) protein kinase (Ahn et al., 2011), with RNA-induced gene silencing demonstrating TAP46 to be crucial for cell growth and survival, autophagy, and protein synthesis (Figure 1.6A; (Ahn et al., 2011)). To date, no clearly defined roles for plant PP4 have been elucidated; however, with PP4 maintaining multiple regulatory roles in humans and yeast, there will likely be key roles for PP4 in plants. Conversely, PP6 affects PIN phosphorylation and auxin efflux (Dai et al., 2012) as well as abscisic acid (ABA) signaling in A. thaliana (Dai et al., 2013). Loss of function mutants, over-expression plant lines and protein interaction analysis suggested that when ABA levels decrease, PP6 dephosphorylates the transcription factor ABI5 (ABSCISIC ACID-INSENSITIVE 5) leading to its degradation. This allows the initiation of seed germination and post-germination growth (Dai et al., 2013). 1.3.4 Type 5 (PP5) protein phosphatases The PP5 subclass of the PPP protein phosphatases is also conserved across all Eukaryotes (Andreeva and Kutuzov, 1999; Shi, 2009). PP5 protein architecture consists of a conserved Nterminal tetratricopeptide repeat (TPR) domain and a C-terminal phosphatase catalytic domain (Figure 1.3; (Andreeva and Kutuzov, 1999; Shi, 2009)). Subcellular localization analysis indicated that PP5 phosphatase largely maintains a dual cytosolic / nuclear cellular location (Chen et al., 1994; de la Fuente van Bentem et al., 2003). Of the two PP5 phosphatase structural components, the TPR domain is implicated in facilitating protein-protein interactions as well as functioning as an auto inhibitory domain (Figure 1.2; (de la Fuente van Bentem et al., 2003; Iki et al., 2012)). In light of this, PP5 phosphatases are characterized by low basal levels of phosphatase activity, which increase in the presence of arachidonic acid or following in vitro 20 proteolytic truncation of the TPR domain (Meek et al., 1999; de la Fuente van Bentem et al., 2005). In the plants A. thaliana and Lycopersicon esculentum, PP5 is encoded by a single gene that produces two alternatively spliced transcripts, with the larger 62 kDa isoform maintaining the additional exon and an endoplasmic reticulum localization, while the smaller 55 kDa isoform exhibits the well conserved dual cytosolic / nuclear subcellular localization (de la Fuente van Bentem et al., 2003). In plants, PP5 functions in light detection (Ryu et al., 2005), disease resistance (de la Fuente van Bentem et al., 2005), thermotolerance (Park et al., 2011) and chloroplast development (Barajas-Lopez Jde et al., 2013). In particular, PP5 from Lycopersicon esculentum interacts with cytosolic members of the heat shock protein 90 (Hsp90) family and disease resistance R protein I-2 through the TPR domain to induce cell death (de la Fuente van Bentem et al., 2005). This observed interaction between PP5 and Hsp90 represents a phenomenon also observed in other Eukaryotes (Chen et al., 1996). Most recently, additional protein interactions between PP5 and Hsp90 were resolved in BY2 tobacco cell culture, where PP5 co-purified with Hsp90-AGO1 (Argonaute 1) RNA-induced silencing complexes (RISCs) in addition to other TPR-containing proteins cyclophilin 40 and others (Iki et al., 2012). In vitro binding assays found that PP5 competes with cyclophilin 40 in binding the Hsp90-AGO1 complex, resulting in reduced RNA loading onto the RISC complex, while cyclophilin 40 alone enhanced RNA loading onto the RISC complex. This was proposed as a regulatory mechanism for RISC complex RNA loading (Iki et al., 2012). Lastly, AtPP5 regulates both chloroplast development via the tetrapyrrole plastid signaling pathway (Barajas-Lopez Jde et al., 2013) and light signaling through phytochrome A (Ryu et al., 2005). Here, mutation in a potential subunit of Mg-protoporphyrin monomethyl ester 21 cyclase complex chl27/crd was rescued by the additional knockout of atpp5, suggesting AtPP5 functions by relaying the chloroplast generated tetrapyrrole signal to the nucleus to stimulate transcription of chlorophyll biosynthetic genes and other key photosynthesis associated, nuclearencoded genes (Barajas-Lopez Jde et al., 2013). Interestingly, AtPP5 was also found to interact with phytochrome A through a yeast two-hybrid screen (Ryu et al., 2005). Experiments employing both atpp5 mutant and AtPP5 over-expression plants related increased AtPP5 abundance to greater photo-responsiveness and enhanced expression of light-inducible genes (Ryu et al., 2005). 1.3.5 Type 7 (PP7) protein phosphatases PP7 protein phosphatases are unique to plants, where they maintain a domain architecture differing from other PPP protein phosphatases. Unlike the comparable class of mammalian protein phosphatases with EF-hand domains (PPEFs), PP7 protein phosphatases do not maintain any N- or C-terminal extensions capable of binding Ca2+(Andreeva and Kutuzov, 2009), but instead possess a charged insertional region of variable length and a C-terminal nuclear localization signal required for constitutive nuclear localization (Figure 1.3; (Andreeva and Kutuzov, 2001)). Despite the lack of EF-hand domains, Ca2+ likely still influences PP7 in vivo, as the insertional region positioned adjacent to the last key PPP protein phosphatase catalytic motif (HGG) has been shown to bind calmodulin in a Ca2+-dependent manner in vitro (Kutuzov et al., 2001). As well, this region has alternatively been proposed to be an auto inhibitory domain (Kutuzov et al., 1998). Across plants, only a single PP7 isoform is encoded by each plant genome, which, when expressed, is targeted to the nucleus (Andreeva and Kutuzov, 2001). 22 Functionally, PP7 protein phosphatases have been implicated in a number of sensory functions, in particular light sensing (Moller et al., 2003; Genoud et al., 2008; Sun et al., 2012), and exhibit preferential expression in the stomatal cells of leaves (Sun et al., 2012). PP7-silenced A. thaliana indicated AtPP7 involvement in light signaling by revealing that PP7 positively regulates the cryptochrome (Moller et al., 2003). Subsequently, it was found that atpp7 loss-offunction plants displayed red / far-red light sensitivity, indicating that PP7 also influences phytochrome signaling (Genoud et al., 2008). Furthermore, this study revealed that phytochrome regulation by PP7 occurs through a direct interaction with nucleoside diphosphate kinase 2 (Genoud et al., 2008). Most recently, PP7 has been directly tied to stomatal aperture control via cryptochrome signaling and the dephosphorylation of a nuclear, zinc finger-containing protein called HRB1 (Sun et al., 2012). 1.3.6 Novel plant bacterial-like PPP protein phosphatases Novel PPP protein phosphatases include the protein phosphatase kelch-like (PPKL), Shewanella-like (SLP) and Rhizobiales-like (RLPH) phosphatases (Figure 1.3; (Andreeva and Kutuzov, 2004; Kerk et al., 2008)). PPKL phosphatases are named after their tandem N-terminal kelch-repeats which likely facilitate protein-protein interactions (Kutuzov and Andreeva, 2002; Mora-Garcia et al., 2004), while SLP and RLPH phosphatases were named after their deduced phylogenetic relatedness to phosphatases from Shewanella and Rhizobiales bacteria, respectively (Andreeva and Kutuzov, 2004). In A. thaliana there are four PPKL phosphatases, BSU1 (Bri1 suppressor 1), BSL1 (Bri1 suppressor-like 1), BSL2 (Bri1 suppressor-like 2) and BSL3 (Bri1 suppressor-like 3) (Moorhead et al., 2009; Wu et al., 2011), along with 2 SLP phosphatases (SLP1 and SLP2) and 2 RLPH phosphatases (RLPH1 and RLPH2). 23 BSU1 represents the first, and remains the only, well-characterized PPKL protein phosphatase (Mora-Garcia et al., 2004; Kim et al., 2009; Kim et al., 2011). BSU1 is an OAsensitive and I2-insensitive protein phosphatase (Mora-Garcia et al., 2004). Investigation of the other A. thaliana PPKL phosphatases BSL1-3 through RNAi-mediated suppression revealed the function of these proteins to overlap (Mora-Garcia et al., 2004). Identified as a suppressor of the BR receptor BRI1, BSU1 was suggested to directly dephosphorylate downstream BR-induced transcription factors BZR1 and 2, resulting in their nuclear accumulation and activation of BRinduced gene transcription (Mora-Garcia et al., 2004). Subsequently, it was found that BSU1 indirectly fine-tunes the phosphorylation status of both BZRs through the dephosphorylation and inactivation of GSK3-like kinase BIN2 (Kim et al., 2009), with BZR1 and 2 dephosphorylation catalyzed by PP2A (Tang et al., 2011). Most recently it was revealed that BRI1 plasma membrane mediated BR signaling to BSU1 was facilitated by a cytosolic receptor-like kinase CDG1 (Kim et al., 2011). Transgenic plant experiments revealed that CDG1 positively regulates BR signaling whereby BR1 phosphorylates CDG1 on S234, which then in turn phosphorylates BSU1 on S764, resulting in BIN2 inactivation via its dephosphorylation (Figure 1.6E; (Kim et al., 2011)). Specific disruption of these phosphorylation sites through targeted site-directed mutagenesis impaired BR signaling (Kim et al., 2011). 1.3.7 Prokaryotic protein phosphorylation and PPP protein phosphatases Prokaryotic protein phosphorylation occurs on a number of residues including serine, threonine, tyrosine, histidine and aspartate (Pereira et al., 2011). Historically, prokaryotic protein phosphorylation was thought to occur solely as part of two component signaling systems involving pHis and pAsp residues. This required the need for dedicated kinases, but not 24 dedicated phosphatases due to the innate lability of these phosphorylated amino acids (Stock et al., 1990; Hoch, 2000). However, thanks to advancements in genome sequencing technologies, numerous eukaryotic-like protein kinases have been uncovered across a multitude of organisms, leading to a change in this perspective. As of 2008, 383 of the 577 sequenced bacterial genomes had been found to maintain at least 1 identifiable eukaryotic-like protein kinase, with 25 of those genomes having greater than 15 protein kinases (Perez et al., 2008). Myxobacteria had the largest percentage of sequenced genomes maintaining 15 or more protein kinases (~85 %), with Stigmatella aurantiaca and Plesiocystis pacifica encoding 194 and 230 putative protein kinases, respectively (Perez et al., 2008). Correspondingly, genome sequencing of Prokaryotes has revealed homologs from all four eukaryotic protein phosphatase families (Kennelly, 2002; Pereira et al., 2011). The identified prokaryotic protein phosphatases qualifying as PPP protein phosphatases have been characterized to dephosphorylate pHis and pTyr in addition to pSer and pThr in vitro, suggesting they are DSP phosphatases (Pereira et al., 2011). The prevalence of in vitro dual specificity amongst prokaryotic PPP protein phosphatases is highlighted by the distant eukaryotic SLP ancestor CAPTase from Shewanella, which was found to dephosphorylate pTyr residues in vitro despite maintaining a PPP protein phosphatase catalytic fold and coordinating Mn2+ ions in its active site (Tsuruta and Aizono, 1999, 2000). The function of regulatory protein phosphorylation in Prokaryotes seems to parallel that which is observed in Eukaryotes. In particular, Mycobacterium tuberculosis protein GarA was found to contain a forkhead-associated (FHA) domain which interacts with phosphorylated PknB protein kinase (Villarino et al., 2005). FHA domains bind phosphorylated threonine residues (Hammet et al., 2003), indicating that, even within Prokaryotes, phosphorylation-dependent 25 protein-protein interaction networks exist. With many protein kinases and phosphatases exerting influence over bacterial cell processes important to human health (e.g. cell division), understanding the in vivo targets of these regulatory proteins could be very revealing (Pereira et al., 2011). Much remains to be uncovered regarding the regulatory roles of prokaryotic protein kinases and phosphatases; however, given the diversity and distant relatedness of these proteins to those found in Eukaryotes, research in this area will likely uncover many novel findings. 1.4 Research Objectives With PPP protein phosphatases representing a key regulatory protein family across Eukaryotes, the focus of my Ph.D. research centered upon the unusual bacterial-like PPP protein phosphatases of A. thaliana: the SLP and RLPH phosphatase subclasses (Andreeva and Kutuzov, 2004; Kerk et al., 2008; Moorhead et al., 2009). Prior to the work presented in this thesis, little was known about either SLP or RLPH phosphatases beyond the existence of genes encoding them in select plants and other Eukaryotes (Andreeva and Kutuzov, 2004; Kerk et al., 2008). Andreeva and Kutuzov 2004, found the A. thaliana genome to encode two SLP phosphatases, AtSLP1 (At1g07010) and AtSLP2 (At1g18480), with preliminary analysis suggesting AtSLP1 may be chloroplastic and/or endoplasmic reticulum (ER) localized and AtSLP2 exclusively ER localized (Andreeva and Kutuzov, 2004, 2008). A. thaliana (At) was also found to encode two RLPH phosphatases, AtRLPH1 (At3g09960) and AtRLPH2 (At3g09970). Other than their initial annotation, and a suggested phylogenetic relatedness to phosphatases from Rhizobiales bacteria, no additional information was known about RLPH phosphatases (Andreeva and Kutuzov, 2004). 26 The goal of the first research chapter of this thesis (Chapter 2) was to resolve the molecular evolution of eukaryotic bacterial-like phosphatases using a number of bioinformatic and phylogenetic approaches. This also revealed sequence motifs and characteristics that are unique to the SLP and RLPH phosphatases, in addition to elucidating their close relatedness to prokaryotic PPP protein phosphatases. These intriguing findings stimulated additional questions regarding the cell biological and biochemical properties of AtSLP1 and AtSLP2. This was addressed in Chapter 3 where the objective was to experimentally verify proposed inferences about AtSLP1 and 2 subcellular localization and potential inhibitor sensitivities. The objectives of Chapters 4 and 5 were to explore the cellular and biological function of AtSLP phosphatases by identifying their protein interactors through the application of protein-protein interaction techniques, and by identifying plant phenotypes using atslp insertional mutant and constitutive 35S::AtSLP over-expression plant lines. Finally, Chapter 6 shifts from the examination of AtSLP phosphatases, to examining the AtRLPH phosphatases. The objective of chapter 6 was to fully characterize the cell biological and biochemical properties of AtRLPH2. Together, the findings presented throughout this thesis lay a solid foundation for future research endeavours involving eukaryotic bacterial-like PPP protein phosphatases. 27 Chapter Two: Evolution of bacterial-like PPP protein phosphatases in photosynthetic eukaryotes features ancestral mitochondrial origin overlaid by horizontal gene transfer 2.1 Introduction Reversible protein phosphorylation is a post-translational mechanism central to the proper function of living organisms (Cohen, 2000, 2002). Governed by two large groups of enzymes, protein kinases and protein phosphatases, this mechanism has been shown to regulate upwards of 70% of all eukaryotic proteins (Olsen et al., 2010). Protein phosphatases represent one half of this dynamic regulatory system and have been shown to be highly regulated proteins themselves (Roy and Cyert, 2009; Shi, 2009). Recently, the PPP protein phosphatase family has been expanded to include two novel subclasses which show enhanced similarity to PPP-like protein phosphatases of prokaryotic origin (Andreeva and Kutuzov, 2004). These bacterial-like phosphatase subclasses were annotated as Shewanella-like (Shelph; SLP) phosphatases, and Rhizobiales-like (Rhilph; RLPH) phosphatases based on their similarity to prokaryotic sequences from these respective sources (Andreeva and Kutuzov, 2004). Characterization of eukaryotic protein evolution can provide insight into individual protein or protein class conservation across the domains of life for biotechnological applications, in addition to furthering our understanding of how multicellular life evolved. In particular, investigation into the evolution of key signaling proteins, such as protein kinases and phosphatases from plants, can have wide-ranging agricultural biotechnology and/or medical potential. This can include the development of disease- or stress-resistant crops in addition to treatments for parasitic organisms such as Plasmodium (malaria) (Patzewitz et al., 2013) and other Chromoalveolates (Kutuzov and Andreeva, 2008), that are derived from photosynthetic Eukaryotes and maintain a remnant chloroplast (apicoplast) (Le Corguille et al., 2009; 28 Janouskovec et al., 2010; Kalanon and McFadden, 2010; Walker et al., 2011). The existence of proteins that are conserved across diverse eukaryotic phyla, but absent in Metazoa, such as the bacterial-like PPP protein phosphatases described here, presents unique research and, possibly, biotechnological opportunities. Conventional understanding of prokaryotic gene acquisition by Eukaryotes largely involves ancient endosymbiotic gene transfer events stemming from primary endosymbiosis of α-Proteobacteria and Cyanobacteria to form the eukaryotic mitochondria and chloroplast, respectively (Keeling and Palmer, 2008; Dorrell and Smith, 2011; Tirichine and Bowler, 2011). Over time, however, it has become apparent that alternative modes of eukaryotic gene and protein acquisition exist, such as independent horizontal gene transfer (HGT) events (Keeling and Palmer, 2008; Keeling, 2009). Targeted studies of protein evolution have seen a steady rise in documented HGT events across a wide variety of eukaryotic organisms, including photosynthetic Eukaryotes (Derelle et al., 2006; Raymond and Kim, 2012; Schonknecht et al., 2013), Nematodes (Mayer et al., 2011), Arthropods (Acuna et al., 2012), Fungi (Wenzl et al., 2005), Amoebozoa (Clarke et al., 2013) and Oomycetes (Belbahri et al., 2008). Each instance documents the integration of a bacterial gene(s) into a eukaryotic organism seemingly resulting in an adaptive advantage(s) important to organism survival. Utilizing a number of in silico bioinformatic techniques and available sequenced genomes, the molecular evolution of two bacterial-like PPP protein phosphatase subclasses found in Eukaryotes is revealed to involve ancient endosymbiotic gene transfer plus additional overlays of likely HGT events. Subcellular localization prediction algorithms reveal distinctive subsets of bacterial-like PPP protein phosphatases, which may correlate with altered functions, while the large sequence collections compiled here have allowed the elucidation of two highly 29 conserved carboxy-terminal domain motifs, which are specific to each eukaryotic bacterial-like PPP subclass. Together these findings substantially expand our knowledge of the molecular evolution of the bacterial-like PPP protein phosphatases, and point the way toward attractive future research avenues. 2.2 Materials and Methods 2.2.1 Multiple sequence alignments Protein sequences were aligned using MAFFT (ver 7) ((Katoh et al., 2002); http://mafft.cbrc.jp/alignment/server/), with the BLOSUM45 scoring matrix, using the E-INS-i option (very slow, multiple domains with long inserts). Alignments were visualized and handedited in GeneDoc ((Nicholas et al., 1997); http://www.nrbsc.org/gfx/genedoc/). As necessary for subsequent work, edited sequence alignment formats were converted using an online converter utility ((Futami et al., 2008); http://biotechvana.uv.es/servers/afc/main.php). 2.2.2 Candidate sequence search, retrieval and validation Initial eukaryotic sequences of the SLP and RLPH phosphatases were obtained from the literature (Andreeva and Kutuzov, 2004), and through database searching at NCBI with BLASTP and PSI-BLAST ((Altschul et al., 1997); http://blast.ncbi.nlm.nih.gov/). These were then used to generate initial multiple sequence alignments, as described above. Edited multiple sequence alignments were converted into Stockholm format, and used to generate Hidden Markov Models (HMMs) by the HMMER (v.3.0) software suite ((Eddy, 1998); http://hmmer.janelia.org/). Databases of protein sequences from completely sequenced eukaryotic and prokaryotic species were compiled locally. Eukaryotic sequences were obtained from the Joint Genomes Institute 30 (JGI) (http://www.jgi.doe.gov/), Phytozome (http://www.phytozome.net/), Metazome (http://www.metazome.net/), or individual genome project web sites. Prokaryotic sequences were obtained from UniProt (http://www.uniprot.org/). Databases were searched using HMMs, candidate sequences extracted, and then placed into further multiple sequence alignments as described above. Potential candidate sequences were evaluated from the full range of HMM hits for each sequence class, from the strongest (lowest E value) to the weakest (the statistical inclusion threshold [E ~ 0.01]). In some instances further iteration was performed with BLASTP and HHBlits ((Remmert et al., 2012); http://toolkit.tuebingen.mpg.de/hhblits) searching of the UniProt database comprising all bacterial sequences (HAMAP (High-quality Automated and Manual Annotation of microbial Proteomes)) ((Lima et al., 2009); http://hamap.expasy.org/). The rationale was to supplement previously identified candidate sequences from completely sequenced genomes with closely related homologues (~E<1e-50) from non-sequenced genomes. Candidate searching for each sequence class was supplemented by using two validated query sequences per class for NCBI TBLASTN searches against various databases: NCBI Genomes (chromosome); High throughput genomic sequences (HTGS); Whole-genome shotgun contigs (wgs). Only candidate sequences with full-length hits to the query and credible matches to conserved carboxy-terminal conserved sequence motifs (see below) were considered for further evaluation. The rationale was to search for previously unannotated sequences relevant to each target sequence class. Candidate sequence identity was confirmed through phylogenetic tree inference. All gene identifiers for sequences used are located in Appendix A.1. and A.2. 31 2.2.3 Phylogenetic tree inference ProtTest (v2.4) ((Abascal et al., 2005) (http://darwin.uvigo.es/ software/prottest2_server.html) was used with completed multiple sequence alignments to assess the optimal amino acid substitution model to use for subsequent work. In all instances the LG model with four gamma categories was optimal. Multiple sequence alignments were subjected to phylogenetic tree inference at both the CIPRES Science Gateway ((Miller et al., 2010); http://www.phylo.org/index.php/portal/) and locally. Maximum Likelihood analysis (RAxML (v 7.4.2); ((Stamatakis et al., 2008); http://www.exelixis-lab.org/) was run at CIPRES under the LG amino acid substitution model, using a maximum of 1000 rapid bootstraps, or until automatic convergence was reached. Bayesian analysis (Mr Bayes (v3.1.2); ((Ronquist et al., 2012); http://mrbayes.sourceforge.net/) was performed at CIPRES, using four independent chains, under the Mixed amino acid substitution model (the LG model is not available with this implementation), with four discrete gamma categories, running to a maximum of 7.5 million tree generations, or until automatic convergence (average standard deviation of split frequencies < 0.010) was achieved. Bayesian analysis (PhyloBayes_MPI (v1.3b); ((Lartillot et al., 2009); http://megasun.bch.umontreal.ca /People/lartillot/www/downloadmpi.html) was run on the WestGrid system of Compute Canada (https://computecanada.ca/index.php/en/), using two independent chains, under the LG amino acid substitution model, with four discrete gamma categories. Maximum Likelihood analysis (PhyML-aBayes (v3.0.1beta); ((Anisimova et al., 2011); http://www.atgc-montpellier.fr/phyml/versions.php) was run locally, under the LG model, with four discrete gamma categories, with all other parameters at defaults, through 25 random starts, employing an initial Parsimony input tree, and SPR moves. For each analyzed protein sequence class, trees were obtained by all the utilized inference methods which had concordant 32 topologies at all major nodes. Within the body of this report tree figures are presented which represent a typical topology, with branch support given for each method at the most critical nodes. The branch support for all the trees is summarized in Appendix B.1. For Bayesian methods (Mr Bayes and PhyloBayes_MPI), branch support represents the posterior probability (PP [max value = 1.00]). For the PhyML maximum likelihood method, branch support represents a Bayesian-like transformation of the approximate Likelihood Ratio Test value (aBayes [max value = 1.00]) (Anisimova et al., 2011). For the RAxML maximum likelihood method, branch support represents rapid bootstrap support (RBS [max = 100]). 2.2.4 Subcellular localization prediction A battery of methods (10 in total for plant or algal sequences with chloroplast potential, 9 for sequences from non-plant species) was used to infer the probable subcellular localization of the eukaryotic proteins described in this study. These included TargetP ((Emanuelsson et al., 2000); http://www.cbs.dtu.dk/services/TargetP/), http://wolfpsort.org/), PREDOTAR fr/predotar/predotar.html), ((Small Protein http://pprowler.imb.uq.edu.au/index.jsp), WoLFPSort et Prowler PredSL http://hannibal.biol.uoa.gr/PredSL/input.html), al., ((Horton 2004); ((Boden al., 2007); http://urgi.versailles.inra. and ((Petsalaki SLP-Local et ((Matsuda Hawkins, et al., et al., 2005); 2006); 2005); http://sunflower.kuicr.kyoto-u.ac.jp/~smatsuda/slplocal.html), iPSORT ((Bannai et al., 2002); http://ipsort.hgc.jp/), PCLR ((Schein et al., 2001); http://www.andrewschein.com/pclr/), MITOPROT ((Claros and Vincens, 1996); http://ihg.gsf.de/ihg/mitoprot.html) and ChloroP ((Emanuelsson et al., 1999); http://www.cbs.dtu.dk/services/ChloroP/). The top two in silico predicted subcellular localizations for each protein sequence are displayed on each respective 33 phylogenetic tree branch. A single subcellular localization is given for those protein sequences where the prediction methods provided a clear preponderance of that location (80% of methods used). Protein sequences where no subcellular prediction is given are those which are fragments lacking a native amino terminus (N-terminal "M") and which therefore could not be properly assessed. The majority of in silico techniques applied here also have their own internal thresholds for compartment predictions, and automatically convert the sequence score into a compartment prediction. 2.2.5 Analysis of sequence motifs and gene architecture SLP and RLPH sequences were identified by HMM, BLASTP, and TBLASTN analyses as detailed above, aligned using MAFFT, with each alignment visualized and hand-edited in GeneDoc. Highly conserved C-terminal regions were manually identified and an amino acid positional probability consensus was generated using WebLogo 3 ((Crooks et al., 2004); http://weblogo. threeplusone.com/). Gene models for each photosynthetic Eukaryote were downloaded from Phytzome v7.0 (www.phytozome.net). Models of representative SLP and RLPH genes are depicted. 2.3 Results 2.3.1 Eukaryotic bacterial-like SLP and RLPH protein phosphatases are PPP protein phosphatases Since the initial documentation of genes encoding SLP1 and SLP2 phosphatases in A. thaliana (Andreeva and Kutuzov, 2004), numerous genomes from a variety of higher and lower plants, as well as other organisms, have been sequenced. Preliminary BLASTP analyses of the 34 Arabidopsis thaliana NCBI database with either full-length AtSLP1 or AtSLP2 retrieved multiple PPP protein phosphatases, with PP1 and PP2A catalytic subunits offering the highest level of identity. AtSLP1 and AtSLP2 were found to be 31% identical to each other in addition to both being 10-14% and 9-13% identical to PP1 and PP2A, respectively, suggesting that these enzymes are PPP protein phosphatases. AtRLPH protein phosphatases, were also found to be most related to PPP protein phosphatases, maintaining 85% identity to each other in addition to 13% and 12-14% identity to PP6 (AtRLPH1) and AtPP7/AtSLP2 (AtRLPH2). Consistent with previous findings (Andreeva and Kutuzov, 2004), the vast majority of the SLP and RLPH phosphatases identified here were found to possess the entire complement of key catalytic motifs indicative of being PPP protein phosphatases (Figure 2.1 and 2.2; (Andreeva and Kutuzov, 2004)). These motifs are represented by GDxHG, GDxVDRG, GNHE and HGG (Chapter 1; (Shi, 2009)), and in some instances can possess conservative substitutions. In a typical sequence, all four of these motifs can be clearly identified upon individual inspection of the amino acid sequence or as part of larger computer assisted alignment (Figure 2.1 and 2.2). Of these sequences, a small proportion in each subclass lacks one or more of the conserved aminoterminal motifs: about 4 % of SLP phosphatases (7/163) and about 6 % of RLPH phosphatases (3/47). It is possible these represent incomplete or incorrect gene models, but a genuine lack of one or more amino-terminal motifs cannot be completely ruled out. 35 GDxHG MgloSLP UmSLP PgraSLP MlarpSLP RgluSLP BdenSLP TgSLP CparSLP LbSLP CneoSLP SpSLP Q1D163_MYX F8CJ73_MYX Q08Y53_STI H8MYK0_COR Q028W8_SOL B8KKW3_9GA A4ABU5_9GA A0Z563_9GA AlSLPa AtSLP1 CarSLPa BrSLPb ThalSLPb AcSLPa AcSLPb CpaSLPa CcSLPb CsSLP CsSLPb GrSLPb MdpSLPd PperSLPb MdpSLPe VvSLPb MeSLPb PtSLPa PtSLPb PtSLPc PtSLPd RcSLPb MgSLPa MgSLPb GmSLPb GmSLPc PvulSLPb MtSLPb EgSLPa EgSLPb CsatSLPa CsatSLPb LuSLPc BdSLPa PvSLPa PvSLPb PvSLPc SiSLPa SbSLPa ZmSLPb OsSLPa PpaSLPc PpaSLPa SmSLPa CrSLPa VcSLPb CvSLPb CocSLPa MpSLPc MpSLPd OlSLPa OrcSLPa OtSLPc MpSLPb MpSLPf OlSLPc OtSLPb OrcSLPb A9EUH1_SOR D0LQ64_HAL A6FXL1_9DE A6G215_9DE AlSLPb AtSLP2 CarSLPb ThalSLPa BrSLPa CpaSLPc CcSLPa CsSLPa MeSLPa RcSLPa PtSLPe GrSLPa CsatSLPc GmSLPa GmSLPd PvulSLPa MtSLPa MdpSLPa MdpSLPb MdpSLPc PperSLPa LuSLPa LuSLPb CpaSLPb VvSLPa AcSLPc MgSLPc BdSLPb OsSLPb PvSLPd SiSLPb SbSLPb ZmSLPa PpaSLPb SmSLPb MpSLPa MpSLPe OlSLPb OtSLPd OrcSLPc CocSLPb CvSLPa CrSLPb VcSLPa ZmSLPd ZmSLPf ZmSLPg CmSLP EhSLPa GtSLPe GtSLPd FcSLP TpSLP EsSLPa EsSLPb GtSLPb GtSLPc GtSLPa CpSLP AaSLPa TcSLPa LmSLPa TbSLPa TcSLPc CrSLPc VcSLPc MpSLPg MpSLPh OlSLPd OrcSLPd OtSLPa PmSLPa PmSLPb ZmSLPc ZmSLPe EhSLPb LmSLPc TbSLPb TcSLPb LmSLPb F4KVD8_HAL G8T712_NIA F9YU86_CAP B8CJM1_SHE A8FSQ7_SHE A0KU52_SHE A6WR58_SHE A6LG58_PAR A6L339_BAC PbSLPa PySLPb PchaSLPb PvxSLPa PbSLPb PySLPa PchaSLPa PvxSLPb C7JGL0_ACE G2I4P6_GLU A9HIJ4_GLU Q5FPF8 GLU : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : GDxVDRG GNHE HGG Motif 1 MVVGHT Motif 2 IDVG * 20 * 40 * 60 * 80 * 100 * 120 * 140 * 160 * 180 * 200 * 220 * 240 * 260 HRRIVALGDIHGDYEHATSILRAAGILHSWAGGSTIFVSTGDTVDRGDDTIRLYRLPQDLREQSRRVGGNVINVLGNHEMMNA---------------MMD---WRYVTPGDMASFGGPRRQAMSLHGWLGMEWMQNVTMNVQRASFVHGGITPAFAIGVDAMNKDAHSLLTKALWLS----D-GPFWYRGYALD-ACATANRAIDALG-----VSSLIMGHTPHSG-IHARCHGQIFIIDTGMSRAGRLSALEIDSYAYSAVYVPEG SRRTVAVADLHGDLDHALNVLSMASIVSTWIGGHDTLVSTGDIVDRGDDTIALYRLPVSLRQQARLAGGEVKNCLGNHEVMNA---------------IGD---WRYVTKADVESFGGVRRHAMSDQGWIGQEWLHNVTHTIPRVSFVHGGITPQYALGIDFINTVGHSLLLKGLWSE----H-GPLWYRGYATD-ACANAEKATKSLA-----VNQLVMGHTPHDG-FVTRCNNTILLIDTGISRAGQQSALIIDFDLLTALYKARP SSRIVAVGDLHGDLDHAVRVLRMAGVVDQWIGGPSILVQTGDIVDRGKATILLYKWMDALRTEAQAAGGAVVSLLGNHEYMNA---------------LGD---WRYVTKEDIETFGSARRKVMSTQGWIGKTWEASVTARIAGTVFVHGGITPEYALGISEINRIGHSLLHRALYAE----H-GPLWERSYALEDICRQIEIATQRLH-----VRRMVMGHTPQKG-ISSRCGGKILLIDTGISSAGPLTALEINYSLIQELADKKK KQRVVAIGDLHGDLPHAVRVLRLAELIDKWIGKKTVLVQTGDIVDRGRDTIVLYQLMDRLRNEAKAAGGAVVSLLGNHEYMNA---------------LGD---WRYVTEEDIETFGGKRRKLMSSEGWIGESWLKNTTARVSAIAFVHGGITPEYAIGISEINRIGQSFLNRSLYSS----H-GPLWERSYALEEICDQIEKTINLLN-----VRRLVMGHTPQKG-ILGRCQGKILLIDTGISSAGALSALEIQYTLVHGLQEGKL RQRIVAMGDIHGDLPAATKILRRAEVVDQWIGGDTILVQTGDIVDRGPDTIALYRFFQSLRPQAERAGGAVVSLLGNHEMMNC---------------LGD---YRYVTKEDIASFGGERREAFLH-GWIGQEFRASVTARVSAISFVHGGITPEYSSPITDINRIGHSILESLLWSE----R-GPMWNRDWALE-ICERVEKALEVLN-----VRRMVMGHTPQEG-ILSRCDGKILLIDTGISRAGAHSSLSLTYTLVEALYTAGR PSRIIAVGDLHGDLAQALKTLKMARIMNEWIAGSSIFVQTGDVVDRGPDTIKLYKMLYDLKVQAEEHGGQVIQLLGNHEVMNM---------------AED---LRYVTEGDYSSFGGHRRKAFDKDGWPG-NYLRNITTWVNGTVFFHGGAHPQWALGIDGMNLRAHNGLIGRSFGG----S-GPLWFRGYAED-ICKQLDKALADMN-----AVRMVVGHTPQDGSVLRRCNGKLYVIDVGISRVGNSAALEI------------GSRILAVGDLHGDIGNTMLLLYGAGVVDNWIAGDTLLIQTGDVVDRGPDGKRIYDYFASLSAQATEQGGKIIQLLGNHDVMNI---------------CGD---FRYAHPSETIEFGGARRRQFMDGGHYG-NMLRPLSIKANGVIFSHAGIPSDFALGLSKLTQQLREELANDCAGS----Q-GPLWTRVYSMG-ICEELDKTLGILD-----SEKMVIGHTVQSGNIEVYCGGRLLLIDTGVSRYDSPRMLEIRDGS--------KGRVLVIGDIHGDLKSLITSLFLSGVINDWIAKNTLLIQLGDVVDRGSHALQIYKLFNKLKSQAPSLGSKFVGLLGNHEVMNL---------------CGQ---LHYVTDEDFQTYGGRRTFEWSKEGFVG-KYLRKLAIRVNDSLYVHAGLLPKYALGLDKLDKLSNDLLEGDFFVE----D-GPLWTRDISLG-ACKLVDETLQILG-----LSRMVVGHTIQDNRINIKCDNKLILADTGFSEAGKPCMLEILYHN--------TRHIVAVGDLHGDLPNARKVLQFSGITDDWTGDVDFFVQTGDIIDRGDDTIPLFFWMDKLRSQAAAVGGTVLSHLGNHEWMNV---------------IGD---WRYVYPTEIKTFGSVRQQMLTT-GRIGRSWAATTASRLSAISFVHGGLSPTYTLTPSRINQISDSLLAKLQYDG----N-GPLWYRGWATD-VCADVDRVLAKTG-----TRRMIMGHTPDKN-IKARCGGKIIIIDTGISHAGALSALSLYYSLVSAIYQHRR RQRLVAVGDLHGDIDNAKKTLQMARIIDKWVASTDILVQTGDIVDRGAYADDIYRLMQSLRGQAASQGGKVVSILGNHEVMNA---------------IGD---WRYVTKGDIARFGGTRQHALSAEGWLGQEWLASTTALVPTFSFTHGSLRPSYALTPAAINDLGHSLLTKALYAE----G-GPLWWRGLAER-VCEWAKNLKQKIG-----ARRIIGGHTPNEK-IVARCNASVIIIDTGISSAGVLSALEIVYTLVHAIYEHSR PTVVYAIGDIHGDFPNALDVLSAAGVVSEWTAGNATLVQTGDVVDRGPDTRKLFRWFNDLHKQAEKHGGRVVRLLGNHEFMNA---------------KGD---WRYVHPGDKASYPEPRIIDWGHSGEIGNLLLSNVTYKDNTSHFMHAGLSPEWAYREETVNELGKELLSHFMWAI----E-GPMWYRGLAQL-ACEVALNVTKTLN-----VNRLVMGHTPQHG-IVSRCEGRILLIDTGLCSAGERAVLRISQNDVEAVYRGKI VERVVAVGDVHGDVDALKEVLRLAGLIDRWIGGKTHLVQTGDVPDRGDQTRAAFDLLMRLEQEALAAGGRVHALLGNHEAMNM---------------LGD---LRYVNPGEMASFADQHRVAYSLQGRYG-QWLRAAVVRINDTLFVHGGVAPGVPGNLAELNRWVRQDFFPDHRDA----Q-GPLWFRGYALG-AEPALDAVLKRYG-----ARRMVMGHTTNDGKVKVRFNGKALLIDTGLSTGRNLAALELRGGKVNALYREGP VERVVAVGDVHGDVDALKEVLRLAGIIDRWIGGKTHLVQTGDIPDRGDQTRAAFDLLMRLEQEALAAGGRVHALLGNHEAMNM---------------LGD---LRYVSPGEMASFADQHRVAYSLQGRYG-QWLRAAVVRINDTLFVHGGVAPGVPTDLSALNRWVRQDFFPGQRNP----Q-GPLWFRGYALG-AGPALDAVLQRYG-----ARRMVMGHTTNDGKVKVRFNGKALLIDTGLSTGRNLAALELRGGKVNALYREGP VERVVAVGDVHGDVEALKEVLRLAGLIDQWTGGKTHLVQTGDIADRGARTREAFELMMRLEREALAAGGRVHLLLGNHEVMNM---------------RGD---LRYVTPEELASFAGLHRAAYGLEGRYG-RWLRPAVVRIDGTLFLHGGLHPEVPKTLGALNRWTRQDLFPDATDA----K-GPLWFRGYAQE-WSQGLDAVLERFG-----ARRMVMGHTPTDGRIGVRFGGRAVLIDTGLSTYRHLAALEIRGDRLTALYPDGR VERIVAVADVHGDVDALKEVLRLAGLIDHWIGGKAHLVQTGDLPDRGDHTRDAFELLMRLETEARKAGGRVHPLLGNHELMNM---------------RGD---LRYVTPGEFASFADQHAAAYAADGRYG-KWLRPAVIRINDTLFLHGGLAPTVPTTLEEVNRWVWQDLTPGQVDP----Q-GPVWFRGYAID-WDAGLTQVLERFS-----ARRMVMGHTPSDGRLSIRFGGRVIVIDTGLSTHRHLAALELRGDRLTALYPEGR VPRVVAVGDVHGDYNGFVEVLRSAGVIDHWAGGKTHLVQTGDVPDRGPDTRKAMDLLMQLEKEADKAGGHVHALVGNHEAMNV---------------YAD---LRYTTPAEFAAFVDNQRIAFSSEGKYG-KWIRNALVKINDTIYLHGGISPRYAMTVKQINEAVAAELNDLPMGT----D-SPLWYRGIALE-IAAHVDLVLKTND-----AQRIVISHTVTGA-IIERFGGKVVMIDTGMTAVGHRACLLLEGDKVYAIHRGEK IDRVVAIADLHGDYQSYITVLEQAGLVNRWTGGTTHLVQLGDVPDRGPDTARIIEHLMKLETQAKKAGGKVHALIGNHEVMNM---------------TGD---LRYVHPGEYEALKSRHRQHWLPNGEFG-AWVANTVVRINRSLFVHAGISPDYLRSIGDINDAVRAELAKPNVDE----R-GPLWYRGLALGDVAVEVDALLKYFD-----VDRIVVGHTPGGT-VVPRYDARVIAADSGLAQHGHLASLLIEGGKVYTLQRGQQ Q Q Q Q Q Q Q QQ VERVVAVADLHGDYDNYITVLRQAGVIDRWDAGKTHLVQLGDVPDRGPDSDKIIRHLMKLEEQAEKAGGKVHPLIGNHEVMNI---------------TGD---LRYVHPGEYEALTSRHRQHWHPEGQFG-AWVANTVIRINRSLFVHGGIGPDYLASMEDINDTVREELRAPDEDE----E-GPLWYRGLIMGEIAAHVDALMERFD-----VDRIVMGHTPGGT-VVPRYHGRVLAADSGIAEYGNLASVLIENGQAFTLQSGER SETVIAIGDVHGDHEQFVKLLQATKLIDRWTGGETHLVQLGDLPDRGPDTRKTMDLLLELQTSSIEQGGAVTTLIGNHDMMNV---------------MND---LRYVDPGEYKAFRSRHRLAWAPTGTYG-EWVLPTVAVVGDSLFVHGGISPDYIMSIEDINTAVHSAMGEGYYDP----L-GPLWFRGWSQDLNTMALNSILNKYG-----VARMVVAHTPVPV-VVPRYDGKIIMVDVGLSQHHGFSALKIENGQTMAMVGDRW ARRIVAVGDLHGDLGKARDALQMAGVLSQWIGQDSVLVQVGDILDRGDDEIAILSLLRSLDGQAKANGGAVFQVNGNHETMNV---------------EGD---FRYVDARAFDECTDFRSILFRPGGRLACE-LAGVILRVNNWIFCHGGLLPHHVYGVERINREVSTWM-RSPRGY----D-SVVWSRLYSRETVNKILHDTLEAVG-----AKAMVVGHTPQSG-VNCEYGCSIWRVDVGMSSGSRPEVLEIRGDKARVIRSNRD ARRIVAVGDLHGDLGKARDALQLAGVLSQWVGQDTVLVQVGDILDRGDDEIAILSLLRSLDDQAKANGGAVFQVNGNHETMNV---------------EGD---FRYVDARAFDECTDFRSVLLRPGGRLACE-LSGVILRVNNWLFCHGGLLPHHVYGIERINREVSTWM-RSPRGY----D-SVVWSRLYSRETVNKILHDTLEAVG-----AKAMVVGHTPQSG-VNCEYGCGIWRVDVGMSSGSRPEVLEIRGDKARVIRSNRD ARRIIAVGDLHGDLSKARDALQLGGVLSQWIGQDT----VGDILDRGDDEIAILSLLRSLDVQAKANGGAVFQVNGNHETMNV---------------EGD---FRYVDARAFDECTDFRSVLFRPGGRLACE-LAGVVLRINNWVFCHGGLLPHHVYGVERINREVSTWM-RSARGY----D-SVVWSRLYSRETVSKILRDTLDAVG-----AKAMVVGHTPQSG-VNCEYGCGIWRVDVGMSSGSRPEVLEIIGDKARVIRSNRD SRRIIAVGDLHGDLSKARDALQIAGVLSQWVGEDTVVVQVGDILDRGEDEIAILSLLRLLDEQAKANGGAVFQVNGNHETMNV---------------EGD---FRYVDTRAFDECIDFRSVLFRPGGRLACE-LSGVVLRVNNWVFCHGGLLPHHVYGVERINREVSTWM-KSSRGQVESSK-TVKSSSAWIECVVGKILRDSLEAVG-----AKAMVVGHTPQSG-VNCEYGCGIWRVDVGMSSGSRPEVLEIRGDKARVIRSNQD ----LAV-------------------------SSQVLVQVGDILDRGDDEIAILSLLRLLDEQAKANGGAVFQVNGNHETMNV---------------EGD---FRYVDARAFDECTDFRSVLFRPGGRLACE-LAGVILRVNDWVFCHGGLLPHHVYGVERINREVATWM-RSYRGY----D-SVVWSRLYSRETVSKILRDTLEAVG-----AKAMVVGHTPQSG-VNCEYGCSIWRVDVGMSSGSRPEVLEIRGDKARVIRSNQD GRRIVAVGDLHGDLAQTRCALELAGVLSLWTGEETVLIQLGDILDRGEDEIAILSLLRSLSNQAKSKGGAVFQLNGNHETMNV---------------EGD---FRYVDPGAFNECGDFRSALLRPGGPLACE-LAAVVLKVDDWVFCHGGLLPRHVYGIERMNREVSHWM-KGLRGY----D-SVVWNRLYSRDLIQAVLEDTLDAMG-----AKAMVVGHTPQAG-INSKYSSSIWRIDVGMSSGSRPEVLEIIDNEARVIRSNRQ GRRIVAVGDLHGDLAQTRCALELAGVLSLWTGEETVLIQLGDILDRGEDEIAILSLLRSLSNQAKSKGGAVFQLNGNHETMNV---------------EGD---FRYVDPGAFNECGDFRSALLRPGGPLACE-LAAVVLKVDDWVFCHGGLLPRHVYGIERMNREVSHWM-KGLRGY----D-SVVWNRLYSRDLIQAVLEDTLDAMG-----AKAMVVGHTPQAG-INSKYSSSIWRIDVGMSSGSRPEVLEIIDNEARVIRSNRQ GRRIIAVGDLHGDLDQARFALQMAGVLSLWTGGETVLVQLGDILDRGEDEIAILSLLRLLDVQAKDVGGAVFQVNGNHETMNV---------------EGD---FRYVEAGAFDECGAFRTALFRPGGPLACE-LASVVLRVNDWVFCHGGLLPNHVYGIERMNKEVSSWM-RGFRGY----D-SVVWNRLYSRDNINAILQDTLQAVG-----AKAMVVGHTPQEG-VNCKYNCSIWRVDVGMSSGSRPEVLEISDDKARVIRSKRE GRRIVAVGDLHGDLDQARCALAMAGVLSLWTGGESVLIQLGDVLDRGEDELAILSLLRSLDMQAKAEGGAVFQVNGNHETMNV---------------EGD---FRYVDSGGFDECSDFRSVLFRPGGPMACE-LAGVVLKVNDWVFCHGGLLPHHVYGLERMNNEVSLWM-KGLRGY----D-SVVWNRLYSRDIINAVLQDTLRAVG-----AKAMVVGHTPQAG-ANCEYNCSIWRIDVGMSSGSRPEVLEITDNKARVISSKRD GRRIVAVGDLHGDLDQARCALEMAGVLSLWTGGESVLIQLGDVLDRGEDELAILSLLRSLDMQAKAEGGAVFQVNGNHETMNV---------------EGD---FRYVDSGGFDECSDFRSVLFRPGGPMACE-LAGVVLKVNDWVFCHGGLLPHHVYGLERMNNEVSLWM-KGLRGY----D-SVVWNRLYSRDIINAVLQDTLRAVG-----AKAMVVGHTPQAG-ANCEYNCSIWRIDVGMSSGSRPEVLEITDNKARVISGKRD GRRIVAVGDLHGDLDQARCALEMAGVLSLWTGGESVLIQLGDVLDRGEDELAILSLLRSLDMQAKAEGGAVFQVNGNHETMNV---------------EGD---FRYVDSGGFDECSDFRSVLFRPGGPMACE-LAGVVLKVNDWVFCHGGLLPHHVYGLERMNNEVSLWM-KGLRGY----D-SVVWNRLYSRDIINAVLQDTLRAVG-----AKAMVVGHTPQAG-ANCEYNCSIWRIDVGMSSGSRPEVLEITDNKARVISGKRD GRRIVAVGDLHGDLDQARYALEMAGVLSLWTGEDAVLVQLGDVLDRGDDEIAILSLLRSLDIQAKAKGGAVFQVNGNHETMNV---------------EGD---FRYVESGAFDECADFRSVLFRPGGPLACE-LAAVALKVNDWIFCHGGLLPHHVYGIEKMNKEVSQWM-RGLKGY----D-SVVWNRLYSRNIINAILQETLQALG-----AKAMVVGHTPQSG-ANCEYNCSIWRIDVGMSSGSRPEVLEIRDDKARVIRSKRN ----MSVGDLHGDLGQTRCALETAGVLSVWIGGETVLIQLGDILDRGEDEIAILSLLRSLDMQAKAEGGAVFQVNGNHETINV---------------EGD---FRYVDTGGFDECLDFRSILLRPGGPLARE-LAAVVLKVNDWVFCHGGLVPQHVYGVERMNREVSDWM-RGLRGY----D-SVVWNRLYSRDTINYILQQTLQAVG-----AKGMVVGHTPQTG-ANCEYDCSIWRIDVGMSSGSRPEGLSSLNM---------GRRIVAVGDLHGDLDQTRCALEMAGVLSLWTGGETVLIQLGDILDRGEDEIAILSLLRSLDIQAKDEGGAVFQVNGNHETMNV---------------EGD---FRYVETGGFDECIDFRSILLRPGGPLACE-LAAVVLKVNDWVFCHGGLIPNHVYGVERMNREVSHWM-RGLRGY----D-SVVWNRLYSRDTIHSILEETLHAVG-----AKAMVVGHTPQTG-ANCKYNCSIWRIDVGMSSGSRPEVLEIKDNKARVIRSKKD YRSVLCVGDLHGDLGQTRCALETAGVLSVWIGGETVLIQLGDILDRGEDEIAILSLLRSLDMQAKAEGGAVFQVNGNHETINV---------------EGD---FRYVDTGGFDECLDFRSILLRPGGPLARE-LAAVVLKVNDWVFCHGGLVPQHVYGVERMNREVSDWM-RGLRGY----D-SVVWNRLYSRDTANKLHSSTDATSG-----NGGGAYSSTYGKL-VRCEYDCSIWRIDVGMSSGSRPEVLEIKDNKARVIRSKKD GRRIVAVGDLHGDLAQARCALEMAGVLSLWTGGDTVLVQLGDILDRGEDEIAILSLLRSLDIQAKASGGAVFQVNGNHETMNV---------------EGD---FRYVDSGAFDECTDFRSILLRPGGLLACE-LAAVVLKIDDWVFCHGGLLPHHVYGIERMNREVSHWM-RGLRGY----D-SVVWNRLYSRDVIYSLLEETLQAVG-----AKAMVVGHTPQVG-VNCKYDCSIWRIDVGMSSGSRPEVLEIRDNTARVIRSKRD GRRIVAVGDLHGDLDQARCALEMAGVLSLWIGGETVLIQLGDILDRGEDEIAILSLLRSLDIQAKAQGGAVFQINGNHETMNV---------------EGD---FRYVDSGAFDECTDFRSILLTPGGPLACE-LASVILKVNDWIFCHGGLLPQHVYGIERMNSKVSQWM-KGLKGY----D-SVVWNRLYSRDVILSILEETLQAIG-----AKAMVVGHTPQAG-VNCKYNCSVWCVDVGMSSGSRPEVI--------------GRRIVAVGDVHGDLDQARCALEIAGVLSLWTGGETVLIQLGDVLDRGEEEIAILSLLRSLDIQAKAQGGAVFQVNGNHETMNV---------------EGD---FRYVDSGAFDECSDFRSILLRPGGPLACE-LAAVVLKINDWVFCHGGLLPQHVYGVERMNYEVSHWM-RGLKGF----D-SVVWNRLYSRDMIQSVLEETLQLLG-----AKAMVVGHTPQTG-VNCKYNCSIWCIDVGMSSGSRPEVLEIVENKARVIRSKRD GRRIVAVGDVHGDLDQARCALEIAGVLSLWTGGETVLIQLGDVLDRGEEEIAILSLLRSLDIQAKAQGGAVFQVNGNHETMNV---------------EGD---FRYVDSGAFDECSDFRSILLRPGGPLACE-LAAVVLKINDWVFCHGGLLPQHVYGVERMNYEVSHWM-RGLKGF----D-SVVWNRLYSRDMIQSVLEETLQLLG-----AKAMVVGHTPQTG-VNCKYNCSIWCIDVGMSSGSRPEVLEIVENKARVIRSKRD GRRIVAVGDVHGDLDQARCALEIAGVLSLWTGGETVLIQLGDVLDRGEEEIAILSLLRSLDIQAKAQGGAVFQVNGNHETMNV---------------EGD---FRYVDSGAFDECSDFRSILLRPGGPLACE-LAAVVLKINDWVFCHGGLLPQHVYGVERMNYEVSHWM-RGLKGF----D-SVVWNRLYSRDMIQSVLEETLQLLG-----AKAMV------------------------------------------------GRRIVAVGDVHGDLDQARCALEIAGVLSLWTGGETVLIQLGDVLDRGEEEIAILSLLRSLDIQAKAQGGAVFQVNGNHETMNV---------------EGD---FRYVDSGAFDECSDFRSILLRPGGPLACE-LAAVVLKINDWVFCHGGLLPQHVYGVERMNYEVSHWM-RGLKGF----D-SVVWNRLYSRDMIQSVLEETLQLLG-----AKAMVVGHTPQTG-VNCKYNCSIWCIDVGMSSGSRPEVI--------------GRRIVAVGDLHGDLDQARCALEMAGVLSLWIGGETVLVQLGDILDRGEDEIAILSLLRSLDIQAKAEGGAVFQVNGNHETMNV---------------EGD---FRYVDSGAFDECTDFRSVLLRPGGPLACE-LASVILRINDWIFCHGGLLPHHVYGIERINREVSQWM-KGLRGY----D-SVVWNRLYSRDMILSILEETLQSAG-----AKAMVVGHTPQAG-VNCKYNCSVWRIDVGMSSGSRPEVLEIQNDKAKVIRSRKD GRRIIAVGDLHGDLDKARYALQMAGILSLWIGGQTVLVQLGDILDRGEDEIAILSLLKSLNIQAKLNGGAVFQVNGNHETMNV---------------EGD---FRYVDSGGFDECADFRSVLLRPGGPLASE-LAAVVLKVNDWIFCHGGLLPHHVYGIERINREVSYWM-EGLRGY----D-SVVWSRLYSRDTIESILRETLQAVG-----AKAMVVGHTPQTG-VNCDFNCSIWRIDVGMSSGSRPEVLEIREGRARAIRSKRD GRRIIAVGDLHGDLDKARYALQMAGILSLWIGGQTVLVQLGDILDRGEDEIAILSLLKSLNIQAKLNGGAVFQVNGNHETMNV---------------EGD---FRYVDSGGFDECADFRSVLLRPGGPLASE-LAAVVLKVNDWIFCHGGLLPHHVYGIERINREVSYWM-EGLRGY----D-SVVWSRLYSRDTIESILRETLQAVG-----AKAMVVGHTPQTG-VNCDFNCSIWRIDVGMSSGSRPEVLEIREGRARAIRSKRD GRRILAVGDLHGDLKQARSALEMAGVLSLWTGGETVLVQLGDILDRGEDEIAILSLLRSLDKQAKAKGGAVFQVNGNHETMNV---------------EGD---FRYVDSGGFDECNDFRSILFRPGGLLARE-LAAVVLKVNDWVFCHGGLLPHHVYGLERMNKEVSEWM-RGQRGY----D-SVVWNRLYSRDTVCSILDETLQAVG-----AKAMVVGHTPQIG-VNCKYNCSIWRVDVGMSSGSRPEVLEIIDDKARVIRCKRD GRRILAVGDLHGDLKQARSALEMAGVLSLWTGGETVLVQLGDILDRGEDEIAILSMLRSLDRQAKEKGGAVFQVNGNHETMNV---------------EGD---YRYVESGGFDECNDFRSILFRPGGLLARE-LAAVVLKVNDWVFCHGGLLPHHVYGLERMNKEVSEWM-RGQRGY----D-SVVWNRLYSRDTVCSILDETMQAVG-----AKAMVVGHTPQIG-VNCKYNCSIWRVDVGMSSGSKPEVLEIIDDKARVIRCKRD GRRILAVGDLHGDLKQARSALEMAGVLSLWTGGETVLVQLGDILDRGEDEIAILSLLRSLDKQAKAKGGAVFQVNGNHETMNV---------------EGD---FRYVDSGGFDESNDFRSVLFRPGGLLARE-LAAVVLKVNDWVFCHGGLLPHHVYGLERMNKEVSEWM-KGQRGY----D-SVVWNRLYSRDAVCSILDETLQAVG-----AKSMVVGHTPQTG-VNCKYNCNIWRIDVGMSSGSRPEVLEIIDNQAKVIRCKRD GRTILAVGDLHGDLKQARYALEMAGVLSLWTGGENVLIQLGDILDRGEDEIAILSLLRSLDKQAKSKGGAVFQVNGNHETMNV---------------EGD---FRYVDSGGFDECSDFRSILFRPGGPLACE-LAGVALKVNDWVFCHGGLLPHHVYGLERMNKEVSEWM-RDPRGY----D-SVVWNRLYSRDSVCSVLEETLQAVD-----AKAMVVGHTPQIG-VNCKYNCSIWRVDVGMSSGSRPEVLEIIDDKARVIRSKTD GRRIVAVGDLHGDLEQARRAFQMAGVLSTWTGKETVLVQLGDILDRGENEIAILSLLRSMDKQAQEDGGAIFQINGNHETMNV---------------EGD---FRYVESGAFEECAGFRSVLLRPGGPLACE-LAAVVLKVNDWIFCHGGLLPHHVYGIERMNFEVSNWM-RGLRGY----D-SVVWSRLYSRDVIQSILEETLQAVN-----AKAMVVGHTPQAG-VNCKYNCSIWRIDVGMSSGSRPEVLEIVDGKARAIKGRRD GRRIVAVGDLHGDLEQARRAFQMAGVLSTWTGKETVLVQLGDILDRGENEIAILSLLRSMDKQAQEDGGAIFQINGNHETMNV---------------EGD---FRYVESGAFEECAGFRSVLLRPGGPLACE-LAAVVLKVNDWIFCHGGLLPHHVYGIERMNFEVSNWM-RGLRGY----D-SVVWSRLYSRDVIQSILEETLQAVN-----AKAMVVGHTPQAG-VNCKYNCSIWRIDVGMSSGSRPEVLEIVDGKARAIKGRRD ARRIVAVGDLHGDLKQTRLALEMAGVLGLWTGGQTVLVQLGDILDRGEDEIAILSLLRSLDVQARAQGGAVFQVNGNHETMNV---------------EGD---FRYVDSGAFDECLNFRSILFRPGGRLARE-LAAVVLKVNDWVFCHGGLLPHHVYGIERMNREVSQWM-KGLRGY----D-SVVWNRLYSRDFINSILKDTLEAVG-----AKAMVVGHTPQAG-VNCKYNCSIWRVDVGMSSGSRPEVLEIRDDKVRVIRSKRD ARRIVAVGDLHGDLKQTRLALEMAGVLGLWTGGQTVLVQLGDILDRGEDEIAILSLLRSLDVQARAQGGAVFQVNGNHETMNV EGD FRYVDSGAFDECLNFRSILFRPGGRLARE LAAVVLKVNDWVFCHGGLLPHHVYGIERMNREVSQWM KGLRGY D SVVWNRLYSRDFINSILKDTLEAVG AKAMVVGHTPQAG VNCKYNCSIWRVDVGMSSGSRPEVLEIRDDKVRVIRSKRD ARRIVAVGDLHGDLKQTRLALEMAGVLGLWTGGQTVLVQLGDILDRGEDEIAILSLLRSLDVQARAQGGAVFQVNGNHETMNV---------------EGD---FRYVDSGAFDECLNFRSILFRPGGRLARE-LAAVVLKVNDWVFCHGGLLPHHVYGIERMNREVSQWM-KGLRGY----D-SVVWNRLYSRDFINSILKDTLEAVG-----AKAMVVGHTPQAG-VNCKYNCSIWRVDVGMSSGSRPEVLEIRDDKVRVIRSKRD GRRIVAVGDLHGDLDQTRSALRIAGVLSRWTGGQTVLVQLGDVLDRGGGEIAILSLLRSLDVQARLQGGAVFQINGNHETMNV---------------EGD---FRYVDSTGFDECLDFRSTLLRPGGALACE-LAGVLLKINDWVFCHGGILPHHVHGVERMNREVSQWM-KGLRGY----D-SVVWNRLYSQDLIEAALYDTLGSLD-----AKGMVVGHTPQRG-ANCECNGRIWRVDVGMSKGSRPEVLEINESKTRVLRLGRD GRRIVAVGDLHGDLYQTRAALVMAGVLSLWTGGQTVLVQVGDILDRGEDEIAILSLLSSLNMQAKSQGGAVFQVNGNHETMNV---------------EGD---FRYVDPGSFDECMRYRASLLKQGGPLACE-LAPVVLVVNDWIFCHGGLLPHHVYGLERINKDVSNWM-QGSRGY----D-SVVWTRFYSQDSSSLVAEQTLKSVG-----AKGMVVGHTPQRG-VNCKCDGKVWCVDVGMSYGSRPEVLEIINDRARVIKDRRG GRRIVAIGDAHGDLSQTRAALVLAGVLSVWTGGRTVLVQVGDILDRGEDEIAILSLLSSLSVQAKSQGGAVFQVNGNHETMNV---------------EGD---FRYVYPGGFDECIRFRSSLFKRGGPLACE-LAPVVLKINDWIFCHGGLLPHHVYGIERMNIEVSTWM-KSSRGY----D-SVVWSRMYSQDSSSIIAEETLKSVG-----AKGMVVGHTPQRG-VNCKCDGKVWCIDVGMSYGSRPEVLEIVNDRPRVLKKRRD GRRIVAIGDAHGDLSQTRAALVLAGVLSVWTGGRTVLVQVGDILDRGEDEIAILSLLSSLSVQAKSQGGAVFQVNGNHETMNV---------------EGD---FRYVYPGGFDECIRFRSSLFKRGGPLACE-LAPVVLKINDWIFCHGGLLPHHVYGIERMNIEVSTWM-KSSRGY----D-SVVWSRMYSQDSSSIIAEETLKSVG-----AKGMVVGHTPQRG-VNCKCDGKVWCIDVGMSYGSRPEVLEIVNDRPRVLKKRRD GRRIVAIGDVHGDLSQTRAALVLAGVLSVWTGGRTVLVQVGDILDRGADEIAILSLLSSLNVQAKPQGGAVFQVNGNHETMNV---------------EGD---FRYVDPGGFDECMRFRSSLFKRGGPLACE-LAPVVLKINDWIFCHGGLLPHHVYGIGRMNREVSRWM-KSSRGY----D-SVVWSRLYSQDSSSIVAEETLESVR-----AKGMVVGHTPQHG-VNCKCDGKVWCIDVGMSYGSRPEVLEIVNDSPRVLKKRRD DRRIVAIGDVHGDLSQTRAALVLAGVLSVWTGGRTVLVQVGDILDRGKDEIAILSLLSSLNLQAKSQGGAVFQVNGNHETMNV---------------EGD---FRYVDPGGFDECIRFRSSLFKRGGPLACE-LAPVVLKINDWVFCHGGLLPHHVYGIERMNREVSMWM-KCSRGY----D-SVVWSRLYSHDPSSIVAEQTLKSVG-----AKGMVVGHTPQGG-VNCKCDGKVWCVDVGMSYGSRPEVLEIVNDRPRVLKKRRD GRRIVAIGDVHGDLSQTRAALVLAGVLSLWTGGRTVLVQVGDILDRGEDEIAILSLLSSLNMQAKSQGGAVFQINGNHETMNV---------------EGD---FRYCDPGGFDECVRFRSSLLKRGGPLACE-LAPVVLKINDWIFCHGGLLPHHVYGIERMNREVSIWM-KCSRGY----D-SVVWSRLYSQDPSSIVAEQTLEAVG-----AKGMVVGHTPQHG-VNCKCDGKVWCVDVGMSSGSRPEVLEIVNDRPKVLKKRRE GRRIVAIGDAHGDLSQTRAALVLAGVLSVWTGGRTILVQVGDILDRGEDEIAILSLLSSLNMQAKSQGGAVFQVNGNHETMNV---------------EGD---FRYCDPGGFDECMRFRSSLLKRGGPLAHE-LAPVVLKINDWIFCHGGLLPHHVYGIERMNREVSMWM-KCSRGY----D-SVVWSRMYSQDPSSIAAEQTLEAVG-----AKGMVVGHTPQHG-VTCKCDGKVWCVDVGMSCGSRPEVLEIVNDRPRVLKERRE GRRIVAVGDLHGDLNQTRAALVMAGLLSVWTGGQTVLVQVGDILDRGEDEIAILSLLSSLNMQAKSQGGAVFQVNGNHETINV---------------EGD---YRYVDPGAFDECIRFRSSLFKRGGPLACE-LAPVVLSVNDWIFCHGGLLPHHVYGIERMNREVSVWM-KSSRGY----D-SVVWSRLYSQGPSSVVAERTLKSVG-----AKGMVVGHTPQRG-VN-------WYLL--FSLG--------------------DRRIIAIGDLHGDFERAQWALQLAGVMSRWTGGTTVLVQIGDVLDRGEDEITILSLLAWLGKQARSKGGAVFQILGNHETMNV---------------AGD---FRYVAPGGFQEAEAFRTKLLSPGGPLARQ-LAGVVLKVNDWLFAHGGVLPHHVYGLERMNREVSLWM-KNIKGF----D-SIVWSRLYSKENACGILTAALSATN-----SRGLIVGHTPQIG-ANSKCDGRIWRIDVGMSSGALPEVLEIVDDKVRILSVAEE NRRIVSIGDLHGDFERAQEALELAGVLSRWTGGTTILVQIGDVLDRGEDEIAILSLLALLGREARSKGGAVFQLLGNHEAMNV---------------AGD---FRYVAPGGFLEAEVFRTKLLSPGGPLATH-LAGVVLKVNDWLFAHGGVLPYHVYGLERMNREVSLWI-QKEKGY----D-SVVWSRLYSKENACGTLAAALSATN-----SRGLVVGHTPQIG-GNSKCDGRVWRIDVGMSSGALPEVLEIVDDKVRILCAREE ERRIVAVGDLHGDLRQTKRALRVAGVLSNWTGGTTVLVQVGDILDRGPDEIAILSLLWHLTEQARSKGGAVFQVHGNHETMNV---------------SHE---FNDTPDCGFEDSQHFRVLLFSPGGPLALE-LSGVVLKINDWLFAHGGLLPHHVYGLEKMNSEVSNWM-KGGKGF----D-SIVWSRLYSRKTACAVLQQTLDAVK-----AKGLVIGHTPQNG-ANSECDGKVWRIDVGMSRGATPQVIEIVGDEVKVLSCQKE TGRIIAIGDLHGDLDKAVEALKLGRVISSWVGGDTVVVQLGDVLDRGDVEIGIINLLRYLDTEARKQGGAVYMLNGNHESLNV---------------CGD---FRYVTPGAFAESALYRYSLYKPGGDLARE-FSPTVLVVNDTVFAHGGLLPTHVYGIERLNSEVAAWM-RGDGDA----N-SVMWNRTLSKERACNALKQALAKVR-----GKRLVVGHTPQGG-VNCECENQVWRIDVGMSYGRPVQVIEIVPPEVRVIRNTPN TGRIVAIGDLHGDLDKAVEALKLGRVIRAWVGSDTVVVQLGDVLDRGDVEIGIINLLRYLDSEARKCGGAVYMLNGNHESLNV---------------CGD---FRYVTPGAFAESAMYRYSLFKPGGDLAKE-FAPTVLVVNDTVFAHGGLLPIHVYGIERLNSEVAAWM-RGDGDA----N-SVMWNRTLSKERACRALQQALAKVH-----GKRLVVGHTPQSG-VNCECENQVWRIDVGMSYGRPVQVLEITQHEVRIIRSTPT TGRIIAIGDIHGDLQKALSCLEMAGVLAKWVGGDTVVVQLGDVLDRGDCEIGSVLLLRELDRQARQQGGAVYMLNGNHESLNV---------------AGD---FRYVTPGAFFESARARWYLYQPGGAMARE-LAATVLVVNDVVFAHGGLLPHHLYGLQRINDEVAEWM-NGAGDS----S-SVQWNRTFGKERMNFQLRATLESLN-----ARAMVVGHTPQSG-VNCECDGRVWRVDAGMSSGAAPQV---------------SGRIIAIGDIHGDVQKAITSLKLGGVLVVWCGGNTVVVQLGDVLDRGDSEIGAIILLRELDRQARLQGGAVYMLNGNHESLNL---------------LGAAMWRRYVTPGGFRESGLVRLHLYSPGGRLACE-LAPTVLIINDTVFAHGGVLPNHVYGLERLNAEVAAWM-RGAGGP----D-SVMWNRQFGHERACMDLDATLRLLG-----ARQLIVGHTPQNG-ANCECNGRVWRMDVGMSKGAAPQVLEIEPEDIRLLCPPSV KGRVVALGDVHGDIGQARRALTIAGVLGTWIGGDTVVVQVGDVLDRGDDEIAILILLQKLHKAAQADGGAVYILNGNHEVLNV---------------SGD---FRYVTQGAFQESTRFRVGLFSPGGPLAQQ-LASTVLIVNDTVFAHGGLMPRHVFGLEKLNNAVADWM-RGAGSK----D-SVVWNRTFGQENACETLGKTLDAID----GAKRLVVGHTPQGG-CNAECDGRIWRIDVGMSFGADPEVIEIVGDEVRVLTSRVP PGRVVALGDVHGDIGQARRALTIAGVLGEWTGGNTVVVQVGDVLDRGDDEIAILILLQKLHKQAEAQGGAVYILNGNHEVLNV---------------SGD---FRYVTQGAFQESARFRVGLFSPGGPLAQQ-LASTVLIVNDTVFAHGGLMPRHVFGLERLNNAVADWM-RGAGSK----D-SVVWNRTFGTENACEVLGNTLDAISEKYGAAKRLVVGHTPQGG-CNGECDGRIWRIDVGMSFGADPEVIEIDGDEVRVLNSRVP -GRVVAIGDLHGDIGQARRALRMAGVLDRWVGGDTTLVQLGDILDRGDDEIGILILLQKLDKEAQKQGGRVYVMNGNHEVLNV---------------SGD---FRYVSRGAFGESTRFRIGLFSPGGPLAQQ-LSHTVLIVNDTVFAHGGLVPRHVFGLDKLNRAVTDWM-RGKGGR----D-SIVWHRAYGTEKSCELLGKTLGMID----GVSRLVVGHTPQGG-ANCECNGQIWRIDVGMSFGAEPQVLEIDGDEVRVLSSSSV PGRVIAIGDLHGDIGQARRALRLAGVLGTWVGGDTTVVQVGDILDRGDDEIGILILLQKLEKEARKQGGGVYVLNGNHEVLNV---------------SGD---FRYVSRGAFGETMRFRIGLFSPGGPLAQQ-MSHTVLIVNDTVFAHGGLVPRHVFGLDKLNRAVTDWM-RGKGGR----D-SIVWHRGYGTERSCALLGETLGMID----GANRLVVGHTPQGG-ANCECDGRIWRIDVGMSFGADPQVLEIDGQEVRVLTPTNM FGRTVEVSDLAGAMGQARRLPHMRRYRAAWVGGNATLVQLGDILDRGDDEIGILILLQKLDKEAKKAGGAVYVMNGNHEVLNI---------------SGD---FRYVTRGAFGETTRFRIGLFSPGGPLAQQ-LSHTVLIVNDTVFVHGGLVPRHVYGLDKLNRAVSDWM-RGKGGR----D-SIVWHRGYGTENSCELLEKTLGMID----GANRMVVGHTPQGG-ANCECDGKIWRIDXXXXXXAEPQVLEIDGGDVRVITSASS -----------------MKALECAKVMDKWCGGDTVVVQVGDILDRGDNELAIMRKFRALAKDARAAGGDFIVINGNHEIMNV---------------LGD---FRYVTKGAYGECARYRRALFLPGGEMAVK-MAPTVLQVDDTVFAHAGIDESHVYGFARINAEVSQWM-AGSLEE----K-GVVWTREYGGADACRRLEKALDATG-----AKRIVIGHTPQKG-VNSGCGGKVWRADVGASRGNTPQVIEIVNGRVRVLSA--GRRLVAVGDIHGDFAQAMKALELSKCMNKWIGGTTVLVQVGDILDRGDNELAIMRKFQKLAKEAKEAGGDVVVMNGNHEIMNV---------------MGD---FRYVTKGAFGECRRWRRDLFLPGGEMAVK-MAPTVLQVGDTVFAHAGITENHVYGFQRLNNEVAAWM-VGKLEE----K-GVVWTRDYGGAEACKRLTEALDATG-----AKRLIVGHTPQKG-INSGCGGKVWRSDTGMSRGNTPQVIEIVNGRVRIL----GRRLIAIGDLHGDFLAMVKSLELAQVIDKWNGGDAVVVQVGDVLDRGDSEIAILRKLRSLAKQAKEAGGDLITMNGNHEIMNV---------------MGD---FRYATPGAFEECRRYRQQLFKPGGEIAKW-LAPTVLVVDDTVFAHAGIDLSHVYGFERINKEVAMWM-EGKLES----D-GVVWTRDYGGKDSCRRLASALEGAD-----AKRLVIGHTPQTG-VTSGCKGTVWRVDVGASRGNSAQVIEIIGGKVRVLA---GRRLVAVGDLHGDFLAAVRALELAGVMDKWVGGDAVVVQVGDVLDRGDSEIAIMRKLRALRKQARQVGGDVVCMNGNHEIMNV---------------MGD---FRYATPGAFEECRRYRQQLFKPGGELAMW-LAPTVLVVDDTVFAHAGIDLSHVYGFERINKEVSMWM-QGKLES----D-GVVWTRDYGGKDACRKLNTALDAAG-----AKRLVIGHTPQTG-VTNGCKGALWRVDVGASRGHDVQVIEIVNGRTRVLA---GRRLIAIGDLHGDFLAMVRSLELAGVCDKWIGGNAVVVQVGDVLDRGDSEIAILRKLRALKRQAKEQGGDVITMNGNHEIMNV---------------MGD---FRYATPGAFEECRLYRQKLFSPGGEIAKW-LAPTVLVVDDTVFAHAGIDLSHVYGFERINREVSLWM-QGKLES----D-GVVWTRDYGGKDACRRLGAALDAAE-----AKRLVIGHTPQTG-VTSGCKGAVWRVDVGASRGNAAQVIEVIGGRVRVLA---ADRVVAIGDVHGDLAATRAALRIAGAIDRWIGGKLVLVQTGDELDRGDDEQTIVDLFDRLAVEARAAGGAVYALNGNHEVMNV---------------QLD---FRYVTEGGFKDFEDVRAAAFAPGGLYARK-LADVIIAVGDTVFTHGGVLPAHTYGISRINREVRAWM-EGKAAE----D-SPVWARLYSSDPPCAALDQTLAAMS-----AKRMVVGHTVQHG-ATSACGDKIWRIDVGMASHGKPAALEIVGDTVRVLTPEAT PARVVAIGDLHGDLDAALRALSLAGAIALWTGGDMVLVQTGDVLDRGDDEQAILDLLARLEREARAAGGAVHALLGNHETMNA---------------AGD---FRYVTPGGFSDFADARAAAFLPGGVYAER-LANTVVVVGDTVFAHGGVVPRWAYGIERVNAELRCWL-AGELAE----D-NPMWSRDFSREPACDALERALAALD-----ATRMVVGHTPQDG-ITSACDGRVWRIDTGMAAHGAVQALELRGDSARILSEDE--------------------------MDKWIGGSLVIVQTGDQLDRGDGEQAILELLARLQHEAKAAGGAIHILNGNHEFMNA---------------MGD---LRYVTPGGLVDFADARVAAFLPGRPWAKE-LGNTVVVVGDTAFVHGGVLPAYAGDITTLNREARAFL-NGEVDP----D-GPVWSRHYSDE-DCALLDQALAKLE-----VKRMVVGHTVHEG-IQSACDEKVWMVDVGMAAHGPTQVLVIAGAEVSVLAG--VPHIIAFGDVHGDFVAMAEVLLGAGIVDHWIAGETWVVQVGDQLDRGYQEEEIMNLFEQLRVEAAEAGGRFLALNGNHEIMQA---------------EGR---MDYVFD--LEAFGGLRVEAFAPGGEWALV-LANVIVKVGRTVFVHGGALPEHALGIENMNDAAKAWL-VGDDGS----G-SIVWDRTYSDDDRCELLTAALAAMD-----ADRIVVAHTIQPG-INAICDDRAWRIDTGMADYGPIEALEITGDVASVIQME-PERLVAVGDLHGDLDKSREAFKIAGLIDRWTGGSTMVVQVGDVLDRGGEELKILYFLEKLKREAERAGGKVLTMNGNHEIMNI---------------EGD---FRYVTKTGLEEFQVWRIAALRPEGPISKRFLTQTVAVVGDSVFVHGGLLAEHIYGLERINEEVRGWI-NGLRGG----N-SVVWLRKFSEEMDCAALEHALSTIG-----VKRMIMGHTIQAG-INGVCNDKAIRIDVGMSKGGLPEVLEIRR-DVRILTSNPL PERLVAIGDLHGDLEKSREAFKIAGLIDRWTGGSTMVVQVGDVLDRGGEELKILYFLEKLKREAERAGGKILTMNGNHEIMNI---------------EGD---FRYVTKKGLEEFQIWRIAALRPDGPIAKRFLTQTVAVVGDSVFVHGGLLAEHIYGLERINEEVRGWI-NGFRGG----N-SVVWLRKFSEEMDCAALEHALSTIG-----VKRMIMGHTIQAG-INGVCNDKAIRIDVGMSKGGLPEVLEIRR-DVRIVTSNPL PERLVAIGDLHGDLEKSREAFKIAGLIDRWTGGSTMVVQVGDVLDRGGEELKILYFLEKLKREAERAGGKILTMNGNHEIMNI EGD FRYVTKKGLEEFQIWRIAALRPDGPIAKRFLTQTVAVVGDSVFVHGGLLAEHIYGLERINEEVRGWI NGFRGG N SVVWLRKFSEEMDCAALEHALSTIG VKRMIMGHTIQAG INGVCNDKAIRIDVGMSKGGLPEVLEIRR DVRIVTSNPL PERLVAIGDLHGDLEKSKESFRIAGLIDRWTGGSTVVVQVGDVLDRGGEEIKILYFLEKLKREAERSGGKVLTMNGNHEIMNV---------------EGD---FRYVTKKGLEEFQIWRIAALRPEGPIAKRFLTQTVAVVGDSVFVHGGLLAEHIYGLERINEEVKGWI-NGLRGG----N-SVVWLRKFSEEMDCAALEHALSTIG-----VKRMIMGHTIQAG-INGVCNDKAIRIDVGMSKGGLPEVLEIRR-DVRIVTSNPL PERLVAIGDLHGDLDKSKEAFRIAGLIDKWTGGSTMVVQVGDVLDRGGEELKILYFLEKLKREAEKSGGKILTMNGNHEIMNV---------------EGD---FRYVTEKGLEEFQIWRIAALRPEGPIAKRFLSQTVAVVGDSLFVHGGLLAEHIYGLERINEEVTCWI-NGLRGG----N-SVVWLRKFSDEMDCSALEHALSTIG-----VKRMIMGHTIQAG-INGVCDNKAIRIDVGMSKGGLPEVLEIRK-DVRIVTSNPL PDRLVAIGDLHGDLEKSKEAFRIAGLIDRWTGGSTMVVQVGDVLDRGGDELKILYFLERLKREAEREGGRVVTMNGNHEIMNI---------------EGD---FRFVTKEGLEEFRVWRIAALRPQGPIAKRFLSQTVAVVGDSVFVHGGLLAEHVYGLERMNEEVTSWI-NGLRGG----N-SVVWLRKFSDERDCAALEHALSTIG-----VRRMIMGHTIQAG-INGVCGDKAIRIDVGMSKGGLPEVLEIRK-DVRIVTSNPL PERLIAIGDLHGDFDKSKQAFRIAGLIDKWIGGSTTVVQIGDVLDRGGNELKILYFIEKLKREAAKTGGKIITMNGNHEIMNM---------------EGD---FRFVTKEGLDEFRVWRIAALRPEGPISRRFFTVTVLVVGDSIFVHGGLLPEHVYGLERINQEVRNWI-NGSRGR----D-GVVWVRRFSREQDCSALEHVLATVG-----VKRMIMGHTIQSG-INGACNNRAIRIDVGMSSGGLPEVLEIDR-NLQVLTSNTI VDRLIAIGDLHGDLEKSKQALRLAGLINQWTGGTATVVQIGDVLDRGDDEIKILYLLEKLKREAEKSGGKFITMNGNHEIMNI---------------EAD---FRYATEMGLKEFEDWRIAALRPDGPIARRFLSTTVLVVGDSVFVHGGLLKQHVYGLERINREVRDWI-NGLKGR----H-AVVWLRKFSDE-DCSALEHALATLG-----VKRMIMGHTIQKG-INAVCDNRAIRIDVGLSRGGLPEVLEING-NLLVLTANPL VDRLIAIGDLHGDLEKSKQALRLAGLINQWTGGTATVVQIGDVLDRGDDEIKILYLLEKLKREAEKSGGKFITMNGNHEIMNI---------------EAD---FRYATEMGLKEFEDWRIAALRPDGPIARRFLSTTVLVVGDSVFVHGGLLKQHVYGLERINREVRDWI-NGLKGR----H-AVVWLRKFSDE-DCSALEHALATIG-----VKRMIMGHTIQKG-INAVCDNRAIRIDVGLSRGGLPEVLEING-NLLVLTANPL PDRLIAIGDLHGDLEKSKQALRLAGLIDRWFGGSATVVQIGDVLDRGDDELKILYFLEKLKREAVKSGGNLITMNGNHEIMNV---------------EGD---FRYMTKLGLEEFTNWRIAALRPNGPIANRFLSLTVLVVGDSVFVHGGLLAEHVYGLARMNKEVSDWI-TGLRGR----N-AVVWLRKFSDEVDCAALEHVLATIG-----VKRMIMGHTIQAG-INGVCENKAIRIDVGMSRGGLPEVLEING-NLRVLTSNPL PNRLVAIGDLHGDLEKSKQAFRLAGLIDRWSGGSATVVQIGDVLDRGGEELKILYFLEKLKREAVKSGGNLITMNGNHEIMNV---------------ESD---FRYVTKVGLEEFSNWRIAALRPDGPIANKFLSVTVLVIGDSVFVHGGLLAKHVYGLERMNQEVRDWI-TGLRGR----E-AVVWLRKFSDEVDCSALEHVLATIG-----VKRMIMGHTIQAG-INAVCDNRAIRIDVGMSKGGLPEVLEING-KLRVLTSNPW PDRLIAIGDLHGDLEKSKQALRLAGLIDKWAGGSATAVQVGDVLDRGDDEIQILYFLEKLKREAMKDGGNFITMNGNHEIMNI---------------EGD---FRYVTKLGLKEFEDWRIAALRPNGPIANKFLSVTVLVVGDSIFVHGGLLAQHVYGLERINEEVRDWI-SGLRGR----N-AVVWLRKYSDV-DCSMLEHVLATVG-----VKRMIMGHTIQDG-INVACNNRAVRIDVGMSKGGLPEVLEINQ-NLRVLTSNPL PNRLVAIGDLHGDLEKSKQAFRLAGLIDRWCGGSTTVVQIGDVLDRGGEELKILYFLEKLKREAVKSGGQLITLNGNHEIMNI---------------EGD---FRYVTKKALEEFDVWRIAALRPDGPISRRFLALTVLVVGDSVFVHGGLLEKHVYGLQRINDEVRDWI-NGLRGR----N-AVVWLRKFSDEFDCSLLEHVLATIG-----AKRMIMGHTIQIG-INGACDNRAIRIDVGMSKGGLPEVLEIDR-NLRVLTSNPM PDRLIAVGDLHGDLSKSKEALRLAGLIDRWIGGSATVVQIGDVLDRGGDELKILYFLEKLKREAAKDGGMIITMNGNHEIMNV---------------EGD---FRYVTKEGLEEFRAWRIAALHPNGPISGRFLSTTVLVVGESVFVHGGLLPGHVYGLQRINEEVRDWI-KGLRRS----N-AVVWLRKFSDESDCSLLKHVLDTIG-----AKRMIMGHTIQAG-INGVCNNQAIRIDVGMSKGGFPEVLEFVG-NPRILTSNPY PPRLVAIGDLHGDLEKSKEALRLAGLIDRYTGGSATVVQIGDVLDRGGDELKILYFLEKLKREAARRGGRIITMNGNHEIMNV---------------EGD---FRFATLPGVEEFRVWRVAALRPNGPIAKRFLSVTVLVVGDSIFVHGGLLPQHTYGLEKINEEVRDWV-NGSRGA----D-GLVWVRKFSRGDDCSALEHVLSTVG-----VKRMVMGHTIQVG-INGVCDDKAIRIDVGLSKGGLPEVLEISG-SLRILTGNPL PPRLVAIGDLHGDLEKSKEALRLAGLIDRYTGGSATVVQIGDVLDRGGDELKILYFLEKLKREAARRGGRIITMNGNHEIMNV---------------EGD---FRFATLPGVEEFRVWRVAALRPNGPIAKRFLSVTVLVVGDSIFVHGGLLPQHTYGLEKINEEVRDWV-NGSRGA----D-GLVWVRKFSRGDDCSALEHVLSTVG-----VKRMVMGHTIQVG-INGVCDDKAIRIDVGLSKGGLPEVLEISG-SLRILTGNPL PPRLVAVGDLHGDLEKSKQALRLAGLIDRYIGGSATVVQVGDVLDRGGEELKILYFLEKLKREAARSGGRIVTMHGNHEIMNV---------------EGD---FRFATESGVEEFRVWRVAALRPNGPIATRFLSVTVLVVGDSIFVHGGLLPEHSYGLEKINEEVRDWV-KGLRSA----D-GLLWLRKFSRSKDCSALEHVLATVG-----VKRMVMGHSIQVG-INGACDDKAIRIDVGLSKGGLPEVLEISR-NLRILTEDPL PSRLIAIGDLHGDLKKSKEALSIAGLIDNYTGGSATVVQIGDVLDRGGDEIKILYLLEKLKRQAAIHGGNFITMNGNHEIMNA---------------EGD---FRFATKNGVEEFKVWRVAALRPNGPISKRFFTVTVLVVGDSIFVHGGLLKEHVYGLEKINGEVSDWY-KGLRGR----N-ALVWLRKFSD--DCSSLEHVLSTIG-----VKRMIMGHTIQEG-INGVCENKAIRIDVGMSKGGLPEVLEIDR--VRILTSNPL QDRLVAIGDLHGDLEKTKESLRLAKLIDKWVGGSTTVVQIGDVMDRGGDEIKILYYLEKLKREAARCGGTVITMHGNHEIMNV---------------EGD---FRFVTRKGLEEFRVWRIAALRPSGPISSRFLSVAVLVVGDSVFVHGGLLSQHVYGLEKINEEVRDWI-HGLHGA----N-GVVWLRKFSLGLDCSALEHVLATIG-----AKRMIMGHTIQFG-INGVCNERAIRIDVGMSKGGLPEVLEIKG-NMRILTSNPL QDRLVAIGDLHGDLEKTKESLRLAKLVDKWVGGSTTVVQIGDVLDRGGDEIKILYYLEKLKREAARCGGTVITMHGNHEIMNV---------------EGD---FRXVTRKGLDDFRVWRIAALRPSGPISSRFLSVAVLVVGDSVFVHGGLLGQHVYGLEKINEELREWI-HGLRGA----N-SVVWLRKFSHEYDCSALEHVLATIG-----AKRMIMGHTIQFG-INGVCNERAIRIDVGMSKGGLPEVLEIKG-NLRILTSNPL QDRLVAIGDLHGDLEKTKESLRLAKLVDKWVGGSTTVVQIGDVLDRGGDEIKILYYLEKLKREAARCGGTVITMHGNHEIMNV---------------XXD---FRXVTRKGLDDFRVWRIAALRPSGPISSRFLSVAVLVVGDSVFVHGGLLGQHVYGLEKINEELREWI-HGLRGA----N-SVVWLRKFSHEYDCSALEHVLATIG-----AKRMIMGHTIQFG-INGVCNERAIRIDVGMSKGGLPEVLEIKG-NLRILTSNPL QDRLIAIGDLHGDLEKTKESLRLAKLIDKWVGGSSTLVQIGDVFDRGGDELKILYYLEKLRREAARCGGTVITMHGNHEIMNV---------------EGD---FRFATHKGLDEFRGWRVAALRPNGPISSRFLALTVLVVGDSVFVHGGLLAAHVYGLEKINAEVRDWV-HGLRGR----N-AVVWLRNFSHEFDCSALEHVLSTIG-----AKKMIMGHTIQFG-INGVCEERAIRIDVGMSKGGLPEVLEING-NMRILTSNPL PSRLIAIGDLHGDLDKTKQAFHLAGLIDRWSGGSSTVVQVGDVFDRGGDELKILYFLEKLKREAAKSGGELITMNGNHEIMNV---------------EAD---FRFVTRSGLQEFEDWRIAALRPGGPVSTRFLTQTVVVVGDSVFVHGGLLSNHVYGLERINDEVREWL-SGSRGR----D-AVVWLRRFSEEADCSALEHVLATIG-----VKRMIMGHTIQSG-INGICGNKAIRIDVGLSKGGLPEVLEIDGTNLRILTANPL PSRLIAIGDLHGDLEKTKQAFRLAGLIDRWSGGSSTVVQVGDVFDRGGDELKILYFLEKLKREAAKSGGELITMNGNHEIMNV---------------EAD---FRFVTRSGLQEFEDWRIAALRPGGPVSTRFLTQTVVVVGDSVFVHGGLLSNHVYGLERINDEVREWL-SGSKGR----D-AVVWLRKFSEEADCSALEHVLATIG-----VKRMIMGHTIQGG-INGICENRAIRIDVGLSKGGLPEVLEIDGTNVRILTANPL PDRLIAIGDVHGDLEKTKEAFRIGGLIDRWIGGSTTVVQVGDVLDRGGEEIQILYFLEKLKREAARSGGKIITMNGNHELLNV---------------DGN---FKFTTADSLKEFKKWRIAALRPGGPITRRFLSATVLVIGDSIFVHGGLSSKHLYGLDRINEEVKDWM-NGL-GE----D-GVVWMRKFSVKKDCSALEHLLTTVG-----VQRMIMGHTPQ-R-INDECDGKAIRIDVGMSRGRLPEVLEITR-GLKILSSSPA PDRLIAVGDLHGDLQKSKEALRLAALIDRWTGRTATVVQIGDVLDRGGDELKILYFLEKLKREAEKSGGTIITMNGNHEIMNV---------------DGD---FRFVTQAGLDEFRVWRIAALRPEGPISVRFLSQTVVVVGDSVFVHGGLLPKHVYGLERINEEVRDWI-NGLRGK----H-SMVWLRKFSHELDCSTLEHVLATIG-----AKRMIMGHTIQTG-INGACGNRAIRIDVEADR---------------------PNRLVAIGDLHGDLDKSKQAFKIAGLIHRWIGGNTTVVQIGDVFDRGDDELKILYFLEKLKREALRSGGSIITMIGNHEIMNI---------------DGD---FRYITPSGLQEFKAWRIAALRPNGPISTRFLSPTALVVGDSVFVHGGLLQDHVYGLERINEEVRDWI-NGLRGR----N-SVVWLRKFSDQVDCSTLQHVLATIG-----AKRMIMGHTIQMG-INGVCDNQAIRIDVGMSKGGLPEVLEIIG-NVRILTSEKV PSRLIAIGDLHGDLPKAKQALRLAGLVGRWSGGSATVVQVGDIFDRGGDEIKLLYFFEKLKREAAGAGGLVITMNGNHEIMNV---------------EGD---FRFAAREGFEEFENWRIAALRPNGLISSRFLSQTVVVVGDSVFAHGGLLQNHVYGLDLINEEVRDWI-LGLRGR----D-SIVWLRSFSNELDCSMLEHVLETIG-----VKRMIMGHTIQNG-INTVCGNRAVRIDVGMSRLRFPEVLEIGE-NLKVLTSNPL PSRLVAIGDLHGDLAKSLAALRLAGLLPSWAAGPTLAVQLGDILDRGGDELRLLYFLRRLSISAAAQGGALLPILGNHEVMNV---------------AGD---YRFVTPEGLREFSAWRLAALRPEGPIARRFLAPTVLVVGDSVFVHGGLLEAHVYGIERINAEVSDWI-RGGSGR----D-AVVWLRRFSEGFDCQRLQGVLGMLG-----TKRMVMGHTIQEG-INAVCGAQAVRVDVGLSRGGLPEVLEINGGGVRVITTDPA PTRLVAIGDLHGDLPKSLSALRLAGLVPSWSAGPTLAVQLGDILDRGGDEIRLLYLIRRLAISAAGQGGALLPIMGNHEVMNV---------------SGD---FRFATPQGLREFSAWRLAALLPDGPIARRFLAPTVLVVGDSVFVHGGLLEANVYGLERINAEVSEWI-RGERGR----D-AVVWLRRFSDGVDCQRLEGVLGMIG-----AKRMIMGHTIQEG-INAVCGAQAVRVDVGLSRGGLPEVLEINGGGVRVITTDPA PSRLVAIGDLHGDLPKSLSALRLAGLVPSWAAGPTLAVQLGDILDRGGDELRLLYLLRRLALSAEARGGALLPILGNHEVMNV---------------SGD---FRFATPQGLQEFSAWRLAALRPDGPIARRFLAPTVLVVGDSVFVHGGLLEANVYGLERINAEVSEWI-RGERGR----D-AVVWLRRFSDGFDCQKLEGVLGMIG-----AKRMVMGHTIQVG-INAVCGAQAVRVDVGLSKGGLPEVLEINGGGVRVITTPPS PSRLVAIGDLHGDLPKSLSALRLAGLVPSWAAGPTLAVQLGDILDRGGDELRLLYLLRRLALSAEARGGALLPILGNHEVMNV---------------SGD---FRFATPQGFQEFSAWRLAALRPDGPIARRFLAPTVLVVGDSVFVHGGLLEANVYGLERINAEVSEWI-RGERGR----D-AVVWLRRFSDGFDCQRLEGVLGMIG-----AKRMVMGHTIQVG-INAVCGAQAVRVDVGLSRGGLPEVLEINGGGVRVITTPPS PSRLVAIGDLHGDLPKSLSALRLAGLVPSWAAGPTLAVQLGDILDRGGDELRLLYLLRRLALSAEARGGAFLPILGNHEVMNV---------------SGD---FRFATPQGLQEFSAWRLAALRPDGPIARRFLAPTVLVVGDSVFVHGGLLEANVYGLERINAEVSEWI-RGERGR----D-AVVWLRRFSDGFDCKRLEGVLGMIG-----ARRMVMGHTIQVG-INAVCSAQAVRVDVGLSRGGLPEVLEINAGGVRVITTPPS LSRLVAIGDLHGDLPKSLSALRLAGLVPSWAAGPTLAVQLGDILDRGGDELRLLYLLRRLSLSAESRGGAFLPILGNHEVMNV---------------SGD---FRFATPQGFQEFSAWRLAALRPDGPIARRFLAPTVLVVGDSVFVHGGLLEANVYGLERINAEVSEWI-RGERGR----D-AVVWLRRFSDGFDCKRLEGVLGMIG-----AKRMVMGHTIQVG-INAVCSAQAVRVDVGLSRGGLPEVLEINAGGVRVITTPPS PKRLIAVGDIHGDLAKARAALHVAEVIDHWIGGETVVVQVGDLLDRGGEEIKVIYLLEKLRGEAQKVGGNVHIMNGNHEIMNI---------------EGD---FRYATPLGLDEFQRWRIAALRPGGPLASRFLAPTVLVVGSSVFVHGGLLPVHVHGLERINQEVSEWM-LGTHGG----N-ALVWLRKYSDVKDCDLLKRCLGSIG-----AKRMVVGHTIQIG-LNGACDNKVIRVDVGLSKGGMPQVLEIRG-DLRILSSRLP VERLVAIGDIHGDLAKTRESLRIAGVVDRWIGGKTVVVQVGDVLDRGGQELKVLYLLERLKGEAQRHGGDLHIMNGNHEVMNI---------------EGD---FRYATKEALDEFSAWRYAALRPGGPISSRFLAPTILIVGSTVFAHGGILPSHLYGLEKINAEVRDWI-QGRRGK----D-AVVWARQYSIPVDCKLLDEVLTGIG-----ASRMVVGHTIQFG-INSACQNQVFRVDVGMSSGTAPEVLEIRH-DVRVLS---VERLVAIGDIHGDLAKTRESLRIAGVVDRWIGGKTVVVQVGDVLDRGGQELKVLYLLERLKGEAQRHGGDLHIMNGNHEVMNI EGD FRYATKEALDEFSAWRYAALRPGGPISSRFLAPTILIVGSTVFAHGGILPSHLYGLEKINAEVRDWI QGRRGK D AVVWARQYSIPVDCKLLDEVLTGIG ASRMVVGHTIQFG INSACQNQVFRVDVGMSSGTAPEVLEIRH DVRVLS VRRLVAVGDLHGDLAKTKEALVAGGLMDRWSGGTTTAVQVGDQLDRGGDEVAILLLLERLRKEARDAGGELVVMNGNHETLNV---------------AGR---FRYATADGLEDFRRWRLAALAPGGPMARRFLAPLVVAVGSTVFAHGGVLPHHVYGLERINKETSDWV-RGNSGR----E-SVVWARDYSMPEDCDVLEASLSKMG-----MKRVVVGHTIQHG-VNAACDGKVLRVDVGMSKGGKPEVLEILDDGGVFRLSLDE -ERLVAIGDLHGDLNKATRAFRAGGLIDKWIGGATVAVQVGDQLDRGGDEVAILYMLERLRKEARDAGGELIVMNGNHETLNV---------------AGR---FRYAFEPGIEDFRRWRLAALAPGGPMAKRFLAPVVVAVGSTLFAHGGVLPHHVYGLERINQETSEWI-RGDSGG----R-SVVWARDYSWPQDCGVLRRALDGLG-----IDRVVVGHTIQEG-VNAACDGRVLRVDVGMSEGSEPEVLEIL-----------VERLVAIGDVHGDLAKTREAFRAAKLTNEWVGGTTTCVQVGDQLDRGKDEVAILHFLERLRGEARAAGGELVVMNGNHETLNV---------------SGR---YRYSLAEGNADFTRWRWRALAPGAPLALRFLAPVVVAIGSTLFVHGGVLPEHVYGLDTLNEEISQWI-KRGQGR----D-SVVWARDYSHVQDCDLLEETLRMIG-----VERVVVGHTIQHG-ITSACDGKVIRIDVGMSKGAKPEALEILKDGIAKMKLGDD VERLVAIGDVHGDLSKVREAFRAAGMTNEWVGGSATCVQVGDQLDRGKDEVAILHFLERLRGEARDAGGELVVMNGNHETLNV---------------SGR---FRYSLPEGRADFARWRWRALAPGSPMTMRFLAPVVVAIGSTLFVHGGVLPEHVYGLDTLNAEISAWM-QRGQGR----D-SLVWARDYSHVQDCDLLKETLAMIG-----IERVVVGHTIQYG-ITKACDGQVIRIDVGMSKGAKPEALEILNDGI-------VERLVAIGDVHGDLDKTREAFRAARLTNEWIGGTATCVQVGDQLDRGKDEVAILHFLERLRTEARNAGGELIVMNGNHETLNV---------------AGR---YRYALTEGSADFARWRWRALAPGAPLTLRFLAPVVVAVGSTLFVHGGILPEHVYGLDSLNAEISTWM-KMGQGR----D-SLVWARDYSHVQDCDLLKETLKMIG-----VERIVVGHTIQHG-ITSACDGKVIRVDVGMSKGAKPEALEILRDGLTKLKVGAD VPRLVAIGDLHGDLKKARRAFRLGGLTDRWIGGTTTAVQVGDQLDRGNDEVRILYFLERLEREAERAGGKLHILNGNHETMNV---------------AGR---FNYATLPGLADFYHHSEALHCSAGAVTARFLAPIALQIGSTVFVHGGVLPQHALGLERINQETRAWM-RGETGR----Q-AIVWAREYSAGADCAALEQVLGRLG-----AERMVVGHTIQAG-INSACEDRVFRIDVGLSKGGEPQV---------------VPRLVAIGDLHGDLRKARRAFRLAGLIDRWAGGTTTV--VGDLLDRGDQELPLLFFLERLQGQAAAAGGALHVLNGNHETMNV---------------AGQ---FRYVTHRGMHSFADWRILALQPGGEVTRRFLAPVVLQVGSTLFVHGGVLRQHVYGLERINRETQEWMLKGSSGR----G-AVVWSRHYSAAADCGQLAEVLGRVG-----AQRMVVGHTIQGG-INSACQGRVLRIDVGLSRGGSPEVLEIVN-DVRRLSESQQ PPRLVAIGDIHGDYHKAVRALRLAGLMDRWAGGSTVAVQVGDILDRGDHEIRILILLERLAAEAAAAGGRLYLLNGNHETMNV---------------MGD---HRYATPGANLEFLGFR--ALMPGSELARRFFAPTVLQLGGNVFVHGGVLPAHVYGLEKINSETQSWM-LAPRGG----S-AIVWARAFSASDDCDTLKSVLESVG-----AQRMVVGHTIQRG-INSACESRVVRVDVGMSHGGPVEVLEVLK-DVLRLREHTP ADRLVAIGDIHGDYHKAVRAFRLAGLTDRWSGGSTVAVQVGDILDRGDHEIRTLIMLERLAREAEAAGGRLYLLNGNHETMNV---------------MGD---HRYATPGANLEVLGLR--ALRPGSEISRRFFAPTVLQVGDNLFVHGGVLPAHVYGLERINRETQSWL-LGGLGG----S-AIVWARAFSASDDCETLQNVLESIG-----ARRMVVGHTIQRG-INSACESRVIRIDVGMSHGFPVEVLEVLK-DVRRLREHGP -----------------------------------------------------------------------------------------------------------------------CLTVLRPDGPISRRFMAPTVLVVGDSVFVHGGLLEANVYGLERINAEVSEWI-RGERGW----D-IVVWLRRFSDGFDCKRLEGVLGMIG-----AKRMVMGHTIQVG-INTVGRGPPPSTSVSVGRGAMAQTLATVG---RTLPQRGG -----------------------------------------------------------------------------------------------------------------------RLTALRPDGPISRRFLAPTVLVVGDSVFIHGGLLEANIYGLERINAEVSEWI-RGERGR----D-TVVWLRRFFDGFDCKRLEGVLGMIG-----AKRMVMGHTIQVE-INTVGRGPPPSTSVSVGRGAMAQTLATTG---RTLPQRGG -----------------------------------------------------------------------------------------------------------------------RLTALRPDGPISRRFLAPTVLVVGDSVFIHGGLLEANIYGLERINAEVSEWI-RGERGR----D-TVVWLRRFFDGFDCKRLEGVLGMIG-----AKRMVMGHTIQVE-INTVGRGPPPSTSVSVGRGAMAQTLATTG---RTLPQRGG VRRLLALGDVHGDLASLRKCLSLANVINEWIGGDAVVVQVGDVFDRGDAERAVLHFLDTLDRQARERGGAVYRLLGNHEVMNV---------------DLD---FRYVTPGGFAEFRRRRAMALAPGCPTARLLAEQLVLIIGDTLFVHGGLRPDHVYGLERLNAETRAWMERRAKGS----R-SPVWMRTYSAPHACRELEETLRRAG-----VRRMVVGHTPQ--VINGACRGRVWRVDTGMSAAGACEVLEISETHIRIFTDHGV PGRVVALGDLHGDAAAAVTALTAAGLIDHWSGGDATLVQLGDVLDRGGEEFALWGLLERLRLEARSAGGRVVTLLGNHEVLNA---------------LGKA--ATYVHPAGHAQFGDDRCAAFRPGGALAARLADPVVAVVGDSVFAHASLPRGATGSLARLNAETRAWL-LGEGGR----D-SPVWDRTFSAPGDCEALRDTLRGLG-----ASRLVVGHTPQ--RVNGACGGAVWRCDTGCSRWGPVEALEIQGGAVRVLRRGTA PKRIVAVGDLHGDIHATRQALRLAGVLHEWIGGDTFLVQVGDQVDRGDDEIQVMSLLHRLGKQARAEGGRVEVLVGNHELMSA---------------QGN---FRYATKGAMRNFQRWRYLSLRSGGTFASKFLAKAILIVGDTVFVHAGLLHKHFEAINVLNQEVAAFL-KGEWGN----D-GILWTRRFSKIDTCKQLKKTLNRLN-----VRRMVVGHSVQDG-ISPACDASVWRIDVGMSKGTSPQVLEIKQDGVNILGKSIE PDRLVAIGDLHGDIVATRRALRLAEVLHEWVGGKTVVVQVGDQLDRGDDELQILSLLRKLAVQARQHEGALHVLIGNHEILS----------------THT---ARYATRGALENFFRWRYMALHPGGMISRTILATTALIVGQTLFVHAGIDLDHIEALKKINEEVSAFF-RGESTP----D-SMVWMRRYGGTHTCATLNATLKALD-----VRRMVIGHTIQSKTINSACDGAVWRVDIGMSSGTEPQILEMWKNGVNVKKEAAM NQRLICIGDVHGDLWALQDFLEISGVYDEWIGGNTIVVQCGDILDRGVEELACYKLLCKLSQQAIHTDGKVILLVGNHEAMNS---------------MGL---FQYAINDLEHERKIGRWSSYEPGGLLATSLLAKVAIRVGRTVCVHAGLRPSHLGGIEGMNKQFRDWITLET---------GPIWMRDYSYPHTTRMIEETLLLLD-----CDRMIMGHTIQ--QINCVLHGRAWRIDVGASRGGTPE-IKIKYRY--------NQRIVSFGDVHGDITALRTFLITARVLDRWSGGDTICVQTGDVLDRGDDELACFRLLATLSRQAKESGGALLLLYGNHESLNA---------------AGL---FQYANPGGNAEFESTRWASFEPGGLLAENMLGLVACVVGRTVFVHAGLQAVHLNGISKLNMEARDWILKDCIGA----S-SPVWMRDYSQPNKNPYAKTMIGTLN-----VQRMVMGHTPQ---INAAMNGRAWRIDVGASQGGTPEVLEI------------AERLVVIGDVHGDIDAFRSCLQMADLVDKWAGGETVVVQMGDIFDRGDDDLPIQEWVYKLAQEAGRANGALYSVMGNHEMLNA---------------MGD---HSMATRKAFVPFLALRLAAMRPGGPVA-RLMAAVSMKVGDNLFVHAGLLPEHIGGMERLNADTCAWM-LGKWQP----E-GPLWTRVYSTPDARAQLEEVLRLTG-----TKRMVVGHTPQAG-INSAADGQVWRVDTGMTAMGRPEALEIRGEEMTILTEYKS --MIPFIGNRVLD--AFAETLLAAGLVDNWAAGETVLVQAGDVFDRGDADLEVEEWLWTLQEQATESGGAVYHLLGNHEIMNA---------------MGD---HSTASPNSFKPFQDLRLAAMAPGGPVS-KMLASVAMKIGDTLLVHGGLRPVN-FDLESLNRWTHEWL-VGQWNR----D-SPVWTRFYSSPGAEADLQKVLDATG-----TVRMIVGHTPQAG-INSALGGRLWRTDTGMTAMGQPEALEIVDGVATIITAGGK AERIIAIGDVHGDVDALHECLKVANLVDNWIGGAAHLVQVGDILDRGMEERRCLQSLLDLKQKAADAGGAVHVLLGNHEVMNV---------------DLD---FRYVSPRENAWFGWERVYAFRPGGGAA-RCLSPIAIQIGDSVFVHGGLRLQHVYGLEKLNQETAAWL-YGSDDI----N-SPIWSRIYSVPSAELELEQVLEKLN-----AKRMVVGHTPQRG-INAFVGYEVWRTDTGMSQGGPIECLEILEDGVHVLTQAGI PKRIIALGDIHGDVRALATSLHMSHLIDNWIGKDSVLVQLGDVLDRGPNDYWCMRLLIKLQEQARASGGDVICLLGNHEVMNV---------------QLD---FRYVDPAAWAGWELERVNALRRGGTLS-KTLSPVCVVIGDTVFCHGGLMPEAVYGLQRLNDETSNWL-AGENP-----QFSPIWHRAFATANTLSAADCTMRLLH-----VSRMVIGHTPQEG-VNCLVGREIWRVDAGMSTGGPVECLEILAAKVNILSEDGV AARIIAIGDVHGGFNELHHALKISGCADRWTGGDTVVVQMGDFLDRGSDESRSIEMLRDLKVQAKDAGGDVITLLGNHEIMNA---------------DLD---FSYAP--DIYSFDSWRAQLMQRGGKLA-MLMSPVVVQIGDNVFVHGGLTRETIHGIDKLNQEVADWLRKDADPPKGGRTVSPLWERVYGMPIALETLDGMLESMD-----AKRMIVGHTPQYGGVGTEKEKEVWRIDTNLNDKGRVECLEILTDLVRVLSEDGR ARRIIAIGDL--------ASLRLAGVIDKWIGGDTVVVQVGDVLDRGDEESAVYDLLRRLASSAARKGGAVHMLLGNHEVMNV---------------GWD---FRYCEMYEKVDGDCPERGGDRPGGPVARK-LGNTVLVIGENVFVHAGLHGRHVYGLERINREVRDWM-RGT---------DPDATKPACVT-NCAKL--TLEDIP-----AKRMIVGHTDN-G-INCACSGAVWRLDVGMSRVAAAEVLEIRGDTVTALRRGVR TDRVVAIGDVHGDLEALRSCLRLSGLVDAWVAAGATLVSCGDVLDRGDGDWDCLTYLADLKTRATDAGGDVHLVLGNHEVLNV---------------LGD---VRFASRGALVRC---RKRAFAPGSGEGARLLAPVARVCGDTLFCHGGLHAR--GGLDALNGDAARWL-GGAPSP----S-SPVWSRSYSHPAQCGDARSALDALG-----CARMVVGHTPQQG-INACCDDSVFRIDTGLSAYGPKEVLELRPGR--------EPRLVAFGDIHGDVFRCRQILQLANITDRWIAGSSIVVQLGDIADRGLHPHEIYDLFASLERQAMKAGGEFIFLVGNHELMNI---------------LGI---FYYVHPDVMSAFGGEYAAAFGPEGPYGLYILQPVTVVREGVVFAHAGITPEYAKGVEGINAELMNGF-RGESNS----S-SPLWSRAVLDE-NCSLLMESLRLLP-----VRVMVGGHTVQGGVMAVECNGSLVGADVGLSRFGYVAYVEFLLDD--------VHRIIAVGDVHGDADNFLKILRIANLIERWTSVRTTLVQVGDLIDRGEQDLEALNIAISLQEQTAQSGDEVVLLIGNHELLNI---------------QGH---YHYVNKHNYGGFLSKRAEGMKATGAFGKYIVDKAAHMDEGVLFIHAGIETSMNKDVEALNADVREALRQGILGS----S-GPLWTRKMIIE-ECSDVRAALKQLN-----ATRIVVGHTPQSGHIGQHCDGQVLAIDVGMSRWDKVAALELVFSK--------IHRIVAVGDVHGDAERFRQILEMSGVISRWTRLRTTLIQTGDLIDRGEEDLEVLEMAVSLFNEVRTNYDKVVLLMGNHELLNL---------------QGH---FHYVHSKSMGGFLTRRKRAFELDGTFGGFILETVAYAVADTLFVHAGIDEHVVDGIERLNREAKQAIRTKNLGS----T-GPLWSRKMFLD-RCADTKKALASLG-----VKRVVVGHTPQSGRVETFCGGSVIAIDVGMSRWGNIAALEITVTS--------IHRIIAVGDVHGDTENFRQILSMAGIVSMWTKLRSTLIQMGDLIDRGEDDLGVLEMAFSLFEQVKSNHDNIVLLLGNHELLNL---------------QEQ---FYYVHPETMGGFLSKRKRAFEPSGTFGRFLLDNVLYFDAETVFVHAGIDQRFAVGVEMLNKKTMQAIREKDLGT----S-GPLWTRKMITD-RCTGIQKMLSFIG-----AKRIVVGHTPQSGHVEVFCNDSVIAIDVGLSRWGNLAALELMVTS--------HSRLVLLPDLHGDARQAARALQLAGLISQWTGGDAMLVQLGDVVDRGHDSLALLGLLQRLGQAAEAAGGRVQAILGNHELMLV---------------HHD---YRYVSREELRAIGAAWQAGLAPSQPLGALVRRPLALLAKGVLVVHAGAFPWMRAGVDAWNAVALEALVHCDLGG----E-GAVWSRRYIQA-VCAEVAAVAERLG-----VQRVVVGHTVQGGRVSSRCGGQLLMVDVGLSRAGEMAVLTCSNGV--------PSRLVVLPDIHGDGPQALRALHLAGLISQWSGGDTVLVQLGDMVDRGPDSLALLTWLESLRGQARATGGDVVALLGNHELMNI---------------HHD---FRYVAPAEIVALAEAWREALAADAPLGSLVRQPLAALIDAVLAVHAGAFPWMLGGVAAWNAATAGALRFCELGE------GAVWSRHLAHGPVCLEVAAVVSRLR-----VERLLVGHTIQGGRVSSRCGGQLLLADVGISRAGEMAVLQCSSGKTDRQTEEEQ FARIVAIPDIHGDLHHYRQSLRLAGVVDEWTAGSTHLVQTGDVVDRGQHSLLIMDMLANLTVRAKRVGGKVTALMGNHELMSL---------------MDD---TRYVHKDEILLLGTKWHRSFDPDAKQGRRLRRPLATVAGSSLFSHAGVRSRHLGGVDAMNEAAAAAI-QGKYDN----E-SPVWNRFYSREDVCDEVNRVLTAAR-----ANRMVIGHTVQGG-MRTKCGGKLHLIDVGMSSAGRGAAWVCEGGETELLEKEAD ARRLVALPDLHGDLDLARRSLILARVINAWSGGETILVQTGDVLDRGDASVALMRLLNKLARAAKDAGGEVVGLLGNHELMTL ARRLVALPDLHGDLDLARRSLILARVINAWSGGETILVQTGDVLDRGDASVALMRLLNKLARAAKDAGGEVVGLLGNHELMTL---------------QGD---LRYVSKRELVKLGKAWRDLFRAGADLGETLRFSMLHVAGESLFSHAGIRAHHL---EGVNGALRAASAGDGAFG----D-GPLWNRFWSTPRVCDELGAVLAKVG-----ARRMVIGHTIQRG-MATRCGGGLHLIDVGVSGKGRPAAWACEEGVVEVLDDGVE QGD LRYVSKRELVKLGKAWRDLFRAGADLGETLRFSMLHVAGESLFSHAGIRAHHL EGVNGALRAASAGDGAFG D GPLWNRFWSTPRVCDELGAVLAKVG ARRMVIGHTIQRG MATRCGGGLHLIDVGVSGKGRPAAWACEEGVVEVLDDGVE -------------MF---EALALAGVTRGWGAGRATCVQTGDLVDRGERSIEAVDEVERLTREANAVGDEFVSLLGNHELMTL---------------QGD---HRFASRDELTALGRSWRQTFAKGSTRGDVLREPWAAVRGRTAFSHAGLLPEHLFGVDELNARGAKLLAVDELLD----R-GPMWTRTISMG-ACALAAEVIRRLG-----VRRMVVGHTVTSGKIETRCDGLIHMIDVGMSKAGSPSVWMCTESESRVPLEN-RRKRLA------------VGRRLAGVVDSWGGGATTVVQTGDATDRGARSIETIDAIERLRVEAKEVGDEVVGLLGNHELMTL---------------QGD---YRFVAREELLALGRRWGKMFARGSERGEVVREPWAVVRGRTAFAHAGLLPEHLLGVDALNAEGKRLLAVECLVG----D-GPVWTREISTG-ACDAALEVTRRLG-----VRRLVVGHTVTSGRIETRCDGLIHMIDVGMSRLGSPSAWTCVESQGERVALE----------------------------------------------------------MDLGVRALREGGT----------------------------TGD------------------WMRAFARGSARGDEVRSPVAVTRGETVFVHAGLTSRHLFGVDALNARARELFDVDVTGG----D-GPLWTREISMG-VCKEVEETLARLN-----VKRMVVGHTPTSGSIETRCEGMVHMIDVGMSSAGVPSVWMCTESEGERVALE-LKRIVAVADVHGDRRNLMQALENGRVLVEWVMEGTQVVQLGDLVDRGPLGLQCYRLMQDLY--VAEGANEVVRVLGNHEVLNL---------------LGMA--GRYVTDEDVAEFGGERRESWSPGGEIWTILKDELVHVYGGHRFVHGGVMPALTRSIDELNEQASRMIKNGALS-----ESSPLWSRVYALG-ACPPLLNVLRHYG-----VARMVVGHTPSDGRMKVRCGGRAILADVALSRWGHPAALEMTLVN--------LKRIVAVADVHGDRRNLMQALENGRVLVEWVMEGTQVVQLGDLVDRGPLGLQCYRLMQDLY--VAEGANEVVRVLGNHEVLNL---------------LGMA--GRYVTDEDVAEFGGERRESWSPGGEIWTILKDELVHVYGGTLFVHGGVMPALTRSIDELNEQATRMIRDGALS-----ESSPLWSRVYALG-ACPPLLNVLRHYG-----VARMVVGHTPSDGRMKVRCGGRAILADVALSRWGHPAALEMTLVN--------PSRLVAIGDLHGDLPKSLSVLRLAGLVPSWPAGPTLAVQLGDILNRSDDELRLLYLLHRLSLSAETRGGAFLPILGNHEVMNV---------------SGD---FRFATPQGFHDRAVCKLPAFLPGGFLASNQTSQRLIKV---LFVHSGLLMKNRSDLEEYIQETGRAGRDGRLSCHLFLDSTTFYKSEYAMSKICSLVKESTSRKV-----LTQLEIGHLQQSGEISFGCGGRYLEVDPVVMEAAKAELVKLVGKG--------PSRLVAIGDLHGDLPKSLSALRLVGLVPSWTAGPTLTVQLGDILDRGGDELRLLYLL-RLSLSAETRGGAFLPILGNHEVKNV---------------SSD---FRFATPQGFHDRAVNRGGGLDPSRLCACAGFAGAVLIVSRDFNLHQAFKFECAYAYTEENTKPGKYI------TKPRTNNSLIW--------KCDQVLS-----------NNLFGLGYAIQ--------ERRVERASVCVTT---------------------PSSVCAVGDLHGDLQHALAALALCGAVDSWVGGAMTVVQTGDVLDRGNNSLGVLRALWRLQAEAEAAGGELVLLLGNHELMNM---------------QGK---VHYVHKAELAAEGGAWKRRMQPTGDLGAALLRDAAAVRGGTLFVHAGVRLSVAFGVERLNEA-----------------------------------------------------VGHNIV--WVSSRCGGTLALLDVGMSSAGLPAACDVGEEGYHGAATRGP ARRLVAVGDLHGDYEQTVSVLRLTRLIDHWIGEDALLVQLGDILDVGPDDILIVRLLMRLQQEAHAKGGDVIELLGNHELRNF---------------RGD---YKAVDKASLAASGGQRDVLLSNATDLGRY-LRKAVFHYGPFLFMHGGFSTATATSVEQFNSELTKALLNGTEDDVDDVA-NPILVRSILTV-KCNALSKVLDKKG-----IQSVVVGHVPHPRDFDGLCGGRLIDIDFGMSRWGHVAALEIEEATVQLIETSTA GRQIVVVGDLHGDLNQTLAILKITGLVDHWIGGDSFFVQLGDIFDVGPDDISIVKLLMKLEKEAQSVGGDVIELLGNHEIRNL---------------LGD---YSAVDPGSLAGSGGVRDRLLSNRTSVGMY-LRKAIFHHNEFLFMHGGLSTATATGIHEFNKDLRKALTNGTEGGGQEVV-NPILVRSILNV-RCKDLIKVLQNKG-----IKSVVVGHVPHHQDFKDLCGGHLIAIDFGLSRWGHVAALQIDDTTVQLLESTVR GRRIVAVGDLHGDLNQTLSILHLAGLVNHWIGKDTYFVQLGDILDVGPDDLMIVRLLMRLEKEAQAEGGDVIQILGNHEIRNL---------------LGD---FSAVDPVSLAQSGGKRRELLSNRTPLGIY-LRRAIFHHKEFLFMHGGLSTATGTGVEEFNKALRDTLINNTENEAKKVA-NPILVRSILNV-RCSELEKVLSKKG-----IKSVVVGHVPHTDDFSDLCGGRLIAIDFGLSRWGHVAALEWDDVSVQLMESTGK NQRIIAIGDLHGDIDRLRSTLRAANVLEAWRKGTDVVVQLGNVVGYGPDAPEMLQLLSGLKPQALASGGRLITLSGNQELLTL---------------SGV---LEYVHPRQLNLSAGYLRYLYGPGGRYGRMMAELAVAIVSDIVFVHGGLTAAYARGVNKLNAE---WY-EGAHDE----E-SPLWDHSVVRA-NCGPLSAGMAALD-----INLMVVGHTTMDGKVGTWCAGKLMTIDVAMSRYGGYEAFVSFLPMLVLGTAERV VEKLIAISDIEGNFEAFRRFLLSHKVIDNWIFGKGHLVCVGDFFDRGANVTECLWLIYDLEEKAKAAGGYVHFILGNHEIMNM---------------SND---YRYVSPKYMRNADLYYGNWFKSSTELG-RWLGNVVEKIGDMIFVHGGLAPDLNKTLIDINNAMRPFYFIQEFDA----N-GPLWNRSYVMANPESDVAESLRKFG-----GKRIIVGHTLV--HAKMLYGGKVIAIDTRHVDPNSTALFFEDGKL--------PKQIFVLSDIEGNFKSLFLLLRNYKIIDKWSFKDNHLVILGDCFDRGEEVMECLWLIYSLEERAQRDGGHVHFILGNHEIMNM---------------NGD---WRYIHPKYAITPSGSATALYDGNSELW-RWLRNIIEQIGDILFVHGGISEALLLSVTEINELARPYYTRANNS-----DESPFWYRGYYRAASEGQVDTILKKFD-----AKMIITGHTIV--KVTSFFNGKIINVDTNHAAGVSEALFIKGNNF--------PEKLIAISDIEGNFDAFCGFLQKNKIIDNWVFGKGHLVLNGDFVDRGQHVTPTLWLIYKLEEQALQQGGKIHYVLGNHEIMNF---------------QGN---HKYADAKYIKLAQIIYKILYSKKSEIG-RWLRNVVTKIGDYLFVHAGFSPEILLSLENINEITRKNWDKDLLG-----RKGPFWYRGLVKSYNAKDVQEILKTYQ-----AEKIVVGHSIV--AVSADYHGKVIRIDVKHGNE-QGLLVENGIEY--------IKKVVALSDVHGQFDVLITLLTNQKIIDNWAFGNGHMVMTGDIFDRGHQVNEVLWFMYKLDQQARDAGGMVHLLMGNHEQMVL---------------GGD---LRYVHKRYDVAAKLLYDKLYGADTEIG-QWLRNTIIKINDVLYMHGGISSEWILTLARANKLYRQHVDASKFFG----N-GPTWYRGYFSEETETELDTILNHFD-----VKRIVVGHTSQ--RILGLFNNKVIAVDTSIKKGGELLLLENKQLT--------IKKVVALSDVHGQYDILITLLRNQNIIDNWAFGDGHMVMTGDMFDRGHQVNEVLWFMYQLDQQAIEAGGKLHLLMGNHEQMVM---------------RGD---LRYVHERYQVAAKLLYDELYDKSSEIG-QWLRHTLVKINDTLFLHGGISGEWLLTLDVANDLYRKNIDRSKFFG----N-GPTWYRGYFKPGDEAELDKILSYFD-----VKHIVVGHTSQ--RVLGLYNNKVIAVDSSIKEGGELLLMEDNTLT--------AGKIVALSDVHGQFDVMINLLKAHKIIDHWAFGDGHMVMTGDMFDRGHQVNEVLWFLYELDKEAQAAGGRLHLLMGNHEQMVF---------------RGD---LRYINERYKTSAELLYDALYNKDTEIG-RWLRNTLVKINNLLFMHGGISPEWVLNISDANQLFRQHLDDKKFFT----N-GPTWYRGYFKDASEQEIDQILSYFK-----VDHIIVGHTSQ--RVLGLYHNKIIAIDSSIKNGGELLLIDKDKLT--------VSKIVALSDVHGQFDVMINLLKAHKIIDHWAFDDGHMVMTGDMFDRGHQVNEVLWFLYALDQEAQAAGGRLHLLMGNHEQMVF---------------RGD---LRYVNERYQVSADLLYDALYNKDTEIG-QWLRHSLVKIINILFMHGGISPEWVLHISEANQLFRQHLDDKKFFT----N-GPTWYRGYFKDEKESDIDSILSYFK-----VEHIVVGHTSQ--QVLGLYHNKIIAIDSSIKNGGELLLIDAGKLS--------PEKTFVLSDPHANWSCFASLLKAGKVIDNWIFGTNQLVIIGDVFDRGVDVLPIYWLIYKLEKEAEDAGGKVTFLIGNHETMVL---------------GND---LRYTKKKYTQLADTLYPELWQK-SELG-HWLKNSIQVVGDNLFVHAGLSKEFLYDIPTVNEIVSDGLFLTKFA-----TYGPIWYRGMVRSADRDDLRKILEKYN-----VNRIFVGHTIF--DITTFYHYKVIAVNVDNQENGRGVMIEKDGSM--------ADKVFVMSDPHGRLDCVISLLQGNHIIDKWSFGKNHLMIIGDIFDRGKDVPQIFWLFYKLEEEAAKAGGHVSFILGNHEPMVL---------------AND---LRYTKEKYKILAEKLYPRLFGPDTELG-RWLGNTMQMIGNDLYVHAGLGKDFYLSIPTVNEEMSKGLFMTKYG-----NSGPIWYRGLVRTDAKDSLEMIMDRYK-----AKHIIVGHTIF--DISTFYNGKVIGVNVDNKENGRAMLIENNQYF--------NGKIIAIGDIHGDIESLKLILRHSKLIGNWIGDNVLLVQNGDVFDRGIYGPIIYNFLFKLQKEAIKKNSRVILIMGNHEQLNL---------------CGY---FNYVNPKEIEMFFHNRYHSFVNPGEYHKRLIRPPMVKVNNIIFTHGGLNLLISLSINDINLKTRLQIENNCLS-----RDGVLWSDAMSRNVKCSELFQILDKYD-----AKYLVVGHTRQSHQIGSYCNNHYFLIDTGMSLFPYPNYLKIDDHK--------NGKIIAIGDIHGDIESLKLILRHSKLIDNWIGNNVLLVQNGDVFDRGIYGPIIYNFLFKLQKEAIKKNSRVILIMGNHEQLNL---------------CGY---FNYVNPKEIEMFFYNRYHSFVNPGEYHKRLIRPPMVKVNNIIFTHGGLNLLISFSINDINLKTRLQIENKCLS-----RDGVLWSDAISRSVKCSELFQILDKYD-----AKYLVVGHTRQSHQIGSYCNNHYFLIDTGMSLFPYPNYLKIDDHKPSHQIGSYC NGKIIAIGDLHGDIDSLKLILRHSELIDNWIGDNVLLVQNGDVFDRGIYGPIIYSFLFKLQKEAVKKNSRVILIMGNHEQLNL---------------CGS---FHYVNPMETKIFFDNRYYSFVSPGAYHKRLIRPPMVKVNNIIFTHGGLNSLISFSINDINLKTRLQIENNCLS-----RDGVLWSDVISRSVRCSELFHILDKYD-----AKYLVVGHTRQSHKIGSYCNNHYFLIDTGMSLFPYPSYLKIDDNK--------EGKIIAIGDIHGDIESLKLILRHANLINEWVAENVLLVQVGDVLDRGIYGPLIYDYLFKLQKEAPLKKSKVLLIMGNHEQLNL---------------CGY---FDYVNEKEVEVFFKKRLFHFVYSGEYFKKLIRPAIAKVNDILFVHAGISTQISLSLNTIRLKTRLQIENMCVS-----REGVLWHDHISRTAACSILSQIFNNYK-----AKHLVVGHTRQTHEISSYCNGGFFLIDTGMSLFPYPNYLVVQNGT--------EYDFYSIGDLHGDKDAFIRILLNESIIDKVIRNNVLTVITGDVLDPTYDDIDIILFIKNYNESGKSLNSKIILLLGNHEVNNL---------------CLK---FK-TPQSNIENY---RNDMFRKGETIYNYLIDPFVVNVNNIIFSHAGVLPFYSYGIDFINEEGKREIINNC-------EYGPTLNRYYSYVTICSSLYKSLGLLK-----SSRMVIGHTVQNKQVNSFCQDKLLLADTGISRWGVISYIQYFNDG--------EYDFYSIGDLHGDKDAFIRILLNESIIDKVIRNNVLTVITG 36 : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : 238 238 239 239 237 225 229 229 237 238 238 238 238 238 238 237 238 238 238 237 237 233 241 208 237 237 237 237 237 237 237 223 237 237 237 222 237 237 189 222 237 237 237 237 237 237 237 237 237 237 237 237 237 237 237 237 237 237 237 207 237 237 237 237 237 221 240 238 242 237 238 238 217 232 233 233 233 237 236 207 233 237 237 237 237 237 237 236 236 237 237 236 237 237 237 237 237 234 237 237 237 237 238 238 235 216 237 237 238 238 238 238 238 238 237 233 238 225 238 230 238 222 236 235 235 134 134 134 238 238 238 238 224 224 237 232 237 237 242 221 224 229 230 230 230 230 239 238 236 Figure 2.1: Alignment of SLP phosphatases from both Prokaryotes and Eukaryotes. Candidate SLP phosphatase sequences identified as detailed in Materials and Methods were aligned using MAFFT. The alignment was visualized in GeneDoc and manually edited to remove the extreme N- and C-terminal regions outside the protein phosphatase domain, plus poorly aligned regions, prior to phylogenetic tree inference. Two conserved bacterial-like phosphatase motifs (red) were identified in addition to the known canonical PPP-family protein phosphatase motifs (blue). Of the two bacterial-like PPP-phosphatase motifs, Motif 1 represents a novel region of conservation. Motif 2 was previously reported (Andreeva and Kutuzov, 2004). The gene identifiers of each sequence used here are located in Appendix A1. 37 GDxHG AlRLPHc AlRLPHa AtRLPHb CarRLPHa ThalRLPH BraRLPH AtRLPHa CarRLPHb AlRLPHb CpaRLPH CcRLPHa CcRLPHb CsRLPH LuRLPH MeRLPH RcRLPH VvRLPH PperRLPH PtRLPHa PtRLPHc PtRLPHb CsatRLPHa MdpRLPHa GmRLPHa PvulRLPH GmRLPHb GrRLPHa GrRLPHb AcRLPHa AcRLPHb MgRLPH EgRLPH BdRLPH PvRLPH SiRLPH SbRLPH ZmRLPH OsRLPH CsatRLPHb PpaRLPH SmRLPH MdpRLPHb MpRLPH MpRLPHb SrRLPH BnRLPH NgRLPH F0SP41_PLA D5SSG4_PLA E8R2G2_ISO GammataRLF B0C4K2_ACA B2J0C0_NOS Q8YP31_NOS D7E399_NOS Q31LV8_SYN Q5N017_SYN C7QL34_CYA B1WQZ5_CYA B1X333_CYA B0JSQ6_MIC E0UA25_CYA A8ZSZ1_DES Q3A7H2_PEL A7HLU4_FER H9UCV4_FER C5CDU8_KOS D0MK25_RHO D5H4I3_SAL Q2S6K7_SAL B8DK70_DES E1YER6_9DE I2Q722_9DE G7Q752_9DE E1K2K4_DES H2J819_MAR A9BIC6_PET E3I2B3_RHO G8API0_AZO D3NZT3_AZO G7ZAB6_AZO H5YEZ7_9BR E8L7W5 _ 9RH F2IW59_POL E0TGJ0_PAR Q11UR6_CYT B7IWU6_BAC B7HV78_BAC C6GXU9_STR E8UQ78_STR D3QFL2_STA B9DJA7_STA Q3M5P0_ANA Q8YZT4_NOS F4B184_KRO E4NEL1_KIT F8JTJ0_STR G8WZV1_STR F2RD99_STR E4N448_KIT : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : GDxVDRG GNHE HGG Motif 1 VVSGH Motif 2 IDEG * 20 * 40 * 60 * 80 * 100 * 120 * 140 * 160 * 180 * 200 * 220 * RTVICVGDIHGYISKLNNLWLNPQSAIDPSEFSSALVIFLGDYCDRGPETGKVIDFLISLPEKHPDQTHVFLAGNHDFAFSGFLG----EGWYRGSVKGMAYKGSIYDLESVKLFVGLPLDDYNNV--NH----------------------------LKAITAGL----KALKLLGVEWGVVEKEAAGKLAVAETHH-TLRIT------------------------------RTVLCVGDIHGYISKLNNLWLNLQSAIDPSEFSSALVIFLGDYCDRGPETGKVIDFLISLPEKHPDQTHVFLAGNHDFAFSGFLG----EGWYRGSVKGMAYKGSIYDAGSTFESYGVPHGLMKAVPESHKKFLTNMVWVHEEDDVCIDTEEGLKHCKLIAVHAGLNNVEEQLKLLRAK--KIQHLSGRKTVVVSGHH-KLHIRLIIDEGGGFPPVAAIVLPSKKIIRDTDNLSS RTVICVGDIHGYISKLNNLWLNLQSAIDPSDFSSALVIFLGDYCDRGPETRKVIDFLISLPEKHPDQTHVFLAGNHDFAFSGFLG----EGWYTGSVKGMAYKGSIYDAGSTFESYGVPHGLMKAVPESHKKFLTNMVWVHEEDDVCIETEEGLKHCKLIAVHAGLNNVEEQLKLLRAK--KIQHLSGRKTVVVSGHH-KLHIRLIIDEGGGFPPVAAIVLPSKKIIRDTDNLSS RTVICVGDIHGYISKLSNLWLNLQSAVDPSDFNTALVIFLGDYCDRGPETRKVIDFLISLPEKHPDQTHVFLAGNHDFAFSGFLG----EGWYRGSVKGMPYKGSIYDAGSTFESYGVPHGLMKAVPESHKNFLTNMVWVHEEDDVCIETEDGLKHCKLIAVHAGLDNVEEQLKLLRAK--KVQHLSGRKTVVVSGHH-KLHIRLIIDEGGGYPPVAAIVLPSKKIIRDTDNLSS RTVICVGDIHGYISKLNNLWSNLQSSLNPSDFSSALVIFLGDYCDRGPETRKVIDFLISLPEKHPDQTHVFLAGNHDFAFAGFLG----EGWYRGSVKGMAYKGSIYDAGSTFESYGVPHGLMKAVPESHKKFLTNMVWVHEEEDVCIETEEGLKHCKLIAVHAGLNGIDEQLKLLRVK--KIQYLSGRKTIVISGHH-KLHIRLIIDEGGGYPPIAAIVLPSQKIIRDTDNVSS RTVICVGDIHGYITKLTNLWLNLQSALDPSQFTSALVIFLGDYCDRGPETRKVIDFLISLPEKHPEQTHVFLAGNHDFAFAAFLG----EGWYKGSVKGMAYKGSIYDAGTTFESYGVPHGLMKAVPESHKKFLTNMVWVHEEDDVCVETEEGVKRCKLIAVHAGLDSVEEQLKLLRAK--KVPYLSGRRTVLVSGHH-KLHMRLIIDEGGGYPPVAAIVLPSKMIIRDTDNVSS RTVICVGDIHGNISKLNKLWLNLQSDIQNSDFSSALVIFLGDYCDRGPETRKVIDFLISLPEKHPDQTHVFLAGNHDFAFAGFLG----EGWYKGSAYGVLYNGSIYDAASTFESYGVPHGLIKAVPESHKKFLTNMVWVHKEDDVCIETEEGLTHCKLIAVHAGLNNVEEQLKLLRDK--RIQPLTGRKTIVVSGHH-KLHIRLIIDEGGGYTPLAAIVLPSKKIIRDTDNFSN RTVICVGDIHGNISMLSKLWLNLQSALK-SDFSSALVIFLGDYCDRGPETRKVIDFLISLPEKHPNQTHVFLAGNHDFAFAGFLG----EGWYKGSAYGVVYNGSIYDAASTFESYGVPHGLMYAVPESHKNFLTNMVWVHEEDDVCIETEEGLKHCKLIAVHAGLNNVEEQLELLRAK--RIQPLVGRKTIVVSGHH-KLYIRLIIDEGGGYAPLAAIALPSKKIIRDTDDISN RTVICVGDIHGNISLLNKLWLNLQSDIEQSDFNSALVIFLGDYCDRGPETKKVIDFLISLPEKHPEQTPCLSCRKPRLCFCWVLG----EGWYKGSAYGVLYNGSIYDAASTFESYGVPHGLINAVSESHKKFLTNMVWVHEEEDVCIETEKGLKHCKLIAVHAGLNNVEEQLKLLRAK--RIQHLTGRKTLIVSGHH-KLHIRLIIDEGGGYAPLAAIVLPSKKIIRDTDNFSN RTVCCIGDIHGYFTKLKNLWSNLESIIDPPDFQSALVIFLGDYCDRGPDTRRVIDFLLSLPARYPNQTHVYLSGNHDLAFAAFVG----EGWYKGTVKGTDYKGSIYDAAPTFESYNVPHGLVKAVPDDHKKFLADMVWVHEEDDVCIETDEGIKHCRLIAVHGGLVKVEEQLEFLRAR--KVEALSGRKTIVVSGHH-ELHIRLIIDEGGGLEPVAAIVLPSMKVIRDTDNIAK RRVCCIGDVHGYISKLQNLWKNLETHIGPSDFNSAIIIFLGDYCDRGPNTREVIDFLISLPTKYPNQKHVFLSGNHDLGFAAFVG----EGWFKGAAKGTEYKGSIYDAAPTFESYGVAHGLVKAVPDHHKKFLADMLWVHEEDEVCVETNDGIKHCKLIAVHAGLKKVEEQLELLKAK--KVEALSGRKTIVVSGHH-KLHIRLIIDENGGLEPVAAIVLPSMKIVRDINNLVRRVCCIGDVHGYISKLQNLWKNLETHIGPSDFNSAIIIFLGDYCDRGPNTREVIDFLISLPTKYPNQKHVFLSGNHDLGFAAFVG----EGWFKGAAKGTEYKGSIYDAAPTFESYGVAHGLVKAVPDHHKKFLADMLWVHEEDEVCVETNDGIKHCKLIAVHAGLKKVGEQLELLKAK--KVEALSGRK--------------VII---------------------------RRVCCIGDVHGYISKLQNLWKNLETHIGPSDFNSAIIIFLGDYCDRGPNTREVIDFLISLPTKYPNQKHVFLSGNHDLGFAAFVG----EGWFKGAAKGTEYKGSIYDAAPTFESYGVAHGLVKAVPDHHKKFLADMLWVHEEDEVCVETNDGIKHCKLIAVHAGLKKVGEQLELLKAK--KVEALSGRK--------------VII---------------------------RTVICVGDIHGYITKLQNLWTNLEALID--DFPTALVIFLGDYCDRGPDTKKVIDFLITLPSRYPNQTHVFLAGNHDLAFSAFLG----EGWYEGVAKGTEYQGSIYDAAPTFESYGVPHGLMKAVPESHREFLANMVWVHEEEAVCVEKEGEIKHCKLIAVHAGLKDVQEQLKLLKAK--KIEALSGRKTVVVSGHH-KLHIRLIIDEGGGLAPVAALVLPSMKLVRDTDSLLS RVVICIGDIHGYLAKLQNLWSNLQSVINPEEFSSALVIFLGDYCDRGPDTKKVIDFLLNLPSAYPNQKHVFLSGNHDLAFAAFLG----EGWYKGATKGTEYKGSIYDAGPTFESYGVPHGLMKVVPDEHKRFLADLVWVHEEENVCIESEEGIKHCKLIAVHAGLKNVEEQLKFLKAK--KVEALSGRKTIVVSGHH-KLHIRLIIDEGGGLEPVAAIVLPSMKLVRDTDDLTK RVVICIGDIHGYFSKLQKLWSNLEAIVNPQDFNSALVIFLGDYCDRGPDTKKVLDFLIKLPSFYPNQKHVFLSGNHDLAFAAFLG----EGWYKGVAKGTEYKGSIYDAGPTFESYGVPHGLIKVVPDEHKKFLVDMVWIHEEEDVCIENEEGMKHCKLIAVHAGLKNVEEQLAFLKVK--KIEALSGRKTIVVSGHH-KLHIRLIIDEGGGLEPVAAILLPSMKLVRDTDNFNK -----------------------------------------------------------------------------------------------VTKGTEYQGSIYDARLTFESYGVPHGLVKAVPDEHKKFLADMVWVHEEDDVCIQTEEGIKHCKLIAVHAGLEAVDEQLKTLKAK--KVGPLSGRATIVVSGHH-KLHIRLIIDEGGGLEPVAAVVLPSMQVVRDTDKLAK RVVCCIGDTHGFYTKLQNLWSNLQTTIDPSDFNTALIIFLGDYCDRGPDTKKVLDFLISLPSKYPNQRHVFLSGNHDLAFAAFVG----EGWFKGAAKNTQYKGSIYDAEPTFASYGVPHGLVKAVPDEHKKFLADMVWVHEEDDVCVETEDGIKHCKLIAVHAGLKDVKQQLEFLRAR--RIEALSDRQTVVVSGHH-KLHIRLIIDEGGGKEPVAAVVLPTMKIVRNTDVLAN RLVICIGDIHGHITKLQNLWSNLETQFDPQHFNAATIIFLGDYCDRGPDTKKVLDFLIDLPSRYPNQKHVFLSGNHDFAFAAFVG----EGWYKGTIKGTECKGSIYDAGPTFASYGVPHGLLKAVPDDHKKFLADTVWVHEEDDVCIEDEEGIRHCKLIAAHAGLKNVGEQLRFLKAK--KIEALSGRKTIVVSGHH-KLHIRLIIDESGGFEPLAAIALPSMKLVRDTDNLTK RLVICIGDIHGYITKLQNLWSNLETQFDPQHFNAATIIFLGDYCDRGPDTKKVLDFLIDLPSRYPNQKHVFLSGNHDFAFAAFVG----EGWYKGTSKGIEYKGSIYDAGPTFTSYGVPHGLLKAVPDDHKKFLADTVWVHEEDDVCIEDEEGIRHCKLIAAHAGLKNVGEQLRFLKAK-----------IIVVSGHH-KLHIRLIIDEGGGFEPVAAIALPSMKLVRDTDNLTK RLVICIGDIHGHITKLQNLWSNLETQFDPQHFNAATIIFLGDYCDRGPDTKKVLDFLIDLPSRYPNQKHVFLSGNHDFAFAAFVG----EGWYKGTIKGTECKGSIYDAGPTFASYGVPHGLLKAVPDDHKKFLADTVWVHEEDDVCIEDEEGIRHCKLIAAHAGLKNVGEQLRFLKAK--KIEALSGRK-------------LLIFLSETFYPPFSPAFFSGKGLGC------RVVCCIGDVHGYITKLQNLWSNLESSIHPSDFNSALIIFLGDYCDRGSNTREVIDFLVNLPSKYPNQKHVFLAGNHDFAFAAFLG----EGWFRGLAKGTDYQGSIYDAGPTFESYGVPHGLVKAVPDEHKKFLSNMAWVHEEDDVCLDTEDGIKHYKLIAVHAGLKDVQEQLNSLKAK--KIECLSGRRTMVVSGHH-KLHIRLIIDEGGGLQPVAAVVLPPMKIVRDTDNMKQ ---------------ISNPPSTHQISPPPSSFSSAITATA------AXDTKKVIDFLVSLPSRYPNQKHVFLTGNHDLAFAAFVG----EGWFKGXAKGTEYKGSIYDAGPTFESYGVPHGLVKAVPDEHKKFLADMVWVHEEDDVCIETADGIKHXRLIAVHAGLKDVKQQLAFLRAR--KIQALSGRKTVVVSGHH-KLNIRLIIDEGGGMAPVAAVVLPSVKIVRDTDALTN RAVICVGDIHGFITKLQSLWKNLEGSLDRSEFETATLIFLGDYCDRGPATRQVIDFLISLPSRYPRQKHVFLCGNHDLAFAAFLR----EGWFKGTVKGTEYQGSIYDAGPTFESYGVPHGLVKAVPDDHKKFLADLVWVHEEDDVFVNTDDGVKCCKLIAVHAGLKDVKEQLKLLKAR--KVEALSGRKAIVVSGHH-KLHVRLIIDEGGGYKPVAAIILPSMKIIRDTDELAK RVVICVGDIHGFVTKLKNLWSNLERSIDRSEFETATVIFLGDYCDRGPDTRHVIDFLVALPSRYPRQKHVFLSGNHDLAFAAFLH----EGWFNGTVKKTEYQGSIYDAGPTFESYGVPHGLVKAVPDEHKKFLADLVWVHEEDDVFINTEDGGKCCKLIAVHAGL-DVNEQLKLLKAR--KVEALSGRRTIIVSGHH-KLHTRLIIDEGGGFKPMAAIVLPSQKVIRDTDVLAK RLVICMGDIHGFITKLQILWSNLEASLDRSEFETATLIFLGDYCDRGPGTRHLIEFLVSLPSRY------LLGG---------------------TVKGTEYQGSIYDAGPTFESYGVLHALVKAVPDDHKKFLADLVWVHEEDDVFVNTDDGVKCCKLIAVHAGLKDLKEQLKLLKAR--KVEALSGRKQKHLIRHH-KLHVRLIIDEGGGYKPVAAIILPSMKIIRDTDVLAK RTVICVGDIHGYVTKLLNLWSNLQSQIDPDSFNTATVIFLGDYCDRGPDTRKVIDFLITLPKRYSNQKHVFLSGNHDFAFGGFVG----EGWYKGAAKGIDYKGSIYDAAPTFESYGVAHGLMKAVPEAHKKFLADMVWVHEEDDVCIETQEGVKHCKLIAVHAGLKNVREQLEFLKAK--KVTDLSGRKTILVSGHH-KLHIRLIIDEGGGLEPVAAIVLPSMKIVRDTDNFSRTVICVGDIHGYVTKLLNLWSNLQSQIDPDSFNTATVIFLGDYCDRGPDTRKVIDFLITLPKRYSNQKHVFLSGNHDFAFGGFVG EGWYKGAAKGIDYKGSIYDAAPTFESYGVAHGLMKAVPEAHKKFLADMVWVHEEDDVCIETQEGVKHCKLIAVHAGLKNVREQLEFLKAK KVTDLSGRKTILVSGHH KLHIRLIIDEGGGLEPVAAIVLPSMKIVRDTDNFS RTVICLGDIHGYLTKLLNLWSNLQSQIDPDSFNTATIIFLGDYCDRGPDTRKVIDFLISLPKRYPNQKHVFLSGNHEFAFARFIG----EGWYKGVAQGIDCKGSIFDAAPTFGSYGVSHGLMKVVPEDHKKFLADVVWVHEEDDVCIETQEGVKHCKLIAVHAGLKNVREQLEFLKAK--QVTGLSGRKTIVVSGHH-KLHIRLIIDEGGGLEPLAAIVLPSMKIVRDTDNLSRVVCCIGDLHGYISKLQNLWSNLESEIDPVSFQSALIIFLGDYCDRGPDTHKVLDFLISLPSKYPNQNHVFICGNHDFAFSAFLR----EGWYKDHAKGTEYKGSIYDAGTTFESYGVLHGLVKAVPDEHKKFLANLVWVHEEDNVSIQTSEGKRNYKLIAVHAGLKGVEEQLKLLKGK--KVAALSGRATVVVSGHH-KLHIRLVIDQGGGLEPIAAIVLPSLKIIRDTDAKVA RVVCCIGDLHGYISKLQNLWSNLESEIDPVSFQSALIIFLGDYCDRGPDTHKVLDFLISLPSKYPNQNHVFICGNHDFAFSAFLR----EGWYKDHAKGTEYKGSIYDAGTTFESYGVLHGLVKAVPDEHKKFLANLVWVHEEDNVSIQTSEGKRNYKLIAVHAGLKGVEEQLKLLKGK--KVAALSGRATVVVSGHH-KLHIRLVIDQGGGLEPIAAIVLPSLKIIRDTDAKVA RLVICVGDIHGYITKLRNLWTNLETAVGPSDFRSALVVFLGDYCDRGPRTDEVIDFLISLPSKYPDQSHVFLCGNHDLAFAAFLG----EGWYKGHAKNTEYKGSIYDAGPTFEAYGVPHGLMKAVPEEHKKFLRDLVWVHEEDDVSIEMKEGLQKCKMIAVHAGLKGVEEQLNYLKNK--KVEPLSGRKTIVVSGHH-KVHVRLIIDRGGGLEPVAAIVLPSKKMVLDTDEFAT RTVVCVGDVHGHFTKLRALWANLESSIPPHAFRAALVIFLGDYCDRGPDTRRVIDFFLSLPSAYPGQTHVFLAGNHDFAFAAFVG----EGWYRGEAKGTEYKGSIFDAAPTFESYGVAHGLIKAVPEEHKKFLANMVLVHEEDDVCITTPEGTKHCRLIAVHAGLKDVGEQLRSLKAK--KIEAIAGRKTIVVSGHH-KLHIRLIIDEGGGYPPVAAVVLPSLKIIRDTDILPT RTVICVGDVHGFISKLESLWANLQSALPADAFATALVIFLGDYCDRGPHTRRVLDFLLALPARHPAQRHVFLCGNHDLAFAAFVG----EGWFHGPKKGLPYKGSIYDAQPTFESYGVAHGLTKAVPEEHKRFLHDLVWIHEGENVPIDTDEGQIVCKLIAVHAGLIDLNEQLRVLRTR--KVQMLSGRQTVIVSGHH-KLHIRFIIDEGGGYEPIAAVVFPSKQLIRSTEGTAP RTVICVGDVHGHITKLESLWSNLQSALPAGAFATALVIFLGDYNDRGPHTRGVLDFLLALPRRHPAQRLVFLCGNHDLAFAAFVG----EGWFRGPKKGLPYRGSIYDAQPTFESYGVAHGLAMAMPEEHKRFLRDLVWIHEEENVPIDTDEGQIICNLIAVHAGLIDLNEQLRVLRTR--KVPMLSGRQTIVVSGHH-QLHIRFIIDEGGGYEPIAAIVFPSKALIRSTEGTTS RTVICVGDVHGYIAKLESLWSNLQSALPADAFATALVIFLGDYNDRGPHTREVLDFLLALPGRHPAQRHVFLCGNHDLAFAAFVG----EGWFRGPKKGLPYRGSIYDAQPTFESYGVAHGLAKAVPEDHKMFLHDLVWIHEEENVPIDTDEGQIICNLIAVHAGLIDLNEQLRVLRTR--KVPMLSGRQTIVVSGHH-QLHIRFIIDEGGGYEPIAAIVFPSKTLIRSTEGAAS RTVICVGDVHGYVTKLESLWSNLEAALPADAFATALVIFLGDYNDRGPDTRRVLDFLLELPTRHPGQRHVFLCGNHDLAFAAFVG----EGWFRGPKKGLPYKGSIYDAQPTFESYGVAHGLAKAVPEEHKRFLHDLVWIHEEENVPIDTDEGQIICNLIAVHAGLIDLNEQLRVLRTR--KVQMLSGRQTIIVSGHH-KLHIRFIIDEGGGYAPIAAIVFPSKTLIRSSEAH-RTVICVGDVHGYITKLESLWSNLQAALPADAFATALVIFLGDYNDRGPHTRRVLDFLLALPTRYPAQRHVFLCGNHDLAFAAFVG----EGWFRGPKKGLPYKGSIYDAQPTFESYGVAHGLAKAVPEEHKRFLHDLVWIHEEENVPVDMDGGQIICNLIAVHAGLIDLNEQLRVLRTR--KVQMLSGRQTIIVSGHH-KLHIRFIIDEGGGYEPIAAIVFPSKTLIRSTEEAGT RVVICVGDVHGYISKLESLWANLQSALPPDAFATALVVFLGDYCDRGPSTREVIDFLLALPSRHPAQRHAFLCGNHDLAFAAFVG----EGWYRGPKRGSSYMGSIYDARPTFESYGVAHGLVKAVPEEHKKFLRELVWIHEEENVPIDTNEGQIICKLIAVHAGLIDLNEQFRILRTK--KVAMLSGRQTIVVSGHH-KLHIRFVIDEGGGYAPIAAIVFPSKELIRSTEGTSS GVLCCIGDIHGYFTKLQNLWRNLESAIGASDFASATVIFLGDYCDRGPNSREVIQFLVSLPFRYPDQKHVFLAGNHEFGLAGFLG----EGWYKG-AYGIEFMGSVYDAAPTFESYGVPHGLMNAVPDEHKKFLSNLVWVHEEDDVCLETKDGIKTYRLIAVHAGLKDIEEQLKFLKAK--KIMGLSGRKTILVSGHH-RLHMRFIIDEGGSAPPLAAIILPSIKIVRDTDLL-RMTICVGDIHGHLDRLKALWRNLELKLGSEVFASSTVIFLGDYNDRGPDTKGVLEFLVGLPERYPKQRHVFLCGNHDFAFAAFLG----EGWWSGTTRNEPYKGSIYDARPTFESYGVSHGLIAAVPDNHKEFLRNLAWVHEQEGEETEDPE-TSYTKLIAVHGGLNSLDNQMQMLRTK--RIEPLAGRKVLLVSGHH-KLHFRLIIDESGGFDPIAAMILPAREIVRDTDNFSE RPTVCIGDVHGHLIRLRALWRNLEAGLGA-LFASATVIFLGDYCDRGPDTAGVLDFLVALPASYPSQKHVFLCGNHDFGMSAFLG----EGWWSGVKKNMEYRGSIFDSEPTFSSYGVEHGLLGAVPEDHKAFLRKLVWIYEQ-------DGSESCGKLVAVHAGLRPLEEQLRVLREK--RHENMLGRSILSLSGHH-SLHMRLIIDESGGQDPLAAVVLPGRQIVRDTDSIGH SSRIS--------GPISNPPSTHQISPPPSSFSSGITAFPPDIRIRSTSSSPITTTWPSPPSWASCSR---------------------KGGLREATKGTEYKRSIYNDGPTFESYGVPHGLVKAXPDKHKXFLSDMVWVHEEDNVCIETADRIKHCRLIXVHAGLKDVKQQLAFLKAK--KIEAIFG--VVVLIVTD-ETNYKLTIDEGGGMAPVAAVVLPSMKIVLDTDALTN RATVLVGDVHGHRAKLVSLFENLEREIGAERLSTSRVVFLGDLVDRGPETSGVLDFIASLPAVHPSMEVRILGGNHDLGMATFCS----PLWVDA------GTRNAFDSEATFASFGVDPGLLSSLPESHAVLLSSLEFVVELDDV--EDDMHPEVNKLIAVHAGLRPIPEQLEALRER--WIEALQGRTVLVASGHH-ILKTWLVVDECAGLDQLAAVVLPERAVVKSH--------------------------LKLELGTERFETSTVIFLGDLCDRGPETSGVFNFLSSLKDKHPRQDVRHLAGNHDFAFATFLG----PLWEDS------DGINAFDSHTTFTSYSAAPGLLKKVPVEHKKILESLEFVVEVEDA--EEHG----KKLVAVHAGLCPFDVQMTALRQR--WVEALQGRAVIIASGHH-ILEMRIIIDGCGGHRRLAAVILPERKVVRSSS---DMCLCIGDLHGRYLRLQQLWTNASVYFGP-WMAGVDVVFLGDLCDRGSDTRQVVDFLLWLQRTRPNTTHI-IAGNHDFGLSSFLR-------------------SVFEAAATFESYGCKFPLLAAMPAAHLSFFRDLPWLVDL---------RFSFGRVICLHAGLKDMDPQMQLARER--FLEVIAGRGVIEVSGHF-FLDVRCIVDEGGGRDALAALVLPSRHILRHNEHVQT KRVVCIGDLHGNIDKITNLWANLQDQLGKEGLKKATVIFLGDYCDRGPDTKGVISWLIDLKKKRDASGTYFLCGNHDFSMAAYLG KRVVCIGDLHGNIDKITNLWANLQDQLGKEGLKKATVIFLGDYCDRGPDTKGVISWLIDLKKKRDASGTYFLCGNHDFSMAAYLG----PGYTSG--------CSIYNADSTFSSYGVKWGFIAAVPEAHKLFLKSLDWVCDI-------EIGFGPGRLIACHAGLGSLKDQLEGLKKR--RYFSFSARDAYLVSGHH-YREVRIICDRSGGYALLEAVILPSQPLNFVNDVREN PGYTSG CSIYNADSTFSSYGVKWGFIAAVPEAHKLFLKSLDWVCDI EIGFGPGRLIACHAGLGSLKDQLEGLKKR RYFSFSARDAYLVSGHH YREVRIICDRSGGYALLEAVILPSQPLNFVNDVREN KYIVAIGDLHGHLSALKLLMLKLEEALGNDLHQKYHLVFLGDYVDRGPKIKETIEYLIQLEKSRKPGTTHFLMGNHDFAMAMFLGLFDDELWKGN---------FTYSSFNTFYSYCTKDGLLRRMPESHKQFFRNMPWMLYN-------------EKNIFVHGGFSSLKEQMLPLMRRVPRIKPISERNRRIVSGHMQRVKVRALVDVSGGLTELACVVLPSCKVITTQPPVAA GPIAVVGDLHGQVDKLGVVLDAIQDA---PQFDDRWVVFVGDFVDRGPDTKACLDMVLEFMDYHPK--TTAVAGNHDFAMCSSLG----GRWL-----------DHYDSQATFRSYGVEFGLAAAMPLQHKKFLASLPWVVEH-------------PYTIIVHAGMTPTDMQLKILRQRLNRPQWLCEKKKTVVSGHVWRVTYRILCDTTGGLENLSAVLLPELQVVDSGNSSFP GPVAVIGDVHGQLDLLNFILSQLQQL---PNYDELWIVLIGDLVDRGPDPAGVIQRIVELREAHRR--TTVVFGNHELAMLGALG----PRWV-----------QFYDSEPTFRSYGVEPGLARAIPPEHREFLTTMPWVVEH-------------PENIFVHAGLQPTEMQLRILHAKLVRPPWLCDKARRIISGHVPEVVIRILTDTTGGLRELSCVILPECRILQSRGGVDR YPILAIGDLHGQSEEFDRLLDRVDRL---AEWPDCAVVFLGDFVDRGPDSRKVIDRVLEILERPPG--GAAVMGNHDLALVHAARLRSPHYWLER-------YLSVYDAATTVRSYVGRTWLRAAMPETHRRFLAELPWVVEA-------------SGHLFLHCGLADALAQVEAMKRRWDRPDYWLGADKIQVTGHCQRPDARIRIDTSGGYGPLTACLLPTDPPHFISSWDDL YPVVAIGDLHGRVEWLDKLVAKLRRR---PEWPHAKLVFLGDLVDRHFEVRALVSRVLELIAEKPG--STCVAGNHDLALVRAAG---LVEWAQR-------YGRNYDHQWTFRSYLGRTPLKAAMPAEHRAFLAGLPWVAEA-------------EGHVFLHNGLCPAAVQLECLHRKWDRPEYWLGADKVQVSGHVHAPDARIRIDTSGGVTPLTACVLPNAEPVFVFSNE-LRRFVIGDIHGHYQGLLDLVSLLEIG------ESDQIYFLGDLIDRGPDSSKVVDFVREQ-----S--YTCLLGNHEQLMVAALANLSPQMWL-Q---------A--GGRETLQSYSSK-----QHLWEHLSWIKTLPNHLDL-------------GDYWLVHAGILPLERQTAQEFC-WIRREFHNMP-RMIITGHTMFSGVWLGIDTGAYHPWLTALELSSSTVYQINVHSNV PRRIVIGDVHGHYEGLMTLLEAIAPT------SDDQIYFLGDLIDRGPHSSQVVNFVKRH-----N--YPCLLGNHEQMLLNILTNERTQAWL-Y---------S--GGQATVASYHEA-----SIPDDHLDWFKSLPTYLDL-------------GDIWLTHAGVKLVTEQTADQLC-WIREEFHSIE-KLIIIGHTILPGVWLDIDTGAYHPWLTGLDVTNNLVYQVNIFKNY QRRIVIGDVHGHYEGLVTLLAAIAPG------GDDQVYFLGDLIDRGPHSSYVIDLVRDN-----N--YFCLLGNHEQMLLNILSGDISQAWL-H---------S--GGHATIASYPDQ-----SIPKEHLEWLRSLPTYIDL-------------GDVWLTHAGLLPLSEQTSEQFC-WIREDFHSIP-KLVIIGHTIFPGVWLDIDTGAYHPWLTALDITNNLIYQINIFTNC PRRIIIGDVHGYYQGLMNLFDKIAPS------SADQVYFLGDLIDRGPQSAQVVDFVKQN-----D--YCCLLGNHEQMLLNVFGNRSHQAWL-Y---------S--GGQATITSYQAA-----QIPQEHIDWFLTLPTFIDL-------------GDYWLTHAGVKSVSEQTAEELC-WIREEFHSME-KLIIVGHTILPGVWLDIDTGAYHPWLTGLDITNNLVYQVNVFNNS GRRLIIGDVHGHYRSLYRLLELADPD------TDDQVYCLGDLIDRGPDSAKVVALVRSQ-----G--YTSLMGNHEHLLSQVLAFPDL-HWF-Q---------A--GGQATLASFGSW-----EALEELRPWLQQLPLYLDL-------------GEIWLVHAGVLPLSRQDANEFC-WIRDQFLARS-KTILVGHTIMAGIWLGLDTGVYHPWLTALDWDNQRVFQVQAQSNE GRRLIIGDVHGHYRSLYRLLELADPD------TDDQVYCLGDLIDRGPDSAKVVALVRSQ-----G--YTSLMGNHEHLLSQVLAFPDL-HWF-Q---------A--GGQATLASFGSW-----EALEELRPWLQQLPLYLDL-------------GEIWLVHAGVLPLSRQDANEFC-WIRDQFLARS-KTILVGHTIMAGIWLGLDTGVYHPWLTALDWDNQRVFQVQAQSNE MRTLAISDIHGCSIALETLLETVHLT------PKDTLITLGDYIDKGPDSKGVLDKLINLYKNY-R--LIPLKGNHEIKMLQARQSDPDCFWL-E---------F--GGRETLNSYS----TLANVPEEHWNFIQNLCVYWQT-------------DNHIFVHANLLSLKRQSEYDLF-WKKFNYQAP--KTMICGHTSKPINRICLDTWVCGGWLTCLELESGKIWQANQRGQI MRTLAVGDIHGCATAFDKLLEAIHLR------PQDRLITLGDYVDKGPNSKEVLERLLYLYENH-R--LVPLKGNHELMMLDAFQGKKKDFWV-I---------A--GGKTTLQSYP----PLLNVPDSHRDFIKQHCLWYET-------------DNHIFVHANLLPLHKQSNHDLF-WQKLYPRHG--KIVVCGHTSSPVNQICIDTWACGQWLTCLDVDSGKIWQTNQQGEV MRTLAVGDIHGCTTAFDQLLEAIDLR------PEDKLITLGDYVDKGPNSKGVLERLLFLYENH-Q--LIPLKGNHELMMLDAFQGKRKNFWL-L---------T--EGKTTLESYP----PLLNVPDSHWDFLKQCCLWYET-------------DNHIFVHANLLPLHKQSNHNLF-WQKLYPHHG--KIVVCGHTSCPVNQICIDTWACGQWLSCLDVDTGKIWQTNQQGEV MRTLAIGDIHGCSKALDHLLEIVNPK------PQDTLITLGDYVNKGRDSKGVIDRLISLHKQG-N--LIPLKGNHEIIMLQARNNPLKRLWL-E---------K--GGKATLKSYS----HLIKIPESHWDFLGKVCISYET-------------ANHLFVHASLLPLARQPEYKLF-WEKFQYPAA--KTMICGHTSKPINAICIDTWAYGQWLSCLDVETGKLWQVNQRGNF MRTLAIGDIHGCLEAFNLLLDVVALR------PNDTIITLGDYINKGPQSKGVIDQLIALHERG-Q--LVALKGNHELMILEARKSLEKFEWL-K---------K--YGTSTLLSYS----TLEDIPPHHWDFLQNICLWWET-------------KEHFFVHANILPLDQQLEDKLF-WAKFVDPQP--KVMICGHTSLPLNAICLDTWVCGEWLTCLDIDSGQIWQTNQQGKV GKIFAVGDIHGCYKKLRVLMDRIPINY-----KEDTLVFLGDYIDRGDESFEVVAYLAELRKKHPG--IVFLKGNHEELFFNYLSGEDEVSFL-F---------N--GGEQTLKGYMSPD-GDISVPKEHLNFFNALQLYYET-------------DDYIFVHAGLVPLEQQKPEDLL-WIRKPFIESS-KIVIFGHTPEVVVKIGVDTGAVYGKLTCIELPARKFYEA-----GRLLAIGDIHGCHRALRSLLARLSPN------RGDRMVFLGDYIDRGPDPRGVIDTLLAVRRRVMR--CTFLMGNHERMLLDVLEGRNLPLYL-A---------N--GGLVTLLAYMTE--GQLRLPASHRRFFNGLQRFHAT-------------GSHIFVHAGLCPLSQQTEEDLL-WIRDAFLASD-KTVVFGHTPKPLLRIGLDTGAVYGVLTACDLHTRRLWQAREPNRD -MLWAIGDIHGCLNALETLINEISPT------PNDKLVFLGDYIDRGPDSKGVVDFLIELSKRT-D--CIFLRGNHEQMLLDVIDNGDDYLWV-I---------N--GATATWRSYGNL--QNLLYNDEHLEFFRNTQYYFIE-------------DKYLFVHGGVIPIEKQDKRDLI-WIREEFISKK-YIVIFGHTPDVFIKIGIDTGCVYGKLTAYNLTLSKKVEVECLNGMGLWAIGDIHGCLRALERVIERISPS------EHDKLIFLGDYIDRGPDAKGVVDFLLLLSKMT-Q--CVFLRGNHEQMLLDVIDNNDDFLWN-L---------N--GAQATIRSYGNL--IQLETNEEHMSFYRNTKYYHIE-------------GKYLFVHGGVVPIEKQEPRDLI-WIREEFILKR-FVVVFGHTPEVYFKIGIDTGCVYGKLTAIEVNQKVVIQEDCRNVR -MIYAIGDIHGCLESLKKLIEKINPS------RTDMLIFVGDYIDRGPNSKGVVDFLIKLSRSR-K--CIFIRGNHEQMLLDYLNGGKTKLWH-F---------N--GMEYTINSYG----GIDKIPEKHIRFFEKTILFHET-------------DEYVFVHAGVIPLNKQKDGDLL-WIRDEFIYAE-KVVIFGHTPEPLIKIGIDTGCVYGKLTALRLDDMCFFSVKSIF-MALIAIGDIHGCARTLDALLDRLAPT------RDDQLVFIGDYIDRGPDARGVIERLLRLREEI-S--CVFLRGNHEALMLNYLDRGEADLWF-I---------N--GGLTTLNSYRND---GIRIPEEHEAFIRETLLYYDT-------------PDFFFVHGGLLTIAEAHTDLFL-WEREHLNAPR-KPVVCGHTPEPINLIAIDTGCVYPRLTAVRLPERTFISIPRQDLI MGLIAVGDIHGCLESLNALLDRLNPS------SDDHLLFVGDYIDRGPDSRGVIDRLLDLRESI-S--CTFLRGNHEAMMIDYLDSGAFSLWR-M---------N--GGVSTLQSYLKGEGSEVHIPAAHAEFVRDTKLYHET-------------DDFLFVHAGLLTVEEPDEEVLL-WERGHLEASG-KTVVCGHTPDPISLILIDTGCVYHRLTAVHLPQREFIDVPYADGMGLIAVGDIHGCLESLNALLDRLNPS------SDDHLLFVGDYIDRGPDSRGVIDRLLDLRESI-S--CTFLRGNHEAMMIDYLDSGAFSLWR-M---------N--GGVSTLQSYLKGEGSEVHIPAAHAEFVRDTKLYHET-------------DDFLFVHAGLLTVEEPDEEVLL-WERGHLEASG-KTVVCGHTPDPISLILIDTGCVYHRLTAVHLPQREFIDVPYADGTRTFAIGDIHGHRDRLEALLRRLPLDR-----ERDTVVFLGDYVNRGPDSRGVIDLLLGLQRTCPG--AVFLRGNHEQILLDYADTGDALLPL-L---------RFLGVEATLRSYGVD--GLACLPPEHAAFLRGLAFAHET-------------ATHVFVHAGLPRSETQRLDSRR-LVRRDMAPHD-RTVVFGHSSTPLVRIGIDTGAGGGMLTALELPALRFHHA-----GKIFAIGDIHGCYDKLISLMDRLPFDL-----HSDTLIFLGDYINRGSQSKEVIEYLLKLKQSSEN--IIFLKGNHEHALLEYAQTKDQYLRL-L---------RSMDVEPTLKSYSDA--DLGFLPPAHLDFLNGLLTYYKT-------------DDYLFIHAGVEDPDGCSLDRLL-SVRDTFLRYE-YTVVFGHTPMPLVMIGIDTGAVYGMLTAVLLPDLRFFHA-----GKLIAIGDIHGQSDALRRLLDDLPYRP-----GRDRLIFLGDYINRGPDTRGVLEVLSALRRDDPG--AVFCLGNHEEALLRYAAGGDPDLRL-L---------RTLGIETTLESYGDL--GLSFLPEEHREFLLGLESFRRI-------------GPYVFVHAGLLPPEACPPDCLL-SVRGAFLTGP-LTVVFGHTTTPLVKIGLDTGAAWGALTALVLPDMVFLHAPGDRFL GKLIAIGDVHGQSDALRRLLDDLPYTP-----GRDRLIFLGDYINRGPDTRGVLEILATIRRDDPG--AVFCLGNHEEALLRYAAGGDPDLRL-L---------RTLGIETTLESYGDL--GLAFLPEAHREFLLGLESFRRI-------------GPYVFVHAGLLPPEACPPDCLL-SVRGAFLTGP-LTVVFGHTTTPLVKIGLDTGAAWGALTALVLPDMVFLHAPGERFL ARLIAVGDIHGQADALRRLLDDLPYRP-----GTDRLIFLGDYINRGPDTRGVLDLLTDIARQDPG--AVFCLGNHEETLLRYADEGEPDLRL-L---------RGLGIETTLRCYGDP--GLSFLPPEHQAFLRALVPYARI-------------GDMLFVHAGLLPPEECPTDCML-SVRGAFLSGP-LTVVFGHTTTPLVKIGLDTGAAWGALTACILPDMELRHVFI----MIWAISDIHGMYDNLIKVLNQIP--------ENDKIIFLGDYVDRGPGSRQVLDLLFKLKNR-----AVFLKGNHEDMMLDYFENTGKGAWF-R---------N--GAQATLDSFNND------IEEKYLDFIRNLKLYHGT-------------QKYLFVHGGIISLSDQAERDLL-WIRDDFYMSD-YIVIHGHTPYITGSIDIDTGCVYGKLSALGISNDNKYDIISI---MIWAISDIHGMYDTLISLLKRIQIK------DSDTMIFLGDYVDRGPDSKKVLDLLITLSKQRN---RIFLKGNHDDMMVDYCQKTHEGVWF-Y---------N--GALSTIKSFNNN------IGEEYINFLKDLPLYYEN-------------EKYLFVHAGVKSLSKQDKWDLL-WIRDEFLSMS-YTVIHGHTPYLTGSVDIDTGCVYGKLTAFGITEDNRHVVLQAV-MRVYAVGDIHGRLDLLEIMLDLVDADQHERGIRPALIVFLGDYVDRGPSSKEVVDAFIGELGEALS--PVFIKGNHEALLMSFLYDATPNFWL-K---------N--GGDAALLSYGLDPDFREALPASHLSFYCNLKPSFRI-------------GGYFFVHGGVVPLDQQREEDLL-WIRDPFLTSD-AVVVHGHTPEPQLRIGVDTYAFKSRLTAVGLEGTERWILSTAS-VRVYAVGDIHGRLDLLEQLLAQIDRDAASGADLVKYLVFLGDYVDRGPDSAMVIERLCREPLPGFG--AIHLRGNHEAAMMDFIEKPEAPDWL-E---------Y--GGRATLASYGVPAPFAAALPPHHRTFLAGLRSSVAI-------------GDYLFVHAGILPLHRQRDEDLL-WIRREFLTSH-KLVVHGHTIQPDIRIGIDTGAYASRLTALVLEGGERRFLCTI--MRVYAVGDIHGRLDLLDQMMGQIARDAAQAPNLLKYIVFLGDYVDRGPDSRLVIERLACGLPPVLG--AVFLRGNHEDTLLGFLSDLRVPGWL-T---------Y--GGDATLESYGIPAPLNSLLPAHHRAFLTGLRCHLTI-------------GDYHFVHAGVVPLDRQEDKDRL-WIRDAFLASR-KVIVHGHTIEPELRIGIDTGAYATRLTALVLEGAERRFLCTV--MRVYAVGDIHGRLDLLDQMLGQIAHDAGQAPNLLKYLVFLGDYVDRGPDSRLVIERLACGLPPVFG--AVFLRGNHEDTMLSFLSDLRIPGWL-T---------Y--GGDATLESYGIPAPLNGLLPPHHRAFLTGLRNHLTI-------------GDYHFVHAGIIPLDRQEDKDRL-WIRETFLASR-KIVVHGHTIEPELRIGIDTGAYATRLTALVLEGTERRFLCTV--VRIYAVGDLHGCADLLAGAFDLIDADLARSRPEQAIQVFLGDYIDRGPDAKRTIDLLIDRGQRHE---TVFVRGNHEALLLRLLDDPKMPNWI-K---------L--GGMTTLMSYGITISLAAAIPSAHTDFLANLVPFFSC-------------GDYFFVHAGVVPLDKQSEEDLM-WIREPFLSSD-KVVVHGHTPEPEIRINIDTGAYATLLTVLSIQDDKVEFHSSSRSR LRLYVIGDVHGRLDLLDDLSKQIAADLDDAP-ADVLTIFVGDYIDRGRESAGVIDRLNRGEFPTP---FHALRGNHEEILLKFLEDESVESWR-K---------F--GGLETLHSYGVDVSLLELMPETHRRFLEGTDLFASF-------------GDYLFVHAGVVPVHLQNADDLL-WIREEFLRFR-KVVVHGHTPAPDVRINVDTGAFATVLTALVLEGADRRFLQTRAPQ Q Q TRVYAVGDIHGRFDLLWRLAGAIEADLAARPVARVVEVYLGDYVDRGPDSARVVDWLSRPAAGGRE--RICLMGNHEHALLSFLADPGLAQWR-Q---------F--GGGETLISYGIDLPLRAALPVSHLRFLRALPALHRI-------------GGYAFVHAGVVTLDAQVETDLL-WIRQEFLDYP-AVVVHGHTPTIDLRINIDTGAYMTRLTCAVLDADGVRFLQTTREG KILVAIGDIHGRLDLLDRLLPKISEHLDG---RAARLVFLGDYIDRGPDSAGVLDRLIGVRESFSD--CVFLKGNHEEALLDFLAAPEEASWL-G---------W--GGDATLLSYGLDTDFAAALPDDHFRFLLDLELFHRA-------------APYQFVHAGLLPFDRQIERDLL-WIRDRFFNAN-WVIVHGHTPAPDNRINVDTGAVWSQLTAVVLDGEDRRFLTS---KRTFVIADIHGCYKSLKALVENLQPT------HEDQFIFLGDYIDRGPDSKAVINYLIYIKEHYKN--CVFLRGNHEQMFLDAHQSIAAERWI-I---------N--GGEEVMISYGLK--NHDHVPYHVQDFISNTFYYFQD-------------ERFIFVHAGIAESPLKDYNSMM-WIRNAEITHT-RIVVHGHTPLDTIEICLDNGCVFKQRTGLGLHSNRLYVQNNIDFKRILVISDIHGEIEKFEQLLKEAQYDA-----KQDQLILLGDYVDRGPNARAVIEKVKELKEE--G--ALILKGNHEDMMIKALTTNEENHWVKR---------N--GGDKTLYSYGFLEELQSSVLEEHIKFIQELDHYIET-------------EEYIFVHAGVKRVSESEPYTLM-WIRNEFHNG--KVVVFGHTETLHSIIGIDGGAVYGQLNCLELPSKKVYVIKN---KRILVISDIHGEIEKFEQLLEEAQYDA-----KKDQLILLGDYVDRGPNARAVIERVKELKED--G--ALVLKGNHEDMMIKALTTDEENHWVKR---------N--GGDKTLYSYGFVEELQSHVLEEHVKFIQELDHYIET-------------EKYIFVHAGVKRVSECEPYTLM-WIRNEFHNG--KVVVFGHTETLHGIIGIDGGAVYGQLNCLELPSKKVYVIKS---KEFFVIGDVHGKFELLLDILKKWNE-------KRQQLIFLGDLIDRGENSKACLELVWSLVREK-G--AICLTGNHERMFLAWLRDPIDDHYR-R---------N--GGDTTINSLLGRPLLEQY--EDLIDFVKSCPYHLET-------------EDYIFVHAGVLPDWRTSNHDKV-WLRTPFHEAE-KMIVFGHTPNLYQKMGIDGGAVYGVLHGIVLDGTGLVQDYIVGAV KEFFVIGDVHGKFELLLDILKKWNE-------KRQQLIFLGDLIDRGENSKACLELVWSLVREK-G--AICLTGNHERMFLAWLRDPIDDHYR-R---------N--GGDTTINSLLGRPLLEQY--EDLIDFVKSCPYHLET-------------EDYIFVHAGVLPDWRTSNHDKV-WLRTPFHEAE-KMIVFGHTPNLYQKMGIDGGAVYGVLHGIVLDGTGLVQDYIVGAV QRILVASDIHGHGDALIKLLSEAVYNA-----ETDQLVLLGDYVNNGPDSVGTLRLVQKLVNKG----AIALVGNHELRWLERQD---------------------------------------SEAQAWHKFLSGLPYIKVL-------------DSYIFVHAGIVPIDKQQCEHVT-AHHSVALNQH-KTIIHGHVPEIHIMIDIDTGAGHNYLTLVDLTNACSYSVNVSHKD GRLFVASDIHGHGETLRQLLETAEYSP-----GNDQLVLCGDYVNNGPDSEGTLELIQKLQKD--G--AVVLFGNHELRWLEEIA----TEFIMH---------H--KMKVTYQRIQKD-----WGKLKWRSLLESFDKYYMT-------------DDFLFIHAGFVPLEKQTAAEVT-GYDKTKDLQH-KYMVHGHVPEIKVSVNIDTGAGHGYLTLLDLTNSKQYCQRVVMKE MRTIVVGDIHGCYAELLQLLAKVEIA------EEDCLVSLGDIVDRGIDSVKVYDFLKNRPN------TIVLMGNHERKHLRQTLSYSQ--------------------------------QFGDSYDEFLEWISHLPYYHET-------------ESAIFVHAAVIPIEEQREEVLC-WSQ--------KPVIFGHRVHPLITYGIDTGACHGTLTAIILPSFEIVQVQAAKDY MRTIIVGDIHGCYEELLQLLTKAQIT------KEDCLISLGDIVDRGVDSVRVYDFLKNRPN------TIVLMGNHERKHLRQTLSYSQ--------------------------------QFGDRYPEFLEWVSHRPYYHET-------------DAAILVHAAVIPIGEQRPEVLC-WSQ--------KPIIFGHHVIPLIIYGIDTGACHGKLTGLILPSFEVVQVQAPQDY MRTLVFGDIHGGYRALVQVMKRANVT------RDDHLIFLGDYVDGWSESDAVIDELMRIKKEYKK--VTLIRGNHDDLVQQWLEGKTMPKWL-Q---------H--GGQSTIDCYEKR---SAQEIQWHIEFYKTFEDYLDD-------------ENRFFCHAGFGPEYEWHSTAFY-WDRDPLYPKRLEEIYIGHTPVPVNIWNLDTGAAFKSLTVMDVDTKEFWQSDALPSL GPLFVIGDVHGYLDELLAALHQLIDTEGHWSAGRSRIWFLGDFTDRGPDGIGVIDLVMQLAAEAAAAGCRALMGNHELLFLGASR----AAWRLN------------GGQQH---------DLERLQPHHISWLSRLPAMALE-------------DGHLLLHSDTLEYGETIADVNDAW--DAFFTKRFYRVVHGHSPGPYIAIAMDGGVTMDRLLVARLPLN----------GPLYVVGDVHGYLDELKAALRELIDAEGHWSAGQSRLWFLGDFTDRGPDGVGVIDLVMQLSAEAAAAGAKALMGNHELLLIGAKR----AAWRLN------------GGQPT---------DMERLQDHHIAWMSRLDAIALE-------------DNHLLLHSDTLEYGSSIEEVNDAW--ELFFTKRFGRVVHGHSPGPHLALAMDGGVTMERLLVAQLPLRG---------GPLYVVGDVHGYLDELKAALRELIDAEGHWSAGQSRLWFLGDFTDRGPDGVGVIDLVMQLSAEAAAAGAKALMGNHELLLIGAKR----AAWRLN------------GGQPT---------DMERLQDHHIAWMSRLDAIALE-------------DNHLLLHSDTLEYGSSIEEVNDAW--ELFFTKRFGRVVHGHSPGPHLALAMDGGVTMERLLVAQLPLRG---------GPLYVVGDVHGYLDELVAALRALIDENGGWAAGNARLWFLGDFTDRGPDGIGVIDLVMRLSAEAAAAGCKALMGNHELLLIGAKR----AAWLLN------------GGQKH---------DMDRLQDVHLQWMSRLDAVVRE-------------DDHLLVHSDTLEYGQTIEDVNETW--DVFFTKRFHRIVHGHSPGPHVAIAMDGGVTMAKLLVVQLPLNG---------TPLYVAADIHGHRTEFRAVLRALADENGHWSGGRARLWLLGDHVDRGPDGIGVIEDVRRLAAQARDEGVGALLGNHEVQLLAAHR----GGWARF------------GGQEA---------DLRGLTPSHLEWLAALPAAALV-------------DGHLLVHSDTLELGTSVAAVNAAWL-EFSMSSRGSVLVHGHSTAALRVLAIDGGAYLGRILLTRL-------------- : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : 165 228 228 228 228 228 228 227 228 228 227 187 187 226 228 228 137 228 228 219 209 228 207 228 227 205 227 227 228 228 228 228 228 228 228 226 228 228 225 227 220 201 215 190 202 213 213 202 202 210 205 190 190 190 190 189 189 194 194 194 194 194 194 198 195 197 192 196 198 198 195 195 201 201 197 185 190 204 203 203 203 205 203 206 199 197 195 195 196 196 172 191 169 169 198 184 185 185 185 182 Figure 2.2: Alignment of RLPH phosphatases from both Prokaryotes and Eukaryotes. Candidate RLPH phosphatase sequences identified as detailed in Materials and Methods were aligned using MAFFT. The alignment was visualized in GeneDoc and manually edited to remove the extreme N- and C-terminal regions outside the protein phosphatase domain, plus poorly aligned regions, prior to phylogenetic tree inference. Two conserved bacterial-like phosphatase motifs (red) were identified in addition to the known canonical PPP-family protein phosphatase motifs (blue). Of the two bacterial-like PPP-phosphatase motifs, Motif 1 represents a novel region of conservation. Motif 2 was previously reported (Andreeva and Kutuzov, 2004). The gene identifiers of each sequence used here are located in Appendix A.2. 38 2.3.2 Distribution and interrelationships of bacterial-like PPP phosphatases 2.3.2.1 SLP phosphatases After multiple sequence alignment and phylogenetic tree inferences using sequences from Shewanella-like phosphatases (SLPs), a final depiction of SLP phosphatase molecular evolution was obtained (Figure 2.3). Here, SLP representatives from numerous eukaryotic supergroups (Opisthokont, Chromalveolate, Rhizaria, Excavates, Plantae) were identified. It is clear from inspection of the sequence composition of this tree that organisms which are now photosynthetic (Green and Red Algae, Plants, diverse Chromalveolates), or which are thought to have been derived from photosynthetic ancestors (Apicomplexa, Oomycetes, possibly Euglenozoa), predominate. Fungi are the only non-photosynthetic group represented in strength. A single sequence was found in Apusozoa (an animal ally), but none in Animals. Thorough TBLASTN searching failed to reveal any additional SLPs amongst previously unannotated sequences in Animals, or any other eukaryotic group. The previously described SLP1 and SLP2 forms of plants and their associated Green Algae are seen here to represent the terminal, most derived aspect of a broad SLP radiation which spreads across Eukaryotes (Figure 2.3). The SLP1 and SLP2 sequences have presumably arisen by gene duplication and divergence from the deeper group of SLP sequences (which we here term SLP3) which are present as a distinct lineage in Green Algae. At the base of the SLP tree is a cluster of bacterial sequences from δ-Proteobacteria ("Outer Myxobacteria" sequences), plus γ-Proteobacteria (including the genus Shewanella). Within the structure of the eukaryotic SLP radiation itself is a second cluster of δ-proteobacterial sequences ("Inner Myxobacteria" 39 40 Figure 2.3: Orthogonal phylogenetic tree depicting SLP phosphatase distribution and interrelationships across both Eukaryotes and Prokaryotes. Phylogenetic tree inference was performed as outlined in the Materials and Methods. The most crucial nodes are labelled. Branch support values with the four inference methods (PhyML [aBayes]; RAxML [RBS]; MrBayes [PP]; PhyloBayes_MPI [PP]) are as follows (see Materials and Methods for details). Node A: (0.998, 75, 0.86, 1.00); Node B: (0.995, 36, 0.80, 0.52); Node C: (0.755, 58, 0.86, 0.52); Node D: (1.00, 80, 0.93, 1.00); Node E: (1.00, 100, 1.00, 1.00); Node F: (0.999, 88, 0.93, 1.00). Branch support values for all trees are summarized as Appendix B. Predicted in silico subcellular localizations are represented by Ch (chloroplast), Mt (mitochondria), Cy (cytosol), Nu (nuclear), ER (endoplasmic reticulum), Px (peroxisome) and SP (signal peptide). Gene identifiers for the sequences used in phylogenetic tree generation are listed in Appendix A.1, while compiled in silico subcellular localization data can be found in Table 2.1. sequences). Intensive searching revealed only four Archaeal SLPs, all in the family Halobacteriaceae. Finally, closely associated with the SLP sequence assemblage is a group of sequences from α-Proteobacteria (Figure 2.3). In another, more basal cluster, are representative distantly related sequences from a diverse group of bacterial phyla, including Cyanobacteria, Bacteroidetes, and Actinobacteria. Through the application of 10 different in silico localization programs, SLPs were found to maintain differing consensus subcellular localizations in plants, with SLP1s being chloroplastic and SLP2s being cytosolic (Table 2.1); however, the potential for tissue-specific subcellular localization differences still exists. SLPs in the Green Algae associated with each of these two groups show predicted localizations in accord with their related plant sequences. This suggests that these differing protein isoform localizations may have been established early in evolution, before the advent of land plants. Furthermore, it is interesting that in the group of green algal sequences deeper in the tree ("SLP3 phosphatases"), the predominant predicted localization is mitochondrial (Figure 2.3). This is also true of the sequences from other photosynthetic organisms in the deeper SLP radiation and suggests that protein retargeting may 41 Table 2.1: In silico subcellular prediction data for eukaryotic SLP phosphatases used in phylogenetic tree and alignment construction. Prediction consensus is highlighted in yellow. Tree labels, gene identifiers, organism group and organism information are also provided. SLP Phosphatases Predict ion Progr am Res ults Predotar Wolf pSORT Target P ChloroP Tree Designation Gene Identifier AlSLPa 470724|PACid:16040444 Streptophyta Arabidopsis lyrata 0.01mTP 0.92cTP 0.01ER 0.08other chlo: 6.0, cyto: 3.0, E.R.: 2.0, nucl: 1.0, mito: 1.0 0.929cTP 0.075mTP 0.031SP 0.073other RC=1 YES AlSLPb 472067|PACid:16060211 Streptophyta Arabidopsis lyrata 0.01mTP 0.00cTP 0.01ER 0.99other cyto: 8.0, chlo: 4.0, nucl: 1.0 0.171cTP 0.071mTP 0.282SP 0.572other RC=4 NO AcSLPa Aquca_030_00216.1|PACid:22035664 Streptophyta Aquilegia coerulea 0.02mTP 0.92cTP 0.00ER 0.07other chlo: 12.0, mito: 1.0 0.916cTP 0.062mTP 0.020SP 0.072other RC=1 YES AcSLPb Aquca_030_00216.1|PACid:22035665 Streptophyta Aquilegia coerulea 0.02mTP 0.92cTP 0.00ER 0.07other chlo: 12.0, mito: 1.0 0.916cTP 0.062mTP 0.020SP 0.072other RC=1 YES AcSLPc Aquca_042_00158.1|PACid:22033007 Aquilegia coerulea 0.01mTP 0.01cTP 0.00ER 0.99other nucl: 7.0, cysk: 6.0 Arabidopsis thaliana AtSLP1 Organism Group PredSL SLP-Local MitoProt PCLR iPSORT mTP, No cTP, No mTP, cTP, No No cTP mTP / cTP nucleus or cytosol/ chloro No No No Cyto 0.999914 0.000215 0.00014 chloro / SP mTP cTP mTP / cTP Chloro 0.999914 0.000215 0.00014 chloro / SP mTP cTP mTP / cTP Chloro 0.666418 0 0.621623 nucleus or cytosol/ chloro Consensus mTP, cTP, ER, Cyto At1g07010 Protein Prowler Organism Streptophyta 0.406TP 0.030mTP 0.114SP 0.567other RC=5 Yes/NO SP, mTP, cTP, other, PTS1 NO 0.00 0.01 0.98 0.01 0.00 cTP, mTP, SP, Other 0.999844 0 0.000418 chloro / SP 0.113195 0 0.092027 other 0.00 0.01 0.99 0.00 0.00 0.00 0.01 0.99 0.00 0.00 0.03 0.10 0.01 0.86 0.00 0.03 0.14 0.03 0.80 0.00 No No Chloro No Cyto cTP Chloro 0.01mTP 0.90cTP 0.05ER 0.10other chlo: 5.0, nucl: 2.0, cyto: 2.0, E.R.: 2.0 0.816cTP 0.067mTP 0.039SP 0.155other RC=2 YES 0.00 0.03 0.94 0.03 0.00 0.999876 0.000122 0.000309 chloro / SP AtSLP2 At1g18480 Streptophyta Arabidopsis thaliana 0.01mTP 0.00cTP 0.00ER 0.99other cyto: 8.0, nucl: 3.0, chlo: 2.0 0.152cTP 0.072mTP 0.277SP 0.443other RC=5 NO 0.03 0.12 0.02 0.84 0.00 0.510013 0 0.222184 other nucleus or cytosol/ chloro No No No Cyto BdSLPa Bradi3g25370.2|PACid:21831821 Streptophyta Brachypodium distachyon 0.11mTP 0.11cTP 0.95ER 0.04other chlo: 13.0 0.751cTP 0.476mTP 0.104SP 0.006other RC=4 YES 0.51 0.11 0.32 0.06 0.00 0.512254 0.013118 0.579544 chloro / mito mTP cTP cTP Chloro Bradi4g20750.1|PACid:21811426 Streptophyta Brachypodium distachyon 0.022cTP 0.019mTP 0.257SP 0.438other RC=5 BdSLPb Streptophyta cTP, mTP, SP other 0.01mTP 0.13cTP 0.04ER 0.83other cyto: 7.0, pero: 4.0, chlo: 2.0 Bra015544|PACid:22693638 Streptophyta Brassica rapa 0.05mTP 0.12cTP 0.06ER 0.78other nucl: 4.0, chlo_mito: 4.0, chlo: 3.5, mito: 3.5 BrSLPa Bra025903|PACid:22696925 Streptophyta Brassica rapa 0.01mTP 0.00cTP 0.00ER 0.99other cyto: 10.0, mito: 2.0, pero: 1.0 0.120cTP 0.380mTP 0.029SP 0.690other RC=4 CpaSLPa evm.model.supercontig_2.360|PACid:16412742 Streptophyta Carica papaya 0.01mTP 0.34cTP 0.02ER 0.64other chlo: 13.0 0.906cTP 0.032mTP 0.009SP 0.050other RC=1 CpaSLPb evm.model.supercontig_9.374|PACid:16428284 Streptophyta Carica papaya 0.01mTP 0.01cTP 0.00ER 0.99other CpaSLPc evm.model.supercontig_9.376|PACid:16428286 BrSLPb 0.727cTP 0.324mTP 0.005SP 0.016other RC=3 No cTP YES 0.33 0.15 0.03 0.49 0.11 YES 0.00 0.28 0.70 0.02 0.00 0.997418 0.00217 0.003871 chloro / mito NO 0.01 0.09 0.00 0.89 0.00 0.004621 0.000792 0.012195 other nucleus or cytosol / mito No No No Cyto YES 0.00 0.01 0.99 0.00 0.00 0.999675 0.000306 0.00068 chloro / SP mTP cTP cTP Chloro 0.10 0.09 0.00 0.80 0.00 0.001743 0.000256 0.040843other nucleus or cytosol / SP 0.15 0.12 0.02 0.71 0.00 0.000979 0.000309 0.0862 other NO 0.592426 0.001214 0.060819 chloro / SP No mTP No cTP No cTP mtp / cTP No No Cyto Chloro cyto: 12.0, cysk: 1.0 0.018cTP 0.048mTP 0.782SP 0.554other RC=4 0.01mTP 0.00cTP 0.12ER 0.87other cyto: 10.0, cysk: 3.0 0.142cTP 0.041mTP 0.333SP 0.475other RC=5 NO nucleus or cytosol/ chloro No No No Cyto CarSLPa Carubv10012402m|PACid:20892489 Streptophyta Capsella rubella 0.01mTP 0.82cTP 0.12ER 0.16other chlo: 12.0, vacu: 1.5, E.R._vacu: 1.5 0.934cTP 0.065mTP 0.064SP 0.047other RC=1 YES 0.999885 0.000128 0.000278 chloro / SP mTP cTP mTP / cTP Chloro CarSLPb Carubv10017403m|PACid:20887668 Streptophyta Capsella rubella 0.01mTP 0.00cTP 0.02ER 0.98other cyto: 10.0, chlo: 2.0, nucl: 1.0 0.089cTP 0.070mTP 0.472SP 0.421other RC=5 NO 0.03 0.08 0.00 0.89 0.00 0.04467 0 0.087162 other nucleus or cytosol/ chloro No No No Cyto CcSLPb clementine0.9_012529m|PACid:19282462 Streptophyta Citrus clementina 0.06mTP 0.94cTP 0.01ER 0.06other chlo: 14.0 0.947cTP 0.031mTP 0.010ER 0.100other RC=1 YES 0.00 0.01 0.99 0.00 0.00 0.999485 0.000595 0.000972 chloro / SP mTP cTP mTP / cTP Chloro Streptophyta Carica papaya 0.01 0.04 0.94 0.02 0.00 Cyto CcSLPa clementine0.9_028360m|PACid:19276388 Streptophyta Citrus clementina **partial sequence xxx xxx xxx xxx xxx xxx xxx xxx xxx xxx CsSLPb orange1.1g016381m|PACid:18111291 Streptophyta Citrus sinensis 0.06mTP 0.94cTP 0.01ER 0.05other chlo: 14.0 0.955cTP 0.032mTP 0.011SP 0.099other RC=1 YES 0.00 0.01 0.99 0.00 0.00 0.999757 0.000391 0.000455 chloro / SP mTP cTP mTP / cTP Chloro CsSLPa orange1.1g041373m|PACid:18124504 Streptophyta Citrus sinensis 0.01mTP 0.00cTP 0.08ER 0.92other cyto: 12.0, nucl: 1.0 0.046cTP 0.087mTP 0.257SP 0.634other RC=4 NO 0.02 0.06 0.00 0.92 0.00 0.00987 0 0.612498 nucleus or cytosol/ chloro No No No Cyto CrSLPa Cre03.g185200.t1.1 Chlorophyta Chlamydomonas reinhardtii 0.14mTP 0.02cTP 0.01ER 0.83other chlo: 10.0, mito: 4.0 0.846cTP 0.738mTP 0.000SP 0.013other YES 0.00 0.93 0.07 0.00 0.02 0.001124 0.999607 0.000346 mito / chloro mTP cTP mTP / cTP Chloro/Mito Chlorophyta Chlamydomonas reinhardtii CrSLPb Cre17.g718800.t1.1 0.041cTP 0.235mTP 0.250SP 0.689other RC=3 NO 0.05 0.11 0.00 0.83 0.00 0 0.82683 0.290684 nucleus or cytosol / mito No No No Cyto CocSLPa jgi|Coc_C169_1|30480|estExt_Genewise1Plus.C_140169 Chlorophyta Coccomyxa sp. 0.15mTP 0.30cTP 0.01ER 0.59other chlo: 10.0, mito: 4.0 0.763cTP 0.113mTP 0.006SP 0.085other YES 0.01 0.85 0.12 0.02 0.00 0.000474 0.999785 0.000608 mito / chloro mTP cTP mTP / cTP Chloro/Mito CocSLPb jgi|Coc_C169_1|35854|fgenesh1_pm.4_#_256 Chlorophyta Coccomyxa sp. 0.01mTP 0.00cTP 0.00ER 0.98other mito: 5.5, chlo_mito: 4.5, cyto: 4.0 0.066cTP 0.420mTP 0.029SP 0.706other RC=4 NO 0.02 0.65 0.01 0.32 0.00 0 0.102375 0.017014 other nucleus or cytosol / mito No No No Cyto CsatSLPa 0.01mTP 0.00cTP 0.01ER 0.98other cyto: 9.0, mito: 3.0, nucl: 1.0 Cucsa.151840.1|PACid:16964486 Streptophyta Cucumis sativus 0.949cTP 0.037mTP 0.010SP 0.068other RC=1 YES 0.00 0.00 0.99 0.00 0.00 0.999885 0.000287 0.000173 chloro / mito mTP cTP mTP / cTP Chloro CsatSLPb Cucsa.151840.2|PACid:16964487 Streptophyta Cucumis sativus 0.01mTP 0.96cTP 0.02ER 0.04other chlo: 12.0, nucl: 1.0 0.949cTP 0.037mTP 0.010SP 0.068other RC=1 YES 0.00 0.00 0.99 0.00 0.00 0.999885 0.000287 0.000173 chloro / mito mTP cTP mTP / cTP Chloro CsatSLPc Cucsa.075800.1|PACid:16955776 Streptophyta Cucumis sativus 0.01mTP 0.00cTP 0.00ER 0.99other cyto: 7.0, chlo: 3.0, nucl: 2.0, cysk: 1.0 0.185cTP 0.055mTP 0.074SP 0.509other RC=4 NO 0.02 0.07 0.00 0.91 0.00 0.013107 0 0.246241 other nucleus or cytosol / chloro No No No Cyto CvSLPb jgi|ChlNC64A_1|50727|fgenesh3_pg.C_scaffold_4000226 Chlorella variabilis 0.01mTP 0.05cTP 0.00ER 0.94other chlo: 10.0, cyto: 4.0 0.721cTP 0.050mTP 0.047SP 0.274other RC=3 YES 0.05 0.26 0.31 0.38 0.00 0 0.998924 0.004293 CvSLPa jgi|ChlNC64A_1|9678|gw1.24.110.1 Chlorophyta Chlorophyta Chlorella variabilis 0.01mTP 0.96cTP 0.02ER 0.04other chlo: 12.0, nucl: 1.0 mito / chloro mTP cTP cTP Mito/Chloro **partial sequence xxx xxx xxx xxx xxx xxx xxx xxx xxx xxx EgSLPa Eucgr.B04000.1|PACid:23569453 Streptophyta Eucalyptus grandis 0.02mTP 0.78cTP 0.04ER 0.21other chlo: 10.0, vacu: 3.0 0.837cTP 0.089mTP 0.003SP 0.056other RC=2 YES 0.00 0.08 0.88 0.04 0.00 0 0.241401 other nucleus or cytosol / chloro mTP cTP mTP / cTP Chloro / Mito EgSLPb Eucgr.B04000.2|PACid:23569454 Streptophyta Eucalyptus grandis 0.02mTP 0.78cTP 0.04ER 0.21other chlo: 10.0, vacu: 3.1 0.837cTP 0.089mTP 0.003SP 0.056other RC=3 YES 0.00 0.08 0.88 0.04 0.00 0 0.241401 other nucleus or cytosol / chloro mTP cTP mTP / cTP Chloro / Mito GrSLPa Gorai.006G232000.1|PACid:26833651 Streptophyta Gossypium raimondii 0.01mTP 0.01cTP 0.07ER 0.92other extr: 4.0, chlo: 2.0, nucl: 2.0, cyto: 2.0 0.207cTP 0.098mTP 0.145SP 0.537other RC=4 NO 0.13 0.22 0.20 0.46 0.00 0.49021 0 0.517934 other mito / chloro No No No Cyto Streptophyta Gossypium raimondii 0.00 0.01 0.98 0.00 0.00 GrSLPb Gorai.013G271300.1|PACid:26789935 0.02mTP 0.52cTP 0.00ER 0.47other chlo: 12.0, nucl: 1.0 0.886cTP 0.059mTP 0.044SP 0.156other RC=2 YES chloro / SP mTP cTP mTP / cTP Chloro GmSLPc Glyma02g39710.1|PACid:26288354 Streptophyta Glycine max 0.01mTP 0.87cTP 0.05ER 0.12other mito: 6.0, chlo: 4.0, cyto: 2.0, nucl: 1.0 0.918cTP 0.143mTP 0.026SP 0.046other RC=2 YES 0.00 0.02 0.98 0.01 0.00 0.999915 0.000198 0.000159 chloro / SP mTP cTP mTP / cTP Chloro GmSLPa Glyma19g13655.1|PACid:26327286 Streptophyta Glycine max 0.01mTP 0.01cTP 0.04ER 0.95other cyto: 9.0, nucl: 2.0, cysk: 2.0 0.377cTP 0.028mTP 0.054SP 0.642other RC=4 NO 0.03 0.09 0.01 0.87 0.00 0.314858 0 0.392796 other nucleus or cytosol / chloro No cTP No GmSLPd Glyma19g13655.2|PACid:26327286 Streptophyta Glycine max 0.01mTP 0.01cTP 0.04ER 0.95other cyto: 9.0, nucl: 2.0, cysk: 2.0 0.377cTP 0.028mTP 0.054SP 0.642other RC=4 NO 0.03 0.09 0.01 0.87 0.00 0.314858 0 0.392796 other nucleus or cytosol / chloro No cTP No Cyto GmSLPb Glyma14g37780.1|PACid:26287143 Streptophyta Glycine max 0.01mTP 0.97cTP 0.03ER 0.03other chlo: 14.0 0.964cTP 0.019mTP 0.057SP 0.032other RC=1 YES 0.00 0.01 0.99 0.00 0.00 0.999914 0.000171 0.000181 chloro / SP mTP cTP mTP / cTP Chloro GmSLPe Glyma16g07840.1|PACid:26351113 Streptophyta Glycine max **partial sequence xxx xxx xxx xxx xxx xxx xxx xxx xxx xxx LuSLPc Lus10031284|PACid:23151137 Streptophyta Linum usitatissimum 0.14mTP 0.46cTP 0.00ER 0.46other chlo: 9.0, nucl: 1.0, mito: 1.0, vacu: 1.0, E.R.: 1.0 0.728cTP 0.102mTP 0.041SP 0.158other RC=3 YES 0.00 0.05 0.93 0.01 0.00 0.999835 0.000327 0.000265 chloro / SP No No mTP / cTP Chloro LuSLPa Lus10038865|PACid:23150278 Streptophyta Linum usitatissimum 0.01mTP 0.00cTP 0.04ER 0.95other nucl: 7.0, cyto: 4.0, cysk: 2.0 0.175cTP 0.066mTP 0.146SP 0.656other RC=3 NO 0.03 0.09 0.01 0.88 0.00 0.643592 0 0.4377 nucleus or cytosol / SP No No No Cyto / Nuc LuSLPb Lus10014985|PACid:23152752 Streptophyta Linum usitatissimum 0.01mTP 0.01cTP 0.02ER 0.97other nucl: 8.0, extr: 2.0, chlo: 1.0, cyto: 1.0, cysk: 1.0 0.087cTP 0.061mTP 0.161SP 0.695other RC=3 NO 0.02 0.07 0.00 0.91 0.00 0.560296 0 0.513057 nucleus or cytosol/ chloro No No No Cyto / Nuc MdpSLPa MDP0000769617|PACid:22639836 0.01mTP 0.00cTP 0.00ER 0.99other cyto: 8.0, nucl: 2.0, E.R.: 2.0, mito: 1.0 0.748cTP 0.018mTP 0.153SP 0.434other RC=4 YES No Cyto / Chloro MdpSLPb MDP0000465067|PACid:22671458 Streptophyta Malus domestica Streptophyta Malus domestica 0.01mTP 0.00cTP 0.00ER 0.99other 0.06 0.24 0.21 0.50 0.00 0.999843 0 0.0004 0.045992 0.241069 0.015069other chloro / SP No No Cyto cyto: 9.0, E.R.: 3.0, nucl: 1.0 0.395cTP 0.029mTP 0.061SP 0.823other RC=3 NO 0.03 0.16 0.05 0.75 0.00 0.020244 0.000857 0.806373 chloro / nucleus or cytosol No No No Cyto MdpSLPc MDP0000899519|PACid:22657341 Streptophyta Malus domestica 0.01mTP 0.00cTP 0.00ER 0.99other cyto: 9.0, E.R.: 3.0, nucl: 1.0 0.395cTP 0.029mTP 0.061SP 0.823other RC=3 NO 0.03 0.16 0.05 0.75 0.00 0.020244 0.000857 0.806373 chloro / nucleus or cytosol No No No Cyto MdpSLPd MDP0000941407|PACid:22628975 Streptophyta Malus domestica 0.01mTP 0.00cTP 0.00ER 0.99other cyto: 5.0, nucl: 3.0, extr: 3.0, cysk: 3.0 0.081cTP 0.132mTP 0.140SP 0.852other RC=2 NO 0.01 0.05 0.00 0.93 0.00 0.000143 0.002346 0.060857 other nucleus or cytosol / SP No No No Cyto MdpSLPe MDP0000244675|PACid:22655284 Streptophyta Malus domestica 0.01mTP 0.50cTP 0.08ER 0.45other cyto: 9.0, nucl: 3.0, cysk: 1.0 0.209cTP 0.103mTP 0.171SP 0.633other RC=3 NO 0.09 0.11 0.00 0.80 0.00 0.010932 0 0.78466 nucleus or cytosol / SP No No No Cyto chloro / SP mTP cTP mTP / cTP Chloro MeSLPb cassava4.1_009955m|PACid:17994174 Streptophyta Manihot esculenta 0.03mTP 0.96cTP 0.02ER 0.04other chlo: 13.5, chlo_mito: 7.5 0.972cTP 0.116mTP 0.025SP 0.009other RC=1 YES 0.00 0.01 0.99 0.00 0.00 MeSLPa cassava4.1_008887m|PACid:17972799 Streptophyta Manihot esculenta 0.01mTP 0.00cTP 0.02ER 0.97other cyto: 7.0, cysk: 3.0, chlo: 2.0, nucl: 2.0 0.104cTP 0.128mTP 0.114SP 0.818other RC=2 NO 0.01 0.05 0.00 0.93 0.00 0.066881 0 0.412251 other nucleus or cytosol / mito No No No MgSLPa mgv1a007295m|PACid:17681918 Streptophyta Mimulus guttatus 0.01mTP 0.92cTP 0.01ER 0.08other chlo: 12.0, cyto: 1.0 0.592cTP 0.188mTP 0.004SP 0.058other RC=3 YES 0.01 0.26 0.63 0.09 0.00 0.859418 0.000801 0.208313 nucleus or cytosol/ chloro mTP cTP mTP / cTP Chloro MgSLPb mgv1a007297m|PACid:17681917 Streptophyta Mimulus guttatus 0.01mTP 0.92cTP 0.01ER 0.08other chlo: 12.0, cyto: 1.0 0.592cTP 0.188mTP 0.004SP 0.058other RC=3 YES 0.01 0.26 0.63 0.09 0.00 0.859418 0.000801 0.208313 nucleus or cytosol/ chloro mTP cTP mTP / cTP Chloro mgv1a007897m|PACid:17678561 Streptophyta NO MgSLPc Mimulus guttatus 0.01mTP 0.00cTP 0.01ER 0.99other 0.07 0.11 0.01 0.80 0.00 0.622945 0 0.613167 SP / chloro Micromonas pusilla 0.01mTP 0.00cTP 0.05ER 0.95other cyto: 11.0, pero: 2.0 0.039cTP 0.085mTP 0.126SP 0.928other RC=1 NO 0.02 0.09 0.00 0.89 0.00 0.000101 0.002354 0.428438other chloro / nucleus or cytosol jgi|MicpuC3|187449|gm1.4570_g Chlorophyta Micromonas pusilla 0.01mTP 0.00cTP 0.00ER 0.99other cyto: 10.0, chlo: 2.0, pero: 2.0 0.039cTP 0.085mTP 0.126SP 0.928other RC=1 NO 0.01 0.05 0.00 0.93 0.00 0 0.015426 0.024612 other nucleus or cytosol / SP No No No Cyto MpSLPc jgi|MicpuC3|60158|MicpuC2.EuGene.0000080352 Chlorophyta Micromonas pusilla 0.91mTP 0.04cTP 0.02ER 0.08other chlo: 12.0, mito: 2.0 0.555cTP 0.735mTP 0.001SP 0.015other RC=5 YES 0.00 0.98 0.02 0.00 0.94 0.002625 0.999423 0.000124 chloro / mito mTP cTP mTP / cTP Chloro/Mito MpSLPd jgi|MicpuN3|62511|EuGene.1100010297 Chlorophyta Micromonas pusilla 0.92mTP 0.04cTP 0.05ER 0.07other chlo: 7.0, mito: 7.0 0.099cTP 0.959mTP 0.013SP 0.006other YES 0.00 0.99 0.00 0.00 0.26 0.001919 0.99919 0.000532 mito / chloro mTP cTP mTP / cTP Mito/Chloro MpSLPe jgi|MicpuC3|152093|e_gw1.3.1360.1 jgi|MicpuN3|69358|gw2.05.320.1 Chlorophyta Chlorophyta Micromonas pusilla **partial sequence 0.130cTP 0.028mTP 0.275SP 0.642other RC=4 xxx xxx xxx xxx xxx xxx No No xxx No No xxx No Cyto MpSLPb MpSLPa cyto: 8.0, chlo: 3.0, nucl: 3.0 0.999827 0.00018 0.000406 No xxx Cyto Cyto xxx MpSLPf jgi|MicpuN3|69434|gw2.13.202.1 xxx xxx xxx xxx xxx xxx xxx xxx MtSLPb Medtr5g081100.1|PACid:17467021 Streptophyta Medicago truncatula 0.01mTP 0.96cTP 0.01ER 0.04other chlo: 13.5, chlo_mito: 7.5 0.955cTP 0.065mTP 0.016SP 0.035other RC=1 YES 0.00 0.00 0.99 0.00 0.00 0.999925 0.000181 0.000159 chloro / nucleus or cytosol mTP cTP mTP / cTP Chloro MtSLPa Medtr7g005770.1|PACid:17472861 Streptophyta Medicago truncatula 0.01mTP 0.01cTP 0.14ER 0.84other cyto: y 9.0,, nucl: 2.0,, chlo: 1.0,, cysk: y 1.0 0.195cTP 0.044mTP 0.062SP 0.503other RC=4 YES 0.03 0.12 0.02 0.82 0.00 0.610576 0 0.66658 chloro / nucleus or cytosol No cTP No y / Chloro Cyto OlSLPc jgi|Ost9901_3|3839|gwEuk.4.274.1 Chlorophyta Ostreococcus lucimarinus **partial sequence xxx xxx xxx xxx xxx xxx xxx xxx xxx xxx jgi|Ost9901_3|38815|e_gwEuk.5.237.1 Chlorophyta Micromonas pusilla **partial sequence xxx Chlorophyta Ostreococcus lucimarinus 0.03mTP 0.00cTP 0.01ER 0.96other cyto: 7.0, pero: 4.0, mito: 2.0 0.185cTP 0.055mTP 0.074SP 0.509other RC=4 NO 0.03 0.31 0.01 0.65 0.00 0.000565 0.60464 0.001904 nucleus or cytosol/ chloro No OlSLPa jgi|Ost9901_3|89674|ost_20_004_020 Chlorophyta Ostreococcus lucimarinus 0.23mTP 0.35cTP 0.04ER 0.48other chlo: 9.5, chlo_mito: 7.5, mito: 4.5 0.740cTP 0.621mTP 0.013SP 0.009other RC=5 YES 0.01 0.36 0.62 0.01 0.00 0.988133 0.009078 0.012316 mito / chloro mTP cTP cTP Chloro/Mito OrcSLPa jgi|OstRCC809_2|31427|e_gw1.4.298.1 Chlorophyta Ostreococcus RCC809 0.20mTP 0.28cTP 0.02ER 0.56other chlo: 9.5, chlo_mito: 6.0, pero: 3.0 0.792cTP 0.569mTP 0.007SP 0.016other RC=4 YES 0.00 0.17 0.82 0.00 0.00 0.003046 0.99932 0.000136 mito / chloro mTP cTP mTP / cTP Chloro/Mito Chlorophyta Ostreococcus RCC809 0.43mTP 0.21cTP 0.05ER 0.43other 0.764cTP 0.514mTP 0.005SP 0.008other RC=4 YES 0.00 0.28 0.71 0.00 0.00 0.068505 0.906441 0.005721 Chlorophyta Ostreococcus RCC809 0.34mTP 0.00cTP 0.01ER 0.65other OlSLPb OrcSLPb jgi|OstRCC809_2|43148|e_gw1.20.557.1 OrcSLPc jgi|OstRCC809_2|57133|fgenesh1_pm.chr_5_#_202 chlo: 9.0, mito: 5.0 chlo: 9.0, mito: 3.0, plas: 1.0 0.214cTP 0.140mTP 0.008SP 0.796other RC=3 mito / chloro mTP No cTP No xxx cTP Cyto Chloro/Mito No 0.02 0.89 0.00 0.09 0.00 0.012245 0.996921 0 nucleus or cytosol / SP No cTP No OsSLPa Os10g25430.1 Streptophyta Oryza sativa 0.33mTP 0.49cTP 0.03ER 0.33other chlo: 14.0 0.831cTP 0.142mTP 0.010SP 0.021other RC=2 YES 0.01 0.11 0.85 0.03 0.00 0.603789 0.069789 0.002153 chloro / SP mTP cTP mTP / cTP Chloro OsSLPb Os11g15570.1 Streptophyta Oryza sativa 0.01mTP 0.07cTP 0.00ER 0.92other chlo: 7.0, cyto: 5.0, pero: 2.0 0.176cTP 0.039mTP 0.155SP 0.394other RC=4 YES 0.10 0.20 0.07 0.63 0.00 0.994859 0.000133 0.005704 chloro / nucleus or cytosol No cTP No Cyto/Chloro OtSLPc jgi|Ostta4|32574|0400010520 Chlorophyta Ostreococcus tauri 0.01mTP 0.00cTP 0.00ER 0.99other chlo: 6.0, nucl: 3.0, mito: 2.0, cyto: 1.0, pero: 1.0 0.513cTP 0.109mTP 0.005ER 0.482other RC=5 NO 0.03 0.24 0.11 0.62 0.65 0.005741 0.000825 0.007086other chloro / nucleus or cytosol No No No Chloro/Cyto Cyto/Mito OtSLPb jgi|Ostta4|21075|e_gw1.14.00.151.1 Chlorophyta Ostreococcus tauri 0.02mTP 0.00cTP 0.04ER 0.94other cyto: 6.0, chlo: 4.0, mito: 4.0 0.005cTP 0.626mTP 0.354SP 0.136other RC=4 NO 0.02 0.88 0.00 0.10 0.00 0 0.942125 0.045213 nucleus or cytosol / mito mTP No No Mito/Cyto jgi|Ostta4|3633|gw1.05.00.226.1 Chlorophyta Ostreococcus tauri **partial sequence xxx xxx xxx xxx xxx xxx xxx xxx PvSLPa Pavirv00009175m|PACid:23776527 Streptophyta Panicum virgatum 0.40mTP 0.06cTP 0.24ER 0.43other 0.603cTP 0.522mTP 0.004SP 0.023other RC=5 YES 0.01 0.60 0.38 0.02 0.00 0.363239 0.88343 0.000139 mito / chloro mTP cTP mTP / cTP Chloro / Mito PvSLPb Pavirv00009176m|PACid:23776528 Streptophyta Panicum virgatum 0.40mTP 0.06cTP 0.24ER 0.43other chlo: 12.0, plas: 1.0 0.603cTP 0.522mTP 0.004SP 0.023other RC=5 YES 0.01 0.60 0.38 0.02 0.00 0.363239 0.88343 0.000139 mito / chloro mTP cTP mTP / cTP Chloro / Mito PvSLPc Pavirv00010440m|PACid:23782081 Streptophyta Panicum virgatum 0.44mTP 0.23cTP 0.11ER 0.39other chlo: 12.5, chlo_mito: 7.3 0.702cTP 0.525mTP 0.004SP 0.011other RC=5 YES 0.00 0.61 0.38 0.01 0.00 0.790952 0.525919 0.000136 mito / chloro mTP cTP mTP Chloro / Mito Panicum virgatum 0.01mTP 0.40cTP 0.00ER 0.59other cyto: 7.0, chlo: 3.0, pero: 2.0, plas: 1.0 Chloro / Cyto OtSLPd xxx chlo: 12.0, plas: 1.0 xxx PvSLPd Pavirv00011040m|PACid:23782673 Streptophyta 0.479cTP 0.012mTP 0.114SP 0.189other RC=4 YES 0.11 0.21 0.40 0.29 0.00 0.563268 0.003873 0.184248 chloro / SP No cTP No PvulSLPb Phvul.008G256500.1|PACid:27155250 Streptophyta Phaseolus vulgaris 0.01mTP 0.86cTP 0.03ER 0.13other chlo: 13.0 0.910cTP 0.123mTP 0.049SP 0.069other RC=2 YES 0.00 0.02 0.96 0.02 0.00 0.999922 0.000177 0.00017 chloro / SP mTP cTP mTP / cTP PvulSLPa Phvul.005G085100.1|PACid:27150157 Streptophyta Phaseolus vulgaris 0.01mTP 0.01cTP 0.06ER 0.93other cyto: 5.0, chlo: 4.0, nucl: 4.0 0.388cTP 0.041mTP 0.033SP 0.765other RC=4 NO 0.03 0.10 0.01 0.87 0.00 0.477711 0 0.058615 other nucleus or cytosol/ chloro No cTP No Cyto PpaSLPa Pp1s194_144V6.1|PACid:18058638 Streptophyta Physcomitrella patens 0.01mTP 0.04cTP 0.00ER 0.95other cyto: 7.0, E.R.: 3.5, E.R._plas: 2.5, chlo: 1.0 0.202cTP 0.211mTP 0.009SP 0.476other YES 0.50 0.20 0.28 0.03 0.00 0.997764 0.002009 0 chloro / nucleus or cytosol No cTP cTP Chloro/Cyto PpaSLPc Pp1s412_38V6.1|PACid:18061202 Streptophyta Physcomitrella patens 0.01mTP 0.04cTP 0.00ER 0.95other cyto: 7.0, E.R.: 3.5, E.R._plas: 2.5, chlo: 1.0 0.202cTP 0.211mTP 0.009SP 0.476other YES 0.50 0.20 0.28 0.03 0.00 0.997764 0.002009 0 chloro / nucleus or cytosol No cTP cTP Chloro/Cyto Pp1s98 64V6 1|PACid:18070451 Pp1s98_64V6.1|PACid:18070451 Streptophyta Physcomitrella patens 0 01mTP 0.00cTP 0.01mTP 0 00cTP 0.01ER 0 01ER 0.99other 0 99other PpaSLPb 0 118cTP 0.153mTP 0.118cTP 0 153mTP 0.080SP 0 080SP 0.863other 0 863other RC=2 NO 0 02 0.08 0.02 0 08 0.00 0 00 0.90 0 90 0.00 0 00 0 0.060762 0 060762 0.458938 0 458938 other ppa025609m|PACid:17645981 Streptophyta Prunus persica 0.01mTP 0.94cTP 0.00ER 0.06other chlo: 13.0 0.948cTP 0.053mTP 0.030SP 0.078other RC=1 YES 0.00 0.02 0.97 0.01 0.00 0.999922 0.000189 0.000164 chloro / SP PperSLPa ppa007078m|PACid:17667068 Streptophyta Prunus persica 0.01mTP 0.01cTP 0.00ER 0.98other cyto: 8.0, nucl: 2.0, cysk: 2.0, chlo: 1.0 0.394cTP 0.082mTP 0.108SP 0.795other RC=3 NO 0.01 0.06 0.00 0.92 0.00 0.009578 0 0.123285 other nucleus or cytosol/ chloro No No No Cyto PtSLPa Potri.009G077900.1|PACid:26987030 Streptophyta Populus trichocarpa 0.03mTP 0.81cTP 0.03ER 0.18other chlo: 13.0 0.904cTP 0.158mTP 0.039SP 0.026other RC=2 YES 0.00 0.02 0.98 0.01 0.00 0.999738 0.000361 0.000527 chloro / mito mTP cTP cTP Chloro PtSLPb Potri.009G077900.2|PACid:26987029 Streptophyta Populus trichocarpa 0.03mTP 0.81cTP 0.03ER 0.18other chlo: 13.0 0.904cTP 0.158mTP 0.039SP 0.026other RC=2 YES 0.00 0.02 0.98 0.01 0.00 0.999738 0.000361 0.000527 chloro / mito mTP cTP cTP Chloro PtSLPc Potri.009G077900.3|PACid:26987029 Streptophyta Populus trichocarpa 0.03mTP 0.81cTP 0.03ER 0.18other chlo: 13.0 0.904cTP 0.158mTP 0.039SP 0.026other RC=2 YES 0.00 0.02 0.98 0.01 0.00 0.999738 0.000361 0.000527 chloro / mito mTP cTP cTP Chloro PtSLPd Potri.009G077900.4|PACid:26987031 mTP PtSLPe PperSLPb E R : 4.5, E.R.: 4 5 cyto: 4.0 4 0, E.R._vacu: E R vacu: 3.5 35 nucleus or cytosol / SP No Chloro mTP No No Cyto cTP mTP / cTP Chloro Streptophyta Populus trichocarpa 0.03mTP 0.81cTP 0.03ER 0.18other chlo: 13.0 0.904cTP 0.158mTP 0.039SP 0.026other RC=2 0.00 0.02 0.98 0.01 0.00 0.999738 0.000361 0.000527 mito / chloro cTP cTP Potri.013G127500.1|PACid:26995499 Streptophyta Populus trichocarpa 0.01mTP 0.00cTP 0.05ER 0.95other cyto: 10.0, nucl: 2.0, golg: 1.0 0.125cTP 0.083mTP 0.115SP 0.702other RC=3 NO 0.01 0.06 0.00 0.92 0.00 0.00149 0.0004 0.018489 other nucleus or cytosol / mito No No No Cyto RcSLPb 30128.m008860|PACid:16818446 Streptophyta Ricinus communis 0.01mTP 0.01cTP 0.99ER 0.01other chlo: 11.0, nucl: 1.0, plas: 1.0 0.108cTP 0.008mTP 0.920SP 0.004other RC=1 YES 0.99 0.01 0.00 0.00 0.00 0.015698 0.0001 0.992733 SP / chloro mTP cTP No SP / Chloro RcSLPa 29747.m001066|PACid:16807732 Streptophyta Ricinus communis 0.01mTP 0.00cTP 0.00ER 0.99other cyto: 9.0, nucl: 2.0, cysk: 2.0 0.051cTP 0.121mTP 0.302SP 0.794other RC=3 NO 0.01 0.06 0.00 0.93 0.00 0.011124 0 0.28284 other nucleus or cytosol / SP No No No Cyto SbSLPb Sb05g006560.1|PACid:1969658 Streptophyta Sorghum bicolor 0.01mTP 0.41cTP 0.07ER 0.54other chlo: 7.0, cyto: 6.0 0.552cTP 0.018mTP 0.090SP 0.155other RC=4 YES 0.07 0.18 0.48 0.27 0.00 0.989283 0.000106 0.015257 chloro / nucleus or cytosol No cTP No Chloro SbSLPa Sb01g023200.1|PACid:1952078 Streptophyta Sorghum bicolor 0.32mTP 0.02cTP 0.62ER 0.25other chlo: 9.0, mito: 1.5, cyto_mito: 1.5, cyto: 1.0 0.515cTP 0.691mTP 0.009SP 0.020other RC=5 YES 0.02 0.73 0.24 0.02 0.00 0.14299 0.965856 0.000114 mito / chloro mTP cTP mTP Mito / Chloro YES Chloro SiSLPb Si026407m|PACid:19708378 Streptophyta Setaria italica 0.01mTP 0.34cTP 0.02ER 0.64other cyto: 6.0, chlo: 3.0, pero: 3.0, plas: 1.0 0.338cTP 0.007mTP 0.183SP 0.217other RC=5 YES 0.11 0.19 0.39 0.32 0.04 0.973411 0 0.070853 chloro / SP No Chloro SiSLPa Si036062m|PACid:19679889 Streptophyta Setaria italica 0.33mTP 0.07cTP 0.28ER 0.45other chlo: 9.0, mito: 2.5, cyto_mito: 2.0, plas: 1.0 0.436cTP 0.581mTP 0.005SP 0.027other RC=5 YES 0.04 0.68 0.25 0.02 0.00 0.101303 0.955894 0.000527 mito / chloro mTP cTP mTP Mito / Chloro SmSLPa 123773|PACid:15403392 Streptophyta Selaginella moellendorffii 0.01mTP 0.06cTP 0.26ER 0.69other chlo: 14.0 0.102cTP 0.298mTP 0.036SP 0.085other RC=5 YES 0.05 0.77 0.13 0.05 0.00 0.892789 0.511599 0 mito / SP mTP cTP mTP / cTP Chloro/Mito SmSLPb 65599|PACid:15406428 Streptophyta Selaginella moellendorffii **partial sequence xxx xxx xxx xxx xxx xxx xxx xxx xxx ThalSLPb Thhalv10009773m|PACid:20186509 Streptophyta Thellungiella halophila **partial sequence xxx ThalSLPa Thhalv10007840m|PACid:20186790 Streptophyta Thellungiella halophila 0.01mTP 0.00cTP 0.01ER 0.98other chlo: 5.0, cyto: 5.0, nucl: 1.0, E.R.: 1.0, cysk: 1.0 0.091cTP 0.079mTP 0.418SP 0.469other RC=5 NO 0.04 0.10 0.01 0.86 0.00 VcSLPb Vocar20007918m|PACid:23138954 Chlorophyta Volvox carteri 0.01mTP 0.01cTP 0.97ER 0.03other chlo: 12.0, mito: 2.0 0.404cTP 0.120mTP 0.319SP 0.004other RC=5 YES 0.28 0.46 0.08 0.18 0.00 Vocar20010810m|PACid:23129267 0.050cTP 0.233mTP 0.077SP 0.893other RC=2 GSVIVT01035133001|PACid 17840382 GSVIVT01035133001|PACid:17840382 VcSLPa xxx xxx Chlorophyta Volvox carteri 0.01mTP 0.00cTP 0.02ER 0.97other cyto: 4.0, nucl: 3.0, chlo: 2.0, mito: 2.0 Streptophyta Vitis vinifera 0.01mTP 0.04cTP 0.00ER 0.95other mito: 6.5, cyto_mito: 4.0, chlo: 2.0, nucl: 2.0 VvSLPb GSVIVT01024637001|PACid:17832792 Streptophyta Vitis vinifera 0.09mTP 0.14cTP 0.07ER 0.72other chlo: 14.0 0.702cTP 0.269mTP 0.010SP 0.030 RC=3 YES ZmSLPa GRMZM2G155058_T01|PACid:20861573 Streptophyta Zea mays 0.01mTP 0.61cTP 0.01ER 0.38other cyto: 9.0, chlo: 3.0, pero: 2.0 0.435cTP 0.009mTP 0.148SP 0.131other RC=4 YES VvSLPa 0.201cTP 0.145mTP 0.017SP 0.608other RC=3 NO NO xxx 0.01 0.06 0.00 0.92 0.00 0.03 0.17 0.04 0.76 0.00 0.00 0.19 0.80 0.01 0.00 0.07 0.19 0.48 0.25 0.00 xxx 0.001589 0.000436 0.020343 other 0.043505 0.000249 0.980277 xxx No xxx cTP xxx xxx xxx nucleus or cytosol/ chloro No No No Cyto SP / mito mTP cTP No Chloro/Mito 0 0.127076 0.095335 other nucleus or cytosol/ chloro No Cyto 0.945737 0 0.149592 chloro / nucleus or cytosol No No No Cyto / Chloro 0.132973 0.948294 0.000409 chloro / mito mTP cTP mTP Chloro / Mito chloro / SP No cTP No Chloro 0.986554 0 0.03539 No No xxx GRMZM2G125140_T01|PACid:20871059 Streptophyta Zea mays 0.42mTP 0.18cTP 0.28ER 0.34other chlo: 12.0, cyto: 1.0 0.752cTP 0.721mTP 0.004SP 0.011other RC=5 YES 0.01 0.38 0.61 0.01 0.00 0.355635 0.907809 0 mito / chloro mTP cTP mTP Chloro / Mito ZmSLPc GRMZM2G306348_T01|PACid:20872112 Streptophyta Zea mays 0.19mTP 0.36cTP 0.01ER 0.51other nucl: 6.0, chlo: 4.0, mito: 2.0, cysk: 1.0 0.169cTP 0.638mTP 0.002 0.135 RC=3 No 0.01 0.78 0.14 0.07 0.00 0 0.898969 0.144492 mito / nucleus or cytosol mTP cTP mTP Mito / Cyto ZmSLPd GRMZM2G459370_T01|PACid:20873602 Streptophyta Zea mays 0.12mTP 0.02cTP 0.09ER 0.78other chlo: 9.0, E.R.: 2.0, cyto: 1.0, mito: 1.0 0.051cTP 0.880mTP 0.023SP 0.213other RC=2 No 0.09 0.69 0.02 0.21 0.00 0 0.951541 0.223712 nucleus or cytosol / SP mTP No mTP / cTP ZmSLPe GRMZM2G364863_T01|PACid:20852560 Streptophyta Zea mays 0.01mTP 0.77cTP 0.00ER 0.23other chlo: 12.0, mito: 1.0 0.969cTP 0.022mTP 0.016SP 0.059other RC=1 YES 0.00 0.01 0.99 0.00 0.00 0.999886 0.000198 0.000157 chloro / nucleus or cytosol mTP cTP mTP / cTP Chloro ZmSLPf GRMZM2G313253_T02|PACid:20822007 Streptophyta Zea mays 0.26mTP 0.00cTP 0.02ER 0.73other cyto: 6.0, chlo: 5.0, vacu: 2.0 0.056cTP 0.905mTP 0.018SP 0.232other RC=2 No 0.01 0.91 0.00 0.07 0.00 0.00028 0.999636 0.001091 mito / nucleus or cytosol mTP No No Mito/Cyto ZmSLPg GRMZM2G313253_T02|PACid:20822006 Streptophyta Zea mays 0.26mTP 0.00cTP 0.02ER 0.73other cyto: 6.0, chlo: 5.0, vacu: 2.0 0.056cTP 0.905mTP 0.018SP 0.232other RC=2 No 0.01 0.91 0.00 0.07 0.00 0.00028 0.999636 0.001091 mito / nucleus or cytosol mTP No No Mito/Cyto Chlamydomonas reinhardtii 0.22mTP 0.01cTP 0.09ER 0.70other ZmSLPb Mito/Cyto Green Algae CrSLPc Cre02.g144900.t1.3|PACid:26900638 Chlorophyta chlo: 6.0, cyto: 6.0, nucl: 1.0 0.022cTP 0.507mTP 0.075SP 0.364other RC=5 0.000344 0.999784 0.000839 mito / SP No No mTP chlo: 6.0, plas: 5.0, E.R.: 2.0 0.005cTP 0.295mTP 0.006SP 0.866other RC=3 No 0.11 0.65 0.02 0.22 0.00 0 0.969472 0.138575 mito / nucleus or cytosol mTP No mTP / cTP Mito / Cyto 0.202cTP 0.891mTP 0.020SP 0.047other RC=2 YES 0.25 0.51 0.12 0.05 0.00 0.032756 0.765469 0.176738 mito / chloro mTP No mTP Mito / Chloro 0.001cTP 0.822mTP 0.446SP 0.005other RC=4 NO No 0.91 0.08 0.00 0.00 0.00 0.033852 0.006602 0.913536 mito / SP mTP No No Mito / Cyto NO 0.03 0.34 0.01 0.62 0.00 0.000303 0.999775 0.000972 mito / nucleus or cytosol No No No Cyto / Mito NO 0.02 0.11 0.00 0.87 0.00 0.001143 0.742107 0.000557 nucleus or cytosol / mito No No No Cyto / Mito NO 0.00 0.97 0.00 0.02 0.00 0.000354 0.999805 0.000814 mito / nucleus or cytosol mTP No mTP Mito 0.03 0.93 0.01 0.03 0.00 0.000401 0.999761 0.000983 chloro / mito mTP No mTP Mito/Chloro No mTP / cTP VcSLPc fgenesh4_pg.C_70252 Volvox carteri 0.32mTP 0.00cTP 0.04ER 0.66other MpSLPg jgi|MicpuN3|99756|fgenesh2_pg.C_Chr_03000537 Chlorophyta Micromonas pusilla 0.08mTP 0.03cTP 0.10ER 0.81other chlo: 10.0, mito: 3.0 MpSLPh jgi|MicpuC3|54428|MicpuC2.EuGene.0000170274 Chlorophyta Micromonas pusilla CCMP1545 0.22mTP 0.00cTP 0.99ER 0.01other mito: 4.0, vacu: 4.0, chlo: 3.0, nucl: 1.0 OlSLPd jgi|Ost9901_3|28347|eugene.1700010140 Chlorophyta treococcus lucimarinus CCE99 0.14mTP 0.00cTP 0.02ER 0.84other cyto: 7.0, chlo: 4.0, pero: 3.0 0.035cTP 0.831mTP 0.015SP 0.313other RC=3 OtSLPa jgi|Ostta4|14383|fgenesh1_pg.C_Chr_18.0001000104 Chlorophyta Ostreococcus tauri 0.01mTP 0.00cTP 0.00ER 0.99other cyto: 11.0, chlo: 1.0, nucl: 1.0 0.255cTP 0.148mTP 0.008SP 0.791other RC=3 OrcSLPd jgi|OstRCC809_2|60729|fgenesh1_pm.chr_17_#_106 Chlorophyta Ostreococcus RCC809 0.32mTP 0.00cTP 0.17ER 0.56other chlo: 10.0, mito: 4.0 0.055cTP 0.930mTP 0.012SP 0.107other RC=1 Stramenopiles Aureococcus anophagefferens 0.22mTP 0.01cTP 0.44ER 0.43 other 12.0 chloro 2.0 mito 0.33cTP,0.912mTP,0.005SP 0.000other RC=3 YES 0.138cTP,0.257mTP,0.011SP,0.540other RC=4 Chlorophyta 0.02 0.87 0.01 0.10 0.00 Mito / Cyto Red & Brown Algae AaSLPa jgi|Auran1|18391 CMJ015C Rhodophyta Cyanidioschyzon merolae 0.01mTP 0.28cTP 0.02ER 0.69other cyto: 9.0, pero: 3.0, chlo: 2.0 NO 0.03 0.74 0.02 0.21 0.00 0.024699 0.964272 0 mito / chloro No CpSLP Contig7282 Glaucocystophyceae Cyanophora paradoxa 0.01mTP 0.00cTP 0.00ER 0.98other cyto: 8.0, chlo: 3.0, mito: 3.0 0.045cTP 0.681mTP 0.015SP 0.575other RC=5 NO 0.02 0.79 0.03 0.16 0.00 0 0.392988 0.011258 other mito / nucleus or cytosol mTP No No Mito/Cyto EsSLPa Esi0089_0101 Stramenopiles Ectocarpus silicuosus 0.82mTP 0.01cTP 0.57ER 0.08 other 10.0 chloro, 4.0mito 0.188cTP,0.789mTP,0.010SP,0.006other RC=2 YES 0.01 0.98 0.01 0.00 0.00 0.003188 0.99818 0.000906 mito / chloro mTP cTP mTP Mito/Chloro CmSLP 42 Mito/Cyto EsSLPb Esi0387_0013 Stramenopiles Ectocarpus silicuosus 0.09mTP 0.00cTP 0.00ER 0.90 other 8.5cyto, 3.0 chloro, 2.0nuc nucleus or cytosol / SP No No EhSLPa jgi|Emihu1|107003|fgeneshEH_pg.86__79 Stramenopiles Emiliania huxleyi CCMP1516 0.04mTP 0.01cTP 0.99ER 0.01other vacu: 4.0, chlo: 3.0, cyto: 3.0, E.R.: 3.0 0.006cTP,0.357mTP,0.688SP,0.207other RC=4 NO 0.99 0.00 0.00 0.01 0.00 0 0.993843 0.027233 SP / mito mTP No No SP/Mito EhSLPb jgi|Emihu1|220501|gm1.9100138 Stramenopiles Emiliania huxleyi CCMP1517 0.19mTP 0.01cTP 0.24ER 0.61other cyto: 11.0, chlo: 2.0 0.078cTP 0.165mTP 0.348SP 0.338other RC=5 NO 0.59 0.07 0.01 0.33 0.00 0 0.082223 0.979695 SP / mito No No mTP / cTP SP/Cyto 0.019cTP,0.122mTP,0.590SP,0.400other RC=5 NO 0.05 0.09 0.00 0.86 0.00 0.012091 0 0.536857 other No Cyto/SP EhSLPc jgi|Emihu1|199870|gm1.1100042 Stramenopiles Emiliania huxleyi CCMP1518 0.01mTP 0.02cTP 0.01ER 0.96other cyto: 5.0, extr: 4.0, chlo: 3.0, E.R.: 1.0 0.070cTP 0.150mTP 0.139SP 0.668other RC=3 NO 0.04 0.11 0.00 0.84 0.00 0 0.671407 0.745108 mito / nucleus or cytosol No No No Cyto FcSLP jgi|Fracy1|274756|estExt_fgenesh2_pm.C_40052 Stramenopiles Fragilariopsis cylindrus 0.02mTP 0.00cTP 0.00ER 0.98other pero: 7.0, mito: 4.0, nucl: 1.0, plas: 1.0 0.018cTP 0.957mTP 0.012SP 0.205other RC=2 NO 0.01 0.95 0.00 0.04 0.00 0 0.597124 0.83023 mito / nucleus or cytosol No No mTP Mito / Cyto GtSLPa jgi|Guith1|163734|estExt_fgenesh2_pg.C_440018 Stramenopiles Guillardia theta 0.03mTP 0.00cTP 0.99ER 0.01other chlo: 9.0, vacu: 2.0, E.R.: 2.0 0.412cTP 0.662mTP 0.068SP 0.003other RC=4 YES 0.70 0.29 0.01 0.01 0.00 0.023462 0.980096 0.001554 mito / chloro mTP cTP mTP Mito / Chloro GtSLPb jgi|Guith1|95409|estExt_Genewise1.C_460167 0.024cTP 0.445mTP 0.069SP 0.794other RC=4 NO 0.02 0.16 0.00 0.81 0.00 0 0.100823 0.036067 other nucleus or cytosol / mito GtSLPc Stramenopiles Guillardia theta 0.01mTP 0.00cTP 0.00ER 0.99other cyto: 9.0, nucl: 2.0, mito: 1.0, plas: 1.0 No mTP / cTP jgi|Guith1|162103|estExt_fgenesh2_pg.C_170187 Stramenopiles Guillardia theta 0.01mTP 0.00cTP 0.98ER 0.02other chlo: 3.0, vacu: 3.0, E.R.: 2.0, nucl: 1.0, cyto: 1.0 0.082cTP 0.009mTP 0.741SP 0.042other RC=2 YES 0.98 0.01 0.00 0.01 0.00 0.003128 0 0.998129 mito / SP mTP No No SP/Mito GtSLPd jgi|Guith1|96181|estExt_Genewise1.C_630107 Stramenopiles Guillardia theta 0.01mTP 0.02cTP 0.99ER 0.01other chlo: 13.0 0.217cTP 0.336mTP 0.188SP 0.011other RC=5 YES 0.87 0.10 0.02 0.01 0.00 0.034939 0.000298 0.98338 chloro / SP mTP cTP No Chloro/SP GtSLPe jgi|Guith1|161633|estExt_fgenesh2_pg.C_120106 Stramenopiles Guillardia theta 0.02mTP 0.00cTP 0.99ER 0.01other cyto: 9.0, E.R.: 2.0, plas: 1.0, vacu: 1.0 0.066cTP 0.120mTP 0.662SP 0.176other RC=3 NO 0.99 0.00 0.00 0.01 0.00 0 0.004464 0.998495 mito / SP No No No SP/Cyto 0.08mTP 0.08cTP 0.00ER 0.84other No mTP PiSLP PITG_07305T0 | PITG_07305 Stramenopiles Phytophthora infestans T30-4 PrSLP jgi|Phyra1_1|94718|C_scaffold_22000047 Stramenopiles Phytophthora ramorum 0.01mTP 0.00cTP 0.99ER 0.01other chlo: 6.0, cyto: 2.0, mito: 1.0, plas: 1.0, extr: 0.007cTP 0.256mTP 0.776SP 0.018other RC=3 NO PsSLP jgi|Physo3|478619|e_gw1.2.4097.1 Stramenopiles Phytophthora sojae 0.01mTP 0.00cTP 0.99ER 0.01other chlo: 9.0, nucl: 1.0, plas: 1.0, vacu: 1.0, E.R.: 1.0 0.038cTP 0.616mTP 0.304SP 0.006other RC=4 NO PuSLP 0.01mTP 0.01cTP 0.05ER 0.94other cyto: 5.0, nucl: 4.0, chlo: 3.0, plas: 1.0 mito: 6.0, chlo: 5.0, nucl: 2.0 0.114cTP 0.539mTP 0.100SP 0.193other RC=4 NO 0.03 0.68 0.06 0.23 0.00 0.97 0.03 0.00 0.00 0.00 0.000475 0.985514 0.001242 mito / SP No mTP Cyto Mito 0 0.569835 0.81068 mito / SP mTP No No SP/Mito 0.49 0.47 0.01 0.03 0.00 0.000103 0.999244 0.003591 SP / mito mTP No No Mito/SP 0.03 0.26 0.00 0.70 0.00 0 0.337718 0.442579 other Cyto PYU1_T014025 Stramenopiles Pythium ultimum 0.111cTP 0.212mTP 0.088SP 0.444other RC=4 NO mito / nucleus or cytosol No No No TpSLP 649363167 XP_002296838 THAPSDRAFT_264391 Stramenopiles Thalassiosira pseudonana **partial sequence xxx xxx xxx xxx xxx xxx xxx xxx xxx LmSLPa LmjF.22.1490 Euglenozoa Leishmania major 0.01mTP 0.00cTP 0.83ER 0.17other vacu: 4.0, chlo: 3.0, E.R.: 3.0, nucl: 2.0, cyto: 1.0 0.000cTP 0.445mTP 0.966SP 0.055other RC=3 NO 1.00 0.00 0.00 0.00 0.00 0.000157 0 0.999557 SP / mito No No No SP LmSLPb LmjF.29.0440 Euglenozoa Leishmania major 0.35mTP 0.00cTP 0.03ER 0.63other plas: 8.5, cyto_plas: 5.0, chlo: 2.0, E.R.: 2.0 0.010cTP 0.494mTP 0.162SP 0.202other Rc=4 NO 0.54 0.33 0.01 0.12 0.00 0 0.023565 0.991472 mito / nucleus or cytosol mTP No mTP SP/Mito LmSLPc LmjF.30.3280 Euglenozoa Leishmania major 0.28mTP 0.00cTP 0.99ER 0.01other plas: 7.0, chlo: 2.0, E.R.: 2.0, extr: 1.0, vacu: 1.0 0.001cTP 0.134mTP 0.990SP 0.012other RC=1 NO 1.00 0.00 0.00 0.00 0.00 0 0.097454 0.982496 SP / nucleus or cytosol No No No SP NgSLP xxx xxx xxx Euglenozoa NW_003163286 Heterolobosea Naegleria gruberi **partial sequence xxx xxx xxx xxx xxx xxx xxx xxx TbSLPa Tb427.06.750 Euglenozoa Trypanosoma brucei TREU927 0.03mTP 0.00cTP 0.99ER 0.01other vacu: 4.0, E.R.: 3.0, nucl: 2.0, chlo: 1.0, cyto: 1.0 0.003cTP 0.269mTP 0.874SP 0.052other RC=2 NO 0.99 0.00 0.00 0.00 0.00 0 0.000255 0.999184 SP / mito mTP No No SP TbSLPb Tb427.06.4630 Euglenozoa Trypanosoma brucei TREU927 0.01mTP 0.01cTP 0.99ER 0.01other cyto: 6.5, cyto_nucl: 4.0, chlo: 2.0, E.R.: 2.0 0.031cTP 0.069mTP 0.748SP 0.284ER RC=3 NO 0.87 0.02 0.00 0.11 0.00 0 0.006315 0.998321 nucleus or cytosol / SP mTP No No SP mTP No No SP/Mito TcSLPb Tc00.1047053511731.30 Euglenozoa Trypanosoma cruzi 0.03mTP 0.00cTP 0.99ER 0.01other plas: 8.0, E.R.: 3.0, golg: 2.0 0.003cTP 0.407mTP 0.881SP 0.004other RC=3 NO 0.99 0.01 0.00 0.00 0.00 0 0.977022 0.123631 SP / nucleus or cytosol TcSLPc T 00 1047053508323 190 Tc00.1047053508323.190 Euglenozoa Trypanosoma cruzi 0.02mTP 0.00cTP 0.99ER 0.01other extr: 5.0, chlo: 3.0, vacu: 3.0, cyto: 1.0, plas: 1.0 0.004cTP 0.046mTP 0.986SP 0.057other RC=1 NO 1.00 0.00 0.00 0.00 0.00 0.000147 0 0.999557 SP / mito No No No TcSLPa Tc00.1047053508075.14 Euglenozoa Trypanosoma cruzi 0.01mTP 0.00cTP 0.99ER 0.01other plas: 9.5, cyto_plas: 5.5, E.R.: 3.0 0.005cTP 0.284mTP 0.928SP 0.025other RC=2 NO 1.00 0.00 0.00 0.00 0.00 0 0.532602 0.861938 SP / mito No No No SP CparSLP 639606472 cgd8_290 Apicomplexa Cryptosporidium parvum 0.08mTP 0.00cTP 0.97ER 0.03other vacu: 5.0, extr: 3.0, chlo: 2.0, plas: 2.0, E.R.: 1.0 0.003cTP 0.074mTP 0.973SP 0.029other RC=1 NO 1.00 0.00 0.00 0.00 0.00 0 0.067682 0.987443 mito / SP No No No SP PmSLPa 649397013 XP_002778164 Pmar_PMAR018604 Alveolata Perkinsus marinus ATCC 5098 0.02mTP 0.00cTP 0.99ER 0.01other chlo: 8.0, mito: 2.0, vacu: 2.0, nucl: 1.0 0.227cTP 0.079mTP 0.824SP 0.003other RC=3 NO 0.95 0.02 0.00 0.03 0.00 0.008994 0.007277 0.951841 chloro / mito mTP No No SP/Mito PmSLPb 649408011 XP_002787064 Pmar_PMAR006484 Alveolata Perkinsus marinus ATCC 5098 0.02mTP 0.00cTP 0.99ER 0.01other chlo: 6.0, mito: 3.0, E.R.: 3.0, vacu: 2.0 0.095cTP 0.145mTP 0.732SP 0.004other RC=3 0.98 0.01 0.00 0.01 0.00 0.000174 0.382049 0.844572 mito / chloro mTP No SP/Mito SP Apicomplexa NO No PbSLPa PBANKA_133240 0.04mTP 0.00cTP 0.99ER 0.01other extr: 7.0, vacu: 4.0, cyto: 1.0, mito: 1.0 0.095cTP 0.145mTP 0.732SP 0.004other RC=4 NO 0.99 0.00 0.00 0.01 0.00 0 0.041936 0.988473 SP / mito No No No PbSLPb PBANKA_060470 Apicomplexa Plasmodium berghei ANKA 0.01mTP 0.00cTP 0.99ER 0.01other chlo: 6.0, extr: 5.0, cyto: 2.0 0.005cTP 0.209mTP 0.928SP 0.028other RC=2 NO 0.99 0.01 0.00 0.00 0.00 0 0.513375 0.811642 SP / mito No No mTP PchaSLPa PCHAS_060650 Apicomplexa lasmodium_chabaudi_chabau 0.01mTP 0.00cTP 0.99ER 0.01other chlo: 6.0, extr: 5.0, cyto: 2.1 0.005cTP 0.209mTP 0.928SP 0.028other RC=2 NO 0.99 0.01 0.00 0.00 0.01 0 0.513375 0.811642 SP / mito No No mTP SP PchaSLPb PCHAS_133700 Apicomplexa lasmodium_chabaudi_chabau 0.05mTP 0.00 cTP 0.99ER 0.01other NO 0.98 0.00 0.00 0.02 0.00 0 0.060306 0.957058 SP / nucleus or cytosol No No No SP PvxSLPa PVX_117005 Apicomplexa SP / mito mTP No mTP SP/Mito PvxSLPb PVX_084335 Apicomplexa Plasmodium_vivax_SaI-1 0.07mTP 0.00cTP 0.99ER 0.01other cyto: 5.0, chlo: 4.0, extr: 2.0, nucl: 1.0, mito: 1.0 NO 0.98 0.00 0.00 0.01 0.00 0 0.004935 0.99743 SP / nucleus or cytosol mTP No No PySLPa PY00915 Apicomplexa lasmodium_yoelii_yoelii_17XN 0.01mTP 0.00cTP 0.99ER 0.01other chlo: 6.0, extr: 5.0, cyto: 2.0 0.005cTP 0.112mTP 0.971SP 0.023other NO 0.57 0.01 0.00 0.42 0.00 0 0.258925 0.652454 SP / nucleus or cytosol No No No PySLPb PY03645 Apicomplexa lasmodium_yoelii_yoelii_17XN 0.04mTP 0.00cTP 0.99ER 0.01other vacu: 4.0, extr: 3.0, cyto: 2.0, golg: 2.0 0.021cTP 0.087mTP 0.909SP 0.128other NO 0.97 0.00 0.00 0.02 0.00 0 0.081172 0.979407 SP / mito No No No SP TgSLP TGGT1_004320 Apicomplexa Toxoplasma gondii GT1 0.18mTP 0.14cTP 0.02ER 0.70other chlo: 6.0, nucl: 2.0, vacu: 2.0, cyto: 1.0, plas: 1.0 0.399cTP 0.297mTP 0.005SP 0.060other RC=5 NO 0.36 0.40 0.20 0.04 0.00 0.000349 0.999578 0.00084 chloro / mito No cTP mTP / cTP Chloro/Mito SP, mTP, other, PTS1 mTP, SP, Other MgloSLP A8Q735_MALGO Fungi Malassezia globosa 0.00mTP 0.01ER 0.99other mito: 8.0, cysk: 8.0, cyto: 5.5, cyto_pero: 4.0 0.116mTP 0.063SP 0.873other RC=2 xxx 0.01 0.03 0.96 0.00 0.004395 0.050324 other Mito/Cyto PgraSLP E3JUL7_PUCGT Fungi Puccinia graminis 0.31mTP 0.46ER 0.37other mito: 11.0, golg: 5.0, cyto: 2.0, plas: 2.0, extr: 2.0 0.555mTP 0.643SP 0.014other RC=5 xxx 0.67 0.11 0.22 0.00 0.989244 0.007792 Apicomplexa Plasmodium berghei ANKA Plasmodium_vivax_SaI-1 Fungi 0.41mTP 0.00cTP 0.99ER 0.00other 0.000cTP 0.025mTP 0.916SP 0.096other extr: 4.0, vacu: 3.0, cyto: 2.0, mito: 2.0 0.000 cTP 0.092mTP 0.970SP 0.014other cyto: 6.0, mito: 3.0, nucl: 2.0, plas: 2.0 0.005cTP 0.097mTP 0.948SP 0.067other mTP, ER, Other NO mTP, SP, Other 0.67 0.04 0.00 0.29 0.00 0 0.847531 0.421309 xxx No mTP xxx mTP BdenSLP F4NYP9_BATDJ Fungi Batrachochytrium dendrobatidis **partial sequence xxx xxx xxx xxx xxx xxx xxx xxx xxx xxx F4RN30_MELLP Fungi u g Melampsora e a pso a larici-populina a c popu a 0.06mTP 0 06 0.90ER 0 90 0.09other 0 09ot e mito: to 20.0, 0 0, nucl: uc 4.5, 5, cyto_ cyto nucl: uc 4.0 0 0.363mTP 0 363 0.589SP 0 589S 0 0.023other 0 3ot e RC=4 C xxx 0.43 0 30 0.18 80 0.39 39 0 0.00 00 0.978622 0 9 86 0 mito to / nucleus uc eus o or cytoso cytosol mTP xxx No o Mito/SP to/S RgluSLP G0T0W9_RHOG2 Fungi Rhodotorula glutinis 0.16mTP 0.99ER 0.01other extr: 19.0, E.R.: 4.0, mito: 1.0, plas: 1.0 0.332mTP 0.717SP 0.012other RC=4 xxx 0.98 0.00 0.02 0.00 0.085595 0.937863 mito / SP mTP xxx No SP/Mito LbSLP jgi|Lacbi1|189922|estExt_GeneWisePlus_worm.C_80340 0.022mTP 0.980SP 0.033other RC=1 extr: 21.0, plas: 2.0, golg: 2.0 extr: 13.0, E.R.: 6.0, golg: 3.0, mito: 2.0, cyto: 2.0 0.139952 0.933613 Fungi Laccaria bicolor xxx 0.95 0.00 0.05 0.00 YQJ7_SCHPO Fungi Schizosaccharomyces pombe 0.021mTP 0.943SP 0.060other RC=1 xxx 0.98 0.00 0.02 0.00 SP / mito No No SP/ER Q5KCT2_CRYNJ Fungi Cryptococcus neoformans 0.01mTP 0.99ER 0.01other extr: 23.0, plas: 3.0 0.094mTP 0.930SP 0.011other RC=1 xxx 0.96 0.00 0.04 0.00 0.140072 0.933536 SP / mito mTP xxx No SP/Mito UmSLP jgi|Ustma1|3316|UM03316 Fungi Ustilago maydis 0.03mTP 0.01ER 0.97other extr: 12.0, mito: 7.0, cyto: 4.5, cyto_nucl: 4.0 0.416mTP 0.088SP 0.508other RC=5 xxx 0.01 0.03 0.96 0.00 0.098582 0.030083 other nucleus or cytosol / mito No xxx No Cyto Aureococcus anophagefferens 0.01mTP 0.00 cTP 0.34ER 0.65 other 3.0 chloro, 3.0vac, 3.0ER, 2.5cyto, 2.0 nuc 0.03cTP,0.064mTP,0.853SP, 0.290other RC=3 NO 0.96 0.01 0.00 0.03 0.00 0 0.004694 0.990785 SP / nucleus or cytosol No No No SP/Cyto xxx 0.006074 0.960532 SP / mito No xxx xxx No Mito/SP CneoSLP SpSLP 0.02mTP 0.99ER 0.01other mito / SP mTP SP SP a pS MlarpSLP 0.01mTP 0.99ER 0.01other nucleus or cytosol / mito SP SP SP Chromalveolates AaSLPb gi|323451994 PcapSLP jgi|Phyca11|66148|gw1.23.354.1 Stramenopiles Phytophthora capsici **partial sequence xxx xxx xxx xxx xxx xxx PiSLP PITG_07305T0 Stramenopiles Phytophthora infestans T30-4 0.08mTP 0.08cTP 0.00ER 0.84other mito: 6.0, chlo: 5.0, nucl: 2.0 0.659mTP 0.058SP 0.224other RC=3 NO 0.03 0.68 0.06 0.23 0.00 0.000475 0.985514 0.001242 mito / SP mTP No mTP Mito/SP PrSLP jgi|Phyra1_1|94718|C_scaffold_22000047 Stramenopiles Phytophthora ramorum 0.01mTP 0.00cTP 0.99ER 0.01other chlo: 6.0, cyto: 2.0, mito: 1.0, plas: 1.0, extr: 1.0 0.099mTP 0.918SP 0.019other RC=1 NO 0.97 0.03 0.00 0.00 0.00 0 0.569835 0.81068 mito / SP mTP No No SP/Mito Phytophthora sojae 0.01mTP 0.00cTP 0.99ER 0.01other chlo: 9.0, nucl: 1.0, plas: 1.0, vacu: 1.0, E.R.: 1.0 0.124mTP 0.932SP 0.011other RC=1 mTP Pythium ultimum 0.01mTP 0.01cTP 0.05ER 0.94other cyto: 5.0, nucl: 4.0, chlo: 3.0, plas: 1.0 jgi|Physo3|478619|e_gw1.2.4097.1 Stramenopiles xxx xxx xxx NO 0.49 0.47 0.01 0.03 0.00 0.000103 0.999244 0.003591 mito / SP No No PutSLP PYU1_T014025 Stramenopiles 0.132mTP 0.088SP 0.757other RC=2 NO 0.03 0.26 0.00 0.70 0.00 0 0.337718 0.442579 Other mito / nucleus or cytosol No No mTP TpSLP 649363167 XP_002296838 THAPSDRAFT_264391 Stramenopiles Thalassiosira pseudonana **partial sequence xxx xxx xxx xxx xxx xxx xxx xxx xxx xxx TtSLP AMSG_02057T0 Apusozoa hecamonas trahens ATCC 500 0.01mTP 0.00cTP 0.00ER 0.99other cyto: 4.0, chlo: 3.0, plas: 3.0, pero: 3.0 0.106mTP 0.070SP 0.878other RC=2 NO 0.01 0.06 0.00 0.93 0.99 0.024642 0 0.290571 other nucleus or cytosol / SP No No No Cyto / Perox PsSLP Stramenopiles Mito/SP Cyto/Mito have occurred during SLP sequence evolution. A clear example of this is provided by the group of sequences from Apicomplexa. This group contains the only other SLP protein which has been characterized in detail biochemically, the SHLP1 protein of Plasmodium berghei (the causative agent of malaria in the mouse) (Patzewitz et al., 2013). This protein (corresponding to sequence PbSLPa) has been shown to be localized in the ER membrane, which is consistent with the prediction of a signal peptide (SP). Analysis indicates this is a conserved feature of most of the SLP proteins from Apicomplexa. It is interesting in this regard that most of SLP sequences in the dataset from the parasitic Euglenozoa also manifest a predicted signal peptide. Furthermore, it should be noted that the non-pathogenic fungi Schizosaccharomyces pombe and Laccaria bicolor also possess SLP proteins with predicted signal peptides. Indeed, previous findings indicate that the Schizosaccharomyces pombe SLP phosphatase is also ER localized (Matsuyama et al., 2006). 43 2.3.2.2 RLPH phosphatases Data uncovered regarding the distribution and interrelationships of the Rhizobiales-like phosphatases (RLPHs) are also presented (Figure 2.4). Once again, we see that the species representation is heavily weighted toward photosynthetic organisms, with the only exceptions being the Heterolobosean Naegleria gruberi and the Choanoflagellate Salpingoeca rosetta. Amongst photosynthetic organisms the RLPH distribution is dominated by land plants. Despite intensive searching, RLPH sequences could only be detected in two different strains of a single green algal species, Micromonas pusilla. None of the photosynthetic Chromalveolates contained a RLPH sequence, with the sole remaining eukaryotic organism being the photosynthetic Rhizarian Bigelowiella natans. Intensive searching by TBLASTN also failed to reveal any additional RLPHs amongst previously unannotated sequences from other species of Green Algae, or any other eukaryotic group. At the base of the RLPH distribution is a closely related set of sequences from Planctomycetes bacteria. Closely associated with the RLPH sequence distribution is a set of sequences from αProteobacteria. Other, more distantly related bacterial sequences include representatives from a variety of groups including Cyanobacteria, δ-Proteobacteria, Bacteroidetes and Thermotogae. No RLPH sequences were detected from Archaea, either by HMM searching of protein databases derived from completely sequenced archaeal genomes, BLASTP searching of archaeal protein databases, or TBLASTN searching amongst archaeal nucleotide databases. The RLPHs have a distinctive predicted subcellular localization not shared by the SLPs, with most sequences maintaining a predicted cytoplasmic / nuclear localization (Table 2.2). This is true not only of the land plants, but also the Naegleria sequence, the most deeply 44 Figure 2.4: Orthogonal phylogenetic tree depicting RLPH phosphatase distribution and interrelationships across both Eukaryotes and Prokaryotes. Phylogenetic tree inference was performed as outlined in the Materials and Methods. The most crucial nodes are labelled. Branch support values with the four inference methods (PhyML [aBayes]; RAxML [RBS]; MrBayes [PP]; PhyloBayes_MPI [PP]) are as follows (see Materials and Methods for details). Node A: (0.999, 99, 0.98, 1.00); Node B: (0.575, 80, 0.95, 0.86); Node C: (0.999, 90, 0.93, 1.00); Node D: (1.00, 100, 0.98, 1.00); Node E: (0.999, 16, 0.83, 0.95); Node F: (0.999, 12, 0.90, 0.91). Branch support values for all trees are summarized as Appendix B. Predicted in silico subcellular localizations are represented by Mt (mitochondria), Cy (cytosol), Nu (Nuclear). Sequences used in tree generation are listed in Appendix A.2 and in silico subcellular localization data are listed in Table 2.2. 45 Table 2.2: In silico subcellular prediction data for eukaryotic RLPH phosphatases used in phylogenetic tree and alignment construction. Prediction consensus is highlighted in yellow. Tree labels, gene identifiers, organism group and organism information are also provided. RLPH Phosphatases Prediction Program Tree Designation Gene Identifier AlRLPHa Predotar Organism Group Organism 478324|PACid:16048263 Streptophyta Arabidopsis lyrata 0.01mTP 0.00cTP 0.00ER 0.99other Wolf pSORT TargetP ChloroP Protein Prowler PredSL cTP, mTP, SP, other Yes, No SP, mTP, cTP, other, PTS1 cTP, mTP, SP, Other nucl: 9.0, cyto: 3.0, mito: 1.0 0.046cTP 0.252mTP 0.213SP 0.536other RC=4 No 0.03 0.12 0.00 0.85 0.00 0 0.003997 0.638358 nucleus or cytosol / SP mTP, cTP, ER, Other SLP-Local MitoProt PCLR iPSORT mTP, No cTP, No mTP, cTP, No No No mTP cyto/nuc Consensus AlRLPHc 478323|PACid:16040833 Streptophyta Arabidopsis lyrata 0.01mTP 0.00cTP 0.00ER 0.99other nuc l: 6.0, cyto: 5.0, mito: 1.0, pero: 1.0 0.037cTP 0.254mTP 0.227SP 0.680other RC=3 No 0.02 0.08 0.00 0.91 0.00 0 0.024833 0.099683 other nucleus or cytosol / SP No No No cyto/nuc AcRLPHa Aquca_027_00420.1|PACid:22045939 Streptophyta Aquilegia coerulea 0.01mTP 0.00cTP 0.00ER0.99other cyto: 8.0, nucl: 3.0, cysk: 2.0 0.075cTP 0.062mTP 0.137SP 0.895other RC=2 No 0.02 0.06 0.00 0.93 0.00 0 0.000509 0.975568 nucleus or cytosol / SP No No No cyto/nuc AtRLPHa AT3G09960 Streptophyta Arabidopsis thaliana 0.02mTP 0.00cTP 0.01ER 0.98other nucl: 9.0, cyto: 3.0, mito: 1.0 0.075cTP 0.199mTP 0.101SP 0.756other RC=3 No 0.02 0.08 0.00 0.90 0.00 0.000119 0.000834 0.619835 nucleus or cytosol / SP No No No cyto/nuc AtRLPHb AT3G09970 Streptophyta Arabidopsis thaliana 0.01mTP 0.00cTP 0.00ER 0.99other nucl: 10.0, cyto: 2.0, mito: 1.0 0.051cTP 0.175mTP 0.223SP 0.628other RC=3 No 0.02 0.09 0.00 0.89 0.00 0 0.001327 0.512618 other nucleus or cytosol / SP No No No BdRLPH Bradi5g09180.1|PACid:21826565 Streptophyta Brachypod ium distachyon 0.01mTP 0.03cTP 0.13ER0.85other cyto: 6.0, chlo: 5.0, plas: 2.0 0.160cTP 0.158mTP 0.090SP 0.619other RC=3 No 0.06 0.13 0.02 0.79 0.00 0 0.009879 0.997885 nucleus or cytosol / chloro No No mTP/cTP cyto/nuc BnRLPH jgi|Bigna1|87196|estExt_fgenesh1_pg.C_170184 Rhizaria Bigelowiella natans CCMP2755 0.36mTP 0.10cTP 0.01ER 0.57other chlo: 7.0, mito: 3.0, nucl: 2.0, E.R.: 2.0 0.224cTP 0.706mTP 0.006SP 0.099other RC=3 No 0.01 0.95 0.02 0.03 0.00 0.000318 0.99905 0.001265 mito / nucleus or cytosol mTP No No mito/cyto nuc l: 7.0, cyto: 6.0 nucleus or cytosol / mito No BrRLPH Bra029809|PACid:22685889 Streptophyta Brassica rapa 0.047cTP 0.210mTP 0.185SP 0.591other RC=4 No 0.03 0.01 0.96 0.00 0 0.037468 0.983744 CruRLPHa Carubv10014233m|PACid:20900199 Streptophyta Capsella rubella 0.01mTP 0.00cTP 0.00ER 0.99other nucl: 9.0, cyto: 3.0, mito: 1.0 0.048cTP 0.183mTP 0.281SP 0.561other RC=4 No 0.02 0.02 0.97 0.00 0.00019 0.001662 0.090186 other nucleus or cytosol / SP No No No cyto/nuc CruRLPHb Carubv10014229m|PACid:20899239 Streptophyta Capsella rubella 0.01mTP 0.00cTP 0.02ER 0.97other nucl: 9.0, cyto: 3.0, mito: 1.0 0.091cTP 0.192mTP 0.141SP 0.609other RC=3 No 0.04 0.01 0.95 0.00 0 0.00413 0.945158 nucleus or cytosol / mito No No No cyto/nuc CpaRLPH Carica papaya 0.01cTP 0.00mTP 0.00ER 0.99other No No cyto/nuc cyto/nuc evm.model.supercontig_5.61|PACid:16421512 Streptophyta 0.01mTP 0.00cTP 0.07ER 0.92other cyto: 11.0, chlo: 2.0 0.055cTP 0.173mTP 0.094SP 0.927other RC=2 No 0.01 0.06 0.00 0.93 0.00 0 0.014847 0.134998 other nucleus or cytosol / mito No No mTP cyto/nuc CcRLPHa clementine0.9_016692m|PACid:19266957 Streptophyta Citrus clemntina 0.03mTP 0.00cTP 0.01ER 0.97other cyto: 11.0, chlo: 2.0 0.060cTP 0.440mTP 0.052SP 0.554other RC=5 No 0.03 0.29 0.00 0.68 0.00 0 0.921123 0.353943 nucleus or cytosol / mito No No mTP/cTP cyto/nuc CcRLPHb clementine0.9_019398m|PACid:19266958 Streptophyta Citrus clemntina 0.03mTP 0.00cTP 0.01ER 0.97other cyto: 11.0, chlo: 2.0 0.060cTP 0.440mTP 0.052SP 0.554other RC=5 No 0.03 0.29 0.00 0.68 0.00 0 0.921123 0.353943 nucleus or cytosol / mito No No mTP/cTP cyto/nuc CsRLPH orange1.1g025290m|PACid:18101020 Streptophyta Citrus sinensis 0.03mTP 0.00cTP 0.01ER 0.97other cyto: 10.0, mito: 2.0, chlo: 1.0 0.060cTP 0.440mTP 0.052SP 0.554other RC=5 No 0.03 0.29 0.00 0.68 0.00 0 0.921123 0.353943 nucleus or cytosol / mito No No mTP/cTP CsatRLPHa Cucsa.378400Cucsa.378400.1|PACid:16982048 Streptophyta Cucumis sativus 0.01mTP 0.00cTP 0.00ER 0.99other cyto: 7.0, nucl: 3.0, chlo: 2.0, plas: 1.0 0.064cTP 0.152mTP 0.158SP 0.784other RC=2 No 0.02 0.07 0.00 0.91 0.00 0 0.001349 0.705375 nucleus or cytosol / SP No No mTP cyto/nuc CsatRLPHb Cucsa .338320.1|PACid:16978737 Streptophyta Cucumis sativus 0.01cTP 0.00mTP 0.01ER 0.98other cyto: 10.5, cyto_E.R.: 6.0, nucl: 1.0, cysk: 1.0 0.092cTP 0.122mTP 0.108SP 0.830other RC=2 No 0.02 0.02 0.96 0.00 0 0.003026 0.741126 nucleus or cytosol / SP No No No cyto/nuc 0.827 0 0.036403 nucleus or cytosol / chloro No cyto/chloro/mito EgRLPH Eucgr.A01531.1|PACid:23563318 Streptophyta Eucalyptus grandis 0.01mTP 0.01cTP 0.00ER 0.99other cTP No GmRLPHb Glyma10g07800.1|PACid:26346430 Streptophyta Glycine max 0.01mTP 0.00cTP 0.00ER 0.99other cyto: 11.0, nucl: 1.0, E.R.: 1.0 0.085cTP 0.047mTP 0.384SP 0.671other RC=4 No 0.02 0.06 0.00 0.92 0.00 0 0.010259 0.048305 other nucleus or cytosol / mito No No mTP/cTP cyto/nuc GmRLPHa Glyma13g21640.1|PACid:26315536 Streptophyta Glycine max 0.01mTP 0.00cTP 0.00ER 0.99other cyto: 9.0, chlo: 2.0, nucl: 1.0, plas: 1.0 0.072cTP 0.096mTP 0.140SP 0.797other RC=2 No 0.01 0.07 0.00 0.91 0.00 0 0.027197 0.090347 other nucleus or cytosol / chloro No No No cyto/nuc GrRLPHa Gorai.009G406100.1|PACid:26769011 Streptophyta Gossyp ium raimondii 0.01mTP 0.00cTP 0.04ER 0.96other cyto: 10.0, chlo: 2.0, nucl: 1.0 0.071cTP 0.181mTP 0.176SP 0.782other RC=2 No 0.01 0.06 0.00 0.93 0.00 0.000229 0.000929 0.196542 other nucleus or cytosol / chloro No No No cyto/nuc GrRLPHb Gorai.009G405800.1|PACid:26764307 cyto: 11.0, nucl: 1.0, cysk: 1.0 0.061cTP 0.170mTP 0.194SP 0.786other RC=3 cysk: 10.0, cyto: 2.0, plas: 1.0 0.877cTP 0.018mTP 0.075SP 0.477other RC=4 No cyto/nuc 0.02 0.09 0.01 0.89 0.00 Streptophyta Gossyp ium raimondii 0.01mTP 0.00cTP 0.06ER 0.93other No 0.02 0.07 0.00 0.91 0.00 nucleus or cytosol / SP No No cyto/nuc LuRLPH Lus10014383|PACid:23143669 Streptophyta Linum usitatissimum 0.01mTP 0.00cTP 0.02ER 0.98other cyto: 10.5, cyto_E.R.: 6.0, nucl: 2.0 0.047cTP 0.108mTP 0.300SP 0.661other RC=3 No 0.02 0.07 0.00 0.90 0.00 0 0.017295 0.530941 other nucleus or cytosol / mito No No No cyto/nuc MdpRLPHa MDP0000218397|PACid:22671674 Streptophyta Malus domestica 0.01mTP 0.65cTP 0.00ER 0.34other chlo: 6.0, cyto: 3.0, mito: 2.0, pero: 2.0 0.824cTP 0.036mTP 0.060SP 0.256other RC=2 YES 0.00 0.01 0.98 0.01 0.00 0.999909 0.000194 0.000199 chloro / nucleus or cytosol No cTP mTP/cTP MdpRLPHb MDP0000131572|PACid:22643486 Streptophyta Malus domestica 0.01mTP 0.88cTP 0.00ER 0.12other chlo: 11.0, mito: 2.0 0.968cTP 0.024mTP 0.001ER 0.083other YES 0.999845 0.000134 0.00026 chloro / nucleus or cytosol mTP cTp cTP chloro MeRLPH cassava4 .1_012125m.g Streptophyta Manihot esculenta 0.01mTP 0.00cTP 0.10ER 0.90other cyto: 11.0, chlo: 2.0 0.019cTP 0.181mTP 0.260SP 0.770other RC=3 No 0.02 0.07 0.00 0.91 0.00 0 0.012238 0.16485 other nucleus or cytosol / chloro No No No cyto/nuc MpRLPHa jgi|MicpuC3|19876|MicpuC2.e_gw1.9.342.1 Chlorophyta Micromonas pusilla CCMP 1545 0.04mTP 0.01cTP 0.02ER 0.94other chlo: 5.0, mito: 5.0, cyto: 2.0, nucl: 1.0 0.098cTP 0.621mTP 0.020 0.195 RC=3 No 0.02 0.82 0.02 0.14 0.00 0.000126 0.996704 0.003362 nucleus or cytosol / chloro mTP No No mito/cyto xxx xxx xxx xxx MpRLPHb xxx 0.00 0.01 0.99 0.00 0.00 0 0.004408 0.056452 other No xxx chloro jgi|MicpuN3|82680|e_gw2.06.480.1 Chlorophyta Micromonas pusilla CCMP1545 **Partial Sequence Only xxx xxx xxx xxx MgRLPH mgv1a008117m|PACid:17676001 Streptophyta Mimulus guttatus 0.01mTP 0.40cTP 0.00ER 0.60other cyto: 7.0, nuc l: 2.0, pero: 2.0, chlo: 1.0 0.195cTP 0.072mTP 0.119SP 0.593other RC=4 No 0.03 0.11 0.01 0.85 0.00 0.237309 0 0.467375 other nucleus or cytosol / chloro No No No cyto/nuc OsRLPH LOC_Os04g33470.1|PACid:21897059 Streptophyta Oryza sativa 0.01mTP 0.00cTP 0.02ER 0.98other chlo: 4.0, cyto: 3.0, nucl: 2.0, pero: 2.0 0.239cTP 0.188mTP 0.051SP 0.632other RC=4 No 0.02 0.09 0.00 0.89 0.00 0 0.005448 0.998691 nucleus or cytosol / chloro No No mTP/cTP cyto/nuc No mTP/cTP cyto/mito y PvRLPH Pavirv00063697 m|PACid:23762733 | Streptophyta p p y 0.264cTP 0.158mTP 0.046SP 0.711other RC=3 No 0.04 0.15 0.03 0.78 0.00 0 0.023443 0.993304 y / chloro nucleus or cytosol No PvuRLPH Phvul.007G222200.1|PACid:27159562 Streptophyta Phaseolus vulgaris 0.01mTP 0.00cTP 0.00ER 0.99other cyto: 7.0, nucl: 5.0, chlo: 2.0 0.117cTP 0.120mTP 0.112SP 0.776other RC=2 No 0.01 0.07 0.00 0.92 0.00 0.000102 0.002787 0.090987 other nucleus or cytosol / chloro No No No PpaRLPH Pp1s118_177 V6.1|PACid:18069084 Streptophyta Physco mitrella patens 0.01mTP 0.00cTP 0.00ER 0.99other cyto: 7.5, cyto_E.R.: 4.5, pero: 2.0, chlo: 1.0 0.093cTP 0.153mTP 0.168SP 0.794other RC=2 No 0.01 0.08 0.00 0.91 0.00 0 0.039665 0.090002 other nucleus or cytosol / mito No No No cyto/nuc PperRLPH ppa015436m|PACid:17668853 Streptophyta Prunus persica 0.01mTP 0.00cTP 0.00ER 0.99other cyto: 12.0, chlo: 1.0 0.050cTP 0.235mTP 0.112SP 0.847other RC=2 No 0.000109 0.001869 0.180797 other nucleus or cytosol / mito No No mTP cyto/nuc PtRLPHa Potri.002G156200.1|PACid:27024057 Streptophyta Populus trichocarpa 0.02mTP 0.00cTP 0.00ER0.98other cyto: 10.0, pero: 2.0, mito: 1.0 0.369cTP 0.069mTP 0.118SP 0.795other RC=3 No PtRLPHc Potri.002G156500.1|PACid:27023072 Streptophyta Populus trichocarpa 0.02mTP 0.00cTP 0.00ER0.97other cyto: 9.0, pero: 2.0, nucl: 1.0, mito: 1.0 0.397cTP 0.060mTP 0.139SP 0.800other RC=3 No RcRLPH 27496.m000096|PACid:16798510 Streptophyta Ricinus communis 0.01mTP 0.02cTP 0.01ER0.97other chlo: 6.0, mito: 4.0, cyto: 3.0 0.061cTP 0.169mTP 0.165SP 0.902other RC=2 No 0.02 0.09 0.01 0.88 0.00 0.347824 0 0.43965 other SbRLPH Sb06g015410.1|PACid:1972275 Streptophyta Sorghum bicolor 0.01mTP 0.00cTP 0.07ER 0.92other cyto: 7.0, chlo: 4.0, plas: 2.0 0.142cTP 0.192mTP 0.180SP 0.658other RC=3 No 0.05 0.11 0.01 0.82 0.00 SiRLPH Si010641m|PACid:19695892 Streptophyta Setaria italica 0.01mTP 0.00cTP 0.08ER 0.91other cyto: 8.0, chlo: 3.0, plas: 2.0 0.165cTP 0.213mTP 0.094SP 0.670other RC=3 No SmRLPH 130493|PACid:15421901 Streptophyta Selaginella moellendorfii 0.03mTP 0.00cTP 0.01ER 0.95other chlo: 5.0, mito: 4.0, vacu : 2.0, E.R.: 2.0 0.022cTP 0.771mTP 0.016SP 0.600other RC=5 ThalRLPH Thhalv10021191 m|PACid:20183521 Streptophyta Thellungiella halophila 0.01mTP 0.01cTP 0.00ER 0.98other cyto: 7.0, cysk: 5.0, nucl: 2.0 0.197cTP 0.115mTP 0.092SP 0.718other RC=3 Panicum virgatum g 0.01mTP 0.00cTP 0.01ER 0.98other chlo: 10.0,, mito: 2.0,, cyto: y 1.0 0.01 0.07 0.00 0.92 0.00 0.03 0.13 0.02 0.82 0.00 cyto/nuc 0.001104 0.000528 0.029679 other nucleus or cytosol / mito No No No cyto/nuc 0.003919 0.000115 0.0854 other nucleus or cytosol / mito No No No cyto/nuc nucleus or cytosol / mito No No mTP/cTP cyto/nuc 0 0.007303 0.963105 nucleus or cytosol / SP No No mTP/cTP 0.04 0.13 0.01 0.82 0.00 0 0.031257 0.992595 nucleus or cytosol / chloro No No mTP/cTP cyto/nuc No 0.03 0.71 0.01 0.25 0.00 0 0.998186 0.008288 nucleus or cytosol / mito mTP No mTP mito/cyto No 0.02 0.08 0.00 0.90 0.00 0.000527 0.000324 0.293576 other nucleus or cytosol / SP No No No cyto/nuc cyto/mito 0.03 0.15 0.05 0.76 0.00 cyto/nuc VvRLPH GSVIVT01033803001 |PACid:17839446 Streptophyta Vitis vinifera 0.06mTP 0.00cTP 0.00ER 0.94other cyto: 4.0, chlo_mito: 4.0, chlo: 3.5, mito: 3.5 0.010cTP 0.883mTP 0.045SP 0.260other No 0.01 0.87 0.00 0.12 0.00 0 0.995098 0.009557 mito / nucleus or cytosol No No No ZmRLPH GRMZM2G128399_T01|PACid:20872846 Streptophyta Zea mays 0.01mTP 0.00cTP 0.10ER 0.89other cyto: 5.0, chlo: 4.0, mito: 2.0, plas: 2.0 0.122cTP 0.236mTP 0.076SP 0.745other RC=3 No 0.05 0.13 0.01 0.81 0.00 0 0.017542 0.994634 nucleus or cytosol / chloro No No mTP/cTP cyto/nuc NgRL PH 649365958 XP_002669792 NAEGRDR AFT_59846 Heterolobosea Naegleria gruberi 0.01mTP 0.02cTP 0.02ER0.96other nucl: 11.0, chlo: 1.0, cyto: 1.0 0.086cTP 0.064mTP 0.038SP 0.840other RC= 2 No 0.50 0.08 0.00 0.42 0.00 0.6341 0.000658 0.000369 nucleus or cytosol / SP No No No cyto/nuc SrRLPH PTSG_01069T0 | PTSG_01069 Choanoflagellida Salpingoeca rosetta 0.01mTP 0.03cTP 0.01ER 0.95other mito: 13.0, nucl: 9.5, cyto_nucl: 8.5, cyto: 6.5 0.141mTP 0.154SP 0.699other RC=3 nucleus or cytosol / SP No xxx No cyto/mito Excavate Choanozoa mTP, SP, Other xxx SP, mTP, other, PTS1 mTP, SP, Other 0.03 0.01 0.96 0.00 0.004088 0.050907 other diverging RLPH sequence in the tree, which suggests that a distinctive targeting of RLPH class sequences may have occurred early in eukaryotic evolution. 2.3.3 Sequence motif identification Upon the classification of SLP and RLPH phosphatases, a novel C-terminal sequence motif I/L/V-D-S/T-G (labeled Motif 2 here) was revealed (Andreeva and Kutuzov, 2004). Data confirm the conservation of sequence Motif 2 across all eukaryotic bacterial-like phosphatases (Figure 2.1-2.2, 2.5), in addition to revealing a second C-terminal motif (Motif 1) (M/I/V)(I/L/V)-(V/S/F)-G-H-(T/H/D) upstream of Motif 2 (Figure 2.1-2.2, 2.6). Within both of these sequence motifs, each eukaryotic bacterial-like phosphatase subclass was found to maintain 46 Figure 2.5: Compiled bacterial-like phosphatase Motif 2 from SLP and RLPH phosphatases. Amino acid positional probability consensus within the bacterial-like motif 2 of (A) SLP and (B) RLPH phosphatases from eukaryotic organisms outlined in each respective phylogenetic tree and listed in Figures 2.3 and 2.4 as well as Appendix A.1 and A.2. Amino acid positional probability consensus within bacterial-like motif 2 of photosynthetic Eukaryote SLP (C) and RLPH (D) phosphatases only. Largest diversity was observed in motif position 3 where threonine (T), conserved amongst prokaryotic and eukaryotic SLP and RLPH phosphatases alike, was replaced with valine (V) and glutamic acid (E) in photosynthetic Eukaryote SLP and RLPH phosphatases, respectively. Amino acid colors represent polar (Green), Neutral (Purple), Basic (Blue), Acidic (Red) and Hydrophobic (Black) amino acids. Each amino acid positional probability consensus was constructed using MAFFT aligned sequences submitted to WebLogo 3 (http://weblogo. threeplusone.com/). 47 Figure 2.6: Compiled bacterial-like phosphatase Motif 1 from SLP and RLPH phosphatases. Amino acid positional probability consensus within the bacterial-like motif 1 of (A) SLP and (B) RLPH phosphatases from eukaryotic organisms outlined in each respective phylogenetic tree and listed in Figures 2.3 and 2.4, respectively, as well as Appendix A.1 and A.2. (C-D) Amino acid positional probability consensus within bacterial-like motif 1 of photosynthetic Eukaryote SLP and RLPH phosphatases only. Bacterial-like motif 1 exhibited largest diversity in motif positions 1 through 3, where predominantly a mixed variety of hydrophobic amino acids were observed. Similar to position 3 of motif 2, position 6 of motif 1 also exhibited conserved bacterial-like class diversity, with photosynthetic Eukaryote SLP and RLPH phosphatases predominantly maintaining threonine (T) and histidine (H) residues, respectively. Amino acid colors represent polar (Green), Neutral (Purple), Basic (Blue), Acidic (Red) and Hydrophobic (Black). Each amino acid positional probability consensus was constructed using MAFFT aligned sequences submitted to WebLogo 3 (http://weblogo.threeplusone.com/). 48 distinct diversity at specific motif positions which parallels their classification (Figure 2.5-2.6). This was most pronounced when examining these motifs from photosynthetic Eukaryotes (Figure 2.5-2.6). 2.3.4 In silico gene structure analysis of bacterial-like PPP phosphatases from photosynthetic Eukaryotes Using the online resource Phytozome v7.0 (www.phytozome.com), in silico annotated gene models of SLP and RLPH phosphatases ranging from green algae through to land plants were extracted and manually analyzed (Figure 2.7). Distinct differences in SLP1 and 2 phosphatase gene architecture was observed over evolutionary time between green algae and land plants. SLP1 and SLP2 phosphatases from the representative green alga Chlamydomonas reinhardtii maintained a similar number of introns, while in land plants, intron number increased in SLP1 phosphatases and decreased to being virtually intronless in SLP2 phosphatases (Figure 2.7A). This trend in SLP2 phosphatase gene architecture was conserved across all sequenced land plants (Table 2.3). Over a similar evolutionary time frame, RLPH phosphatases only lost a single intron (Figure 2.7B) and showed clear signs of a recent gene duplication in dicotyledonous plant species (e.g. Arabidopsis thaliana). Gene duplication was not observed in any monocotyledonous plant species (e.g. Oryza sativa). SLP3 phosphatases were not subjected to this assessment due to their restrictive evolutionary distribution and absence in land plants. 2.4 Discussion For each type of eukaryotic bacterial-like PPP phosphatase investigated here, a well supported group of sequences from α-Proteobacteria lies in close phylogenetic association. The 49 Figure 2.7: Gene structure of eukaryotic SLP and RLPH phosphatases. (A) SLP1 and 2 phosphatases maintain similar gene structures in lower Eukaryotes indicative of likely gene duplication. Exon to intron ratio increased in SLP1 phosphatases versus complete intron loss in SLP2 phosphatases. Representative SLP1 and 2 phosphatases are shown in green and blue, respectively. (B) RLPH phosphatases demonstrated only a slight reduction in their exon to intron ratio in higher plants. Some higher plant dicotyledonous species indicated likely RLPH phosphatase gene duplication (e.g. A. thaliana). Monocotyledonous plants completely lacked any indication of RLPH phosphatase gene duplication. Representative RLPH phosphatases are shown in yellow, while higher plant RLPH1 and RLPH2 phosphatases are depicted in yellow (original gene) and red (putative duplicated gene). 50 Table 2.3: Intron quantity for photosynthetic Eukaryote SLP phosphatases. Highlighted are Green Algae (grey), Selaginella moellendorffii (Lycophyte) and Physcomitrella patens (Moss) (green) and A. thaliana (blue) SLP1 and 2 phosphatases. Additional information includes their corresponding ‘SLP1’ and ‘SLP2’ tree identifiers, as well as organismal and gene identifier information. Gene models were obtained from Phytozome v7.0 (www.phytozome.net). most straightforward interpretation of this observation is that these bacterial-like PPP genes entered Eukaryotes very early in their history, with the advent of mitochondria. This is consistent with the broad eukaryotic distribution of these sequences. Current concepts of the origin of mitochondria and early Eukaryotes differ somewhat, with endosymbiosis occurring through an α-Proteobacterium within an amitochondriate eukaryotic host, or symbiogenesis combining an αProteobacterium with another Prokaryote (usually deemed to be an Archaeon) (Koonin, 2010). These concepts are embodied in competing and as yet unresolved models of early eukaryotic evolution (Embley and Martin, 2006; Poole and Penny, 2007). However, all are agreed that the advent of mitochondrial formation, with its large-scale genetic transfer to the eukaryotic nucleus, 51 together with intracellular retargeting of translated proteins, was a major driver of eukaryotic evolution. Classically, the donor α-proteobacterium was held to be an ancient Rickettsia-like organism (Gray, 1998; Lang et al., 1999). The data here fail to support this hypothesis. None of the deeply placed α-proteobacterial sequences found in any of the bacterial-like PPP phosphatase phylogenetic trees are from the order Rickettsiales. These findings are consistent with a recent review (Gray, 2012) which emphasized that the true identity of the ancestral α-proteobacterium has yet to be definitively established. Superimposed on a basic pattern of α-proteobacterial ancestry is a more complex picture of bacterial-like PPP phosphatase origins (Figure 2.8 and 2.9). In both subclasses studied here, there is a group of bacterial sequences which cluster closely with the radiation of each sequence type in Eukaryotes. In the case of the SLPs, these are from the δ-Proteobacteria (Myxobacteria), while in the case of the RLPHs, these are from the Planctomycetes. As befits their positioning in the phylogenetic trees, these sequences are much more closely related to their respective eukaryotic sequence group than are those of the presumably ancestral α-Proteobacteria. It is possible to interpret these results in different ways. One possibility might be HGT. A hallmark of this process is a "discordant" clustering of sequences from distant organismal sources in the same gene tree (Keeling and Palmer, 2008; Boto, 2010). Alternatively, given the likelihood of the αproteobacterial ancestry detailed above, a more attractive possibility becomes apparent. In this interpretation, the particular bacterial-like PPP sequence radiation observed in Eukaryotes (e.g. the SLPs) would be viewed as the "sister-group" of the closely related bacterial sequence cluster (e.g. the "Outer Myxobacteria" sequences). Both would derive from the same α-proteobacterial source (Figure 2.8 and 2.9). If this hypothesis is true, what is striking is the high degree of 52 Figure 2.8: Proposed model of RLPH phosphatase molecular evolution. The RLPH phosphatase molecular evolution model was derived from phylogenetic tree inference methodologies combined with consensus in silico subcellular localization data. Dashed arrows represent the potential HGT event delivering a Planctomycetes RLPH phosphatase to the nucleus of an ancestral non-photosynthetic Eukaryote and its subsequent targeting to the cytosol/nucleus. Through photosynthetic Eukaryote evolution the predominant predicted localization of RLPH phosphatases remained cytosolic/nuclear; however, the few identified RLPH phosphatases of Green Algae and S. mollendorffii seemed to also adopt a predicted mitochondrial localization. Solid black arrows depict primary endosymbiosis and the corresponding gene transfer events to the eukaryotic nucleus. Light grey arrows denote movement through evolutionary time, while dark grey arrows depict likely secondary gene loss events. The ‘OR’ notation represents an intersection in the model where two possibilities exist (HGT or mitochondrial endosymbiosis) for how RLPH phosphatase genes integrated into modern Eukaryotes. 53 Figure 2.9: Proposed model of SLP phosphatase molecular evolution. The SLP phosphatase molecular evolution model was derived from phylogenetic tree inference methodologies combined with consensus in silico subcellular localization data. Dashed arrows (coarse) represent the HGT-A event delivering the "outer" myxobacterial SLP3 phosphatase to the nucleus of an ancestral non-photosynthetic Eukaryote and its subsequent targeting to the mitochondria/cytosol. Dashed arrows (fine) represent the independent HGT-B event delivering the "inner" myxobacterial SLP phosphatase to the nucleus of an ancestral photosynthetic Eukaryote (i.e. glaucophyte) and its subsequent targeting to the mitochondria/cytosol. Dashed arrows (fine) also depict SLP phosphatase gene duplication in Green Algae which lead to distinct SLP1 (chloroplast targeted) and SLP2 (cytosolic targeted) subpopulations. Through photosynthetic Eukaryote evolution the respective chloroplast and cytosolic localizations of SLP1 and SLP2 phosphatases were conserved. Solid black arrows depict primary endosymbiosis and the corresponding gene transfer events to the eukaryotic nucleus. Light grey arrows denote movement through evolutionary time, while dark grey arrows depict likely secondary gene loss events. The ‘OR’ notation represents an intersection in the model where two possibilities exist (HGT or mitochondrial endosymbiosis) for how SLP phosphatase genes integrated into modern Eukaryotes. 54 conservation between the bacterial and the eukaryotic sequences, such that a strong relationship is apparent. The two mechanisms outlined above are sufficient to explain the structure of the RLPH tree, which is the simplest of the two (Figure 2.8 and 2.9). In the SLP tree, there is a further complication in that there is a second group of "Inner Myxobacteria" sequences nested within the overall eukaryotic radiation (Figure 2.3). This could be explained by a second application of the "sister-group" argument, where a more basal eukaryotic SLP sequence ancestor gave rise to both a further, more derived eukaryotic SLP radiation, and also a second side cluster of Myxobacteria sequences. However, given that the origin of the sequences would be eukaryotic and the destination bacterial, these would qualify as instances of HGT. Other hallmarks of HGT besides phylogenetically "discordant" clustering patterns are socalled "patchy distributions", where there is non-uniform sequence representation amongst a broad organismal phylogenetic group (Snel et al., 2002). Instances must be judged as individual situations, and sometimes it still remains difficult or impossible to establish an unambiguous mechanism. In the case of the "Inner Myxobacteria" sequences within the eukaryotic SLP sequence distributions, HGT is favoured, as it would be difficult to conceptualize this as a case of differential gene transmission and loss (Figure 2.9). This would require, for example, that SLPs would be widely distributed ancestrally within both Bacteria and Eukarya (perhaps by a universal common ancestor), then preferentially lost in such a way that lineages coincidentally wound up closely related in a segment of Eukaryotes and Myxobacteria. Given the apparent entry of SLP genes into Eukaryotes through α-proteobacterial endosymbiosis, this scenario seems to be very unlikely. 55 It is worth considering the nature of the two bacterial groups (Myxobacteria and Planctomycetes) whose sequences are most closely related to the eukaryotic bacterial-like PPP phosphatases. It is entirely possible that the ancestral α-Proteobacteria transmitted these sequences to a variety of bacterial offspring groups by vertical transmission, or perhaps to other groups by HGT. Why are they now found in these particular bacterial groups with such high degree of similarity to the eukaryotic sequences? Interestingly, both the Myxobacteria and Planctomycetes have been noted as being "eukaryotic-like" in terms of possessing features unusual for Bacteria. Myxobacteria possess a complex life cycle and social behaviour heavily dependent on intercellular signaling (Goldman et al., 2006; Perez et al., 2008), while Planctomycetes have shown signs of intracellular compartmentation (Fuerst and Sagulenko, 2011). It is possible that in each case the emerging adoption of unusually complex aspects of bacterial cellular physiology pre-adapted this organismal group to retain and profit from the acquisition of additional signaling protein phosphatases. Alternatively, it might be that the acquisition of these sequences was itself important in allowing such unusual cellular adaptations. It would seem that further research into the role of the SLPs in Myxobacteria, and RLPHs in Planctomycetes is warranted. The distribution of the bacterial-like PPP phosphatases with respect to Eukaryotes is truly remarkable. SLPs are present across the entire range of photosynthetic organisms (and greatly expanded in land plants), while being completely absent in Animals, while RLPHs are present in only a few non-photosynthetic organisms and one species of green alga, but are widely represented in land plants. In all cases the actual functions of the bacterial-like PPP phosphatases in photosynthetic organisms remain largely unknown and given the differential biases in their 56 distribution, it remains impossible to formulate a general hypothesis that would encompass them all. Interestingly, the SLPs underwent a large gene expansion in Green Algae, with the initial acquisition of an ancestral SLP3 phosphatase and subsequently an SLP form that gave rise to the SLP1 and SLP2 forms later inherited by land plants. This suggests that each related gene product might serve distinct cellular functions (Kutuzov and Andreeva, 2012). Supporting this inference are the distinct localizations described here by in silico subcellular localization prediction data. Finally, the RLPH gene lineage has become nearly extinct in living Green Algae, while being ubiquitous in land plants. This might suggest again co-option of gene function in algae. It is noteworthy that the RLPHs of land plants show a predicted cytoplasmic / nuclear localization, which is unique within the eukaryotic bacterial-like PPP phosphatases, suggesting a marked change in cellular function, which deserves further research exploration. It is ironic that at present the most is known about the function of an SLP protein from a non-photosynthetic organism. In Plasmodium berghei (the causative organism of malaria in mice), the SHLP1 protein has been shown to be necessary for a critical life cycle stage transition, and for development of ultrastructural features important for host cell infection (Patzewitz et al., 2013). It is well established that Plasmodium (as all Alveolates) were ancestrally photosynthetic, retaining an altered chloroplast remnant, the apicoplast (Kalanon and McFadden, 2010). This indicates that in this organism the SHLP1 gene, freed from possible previous functional constraints in a photosynthetic ancestor, evolved a novel role important to the pathogenic lifestyle. Since both the mouse and human hosts of malaria parasites lack any evidence of SLP genes and proteins, SHLP1 represents an attractive target for therapeutic drug development. 57 It is interesting that in the SLP phylogenetic tree there are several groups of sequences in the deep eukaryotic portion of the tree which have long branches (indicating probable rapid sequence evolution), and which are encoded by parasitic organisms. Most sequences in the Apicomplexa group (including several species of Plasmodium) have a predicted signal peptide, confirming previous findings (Kutuzov and Andreeva, 2008). This correlates with the discovery that plasmodial SHLP1 is localized to the ER membrane. There is a single sequence from the genus Perkinsus (a marine shellfish pathogen), a group from the Euglenozoa (including the genera Leishmania and Trypanosoma) and a group including Oomycete plant pathogens from the genera Phytophthora and Pythium. In the case of the Euglenozoa and Perkinsus, there is also a marked tendency toward possession of a signal peptide. The predicted localizations are more mixed for the Oomycete sequences, which may be due to the fact that they are the most divergent SLP sequences in the dataset and/or the amino termini may have been mis-annotated. In contrast, amongst SLP sequences from photosynthetic organisms, predicted signal peptides outside of mitochondrial- and chloroplast-targeting peptides were rare. Taken together, these observations suggest that the SLP sequences of pathogens of both plants and animals may have taken alternative evolutionary trajectories from those in currently photosynthetic organisms. These genes and proteins may thus represent attractive targets of further research efforts. The catalytic subunits of eukaryotic phosphoprotein phosphatases such as PP1 and PP2A are well known to combine with a variety of regulatory subunits to form holoenzymes, which provides for substrate specificity, subcellular localization, and enzymatic regulation (Virshup and Shenolikar, 2009). In PP1, for example, these interactions are mediated by small canonical motifs such as the RVxF motif (Templeton et al., 2011). It has been recently suggested that SLP phosphatases might also interact with a diverse set of regulatory proteins (Kutuzov and 58 Andreeva, 2012). The data on carboxy-terminal motifs presented here demonstrates that they maintain conserved class specific alterations, which are most pronounced in photosynthetic Eukaryotes. Amino acid substitutions in position 6 and 3 of Motif 1 and 2, respectively, would be expected to alter motif charge, polarity, and hydrophobicity. This could alter protein binding specificity without an overall change in phosphatase conformation, suggesting that a regulatory protein binding strategy might be a general feature of the bacterial-like PPP phosphatases. Exploration of this possibility represents an attractive option for future research. Another interesting characteristic of the bacterial-like SLP and RLPH phosphatases examined was their gene architecture. Of particular note was a complete loss of introns in the SLP2 phosphatase lineage over time from green algae to land plants. In A. thaliana, Schizosaccharomyces pombe, Mice and Teleostei (fish) genes involved in processes requiring rapid and abundant transcriptional expression have been found to possess a lower intron abundance (Jeffares et al., 2008; Kawaguchi et al., 2010). These include genes involved in stress response (Jeffares et al., 2008) and development (Kawaguchi et al., 2010), where transcriptional splicing may impede the required response time for gene expression in addition to incurring added energy costs associated with longer transcript production and splicing (Jeffares et al., 2008). Contrary to SLP phosphatases, the RLPH phosphatase lineage underwent no notable gene architecture changes; however, likely gene duplication in land plants was observed. 2.5 Conclusion Here, the evolutionary trajectories of two recently resolved eukaryotic bacterial-like PPP phosphatases was examined. SLP and RLPH phosphatase evolution was found to be unique, in that it possessed elements of classic endosymbiotic origins as well as HGT. Emphasizing the 59 likelihood of HGT are the SLP1 and 2 phosphatases of photosynthetic Eukaryotes, which were found to maintain closely related homologs in the distantly related δ-proteobacteria Myxobacteria, with which they phylogenetically cluster. Further intrigue stemmed from the revelation that SLP1 and SLP2 phosphatases are conserved across photosynthetic Eukaryotes and predicted to be located in completely different subcellular compartments. As well, they were found to maintain completely different gene architectures, which in conjunction with their predicted contrast in subcellular localization, likely indicates independent cellular functions. Similarly, the RLPH phosphatases were also predicted to possess a subcellular location differing from the SLP phosphatases and to have recently undergone a gene duplication event in dicotyledonous plants; however, no rationale for why this occurred was uncovered. Collectively, the comprehensive compilation of SLP and RLPH phosphatases revealed here provides insight into their potential application in biomedical and/or agricultural biotechnology. This is highlighted by the lack of SLP phosphatases in humans, but their presence in the parasitic protozoa Plasmodium. 60 Chapter Three: Bacterial-like SLP protein phosphatases from Arabidopsis thaliana are highly conserved plant proteins that possess unique properties 3.1 Introduction Since their discovery, protein phosphatases have emerged as key components of cellular regulation with roles in essentially all aspects of biology. Numerous phosphatases exist in nature, ranging from phosphate-scavenging acid phosphatases (Bozzo et al., 2002), to highly specific protein phosphatases, which remove covalently attached phosphate groups from amino acids phosphorylated by a specific protein kinase (DeLong, 2006; Moorhead et al., 2007). Reversible protein phosphorylation has been documented to occur on five amino acids in Eukaryotes, with most phosphorylation events occurring on serine (Ser), threonine (Thr) and tyrosine (Tyr) residues (Olsen et al., 2006; Sugiyama et al., 2008). Mass spectrometry-based phosphoproteomic studies have estimated that 70% of all proteins are regulated by reversible phosphorylation, emphasizing the prevalence of regulatory phosphorylation events in biological systems (Olsen et al., 2010). Despite having similar catalytic mechanisms and targeting the same phosphorylated residues, the PPP and PPM phosphatases differ significantly. Unlike the PPP enzymes, PPMs are Mg2+/Mn2+-dependent, generally lack associated regulatory subunits, and are insensitive to small molecule inhibitors, such as okadaic acid (OA) and microcystin (MCLR) (Shi, 2009). Most PPP protein phosphatase catalytic subunits lack accessory domains and associate with additional proteins (regulatory subunits) to direct their cellular function (Moorhead et al., 2009). In A. thaliana and other plants, the PPP family consists of PP1, PP2A (PP2), PP4, PP5, PP6 and PP7 mammalian protein homologs, while lacking PP2B (also known as PP3)-related phosphatases (Moorhead et al., 2009). In addition to these highly conserved PPP subclasses, Arabidopsis 61 thaliana also possess eight conserved PPP protein phosphatases that are absent in mammals (Chapter 2; (Kerk et al., 2008; Moorhead et al., 2009)). Four of these phosphatases are Kelchrepeat PPKL protein phosphatases shown to be involved in brassinosteroid signaling (MoraGarcia et al., 2004) and another two are uncharacterized Rhizobiales-like phosphatases. The last 2 phosphatases are distantly-related to several bacterial phosphatases from Shewanella and, as such, were termed Shewanella-like phosphatases, or SLPs (Andreeva and Kutuzov, 2004). Due to their ancient prokaryotic origins and horizontal gene transfer from Myxobacteria to early photosynthetic Eukaryotes (Chapter 2), as well as a number of unique structural elements within their primary amino acid sequence, characterization of the SLP protein phosphatases from Arabidopsis thaliana was undertaken to elucidate their biochemical properties. 3.2 Materials and Methods 3.2.1 Bioinformatics The online search engine Genevestigator (www.genevestigator.com), which contains pooled ATH1: 22k microarray data, and NCBI Gene Expression Omnibus (GEO) Profiles (http://www.ncbi.nlm.nih.gov/geoprofiles) were used to assess the transcriptional expression of At1g07010 (AtSLP1) and At1g18480 (AtSLP2) in various tissue types and across plant development. 3.2.2 Molecular cloning Fluorescent construct creation utilized full-length clones obtained from TAIR (http://www.arabidopsis.org/). Propagated by PCR with Gateway compatible primers, AtSLP1 and AtSLP2 were inserted into pDONR201 via BP ligation reactions (Invitrogen). Each 62 construct was further sub-cloned into the plant expression vector pB7RWG2 (http://gateway.psb.ugent.be/) via LR ligation reactions (Invitrogen) to create C-terminal RFP (cRFP) fusion constructs. All cloning done in conjunction with fluorescent construct creation was accomplished using DH5α E. coli. Agrobacterium tumefaciens strain GV3101 (Agrobacterium) was transformed with each fluorescent construct for wild-type A. thaliana cell culture transfection purposes. Heterologous protein expression employed full-length AtSLP1 (minus the predicted chloroplast transit peptide, amino acids 1 - 53) and AtSLP2 cloned containing EcoRI and NotI restriction sites. Each construct was initially TA cloned into pGEM-T (Promega) and subsequently sub-cloned into pET47b(+) for antibody production and pET48b(+) for purification and enzymatic characterization (EMD-Chemicals). 3.2.3 Cell culture transfection and protoplast creation Wild-type A. thaliana cell culture transfection was modeled from previous studies (Forreiter et al., 1997). Empirical testing of the cell culture transfection process found 3 d old A. thaliana liquid cell culture grown under 24 h light in 1 x Murashige Skoog (MS) medium along with 3% w/v sucrose, 0.5 mg/mL 1-naphthaleneacetic acid (NAA), 0.05 mg/mL kinetin (1x Growth Media) combined with transformed Agrobacterium grown to an OD600 = 0.5 at 28oC, 150 rpm re-suspended in 10 mL of 1 x growth media as ideal for successful A. thaliana cell transfection. Incubation of Agrobacterium and A. thaliana liquid cell culture was performed for 2 d at room temperature under 24 h light shaking at 150 rpm. The Agrobacterium-A. thaliana cell culture mixture was then pelleted at 300 x g for 1 min and re-suspended in 1 x growth media containing 100 µg/mL ampicillin (repeated 3 x). Cells were then spread on 0.8% w/v agar plates containing 1 x growth media and 10 µg/mL Basta and grown under 24 h light for 2 weeks to 63 select for positively transfected A. thaliana cells. Positive transfectants grown as callus cells were visually identified (green papules) and re-plated on 1x selective media then sub-cultured to liquid 1 x growth media and grown as above under 24 h light. Protoplasts were subsequently derived from 10 mL of positively transfected Arabidopsis thaliana cell culture as previously described (Lingard et al., 2008). Protein expression was confirmed by Western blotting with antiRFP IgG (Chromotek). 3.2.4 Transient AtSLP expression in Vicia faba epidermal leaf cells Transient co-expression of AtSLP1-RFP and AtSLP2-RFP with fluorescent marker constructs was conducted using equimolar amounts of gold coupled with DNA and particle bombardment. Plasmid DNA (10 µg) was coated onto 1 mg of gold microcarriers, washed, spotted on a macrocarrier and accelerated at the epidermal layer of Vicia faba leaves using a PDS-1000/He System (BioRad) as per (Russell et al., 1992). AtSLP1-RFP was co-bombarded with GFP constructs specifically designed to target the endoplasmic reticulum (ER), Golgi and peroxisome (Nelson et al., 2007), while AtSLP2-RFP was co-bombarded with established mitochondria, ER, Golgi (Nelson et al., 2007) and cytosolic/nuclear targeted GFP (vector p2FGW7; GFP with no targeting motif). AtSLP2-RFP was also co-bombarded with GFP-tagged nucleoporin protein 50a to illuminate the nucleus (Tamura et al., 2010). 3.2.5 Microscopy All imaging was conducted using a Leica DMIRE2 spectral confocal and multiphoton microscope with a Leica TCS SP2 acoustic optical beam splitter (AOBS) (Leica Microsystem, Richmond Hill, ON, Canada), with all cells and leaves visualized using the 63x water immersion 64 lens. The excitation / emission wavelengths (nm) respectively employed: GFP 488/505-515, RFP 594/610-650 and chlorophyll autofluorescence 488 & 594/685-715. Subsequent image processing was performed using the freely accessible MacBiophotonics ImageJ (http://www.macbiophotonics.ca/downloads.htm). 3.2.6 Heterologous protein expression and antibody production AtSLP1- and AtSLP2-HIS6 fusion proteins were expressed in BL21 (DE3) Codon(+)RIL E.coli grown in LB containing 1 mM MnCl2 and 1 % glucosamine at 8oC, 200 rpm by induction with 0.1 mM isopropyl β-D-1-thiogalactopyranoside (IPTG) for 24 h. Bacteria were pelleted at 4000 x g for 15 min and re-suspended in 1 x extraction buffer (50 mM Hepes-NaOH pH 7.5, 150 mM NaCl, 5 % (v/v) glycerol and 10 mM imidazole) followed by snap freezing and storage at -80oC prior to use. Day of purification saw the addition of 1 % (v/v) Tween-20, 20 mM imidazole, 1 mM phenylmethanesulfonylfluoride (PMSF), 1 mM benzamidine, 2 µg/mL leupeptin, 5 µg/mL pepstatin upon thawing. Extraction involved mechanical lysis via French press at 3 x 16000 PSI (Sim-Aminco Spectronic Instruments). Crude lysates were clarified at 40000 x g for 30 min at 4oC. Supernatants were removed and 1 mM PMSF and 1 mM benzamidine added prior to incubation with NiNTA agarose matrix (Qiagen) end-over-end mixing at 4oC for 1 h. Matrix was poured into a column and washed with 500 column volumes (cv) of wash buffer A (1 x extraction buffer with 1 M NaCl, 1% (v/v) Tween-20 and 20 mM imidazole, minus protease inhibitors) by gravity followed by 100 cv of wash B (1x extraction buffer minus protease inhibitors). NiNTA-bound proteins were eluted using 1 x extraction buffer containing 500 mM imidazole pH 7.5. Protein eluates were concentrated using a 30000 MWCO Amicon concentrator (Millipore). 65 Purification of AtSLP1 to homogeneity involved four steps: 1) NiNTA purification, 2) size exclusion chromatography, 3) cleavage of the affinity tag by HRV3c protease (Novagen), and 4) anion exchange chromatography. Heterologous HIS6-AtSLP1 expression and subsequent purification by NiNTA was conducted as outlined above from 4 x 1 L of bacterial cell culture. The NiNTA HIS6-AtSLP1 eluate was concentrated to 500 µL and subjected to size exclusion chromatography using a HiLoad 16/60 Superdex 200 (Amersham) column. The sample was chromatographed at 0.5 mL/min, collecting 1 mL fractions in buffer A (50 mM Hepes-NaOH pH 7.5, 50 mM NaCl, and 5% (v/v) glycerol). Column fraction proteins were resolved by SDSPAGE (12%) and visualized by Colloidal blue. Peak fractions were pooled and concentrated to 500 µL in a 30000 MWCO Amicon concentrator. Cleavage of the HIS6-tag was performed on 500-700 µg of total protein from the concentrated size exclusion chromatography eluate using 10 U of Novagen HRV 3C protease (EMD Chemicals) in the presence of 1 mM dithiothreitol (DTT) overnight at 4oC rotating gently end-over-end. Anion exchange chromatography was performed on a Mono-Q 5/50 GL (Amersham) column employing Buffer A (as above) and Buffer B consisting of 50 mM Hepes-NaOH pH7.5, 1 M NaCl, 5% (v/v) glycerol and 1 mM DTT. The concentrated Superdex 200 eluate was loaded at 0.5 mL/min, the column washed at 1 mL/min for 10 cv and protein eluted using a gradient of 50 - 400 mM NaCl over 35 cv collecting 1 mL fractions. Peak fractions were run on SDS-PAGE, pooled, concentrated as above, aliquoted, snap frozen in liquid nitrogen and stored at -80oC for subsequent use. AtSLP1- and AtSLP2-pET47b were expressed in BL21 (DE3) Codon(+)-RIL E. coli at 37oC and induced with 0.5 mM IPTG for 4 h and purified from inclusion bodies using NiNTA Agarose as per the manufacturer's instructions (Qiagen). Purified protein was dialyzed against water, freeze dried and used for polyclonal antibody production in a New Zealand White rabbit 66 as described (Tran et al., 2004). Both the AtSLP1 and AtSLP2 antibodies were affinity purified using nitrocellulose membrane as previously described in the absence of BSA (Plaxton, 1989). 3.2.7 Enzymatic analysis Enzymatic assessment of AtSLP1 and AtSLP2 was conducted using the small molecule phosphatase substrate para-nitrophenyl phosphate (pNPP) (Sigma). Enzymatic assays comparing AtSLP1 and AtSLP2 were conducted using day-of purified and concentrated NiNTA eluates. All assays were done in parallel with NiNTA eluates generated from uninduced BL21 (DE3) Codon(+)-RIL E. coli to account for potential background pNPP cleavage from non-specific copurifying proteins (NSCP). HIS6-AtSLP1 and HIS6-AtSLP2 percent activities were calculated relative to assays conducted with 0.5 mM Mn2+ or 5 mM EDTA, respectively. Due to its stability, pure, untagged AtSLP1 was used to perform more thorough enzymatic analyses in conjunction with completely purified AtPP1/TOPP2 (At5g59160) and partially purified (~50%) human PP1 gamma (HsPP1γ; NP_002701) as assay benchmarks. Both TOPP2 and HsPP1γ were cloned, expressed and purified from E. coli as previously described (Douglas et al., 2001; Templeton et al., 2011). Enzymatic assay strategy was adapted from the SensoLyte pNPP Protein Phosphatase Assay Kit (http://www.anaspec.com). Base Buffer consisted of 100 mM HEPES pH 7.5 and 150 mM NaCl. Dilution buffer(s) consisted of base buffer plus 4 mM DTT and 0.2 mM EDTA with either 0.5 mM metal cation (metal-dependent activity) or 5 mM EDTA (metal independent activity). Assay buffer consisted of dilution buffer plus a final concentration of 5 mM pNPP. All protein samples were pre-mixed under their experimental condition, brought to 20 µL with the respective dilution buffer and pre-incubated at 30oC for 10 min prior to enzymatic analysis. Enzyme assays were initiated by the addition of 180 µL of assay buffer and 67 incubation at 30oC for 30 min (NiNTA AtSLP eluates and controls) or 20 min (pure AtSLP1 and controls). For the NiNTA AtSLP and control eluates 750 ng of total protein was used per assay. Assays involving pure AtSLP1 employed 100 ng AtSLP1, 200 ng TOPP2 and 400 ng total Human (Hs) PP1γ protein (200 ng pure HsPP1γ). Assays were quenched with 200 µL of 0.5 M EDTA and pNPP cleavage assessed using an Ultrospec 2000 spectrophotometer set to 405 nm. 3.3 Results 3.3.1 AtSLP1 and AtSLP2 localize to different cellular compartments Full-length AtSLP1 and AtSLP2 were fused to a C-terminal RFP and transfected into A. thaliana cell culture to create protoplasts constitutively expressing either AtSLP1-RFP or AtSLP2-RFP for live-cell imaging. Stably transfected cell culture was extracted, resolved by SDS-PAGE and probed with anti-RFP to verify expression of the fusion protein (Figure 3.1). Positive AtSLP1-RFP cell culture demonstrated chloroplastic localization (Figure 3.2A), while positive AtSLP2-RFP cell culture maintained a non-chloroplastic localization (Figure 3.2B). Transient co-expression of RFP-tagged AtSLPs in Vicia faba epidermal leaf cells with various fluorescence marker proteins was also performed to further resolve the specific subcellular localization of AtSLP1 and AtSLP2. AtSLP1-RFP, co-expressed with GFP specifically targeted to the endoplasmic reticulum (ER) or peroxisome, confirmed the exclusive chloroplast subcellular localization of AtSLP1 by the co-localization of AtSLP1-RFP with chlorophyll autofluorescence and not with either GFP construct (Figure 3.3). The localization of AtSLP2 was resolved through transient co-expression of AtSLP2-RFP with non-targeted GFP (GFP alone resides in both the cytosol and nucleus; Figure 3.4) and GFP targeted specifically to either the ER or mitochondria (Figure 3.4). AtSLP2-RFP demonstrated specific 68 Figure 3.1: Development of AtSLP1 and 2 stably-transfected A. thaliana cell culture. (A) Schematic of fluorescent constructs used in AtSLP1 and AtSLP2 subcellular localization. (B) Schematic depicting the use of Agrobacterium to transfect A. thaliana cell culture. Cell culture was grown under 24 h light on selective 1 x MS-Agar as a primary screen for positive transfectants. Modified from http://www.accessscience.com. (C) Western blot verification of stably-transfected cell culture constitutively expressing AtSLP1-RFP or AtSLP2-RFP. Each lane contains 30 µg of crude cell culture lysate probed with 1:100 Anti-RFP IgG (Chromotek). 69 Figure 3.2: In vivo subcellular localization of AtSLP1 and 2 using stably-transfected A. thaliana cell culture. (A) and (C) Fluorescence images of protoplasts derived from stably transfected A. thaliana cell culture expressing AtSLP1-RFP or AtSLP2-RFP (red), respectively. Chlorophyll autofluorescence is also shown (green) and images merged to reveal localization. All images are single slices obtained using a confocal laser scanning microscope (Leica). Scale bars = 10 µm. (B) and (D) depict merged stacks of protoplast image slices derived from stably transfected A. thaliana cell culture stably expressing AtSLP1-RFP and AtSLP2-RFP (red) with chlorophyll fluorescence (green). Overlapping RFP and GFP signals are yellow. The X and Y axis define the 2D sides of the merged image (i.e. top view), while the Z-axes depict a view through the stacked slices (i.e. side view) giving a 3D rendering of fluorescence overlap through the X and Y axes. 70 Figure 3.3: Co-localization of AtSLP1 with fluorescent marker constructs in Vicia faba epidermal leaf cells. (A) and (B) are representative images of guard cells co-bombarded with AtSLP1-RFP (red) and either endoplasmic reticulum (ER)- or peroxisomal-targeted GFP (green), respectively. Chlorophyll fluorescence (blue) is shown along with the image merge (right). The specific organelle targeting of GFP was achieved though fusion with conserved targeting motifs (Nelson et al., 2007). All images are single slices obtained using a confocal laser scanning microscope (Leica). Scale bars = 10 µm. 71 Figure 3.4: Co-localization of AtSLP2 with fluorescent marker constructs in Vicia faba epidermal leaf cells. (A) and (B) represent images of pavement cells co-bombarded with AtSLP2-RFP and either mitochondrial- or endoplasmic reticulum (ER)-targeted GFP, respectively. The specific organelle targeting of GFP was achieved as described in Figure 3.3. (C) AtSLP2-RFP cobombarded with GFP-tagged nucleoporin protein 50a (Nup50a-GFP) (Tamura et al., 2010). (D) AtSLP2 co-localized with GFP-only (no targeting motif). GFP-only localizes to both the nuclear and cytosolic compartments (Seibel et al., 2007) but only co-localizes with AtSLP2RFP in the cytosolic fraction. All images are single slices obtained using a confocal laser scanning microscope (Leica). Scale bars = 10 µm. 72 co-localization with only the non-targeted (cytosolic) GFP indicating a cytosolic subcellular localization (Figure 3.4). Oddly, the cytosolic AtSLP2-RFP signal was punctate in nature. AtSLP2-RFP was also co-expressed with GFP-tagged nucleoporin protein 50a (Nup50a), a known nuclear protein (Tamura et al., 2010), to account for the dual subcellular localization of non-targeted GFP to the cytosol and nucleus (Seibel et al., 2007). AtSLP2-RFP did not colocalize with Nup50a-GFP (Figure 3.4). Overall, these studies suggest that SLP1 phosphatases reside in the chloroplast and SLP2 phosphatases are present in the cytosol. 3.3.2 Temporal and spatial expression of AtSLP1 and AtSLP2 Initial characterization of endogenous AtSLP phosphatases consisted of transcriptional expression analysis using Genevestigator (www.genevestigator.com). The Genevestigator search engine houses a number of A. thaliana microarray experiments based on the commercially available A. thaliana ATH1: 22k array (http://www.affymetrix.com) to explore gene expression under various experimental conditions. This screen indicated AtSLP1 and AtSLP2 phosphatases are differentially expressed among A. thaliana plant tissues and throughout plant development (Figure 3.5A). AtSLP1 had highest expression in cauline leaves, sepals and in photosynthetic tissues but was absent in roots and other non-photosynthetic tissues (i.e. seeds), while AtSLP2 demonstrated elevated transcriptional expression in protoplasts, roots and the endosperm of seeds (Figure 3.5A). Both AtSLP1 and AtSLP2 also exhibited different transcriptional expression patterns across plant development (Figure 3.5B) and in response to diurnal cycling. AtSLP1 transcript expression demonstrated a sharp increase upon germination followed by a subsequent decrease towards plant maturity (Figure 3.5B) as well as a clear diurnal fluctuation in expression pattern (Figure 3.6A). AtSLP2 however, did not demonstrate any major transcriptional 73 Figure 3.5: AtSLP transcript expression data. AtSLP transcript expression from (A) different tissue types and (B) across plant development. Transcript expression is indicated by a gradient of white (low expression) to blue (high expression) as a percentage of the top 1% genes expressed in each tissue. The numbers beside (A) and above (B) describe the number of total arrays used in determining the displayed level of relative gene expression for AtSLP1 and AtSLP2. Data obtained from Genevestigator (www.genevestigator.com). 74 fluctuations over the course of plant development or during the diurnal cycle (Figure 3.6B) To examine if AtSLP gene expression was paralleled at the protein level, antibodies were raised using heterologously expressed HIS6-AtSLP1 and HIS6-AtSLP2 protein purified by NiNTA (Figure 3.7). Affinity purification (AP) of each antibody was performed to enhance the specific detection of each phosphatase (Figure 3.8). The AP anti-AtSLP1 IgG provided monospecific detection of AtSLP1, while the AP of anti-AtSLP2 IgG resulted in an antibody that detects both AtSLP1 and AtSLP2, but with greater affinity for AtSLP2 (Figure 3.8). The documented transcript expression profile of AtSLP1 and AtSLP2 uncovered using Genevestigator was largely paralleled at the protein level (Figure 3.9A) with AtSLP1 protein observed in all photosynthetic tissues with the exception of siliques. Conversely, AtSLP2 was only definitively detectable in roots and intact siliques (containing seeds). Immunoblotting was less conclusive for other tissues due to the cross-reactivity of the AP anti-AtSLP2 IgG with AtSLP1 and a small 2 kDa mass difference between endogenous AtSLP1 (~41 kDa) and AtSLP2 (~43 kDa). Interestingly, AtSLP2 was barely detectable in intact siliques despite comparatively high levels of transcript in seeds relative to other plant tissues. AtSLP1 protein expression was also examined over a light / dark photoperiod consisting of 12 h light and 12 h dark. Initial examination of online accessible transcript expression data from NCBI Gene Expression Omnibus (GEO) Profiles (www.ncbi.nlm.nih.gov/geoprofiles) demonstrated a diurnal cycling of AtSLP1 transcript, but no such pattern for AtSLP2 expression (Figure 3.6). Despite the noted changes in AtSLP1 transcript levels, AtSLP1 protein was basally detectable over the entire diurnal cycle, with protein expression beginning to increase at 4 h light and peaking at the 8 h light time point before decreasing back to basal expression levels (Figure 3.9B). 75 Figure 3.6: NCBI Gene Expression Omnibus (GEO) profile of diurnal AtSLP1 and AtSLP2 transcript cycling (www.ncbi.nlm.nih.gov/geoprofiles). (A) AtSLP1 and (B) AtSLP2 transcript expression over the course of a 12 h light and 12 h dark photoperiod. The red columns each represent the expression measurement from an original GEO profile submitter-supplied sample record. The grey boxes located below each red column represent the original sample accession. The blue squares on top of each column represent where the expression of that gene falls with respect to all other genes measured on that array. 76 Figure 3.7: Colloidal blue stained 12 % SDS-PAGE of the HIS6-AtSLP1 and HIS6AtSLP2 NiNTA eluates employed in producing anti-AtSLP1 and anti-AtSLP2 polyclonal antibodies. NiNTA purifications shown in panels (A) and (B) were performed as per the manufacturer's instructions (Qiagen) using purified inclusion bodies. Each lane contains 5 µg of total protein. W/C and S/N represent the whole cell lysate and supernatant, respectively. 77 Figure 3.8: Western blot assessment of affinity-purified (AP) anti-AtSLP1 and antiAtSLP2 IgG detection limits and specificity. Both NiNTA-purified antigen (HIS6-AtSLP1 & HIS6-AtSLP2) as well as AtSLP-RFP fusion proteins stably expressed in light-grown A. thaliana cell culture were used to test the detection limits and cross reactivity of each set of antibodies. In (A) 0.01 to 10 ng of recombinant AtSLP1 and AtSLP2 was blotted with either 1.5 or 1.2 µg/mL AP anti-AtSLP1 or antiAtSLP2 IgG, respectively. Anti-AtSLP1 IgG demonstrated mono-specific detection of AtSLP1, while AP anti-AtSLP2 IgG demonstrated detection of AtSLP2 in addition to cross reacting with AtSLP1. In (B) 0.1 to 30 µg of extracted AtSLP1-RFP and AtSLP2-RFP expressing cell culture was blotted as above. Anti-AtSLP1 IgG again demonstrated monospecific detection of AtSLP1 (AtSLP1-RFP), while Anti-AtSLP2 IgG exhibited the same non-specific detection of AtSLP1 (AtSLP1-RFP) as in (A) in addition to clear detection of AtSLP2 (AtSLP2-RFP). 78 Figure 3.9: Spatial and temporal western blot analysis of AtSLP1 and AtSLP2 protein expression. (A) Tissue-specific expression of AtSLP1 and AtSLP2 from different A. thaliana Col-0 tissues. Each lane contains 30 µg of clarified cell lysate. (B) Time-course analysis of AtSLP1 protein expression over 12 h light / 12 h dark photoperiod. Each lane contains 30 µg of clarified lysate from rosette leaf tissue. All tissues were harvested from 21 d old plants. Lower panels of both (A) and (B) represent Ponceau S (0.1 % w/v)-stained membranes demonstrating equal protein loading prior to incubation with either affinity-purified (AP) anti-AtSLP1 or -2 IgG. 79 3.3.3 AtSLP phosphatase primary sequence and enzymatic properties Alignment of the AtSLPs with the other A. thaliana PPP protein phosphatases and representative human PPP protein phosphatases uncovered many amino acid substitutions and unique regions within each full-length AtSLP amino acid sequence. An alignment containing representative PPP protein phosphatases is shown along with AtSLP1 and AtSLP2 (Figure 3.10). Human PP1γ (HsPP1γ) and an A. thaliana PP1 (TOPP2) were found to be 76 % identical to each other, but only possessed ~10 % identity to either of the AtSLP phosphatases. Regions of conserved identity between HsPP1γ, TOPP2 and the AtSLP phosphatases were found to largely reside in the key amino acid clusters comprising the active site of PPP protein phosphatases (Figure 3.10; Shi 2009). AtSLP1 and AtSLP2 however, share only 31% identity to each other and possess many overlapping regions of identity beyond those comprising the PPP protein phosphatases active site (Figure 3.10). SLP phosphatases were also found to lack 7 of the 8 key amino acids involved in the docking of mammalian and yeast PP1 regulatory subunits through the RVXF sequence motif found in essentially all PP1 interactors (Egloff et al., 1997; Moorhead et al., 2007; Dancheck et al., 2011). Key amino acid stretches shown to be involved in the formation of PP2A trimeric complex were also absent (Xu et al., 2006). Of the many differences revealed at the amino acid level between the SLPs and the eukaryotic PPP protein phosphatases, the absence of the canonical SAPNYC motif was of particular interest (Figure 3.10). Although not directly involved in catalysis or metal ion coordination, the cysteine within this motif forms a covalent bond with the potent PPP phosphatase inhibitor microcystin-LR (MCLR) (MacKintosh et al., 1995). SLP phosphatases were also found to lack other conserved amino acids within the 80 Figure 3.10: Alignment of AtSLP1 and AtSLP2 full-length protein sequences with representative PP1 sequences from Arabidopsis thaliana (TOPP2; At5g59160), AtPP2A-1 (At1g59830) and Homo sapiens (HsPP1γ; NP_002701). Alignment was performed using ClustalX followed by import into GeneDoc for image assembly. Solid black line represents highly conserved motifs comprising the active site of canonical Ser/Thr phosphatases. Dashed line represents a highly conserved region in PPP phosphatases which conveys sensitivity to microcystin-LR (MCLR) through its covalent attachment to Cys273 (HsPP1γ) and equivalent residue in a plant PP1, Cys279 (TOPP2) (§). Black circles denote amino acids specifically important to MCLR binding, while black squares denote amino acids specific to OA binding, but are also key in coordinating MCLR. The arrows denote amino acids found to play a role in metal ion coordination, while black triangles denote amino acids found to dock the PP1 inhibitor I2 protein. Amino acids marked with an open square have been implicated in the binding of MCLR, OA and I2. 81 SAPNYC motif and throughout their full-length sequence which coordinate okadaic acid (OA) (Maynes et al., 2001) and the PP1 specific inhibitor I2 (Figure 3.10; (Hurley et al., 2007)). This would suggest that the SLP phosphatases are resistant to the toxins and proteins that have evolved to inhibit the PPP protein phosphatases by docking these regions. To explore this idea, enzyme assays were performed using a selection of phosphatase inhibitors. To assess the enzymatic properties of the AtSLP phosphatases each protein was expressed as a HIS6-fusion protein in E. coli and purified to near homogeneity by NiNTA (Figure 3.11). Both HIS6-AtSLP1 and HIS6-AtSLP2 NiNTA eluates had additional non-specific co-purifying (NSCP) proteins, rendering a control NiNTA eluate from the same E. coli expression strain necessary to control for potential NSCP phosphatase activity during enzymatic analyses of each AtSLP (Figure 3.11). Enzymatic assays were conducted using the artificial phosphatase substrate pNPP to assess the activity of each AtSLP phosphatase. HIS6-AtSLP1 was most active in the presence of Mn2+, but could also use Fe3+ to generate 65% of its observed Mn2+ activity (Figure 3.12A). Contrary to HIS6-AtSLP1, HIS6-AtSLP2 was seemingly most active in the absence of additional metal ions (i.e. 5 mM EDTA). Application of a Student's Ttest found no significant difference between the activity achieved with 5 mM EDTA, and that achieved with either 0.5 mM Mg2+ or 0.5 mM Zn2+. A notable inhibitory effect was observed in the presence of Fe3+, which reduced HIS6-AtSLP2 activity by 70% (Figure 3.12A). Along with examining metal cation dependency the AtSLP phosphatases were tested for sensitivity to classic protein phosphatase inhibitors (Figure 3.12B). Both HIS6-AtSLP1 and HIS6-AtSLP2 demonstrated a complete lack of inhibition in the presence of the PPP phosphatase inhibitors OA (150 nM) and MCLR (10 nM). Surprisingly, HIS6-AtSLP1 phosphatase activity 82 Figure 3.11: 12 % SDS-PAGE of NiNTA-purified HIS6-AtSLP1, HIS6-AtSLP2 and uninduced bacteria cell line control eluates stained with Colloidal blue. HIS6-AtSLP1 and HIS6-AtSLP2 were partially purified by NiNTA along with non-specific copurifying (NSCP) NiNTA-binding proteins isolated in parallel from untransformed, uninduced BL21 (DE3) CP-RIL grown under identical conditions. The BL21 (DE3) CP-RIL NiNTA eluate controlled for non-specific cleavage of the phosphatase substrate pNPP by NSCP proteins during enzymatic analysis of NiNTA-purified HIS6-AtSLP1 and HIS6-AtSLP2. The NSCP protein glucosamine-fructose-6-phosphate aminotransferase (GFAT, *) was identified by MALDI-TOF peptide mass fingerprinting, while (**) represents another NSCP protein. Each lane contains 5 µg of concentrated NiNTA eluate. 83 Figure 3.12: Enzymatic analysis of HIS6-AtSLP1 and HIS6-AtSLP2. (A) Metal dependency assessment of each SLP phosphatase was performed using a variety of metal cations as well as 5 mM EDTA. (B) Analysis of known Ser/Thr phosphatase inhibitors employed at concentrations known to fully inhibit bacterially-expressed and purified Homo sapiens and Arabidopsis thaliana PP1 protein phosphatases. Dark grey bars represent HIS6AtSLP1 while the light grey bars represent HIS6-AtSLP2. Inhibition assays were conducted as outlined in Materials and Methods and were performed using OA, MCLR, AtI2 (At5g52200), pyrophosphate (PPi), phosphate (Pi) and sodium fluoride (NaF). 100% enzyme activity in (A) and (B) is defined by the presence of 0.5 mM Mn2+ (AtSLP1 assays) or 5 mM EDTA (AtSLP2 assays). Error bars denote ± standard error; n = 3. 84 was activated in the presence of OA (~145%) and MCLR (~140%), while these exhibited no overall effect on HIS6-AtSLP2 activity (Figure 3.12B). HIS6-AtSLP1 also showed enhanced inhibition by 5 mM pyrophosphate (PPi) and 50 mM phosphate (Pi) relative to HIS6-AtSLP2, while both HIS6-AtSLP1 and HIS6-AtSLP2 showed similar inhibition by 100 mM sodium fluoride (NaF) (Figure 3.12B). Both AtSLPs were also tested for their sensitivity to the wellcharacterized and specific PP1 protein inhibitor I2 from A. thaliana (AtI2) (Templeton et al., 2011). Both AtSLP1 and AtSLP2 exhibited minimal sensitivity to AtI2 being inhibited ~40 % and ~20 %, respectively, at a 1 µM concentration (Figure 3.12B). 3.3.4 Characterization of purified untagged AtSLP1 Due to the inability to identify conditions permitting the stable purification and storage of AtSLP2 past initial NiNTA elution, only AtSLP1 was purified to homogeneity. Initial HIS6AtSLP1 NiNTA eluate was concentrated and subjected to Superdex 200 size exclusion chromatography (Figure 3.13A) followed by enzymatic affinity tag removal and subsequent Mono-Q anion exchange separation as described in the Materials and Methods (Figure 3.13B). Pooled and concentrated Mono-Q peak fractions were used for enzymatic analyses of untagged AtSLP1 (Figure 3.14). Some degradation of full-length AtSLP1 was observed as a 20 kDa fragment was confirmed as AtSLP1 by mass spectrometry (Table 3.1) and immunoblot analysis (Figure 3.14). 85 Figure 3.13: Size exclusion and anion exchange chromatography steps of AtSLP1 purification. (A) AtSLP1 was purified on NiNTA followed by concentration for further purification on HiLoad 16/60 Superdex 200. Inset, aliquots (25 µL each) from various fractions were subjected to SDS-PAGE and Colloidal blue staining. (B) Peak fractions from Superdex 200 chromatography were pooled, digested with HRV3c protease (Novagen) to release the HIS6tag (affinity tag), then run on Mono-Q 5/50 GL. Inset, aliquots (25 µL each) from various fractions were also subjected to SDS-PAGE followed by protein staining with Colloidal blue. The right hand inset represents the Mono-Q pool fraction used for enzymatic analysis of AtSLP1. AtSLP1 polypeptide identifications were confirmed by immunoblot analysis (Figure 3.14). 86 Figure 3.14: Analysis of the major end-point purification steps employed to completely purify AtSLP1. (A) Represents the NiNTA (panel 1), Superdex 200 / proteolytic HIS6-tag cleavage (panel 2) and Mono-Q (panel 3) purification steps resolved by 12 % SDS-PAGE and visualized by Colloidal blue. (*) & (**) represent the NSCP proteins previously described (Figure 3.11). Each lane contains 5 µg of total protein with the exception of the Mono-Q pool which contained 2 µg of total protein. (B) Western blot of the Mono-Q purified AtSLP1 (25 ng) shown in (A) performed using anti-AtSLP1 IgG. This purified AtSLP1 was subsequently used in enzymatic analysis. S/N and F/T represent the supernatant and flow through, respectively. 87 Table 3.1: MALDI-TOF mass spectrometry identification of lower molecular weight polypeptides in the Superdex 200 pool post-digest. Peptides were searched against the NCBInr database (http://www.ncbi.nlm.nih.gov) using MASCOT (http://www.matrixscience.com). Enzymatic characterization of pure, HIS6-tag free AtSLP1 was conducted to elucidate the impact of the HIS6 affinity tag on both small molecule and protein inhibitor sensitivity as well as its affinity for a protein substrate. For comparison, control assays containing the same inhibitors at the same experimental concentrations were performed in parallel using purified HIS6-TOPP2 and HsPP1γ (Figure 3.15). As observed with HIS6-AtSLP1, untagged AtSLP1 was again activated by both OA and MCLR (Figure 3.16). Activation by OA was moderately reduced in the absence of the HIS6-tag, while MCLR elicited the same 40 % increase in AtSLP1 phosphatase activity. PPi, Pi and NaF all had inhibitory effects identical to those observed with HIS6-AtSLP1 (Figure 3.16). 3.4 Discussion 3.4.1 Subcellular targeting of the AtSLP phosphatases Previous bioinformatic and proteomic studies indicated SLP phosphatases occupy a number of subcellular locations (Chapter 2). In particular, AtSLP1 was speculated to be peroxisome- (Fukao et al., 2002) and ER-targeted, while AtSLP2 was suggested to reside solely in 88 Figure 3.15: Colloidal blue stained 12 % SDS-PAGE of purified HsPP1γ and HIS6-TOPP2. Each protein was purified as previously described: TOPP2 (Templeton et al., 2011) and HsPP1γ (Douglas et al., 2001). Each lane contains 5 µg of total protein. 89 Figure 3.16: Assessment of purified, untagged AtSLP1 sensitivity to small molecule inhibitors. (A) and (B) Inhibition curves comparing the phosphatase inhibitor sensitivity of AtSLP1 to TOPP2 and HsPP1γ using the PPP protein phosphatase inhibitors OA and MCLR. (C - E) Inhibition curves comparing the PPi, Pi and NaF sensitivity of AtSLP1 to TOPP2. All phosphatase assays were conducted using the small molecule substrate pNPP as outlined in the Materials and Methods. AtSLP1 (Black inverted triangle), TOPP2 (open circle) and HsPP1γ (solid circle) are represented, respectively. 90 the ER (Kutuzov and Andreeva, 2008). Using a combination of fluorescent fusion constructs stably transfected into cell culture, and transiently co-expressed Vicia faba leaves, a subcellular localization was elucidated for AtSLP1 (chloroplast) and AtSLP2 (cytosol). These AtSLP findings substantiated the in silico subcellular localization prediction consensus for SLP1 and SLP2 phosphatases (Chapter 2), emphasizing the likelihood of a conserved chloroplastic and cytosolic location for the respective enzymes in all plants. Previous application of prediction algorithm-assisted inspection of AtSLP1 and other SLP1 phosphatase amino acid sequences using the peroxisomal targeting sequence (PTS1) predictor (http://mendel.imp.ac.at /mendeljsp/sat/pts1/PTS1predictor.jsp), as well as Predotar and Wolf pSORT, found no evidence of a canonical C-terminal PTS1 signaling motif (Chapter 2). The possibility of a N-terminal PTS2 signal or a non-canonical PTS signaling motif was also considered (Lazarow, 2006; Girzalsky et al., 2009), but with the C-terminal AtSLP1-RFP fusion protein lacking observable peroxisome localization in either the stably-transfected cell culture or transiently-expressed condition, it is unlikely that AtSLP1 resides in the peroxisome. Previous accounts of AtSLP1 peroxisome targeting were based on the proteomic analysis of isolated plant peroxisomes (Fukao et al., 2002), rendering chloroplast contamination during their purification a possibility. Previous bioinformatic analysis of the SLP phosphatases suggested AtSLP1 was also ER-targeted (Kutuzov and Andreeva, 2008). These predictions were based on the identification of a putative C-terminal ER retention motif (KDEL or RDEL) and an in silico subcellular prediction result from the online resource Predotar (Kutuzov and Andreeva, 2008). Despite these findings, the consensus of 10 subcellular localization prediction algorithms employed previously (Chapter 2), combined with experimentally verified subcellular localization results of AtSLP1 demonstrated here, do not support ER localization for SLP1 phosphatases. One of the 10 in silico 91 targeting programs employed in the subcellular localization prediction analysis was Predotar, and as in the previous study, it predicted several SLP1 phosphatases to be ER targeted (Chapter 2). ER localization, however, was not suggested by any other program, with the majority of programs predicting the same SLP phosphatases to be chloroplast targeted (Chapter 2). Given the overall predicted chloroplast localization consensus of SLP1 phosphatases, the experimentally verified chloroplast localization of RFP-tagged AtSLP1 and high level of conserved identity (~50-70%) across SLP1 phosphatase homologs, SLP1 phosphatases likely reside solely in the chloroplast. Previous bioinformatic work also predicted AtSLP2 to be an ER-localized protein (Kutuzov and Andreeva, 2008). More comprehensive efforts alternatively identified a cytosolic localization consensus for SLP2 phosphatases, with none of the in silico targeting programs predicting the SLP2 phosphatases to possess any transit peptides (Chapter 2). Moreover, AtSLP2 expressed as an RFP-fusion protein in both stably-transfected cell culture and transiently expressed in Vicia faba leaves lacked detectable ER localization, but maintains what looks to be a consistent punctate cytosolic localization under both experimental conditions. Compared to SLP1 phosphatases, SLP2 phosphatases exhibit an even higher level of sequence identity (~5080%) making it unlikely that an alternative localization in other plants would be observed. 3.4.2 Temporal and spatial differences in AtSLP phosphatase expression Both AtSLP1 and AtSLP2 were found to be expressed in different A. thaliana plant tissues indicating they may differ in biological function. Combined with differing subcellular locations, AtSLP1 protein expression was only detected in photosynthetic tissues with the exception of intact developing siliques, while maintaining complete absence from roots. 92 Conversely, AtSLP2 demonstrated clear expression in roots, with marginally detectable expression in intact siliques. When compared, AtSLP1 and AtSLP2 tissue-specific protein expression largely correlated with transcript expression. However, presence of AtSLP2 protein in photosynthetic tissues cannot be discounted, as data suggest at least some AtSLP2 transcriptional expression is occurring in rosettes, shoots and flowers. Unfortunately, the clear detection of AtSLP2 protein in these tissues via immunoblotting was hindered by the cross-reactivity of AP anti-AtSLP2 IgG with AtSLP1, coupled with the small size differences between AtSLP1 and AtSLP2. Interestingly, parallel temporal expression of AtSLP1 phosphatase transcripts and proteins did not hold true within A. thaliana leaves. AtSLP1 presented a clear diurnal cycle in transcript abundance, which did not completely translate to parallel AtSLP1 protein expression. AtSLP1 transcript abundance was shown to increase in the dark, peaking at the 24 h dark to 0 h light interface, followed by a reduction in transcriptional abundance as the light period progressed. Comparatively, AtSLP1 protein abundance seemed to peak at 8 h light followed by a reduction back to a basal lower level of abundance for the remainder of the 24 h light / dark cycle. This finding may indicate that AtSLP1 is not only transcriptionally regulated, but also regulated posttranslationally and is reminiscent of other chloroplast proteins involved in starch metabolism, which demonstrate diurnal gene expression while maintaining constant protein levels (Lu et al., 2005). Collectively, AtSLP1 expression findings indicate two things: 1) AtSLP1 transcript levels seem to peak prior to AtSLP1 protein production, analogous to other chloroplast-targeted proteins which exhibit a pre-emptive transcript accumulation prior to translation as part of a diurnal cycle anticipating the metabolic and cellular changes accompanying illumination 93 transition (Smith et al., 2004; Lu et al., 2005). 2) AtSLP1 transcripts and AtSLP1 protein lack complete correlative expression, indicating AtSLP1 may possess more than just a light-regulated role in the chloroplast and/or a complex cellular regulation mechanism involving a combination of transcriptional and post-translational control factors. AtSLP2, on the other hand, was not investigated for a diurnal fluctuation in protein expression since it was shown to be likely cytosolic, expressed mainly in non-photosynthetic root tissue and exhibited no pre-existing indication of diurnally regulated transcript fluctuations. 3.4.3 Conservation of essential PPP protein phosphatase motifs in AtSLP1 and AtSLP2 All SLP phosphatases possess the core signature motifs constituting the active site of PPP protein phosphatases (Chapter 1 and 2; (Egloff et al., 1995; Xu et al., 2006)) and lack any of the defining motifs for other protein phosphatase families (Kerk et al., 2008). As PPP protein phosphatases, maintenance of these motifs also served as an indicator that the SLP phosphatases are Ser/Thr protein phosphatases. As shown in Figure 3.9, the SLP phosphatases have all the canonical amino acids involved in coordinating active site metal ions required for catalysis in PPP protein phosphatases (Egloff et al., 1995; Zhang et al., 1996; Xu et al., 2006). These six amino acids include D64 (D54), H66 (H56), D98 (D88), N124 (N114), H173 (H164) and H248 (H238) of the mammalian HsPP1γ and PP2A (in brackets), respectively. Also important are the regulatory protein interactor coordination motifs of PP1 and PP2A catalytic subunits. Our alignments revealed AtSLP phosphatases possess almost none of the amino acids involved in coordinating the canonical RVxF motif of PP1 regulatory subunits (Egloff et al., 1997). Nonetheless, AtI2, an ancient PP1 interactor with an RVxF motif, did weakly inhibit the SLP phosphatases. With human I2 possessing multiple PP1 contact points 94 outside of the RVxF motif (Hurley et al., 2007), contribution from several interaction sites likely allowed a weak association in vitro. This is consistent with data showing that human PP2A can be inhibited by I2 at high concentrations (Brautigan et al., 1986). Since both AtSLP phosphatases possess a slightly higher sequence identity to PP2Ac (10-14%) relative to PP1 (9-13%), the presence of amino acid stretches involved in mediating PP2A-regulatory subunit interactions were also examined. Neither AtSLP phosphatase was found to possess regions of amino acids resembling these stretches (Xu et al., 2006). Together, this suggests that SLP phosphatases either have unique targeting subunits or no targeting subunits at all. 3.4.4 Metal cation preferences of AtSLP phosphatases Atomic structures of PP1 and PP2A revealed the presence of Mn2+/Fe2+ and Mn2+/Mn2+ di-metal cation arrangements in their active sites, respectively (Egloff et al., 1995; Cho and Xu, 2007). These metal cations function to coordinate negatively charged phosphate in the binding pocket, while water initiates nucleophilic attack on a protein-coupled phosphate group and mediates its release (Egloff et al., 1995; Xu et al., 2006; Cho and Xu, 2007). Our results revealed PP1-like metal-dependent phosphatase activity for AtSLP1 using Mn2+ and also Fe3+. Zn2+ and Mg2+ were also examined, as both these cations have documented importance in the catalytic mechanisms of non-protein phosphatases (Kim and Wyckoff, 1991) and PP2C protein phosphatases, respectively (Ingebritsen and Cohen, 1983; Cohen, 1997; Klumpp et al., 2006). Neither Zn2+ nor Mg2+ could recover the activity of AtSLP1 beyond the basal activity observed in the presence of 5 mM EDTA chelating agent. Conversely, AtSLP2 displayed phosphatase activity whether additional metal ions were present (Mg2+ or Zn2+) or not (EDTA), and was found to be inhibited by Fe3+. The lack of 95 discernible metal-dependent activity suggests AtSLP2 may maintain a conformation that renders its bound metal ions unable to be quenched by 5 mM EDTA; however, AtSLP2 activity was easily quenched by 200 mM EDTA indicating that active site metal ions are catalytically important but not easily displaced in vitro. 3.4.5 Inhibition by classic PPP protein phosphatase inhibitors As AtSLPs have the greatest identity to eukaryotic PPP protein phosphatases PP1 and PP2A, each AtSLP phosphatase was examined for sensitivity to known small molecule and protein inhibitors of PPP protein phosphatases. Both PP1 and PP2A type phosphatases have well documented sensitivity to MCLR and OA, with PP1 possessing greater sensitivity to MCLR (0.3 - 1 nM) and PP2A to OA (0.1 - 0.3 nM) (Bialojan and Takai, 1988; MacKintosh et al., 1995). These compounds have also been shown to inhibit the closely-related PPP protein phosphatases PP4, PP5 and PP6 (Heidari et al., 2011). Neither AtSLP1 nor AtSLP2 were sensitive to MCLR or OA. In fact, AtSLP1 demonstrated an unexpected activation by both MCLR and OA. Other Ser/Thr protein phosphatases have been documented to lack sensitivity to these compounds including AtPP7 (Kutuzov et al., 1998), TOPP6 (Templeton et al 2011) and PP2C protein phosphatases (Bialojan and Takai, 1988), while no PPP protein phosphatase has demonstrated activation by either MCLR or OA. This enzymatic activation phenomenon was exhibited by both tagged and untagged AtSLP1 protein, eliminating the possibility of HIS6-tag influence over AtSLP1 conformation and sensitivity to inhibitors. Lack of MCLR and OA inhibition was likely attributed to the absence of a canonical inhibitor binding motif (SAPNYC) which comprises a hydrophobic binding pocket that both covalently (MCLR) and non-covalently (OA) accommodates these inhibitors (Zhang et al., 1996; Maynes et al., 2001). Furthermore, site96 directed mutagenesis of amino acid residues R221 and F276 of HsPP1γ, found outside the SAPNYC motif, conveyed substantive decreases in OA inhibition (Maynes et al., 2001). AtSLP phosphatases also lack these residues (R221 and F276) at equivalent positions, supporting the lack of observable SLP phosphatase inhibition. Additional PPP phosphatase inhibitors tested here included pyrophosphate (PPi), phosphate (Pi) and sodium fluoride (NaF), which have all been shown to inhibit Arabidopsis thaliana PP1 phosphatases (Stubbs et al., 2001). Previously, 6 of the 9 AtPP1 (TOPP) phosphatase isoforms (TOPP1 - 6) were affinity purified from A. thaliana cell culture using a MCLR conjugated Sepharose matrix (Stubbs et al., 2001). TOPP2 was amongst these six phosphatases and was used here in its bacterially-expressed and purified form as an enzyme assay control. Stubbs and colleagues (2001) found that the affinity-purified TOPPs were almost completely inhibited by 100, 1 and 10 mM Pi, PPi and NaF, respectively. Comparatively, the AtSLP phosphatases demonstrated a different pattern of inhibition by these compounds, possibly emphasizing differences in their cellular roles relative to the TOPP phosphatases (Takemiya et al., 2009). Both AtSLP1 and AtSLP2 were relatively insensitive to PPi inhibition, while AtSLP1 exhibited enhanced sensitivity to Pi with complete inhibition at a 50 mM concentration. An inability to completely inhibit either AtSLP phosphatase at 100 mM NaF signifies a notable enzymatic difference to the affinity-purified TOPPs, which were almost entirely inhibited at 20 mM NaF (Stubbs et al., 2001). The sensitivity of AtSLP1 to Pi inhibition (IC50 ~1.5 mM) coupled to its chloroplast localization may indicate an energy sensing function as the chloroplast stroma has been documented to maintain Pi levels of ≥ 12 mM to support production of ATP upon the onset of light (Bligny et al., 1990). Intriguingly, a light-induced decrease in stromal Pi levels to produce 97 ATP would parallel the corresponding increase in detectable AtSLP1 in A. thaliana rosette leaves, suggesting that decreased stromal Pi levels could activate AtSLP1 to dephosphorylate its substrates. Conversely, the relative insensitivity of AtSLP2 to Pi, coupled with a 50 % lower Pi concentration in the cytosol (6 mM), would seem to indicate a function unrelated to cellular energy levels (Bligny et al., 1990). 3.5 Conclusion This work has characterized for the first time two novel, highly conserved plant PPP protein phosphatases, the SLP phosphatases, from A. thaliana. Unique subcellular localizations for each AtSLP were resolved, with AtSLP1 and AtSLP2 found to be chloroplastic and cytosolic, respectively. Enzymatic analyses presented here coincide with bioinformatics, which suggested that SLP phosphatases may be insensitive to MCLR and OA, while also highlighting AtSLP1 biochemistry as indicative of a potential role in regulating chloroplast energy- related processes. Future endeavors will need to employ additional cell biological and biochemical approaches to identify possible regulatory subunits as well as the substrates of these unique enzymes. 98 Chapter Four: Protein interactome analysis of SLP phosphatases 1 and 2 from A. thaliana 4.1 Introduction In A. thaliana, and all other plants, there exists a unique subclass of bacterial-like PPP protein phosphatases called the Shewanella-like protein (SLP) phosphatases (Chapter 2, (Andreeva and Kutuzov, 2004)). Biochemical characterization of this PPP phosphatase subclass has found it further divided into two distinct groups called 'SLP1' and 'SLP2' phosphatases, which differ in both subcellular localization and spatial expression across the plant (Chapter 3). Specific characterization of SLP phosphatases from A. thaliana (At) found that AtSLP1 is chloroplast targeted and expressed exclusively in photosynthetic tissues, while AtSLP2 displays a punctate cytosolic localization and is expressed in non-photosynthetic tissues (Chapter 3). AtSLP phosphatases were also found to maintain a unique insensitivity to classic PPP protein phosphatase inhibitors MCLR and OA, which relates to their lack of key inhibitor binding residues including the C-terminal SAPNYC motif (Chapter 3, (Andreeva and Kutuzov, 2004)). Furthermore, SLP phosphatases are absent in humans, but found in a number of parasitic protozoa in addition to plants (Chapter 2, (Andreeva and Kutuzov, 2004; Patzewitz et al., 2013)), and have recently been proposed as a target of drug development against malaria (Patzewitz et al., 2013). The chloroplast is a major resource generator for photosynthetic Eukaryotes, assimilating carbon into starch during the day to produce carbon skeletons for amino acid and fatty acid biosynthesis during the night (Rolland et al., 2012). This ability of chloroplasts centers around their energy generating capabilities. Light harvesting machinery embedded in the thylakoid membrane establishes an electrochemical proton gradient across the thylakoid membrane, to 99 drive ATP production during the day through chloroplast (C) ATP synthase (Bunney et al., 2001). Chloroplast ATP synthase structure includes 2 major protein complexes designated F0 and F1 (McCarty, 1992). The F0 domain is embedded in the chloroplast thylakoid membrane and consists of a, b, c and b'-ring subunits (x14) (McCarty, 1992; Krah et al., 2010). The CF1 protein complex; however, is membrane associated on the stromal side of the thylakoid membrane and is comprised of alpha (α), beta (β), gamma (γ), delta (δ) and epsilon (ε) subunits (McCarty, 1992; Groth and Pohl, 2001). Interestingly, all CF1 ATP synthase subunits are chloroplast encoded, with the exception of the γ and δ subunits. Phosphoproteomics has revealed a number of phosphorylation sites present on the CF1 ATP synthase β subunits from barley and A. thaliana (del Riego et al., 2006; Reiland et al., 2009), with these β subunit phosphorylation sites being implicated in the regulation of ATP synthase activity through the phosphorylation-dependent interaction of 14-3-3 proteins (Moorhead et al., 1999; Bunney et al., 2001). Interestingly, casein kinase II has been shown to phosphorylate CF1 ATP synthase β subunits; however, no opposing protein phosphatases have yet been identified (Bunney et al., 2001; Reiland et al., 2009). With 95% of all mitochondrial proteins being nuclear encoded and in need of import from the cytosol, there has evolved a diversity of protein import mechanisms across Eukaryotes (Carrie et al., 2010; Fraga and Ventura, 2013). Of recent interest is an unique mitochondrial targeting peptide (mTP)-independent protein import mechanism responsible for the import and assembly of mitochondrial intermembrane space (IMS) proteins. This import machinery consists of two proteins, Essential for Respiration and Vegetative Growth 1 (ERV1; ALR) and Mitochondrial Import and Assembly 40 (MIA40; TIM40) (Chatzi and Tokatlidis, 2012; Herrmann and Riemer, 2012). These two proteins form a redox relay with cytochrome C through their oxidoreductase activity (Chatzi and Tokatlidis, 2012; Herrmann and Riemer, 2012), with 100 MIA40 being responsible for catalyzing disulphide bridge formation on target proteins, trapping them in the IMS (Banci et al., 2009; Sideris et al., 2009; Kawamata and Manfredi, 2010). In nonphotosynthetic Eukaryotes, such as yeast and humans, loss of this system has been found to be lethal (Chacinska et al., 2008; Hell, 2008; Banci et al., 2009); however, in A. thaliana MIA40 was shown to be non-essential under standard growth conditions (Carrie et al., 2010). In all photosynthetic and non-photosynthetic Eukaryotes alike, MIA40 was found to be necessary for the import of the chaperone superoxide dismutase (Ccs1) and copper/zinc superoxide dismutase (CSD1/SOD1) into the mitochondrial IMS (Carrie et al., 2010; Kawamata and Manfredi, 2010). In addition to a mitochondrial IMS subcellular location, plant MIA40s are also targeted to the peroxisome through a conserved PTS1 domain. In A. thaliana, MIA40 was found to import CSD3 to the peroxisome (Carrie et al., 2010), indicating that plant MIA40 oxidoreductase may be involved in regulating diverse cellular processes. In addition to the cytosol, phosphorylated proteins have been annotated in organelles, including the mitochondria (Ito et al., 2009) and the chloroplast (Reiland et al., 2009), indicating protein kinases and phosphatases are operating within these compartments (Schliebner et al., 2008; Bayer et al., 2012). Utilizing a proven tandem affinity purification (TAP) approach, the protein interactome of AtSLP1 and AtSLP2 was characterized from both A. thaliana cell culture and plant tissues. Findings presented here reveal that AtSLP1 and AtSLP2 are involved in regulating different cellular processes located in different organelles, further emphasizing their potential utility in agricultural biotechnology. 101 4.2 Materials and Methods 4.2.1 Plant growth conditions Arabidopsis seeds were surface-sterilized in 0.525 % (v/v) NaOCl (Clorox® Bleach) and 0.2 % (w/v) sodium dodecyl sulfate (SDS), rinsed in deionized water, and spotted onto 0.5 x Murashige-Skoog (MS) plates [0.5 x MS (2.1 g/L) pH 5.7 (PhytoTechnology Laboratories) and 0.8 % (w/v) agar]. Seeds were imbibed at 4oC in the dark for 2 d, followed by germination under 24 h light at 22oC at a light level of 125 µmol m-2s-1. Developed seedlings were transplanted to soil pre-soaked with 1 g/L 20:20:20 fertilizer and covered with translucent plastic lids (i.e. humidity control). Plants were grown in an Econoaire GC-50-BH growth chamber (Enconoaire Systems Ltd) with a 12 h light / 12 h dark photoperiod at a temperature of 22°C and a light level of 125 µmol m-2s-1, fertilized weekly with 0.5 g/L 20:20:20 fertilizer. Plastic lids were removed after 1 week once rooting was established. Plant roots were isolated by germinating A. thaliana seedlings grown as above in Magenta boxes for 21 d. Magenta box growth media consisted of 0.5 x MS media supplemented with 1 mM potassium phosphate. Media was changed every 10 d. A. thaliana rosette tissue was germinated and grown as above under a 12 h light / 12 h dark photoperiod for 21 d prior to harvesting. All tissues were harvested, snap frozen in liquid N2 and stored at -80oC prior to extraction. 4.2.2 Molecular cloning and expression Fluorescent and tandem affinity tag (TAP) constructs were created using full-length At1g18480 (AtSLP2), At1g07010 (AtSLP1) and At5g23395 (AtMIA40) clones obtained from The Arabidopsis Information Resource (TAIR; http://www.arabidopsis.org/). PCR amplification of each respective cDNA was performed using gateway compatible primers and inserted into 102 pDONR221 (Invitrogen). Fluorescent constructs were subsequently created using the binary vectors pB7RWG2 (cRFP) and pK7RWG2 (cGFP), respectively (http://gateway.psb.ugent.be/), while TAP clones were created using pYL436 (Rubio et al., 2005). Transformation of each construct into Agrobacterium tumefaciens strain GV3101 (Agrobacterium) was performed to facilitate stable integration of each construct into both A. thaliana plants and cell culture. At5g23395 was also cloned into pDEST15 (N-terminal GST-tag) and pDEST17 (N-terminal HIS6-tag) for heterologous expression in and purification from E. coli, while AtSLP1 and 2 were cloned into pET-48b(+) (Novagen) and purified via NiNTA as previously described (Chapter 3). GST-only (pGEX4T-1) (Qiagen) was also expressed and purified for use as a control in enzyme and in vitro interaction analyses. 4.2.3 A. thaliana cell culture transfection and TAP pull-downs Wild-type, light-grown A. thaliana cell culture and plants were individually transfected by Agrobacterium carrying AtSLP2-, AtSLP1-, AtMIA40- and GFP-cTAP constructs (Figure 4.1A) as previously described (Chapter 3). Positive transfectants were screened via immunoblotting with anti-Myc IgG. A. thaliana cell culture was subsequently transitioned from light to dark over a two week period, with subculturing performed every 7 d. TAP pull-downs from either light or dark cell culture were performed as previously described (Figure 4.1B; (Templeton et al., 2011)) using extracts derived from 50 g of cells per pull-down, while pulldowns from rosette and roots were performed using 10 and 5 g of tissue extract, respectively. TAP expressing cell culture was mechanically separated and thawed in TAP buffer consisting of 50 mM Hepes-NaOH (pH 7.5), 150 mM NaCl, 10 % (v/v glycerol) and 0.1 % Nonidet P-40 plus protease inhibitors added day of use: 1 mM PMSF, 1 % PVP and 1 µg/mL 103 ProteCEASE-100. Cells were lysed by a single pass through a French Pressure Cell (CimAminco) at 16000 psi, followed centrifugation for 35 mins at 40000 x g. Root and rosette tissue was ground in a pre-cooled mortor and pestle in the aforementioned buffer prior to centrifugation as above. Supernatants were filtered through one layer of mirracloth prior to incubation with 100 µL of IgG-Sepharose (GE Healthcare) end-over-end for 1.5 h at 4oC. TAP-protein coupled IgGSepahrose matrix was collected in a 10 mL gravity column and washed with 30 mL of TAP buffer containing 1 mM DTT without protease inhibitors. Coupled matrix was removed and digested with Precision Protease (GE Healthcare) in a 2 mL eppendorf tube end-over-end at 4oC for 1 h. Eluate was collected by gravity and incubated with 300 µL of settled NiNTA agarose end-over-end for 30 min at 4oC. Coupled NiNTA agarose matrix was washed with 30 mL of TAP buffer containing 5 mM imidazole. TAP pull-downs subjected to SDS-PAGE and / or immunoblot analysis were eluted from NiNTA using a minimal volume of 1x SDS-PAGE sample buffer and boiling for 5 min, while samples subjected to whole eluate analysis were subsequently washed while bound to NiNTA agarose with 10 mL of triethylammonium bicarbonate (pH 8.5) prior to on-bead, overnight (18 h) proteolysis using 750 ng of trypsin (Promega) at 30oC in a water bath. Peptides were eluted using a 50 % acetonitrile : 2.5 % formic acid solution and dried prior to analysis by mass spectrometry. 4.2.4 Mass spectrometry Digested samples were re-suspended in 5 % (v/v) formic acid. After centrifugation at 4oC (10 min, 12000 x g), the supernatant was directly loaded onto capillary columns packed in-house with Magic 5 μM, 100 Å (1 Å=0.1 nm), C18AQ. The data acquisition protocol was modified 104 Figure 4.1: Schematic depiction of tandem affinity purification (TAP) protein isolation methodology. (A) Cartoon of protein construct stably expressed in, and TAP isolated from, A. thaliana plant tissues. Stable plant transfection was performed as outlined in the Materials and Methods and Chapter 3. (B) Workflow depiction of the TAP isolation protocol employed to isolate specific AtSLP interactors. Alternatively, stages 3 & 4 would be combined by employing on-bead trypsinization for subsequent whole eluate mass spectrometric analysis. Dashed line in (A) and '3C' in (B) depict the human rhinovirus 3c (HRV3c) protease cleavage site. 105 from (Breitkreutz et al., 2010). MS/MS results were acquired in data-dependent mode (over a 2 h period in an acetonitrile 2–40 % gradient) on a ThermoFinnigan LTQ equipped with a Proxeon NanoSource and an Agilent 1100 capillary pump. Acquired RAW files were converted to mgf (Mascot generic format) format, and searched with the Mascot search engine (Matrix Sciences) against the A. thaliana complement of the RefSeq database (http://www.ncbi.nlm.nih.gov/RefSeq/; release 38) with a precursor-ion-mass tolerance window of 3.0 Da and a fragment-ion-mass tolerance of 0.6 Da. Methionine oxidation was allowed as a variable modification, and trypsin specificity (with one missed cleavage allowed) was selected. The search results were parsed into a relational database developed in-house (ProHits; (Liu et al., 2010)). The initial filtering of the results was performed by removing the hits with a Mascot score < 60 and only one unique peptide. In addition to GFP-cTAP pull-downs controlling for experimentation involving cell culture, AtSLP1 offered a control for AtSLP2 pull-downs from root tissue, as AtSLP1 is not expressed in 'non-green' tissues and is the closest ortholog of AtSLP2 (Chapter 3). Bonafide binding partners were only considered if they were not found in the majority of negative controls: 6 × GFP–cTAP and 7 × AtSLP1-cTAP for SLP2, 6 × GFP– cTAP and 7 × AtSLP2-cTAP for SLP1 (Table 4.1). 4.2.5 Transient tobacco BY2 cell expression and imaging Transient expression of AtSLP2-cRFP and AtMIA40-cGFP was performed using darkgrown tobacco BY2 suspension cell culture. Prior to bombardment, 3 mL of settled BY2 cell culture was rinsed with wash solution (1.0 x MS, 3 % (w/v) sucrose, 0.25 % (w/v) KH2PO4, 4.5 % (w/v) sorbitol, 4.5 % (w/v) mannitol pH 5.0). BY2 cells were pelleted at 500 x g for 5 min 106 prior to plating on wash solution-moistened Whatman paper placed inside a Petri dish. 5 µg of AtSLP2-cRFP and AtMIA40cGFP plasmid DNA was coupled to 1 mg of gold micro-carriers, washed, spotted on a macro-carrier and accelerated at BY2 cell culture as previously described (Chapter 3). Mitochondrial-targeted GFP plasmid DNA (10 µg) was separately accelerated at BY-2 cells to assess the specificity of Far-Red MitoTracker staining (Invitrogen). Microscopy was performed 18 h post-ballistics treatment. Prior to imaging, bombarded cell culture was treated with 2 µM Far-Red MitoTracker (Invitrogen) for 10 min to stain for mitochondria. Imaging was conducted using a Leica DMIRE2 spectral confocal and multiphoton microscope with a Leica TCS SP2 acoustic optimal beam splitter (Leica Microsystems). Excitation / emission spectra (nm) respectively employed included: GFP 488/500-510, RFP 543/600-625 and MitoTracker 633/680-750. All image processing was performed using freely accessible MacBiophotonics ImageJ (http://www.macbiophotonics.ca). 4.2.6 Isolation of A. thaliana cell culture mitochondria Seven day old, dark-grown A. thaliana suspension culture (120 g) was filtered through 1 layer of miracloth to remove growth media and homogenized batchwise 30 g at a time in a precooled 4oC mortar and pestle. Grinding of each 30 g was performed in 100 mL homogenization buffer consisting of 0.3 M mannitol, 50 mM Mops-KOH pH 7.8, 5 mM EDTA, 0.5 % (w/v) bovine serum albumin (BSA), 1.0 % (w/v) polyvinlypyrrolidone (PVP) and 10 mM DTT added just before use. Isolation of purified mitochondria and the corresponding cytosolic fraction was obtained by Percoll (Sigma P-1644) density gradient centrifugation as previously described (Eubel et al., 2007). Isolated mitochondria were lysed with 0.2 % (v/v) Triton X-100 in 50 mM Mops-KOH pH 7.5, 150 mM NaCl. 107 4.2.7 Enzymatic analysis Analysis of AtSLP2 phosphatase activity was examined using either the small molecule phosphatase substrate pNPP (Sigma) or malachite green reagent (Sigma) to detect the catalyzed release of phosphate. Both pNPP and malachite green assays were assessed using a Spectromax spectrophotometer set to 405 nm and 630 nm, respectively. Enzymatic assays utilizing pNPP were conducted with NiNTA-purified HIS6-AtSLP2 and glutathione Sepharose-purified GST-AtMIA40. All assays were done in parallel with NiNTA-purified eluates generated from uninduced BL21 (DE3) Codon(+)-RIL E. coli to account for potential background pNPP cleavage from NSCPs as in Chapter 3. Phosphatase activities were calculated relative to assays conducted with AtSLP2 only under previously determined conditions rendering highest activity (Chapter 3). All pNPP assays were conducted as previously described (Chapter 3) using either 5 mM DTT, Tris(2-carboxyethyl)phosphine (TCEP) or reduced glutathione (GSH) as a reductant. All assays were quenched with 2.0 M NaOH. Expressed and purified GST only was used as an assay control. Malachite Green assays were performed as previously described (Baykov et al., 1988) alternatively employing a 10 % ammonium molybdate solution. Each assay employed 1 µg of recombinant phosphatase protein incubated for 1 h at 30oC in 160 µL of 1 x dilution buffer containing each respective experimental phospho-peptide substrate. Addition of 40 µL malachite green solution quenched the reaction to reveal phosphatase-catalyzed increase in free phosphate relative to control assays lacking protein phosphatase. Quenched assays were left at room temperature for 10 min prior to spectrophotometric assessment at 630 nm. Amount of free phosphate was calculated using a standard curve derived from known amounts of potassium 108 phosphate. Several of the phosphorylated peptide substrates were generously provided by Dr. D. Alessi of the Protein Phosphorylation Unit (Dundee, Scotland). Assays performed using TAP-purified AtSLP2 employed 65 µL of protein phosphatase bound to NiNTA that was isolated from 20 g of A. thaliana rosette tissue as described above. Each aliquot of coupled NiNTA matrix was re-suspended in 235 µL of 1 x dilution buffer containing 500 µM of the respective phosphorylated peptide and 1 µM HIS6-AtMIA40. Each 1x assay mixture was incubated for 1 h in a 30oC water bath and NiNTA matrix was pelleted. 160 µL of reaction mix was removed and quenched with 40 µL of malachite green solution. Quenched assays were evaluated spectrophotometrically as outlined above. 4.2.8 Antibody production HIS6-AtMIA40 was expressed in BL21 (DE3) Codon(+)-RIL E. coli at 37oC and induced with 0.5 mM isopropyl β-D-1-thiogalactopyranoside (IPTG) for 4 h and purified from inclusion bodies using NiNTA according to the manufacturer’s instructions (Qiagen). Purified protein was dialyzed against water and used for polyclonal antibody production in a New Zealand White rabbit as described (Tran et al., 2004). 4.2.9 Western immunoblotting Affinity-purified polyclonal rabbit anti-AtSLP1 and -2 antibodies previously generated (Chapter 3) were used at 1.5 and 1.2 µg/mL, respectively. Rabbit anti-AtMIA40 crude immune serum was used at 1:1000 dilution. Rabbit anti-plant-type phosphoenolpyruvate carboxylase (PTPC) and sucrose synthase (SuSy) antibodies were used at 1:1000 and 1:10000 dilutions, respectively (Uhrig et al., 2008). Mouse anti-mitochondrial pyruvate dehydrogenase complex 109 E1α (PDCmt-E1α) antibodies were used at 1:100 dilution (Szurmak et al., 2003). Commercially available rabbit anti-Myc antibodies were used at 0.2 µg/mL (ICL Inc). A. thaliana CF1 ATPase antibodies were used at 1:10000 dilution (Roy and Barkan, 1998). 4.2.10 Native polyacrylamide gel electrophoresis (PAGE) Native-PAGE (10 %) was performed as outlined previously (Gennidakis et al., 2007). Size exclusion gel filtration standards (GE Healthcare) run in parallel to immunoblot analysis facilitated an estimation of protein complex sizes. All gels were run at 4oC at 100V prior to transfer to nitrocellulose membrane for immunoblot analysis. 4.3 Results 4.3.1 AtSLP1 specifically interacts with chloroplast F1 ATP synthase subunits β and γ AtSLP1 was found to be exclusively expressed in photosynthetic tissues, while its paralog AtSLP2 was found to be expressed in dark-grown A. thaliana cell culture, roots and seeds (Chapter 3). Given this, tandem affinity purification (TAP) tagged AtSLP1 was used to isolate the AtSLP1 protein interactome from light-grown A. thaliana cell culture and rosettes (Figure 4.2 and 4.3). AtSLP1-TAP was stably expressed in both wild-type A. thaliana cell culture and plants alongside controls; GFP-TAP (A. thaliana cell culture) and AtSLP2-TAP (A. thaliana rosettes and cell culture) (Figure 4.3). AtSLP1-TAP and control TAP pull-downs were performed in parallel, with each isolated TAP-tagged protein immunodetected using anti-Myc IgG to verify their specific isolation from clarified cell extracts (Figure 4.3). No obviously enriched bands were resolved via one dimensional SDS-PAGE, therefore a subtractive mass 110 Figure 4.2: Western blot analysis of dark-grown, wild-type A. thaliana cell culture. 1:1 (w/v) 1 x SDS-PAGE sample buffer-extracted cell culture was loaded in increasing volumes of 5, 15 and 30 µg. Proteins were resolved on 12 % SDS-PAGE, transferred to nitrocellullose and probed with AP anti-AtSLP1 and anti-AtSLP2 IgG used at 1.5 and 1.2 µg/mL, respectively. 111 Figure 4.3: AtSLP-TAP pull-downs isolating AtSLP1- and AtSLP2-specific protein interactors. Representative silver stained SDS-PAGE gel (A) and anti-Myc immunoblot (B) of TAP pulldowns from stably-transfected, light (AtSLP1)- and dark (AtSLP2)-grown, A. thaliana cell culture. GFP-cTAP pull-downs offered a control for NSCPs resulting from protein overexpression, the TAP tag and applied biochemical purification process. TAP pull-downs were also performed from root and rosette tissue (Table 4.1). All TAP pull-down experimentation was performed in at least biological duplicate (Table 4.1) as described in the Materials and Methods. Indicated bands in (A) are enriched compared to other lanes and correspond to the immunoreactive bands in (B). Band marked AtMIA40 was determined by mass spectrometry of the excised band. Lanes contain 30 µL (SDS-PAGE) and 5 µL (Immunoblot) of TAP eluate isolated from 50 g of A. thaliana cell culture. 112 spectrometry approach involving the direct comparison of on-bead digested AtSLP1-TAP and control TAP pull-downs was performed to identify lower abundance AtSLP1 protein interactors. Here, a number of AtSLP1-specific protein interactors common to both light-grown A. thaliana cell culture and rosette AtSLP1-TAP pull-downs were uncovered, in addition to rosette-specific protein interactors. AtSLP1 protein interactors found in both light grown A. thaliana cell culture and rosettes included chloroplast-specific chaperonin proteins At2g28000, At1g55490, At3g13470, At5g56500, while rosette-specific AtSLP1 protein interactors included CF1 ATP synthase subunits β and γ (Table 4.1). Interestingly, examination of compiled online available microarray data (www.genevestigator.com) revealed that neither AtSLP1 nor CF1 ATP synthase γ subunit were abundantly expressed in callus / cell culture tissue (Figure 4.4A). This was corroborated by immunoblotting light and dark cell culture for both AtSLP1 and CF1 ATP synthase β. Here, extremely low levels of AtSLP1 and CF1 ATP synthase β subunit protein were detected in lightgrown A. thaliana cell culture when compared to rosette tissue (Figure 4.4B). 4.3.2 AtSLP2 interacts with mitochondrial redox relay protein AtMIA40 As AtSLP2 was found to be specifically expressed in tissues opposite to AtSLP1 (darkgrown A. thaliana cell culture, roots and seeds), AtSLP1-TAP offered a control for AtSLP2-TAP pull-downs from these tissues (Chapter 3). This facilitated the isolation of the AtSLP2 protein interactome from dark-grown A. thaliana cell culture and roots (Figure 4.3A and Table 4.1). Identical to AtSLP1-TAP, AtSLP2-TAP was stably expressed in both dark grown, wild-type A. 113 Table 4.1: AtSLP1- and AtSLP2-specific protein interactors. Each interactor was uncovered via TAP pull-downs. Specificity of the interaction with either AtSLP1 or 2 was determined relative to control pull-downs involving GFP-TAP or the other SLP-TAP construct. Numbers represent total number of identified peptides obtained for each experiment. 114 Figure 4.4: Transcript and protein expression of AtSLP1 and AtCF1 ATP synthase subunits. (A) Transcript expression of AtSLP1 (At1g07010) and AtCF1 ATP synthase γ subunit (At4g04640) across A. thaliana tissue types. In silico expression data presented here were derived from compiled microarray data (www.genevestigator.com). A scale of white (no expression) to red (high expression) is shown, along with the number of microarrays each dataset was derived from (right). (B) AtSLP1 and AtCF1 ATP synthase β subunit protein expression in cell culture and rosette leaf tissue. Increasing amounts of clarified cell lysates are shown depicting 5, 15 and 30 µg of total protein per lane. AP anti-AtSLP1 IgG (Chapter 3) and AntiCF1 ATP synthase β crude immune serum (Roy and Barkan, 1998) were used for immunoblot analysis. 115 thaliana cell culture and plants alongside controls GFP-TAP (A. thaliana cell culture) (Figure 4.3A) and AtSLP1-TAP (A. thaliana roots and cell culture). All TAP pull-downs were performed in parallel, with each isolated TAP-tagged protein detected by immunoblot using anti-Myc IgG to verify their specific isolation from clarified cell extracts (Figure 4.3B). Unlike AtSLP1-TAP pull-downs, AtSLP2-TAP isolated a highly enriched ~25 kDa polypeptide, which was excised from an SDS-PAGE gel and identified by mass spectrometry as Cox19-like CHCH family mitochondrial import and assembly 40 or AtMIA40. In parallel, a subtractive mass spectrometry approach involving the direct comparison of on-bead digested AtSLP2-TAP and control TAP pull-downs was performed to identify additional AtSLP2 protein interactors (Table 4.1). This approach revealed a number of specific AtSLP2 protein interactors (Table 4.1). AtSLP2 protein interactors found in both dark-grown A. thaliana cell culture and roots included Cox19-like CHCH family mitochondrial import and assembly 40 (MIA40) protein and heat shock protein 60-3a, while root-specific AtSLP2 protein interactors included Initiation factor 4A (eIF-4A) DEAD box RNA helicase and glutamine synthetase (Table 4.1). 4.3.3 Reciprocal interaction analysis supports a specific AtSLP2-AtMIA40 interaction With MIA40 having been shown to function in Eukaryote mitochondrial intermembrane space protein import and assembly (Herrmann and Riemer, 2012), verification of a specific AtSLP2-AtMIA40 protein complex was pursued and subsequently supported through reciprocal TAP pull-downs using AtMIA40-TAP stably expressed in dark-grown A. thaliana cell culture (Figure 4.5). Immunoblot of AtMIA40- and GFP-TAP eluates from dark-grown A. thaliana cell culture demonstrated that endogenous AtSLP2 interacts with AtMIA40-TAP just as AtSLP2TAP interacted with endogenous AtMIA40 (Figure 4.5). Furthermore, AtSLP1-TAP pull-downs 116 from the same tissue failed to pull-down AtMIA40, corroborating the specificity of the observed AtSLP2-AtMIA40 interaction (Table 4.1). Further support for a specific AtSLP2-AtMIA40 protein complex was observed using non-denaturing gel electrophoresis (Native-PAGE) (Figure 4.6). Bacterially-expressed and purified HIS6-AtSLP2 and GST-AtMIA40 (Figure 4.6A and B) were incubated together and resolved via Native-PAGE (Figure 4.6). Immunoblots employing affinity-purified (AP) antiAtSLP2 IgG resolved a higher molecular weight complex specific only to AtSLP2-AtMIA40 (Figure 4.6C). No higher molecular weight complexes were observed between GST-MIA40 and HIS6-AtSLP1, while GST-only failed to form a higher molecular weight complex with either HIS6-AtSLP1 or HIS6-AtSLP2 (Figure 4.6C). Both these approaches provide validation of AtSLP2-AtMIA40 protein complex formation. 4.3.4 AtSLP2 and AtMIA40 co-localize to the mitochondria MIA40 is a mitochondrial-targeted intermembrane space (IMS) oxidoreductase (Carrie et al., 2010). Utilizing fluorescent AtSLP2-RFP and AtMIA40-GFP fusion protein constructs, transient co-localization analysis was performed via particle bombardment of BY2 cell culture in conjunction with FarRed MitoTracker fluorescent staining to specifically visualize mitochondria. This demonstrated AtSLP2-RFP to co-localize with AtMIA40-GFP in the mitochondria (Figure 4.7A). The specific mitochondria staining capabilities of FarRed MitoTracker was assessed using BY2 cell culture transiently expressing mitochondrial-targeted GFP (Mito-GFP) as outlined above (Figure 4.7B). 117 Figure 4.5: Reciprocal TAP pull-downs verify specific interaction between AtSLP2 and AtMIA40. AtMIA40-TAP was stably transfected into dark grown A. thaliana cell culture and used as bait to pull-down native AtSLP2. AtSLP2 was absent from in-parallel GFP-TAP pull-downs. Proteins were resolved on 12 % SDS-PAGE, transferred to nitrocellulose and immunoblotted using 0.5 µg/mL and 1.2 µg/mL AP anti-Myc (ICL Inc.) and anti-AtSLP2 IgG, respectively. 118 Figure 4.6: Reciprocal non-denaturing PAGE verifies specific interaction between AtSLP2 and AtMIA40. (A) Purified fractions of HIS6-tagged AtSLP1, AtSLP2 and non-specific co-purifying proteins (NCSP) as well as GST-tagged AtMIA40 and GST-only. (B) Purified HIS6-tagged AtMIA40 along with clarified HIS6-AtMIA40 expressing bacterial cell supernatant (S/N) and NiNTA flow-through (F/T). Each lane contains 5 µg of total protein. (C) Purified HIS6-AtSLP2, HIS6AtSLP1 and GST-AtMIA40 were incubated for 1 h at 22oC prior to resolution by non-denaturing (NATIVE) PAGE, followed by transfer to nitrocellulose for immunoblot analysis. A specific high molecular weight complex was only observed between HIS6-AtSLP2 and GST-AtMIA40. Neither AtSLP2 nor AtSLP1 interacted with GST-only. Immunoblotting was performed using 1.5 and 1.2 µg/mL affinity-purified anti-AtSLP1 and anti-AtSLP2 IgG as described previously (Chapter 3). 119 Subsequent biochemical isolation of mitochondria from dark-grown wild-type A. thaliana cell culture was performed to confirm transient co-localization data (Figure 4.7C). Immunoblot analysis of purified mitochondria and the cytosolic fraction using AtSLP2- (Chapter 3) and AtMIA40-specific antibodies (Figure 4.8) demonstrated that the endogenous AtSLP2 and AtMIA40 reside in the mitochondria (Figure 4.7C). Additional immunoblotting of both the purified mitochondria and cytosolic fractions with antibodies to known mitochondrial and cytosolic proteins confirmed the successful isolation of intact mitochondria and the localizations of AtSLP2 and AtMIA40 (Figure 4.7C). Immunodetection of mitochondrial pyruvate dehydrogenase complex E1α subunit (Mito-PDC E1α) provided a marker for the isolation of intact mitochondria (Szurmak et al., 2003), while immunodetection of plant-type phosphoenolpyruvate carboxylase (PTPC) and sucrose synthase (SuSy) offered markers for the cytosolic fraction (Gennidakis et al., 2007; Uhrig et al., 2008) and also showed that no crosscontamination occurred during the preparation of each fraction. These localization findings further support the isolation of a specific AtSLP2-AtMIA40 protein complex by demonstrating that AtSLP2 and AtMIA40 are both mitochondrial-targeted proteins. 4.3.5 AtMIA40 activates the Ser/Thr phosphatase activity of AtSLP2 Phosphatase assays employing the substrate pNPP and HIS6-AtSLP2 exhibited a GSTAtMIA40 concentration-dependent increase in phosphatase activity (Figure 4.9A). HIS6-AtSLP2 activation by GST-AtMIA40 was also found to be dependent on the presence of reductant (Figure 4.9A); however, neither reductant alone (Figure 4.10), nor GST-only plus reductant could activate HIS6-AtSLP2 (Figure 4.9A). As well, GST-AtMIA40 did not activate the AtSLP2 120 Figure 4.7: AtSLP2 and AtMIA40 are mitochondrial-targeted proteins. (A) Transient co-expression of RFP-tagged AtSLP2 (red) with GFP-tagged AtMIA40 (green) in tobacco BY2 cells displayed co-localization with the FarRed MitoTracker illuminated mitochondria (blue). (B) Transient expression of mitochondrial-targeted GFP (green) exhibited co-localization with the FarRed MitoTracker illuminated mitochondria (blue). Excitation and emission wavelengths are outlined in the Materials and Methods. (C) Wild-type mitochondria were isolated from dark-grown A. thaliana cell culture. Immunoblotting of the cytosolic (cyto) and purified mitochondrial (mito) fractions was performed using 1:100 anti-mitochondrial pyruvate dehydrogenase complex E1α (anti-PDCmt E1α), 1:1000 anti-plant-type phosphoenolpyruvate carboxylase (anti-PTPC) and 1:10000 anti-sucrose synthase (anti-SuSy) antibodies in conjunction with affinity-purified anti-AtSLP2 and anti-AtMIA40 antibodies used at 1.2 µg/mL and a 1:1000 dilution of crude immune serum respectively. Immunoblotting corroborated the observed mitochondrial targeting of AtSLP2 and AtMIA40. Each lane contains 15 µg of total protein. 121 Figure 4.8: Anti-AtMIA40 IgG production and testing. (A) Purification scheme for 6 M urea-extracted HIS6-tagged AtMIA40. Whole cell (W/C), clarified supernatant (S/N), 6 M urea-extracted protein pellet loaded on NiNTA (I), NiNTA flow-through (F/T) and concentrated NiNTA eluate ([E]) are depicted. (B) Dilution series of purified, 6 M urea-extracted HIS6-AtMIA40 [E] subjected to immunoblot analysis using antiHIS6 IgG. Each lane of the Colloidal stained gel contains 5 µg of total protein. (C) Wild-type A. thaliana Col-O (WT Col-O) and AtMIA40 insertional mutant (atmia40) roots were employed to assess the detection of endogenous AtMIA40 by anti-AtMIA40 crude immune serum. The total protein of each lane is shown. 122 paralog AtSLP1, indicating that the activating effect of AtMIA40 is specific to AtSLP2 (Figure 4.9A). Furthermore, phosphatase assays performed using HIS6-AtSLP2 purified in the continuous presence of reductant (reduced) but lacking exogenous reductant in the assay, exhibited some activity increase when assayed in the presence of oxidized HIS6-AtMIA40. HIS6-AtSLP2 purified in the absence of reductant (oxidized), was highly activated by oxidized HIS6-AtMIA40, but only in assays conducted with 5 mM GSH (Figure 4.9B), while assay of oxidized AtSLP2 and AtMIA40 in the absence of reductant resulted in no increase in activity (Figure 4.9B). As MIA40 proteins maintain oxidoreductase activity towards mitochondrial IMS proteins containing dual Cx9C, Cx3C and other dual CxnC motifs to catalyze the formation of disulphide bridges (Banci et al., 2009; Sideris et al., 2009), AtSLP2 orthologs from completely sequenced plant genomes were aligned to identify equivalent motifs. Two potential CxnC motifs were identified within the C-terminus of SLP2 phosphatases consisting of a CxC and Cx14C sequence arrangement (Figure 4.11). Bacterially-expressed and purified HIS6-AtSLP2 was also found to possess an AtMIA40dependent increase in activity towards pSer, pThr, pTyr and pThr / pTyr phosphorylated peptides (Table 4.2; Figure 4.12), which was consistent with previous phosphatase assays involving pNPP (Figure 4.9). Despite an almost 20-fold activation of HIS6-AtSLP2 activity, AtMIA40 did not alter the specificity of the AtSLP2 toward any one substrate peptide (Figure 4.12A). Phosphopeptide substrate preferences obtained using recombinant HIS6-AtSLP2 also suggested that AtSLP2 was a potential dual specificity protein phosphatase; however, assays conducted using AtSLP2-TAP isolated from dark-grown A. thaliana cell culture found that AtSLP2 maintained a clear preference for pThr phosphopeptide substrates over pTyr-containing peptides 123 Figure 4.9: AtMIA40 activates AtSLP2 under reducing conditions. (A) HIS6-AtSLP2 (250 ng) was incubated with increasing amounts of GST-AtMIA40 in the presence of either 5 mM DTT or TCEP to evaluate the effect of reductant on AtSLP2 activation. Control assays involving HIS6-AtSLP2 + GST-only as well as HIS6-AtSLP1 + GST-AtMIA40 and AtSLP1 + GST-only in the presence of 5 mM DTT demonstrated a specific activation effect of AtMIA40 on AtSLP2. (B) AtMIA40 has been shown to function as an oxidoreductase in humans and yeast. Phosphatase assays involving pre-oxidized (black) or reduced (red) AtSLP2 or oxidized AtMIA40 ± reductant. All phosphatase activities were assessed using the artificial phosphorylated substrate pNPP at 30oC for 1 h. All error bars represent ± standard error; n = 3. 124 Figure 4.10: Effect of reductant only on AtSLP1 and AtSLP2 activity. Phosphatase activity was assessed as outlined in the Materials and Methods using the artificial phosphorylated substrate pNPP with either DTT or GSH at 30oC for 1 h. All error bars represent ± standard error; n = 3. 125 126 Figure 4.11: Amino acid sequence alignment of photosynthetic Eukaryote SLP2 phosphatases against select SLP1 phosphatases. Alignment was performed using ClustalX followed by import into GeneDoc for image assembly. Amino acid sequence gene identifiers are located in Appendix A.1. Highlighted in blue are representative green algal, moss and land plant SLP1 paralogous phosphatases. All cysteines are highlighted in yellow. The CxC and Cx14C motifs are underlined in yellow. (Figure 4.12B). Collectively, these results indicate that AtMIA40 functions to enhance AtSLP2 activity, not alter its phosphoprotein specificity, through the likely formation of disulphide bridges. 4.4 Discussion 4.4.1 AtSLP1 phosphatase interacts with chloroplast F1 ATP synthase subunits β and γ. With the finding that AtSLP1 protein phosphatase interacts with CF1 ATP synthase subunits β and γ, another component of a reversible phosphorylation regulatory mechanism discovered almost 15 years ago may have been uncovered (Kanekatsu et al., 1998; Moorhead et al., 1999; Bunney et al., 2001). Previous phosphoproteomic research has found that both CF1 ATP synthase subunits β and γ are phosphorylated proteins (del Riego et al., 2006; Reiland et al., 2009), and that β subunit phosphorylation is mediated by chloroplast caesin kinase II (Kanekatsu et al., 1998). As well, it was shown that 14-3-3 proteins associate with CF1 ATP synthase β subunit in cauliflower (Moorhead et al., 1999) and that this association was phosphorylation dependent when also examined in barley (Bunney et al., 2001). It was also demonstrated that the association of 14-3-3 proteins inhibited CF1 ATP synthase activity, indicating this interaction is regulatory in nature (Bunney et al., 2001). The association of AtSLP1 protein phosphatase with CF1 ATP synthase subunits suggests that AtSLP1 may function to dephosphorylate and activate the β subunit, by alleviating 14-3-3 protein inhibition. CF1 ATP synthase is activated in the light and inhibited in the dark through a combination of reversible phosphorylation (Moorhead et al., 1999; Bunney et al., 2001), redox mechanisms (Kohzuma et al., 2013) and ADP/ATP binding (Malyan, 2010). Progression from 127 Figure 4.12: Substrate specificity of AtSLP2-AtMIA40 complex. (A) A series of phosphorylated peptides were screened using 250 ng of purified HIS6-AtSLP2 with (grey bars) and without (black bars) the addition of 1000 nM HIS6-AtMIA40 to assess the influence of AtMIA40 on the substrate specificity of AtSLP2. Phosphorylated peptides depicted here represent pSer (Ki67, B56), pThr (RRP1B, BRCA1), pTyr (Rat SAPK3), pThr/pTyr (p38, p38beta)-containing peptides. (B) TAP-purified AtSLP2-cTAP was assessed using BRCA1 (pThr) and rat SAPK3 (pTyr) peptides to assess differences in bacterially-produced and in planta assembled AtSLP2. HIS6-AtMIA40 (1000 nM) was added to assays to maximize AtSLP2 activity. All assays were conducted at 30oC for 1 h, with enzymatic activity deduced by malachite green as outlined in the Materials and Methods. Error bars represent ± standard error; n=3. 128 Table 4.2: Phosphorylated peptides used in AtSLP2 enzyme assays. Listed are the phosphorylated peptide sequences and the corresponding protein frrom which they were derived. 129 night to day would see a decrease in the stromal concentration of free Pi and ADP as ATP begins to be manufactured. Previous examination of AtSLP1 enzyme properties found that it maintained an approximate IC50 in line with physiologically relevant chloroplast Pi concentrations (Chapter 3). As well, despite diurnal fluctuations in AtSLP1 transcripts, AtSLP1 protein levels were largely unchanging, except during the day when a small increase in AtSLP1 protein levels was observed at around 8 h post continuous light exposure (Chapter 3). The interaction between CF1 ATP synthase β subunit and AtSLP1, as well as its phosphorylation-dependent inhibition by 143-3 proteins, coupled with AtSLP1 phosphatases in vitro biochemical characteristics, suggest AtSLP1 could be involved in modulating CF1 ATP synthase activity. Additional experimentation is required to verify this hypothesis (Figure 4.13). 4.4.2 AtSLP2 phosphatase is a novel client of redox relay protein AtMIA40 AtSLP2 represents a novel interaction partner of MIA40 in photosynthetic Eukaryotes, as SLP2 phosphatases are absent in metazoans (Chapter 2). Previous examination of AtMIA40 insertional mutant plants found AtMIA40 responsible for importing the chaperone for superoxide dismutase (Ccs1) and copper/zinc superoxide dismutase (CSD1) into the mitochondrial IMS as well as CSD3 into the peroxisome (Carrie et al., 2010). Import of Ccs1 and CSD1 into the mitochondrial IMS indicates a conserved MIA40 function in plants, as has been observed in other eukaryotic systems such as humans and yeast (Kawamata and Manfredi, 2010). In addition to these proteins, research in non-photosynthetic Eukaryotes has identified a number of mitochondrial IMS proteins that are directly regulated by MIA40, in particular, cytochrome oxidase c protein 17 (COX17) (Sideris et al., 2009) and translocase of inner membrane (TIM) 130 Figure 4.13: Model depicting how AtSLP1 may operate in vivo. Given the activation of AtSLP1 by reductant in vitro, it may be possible that AtSLP1 is redox regulated in vivo. Currently AtSLP1 has demonstrated inhibition by sodium pyrophosphate (PPi) and phosphate (Pi) in vitro at physiologically relevant concentrations as well as an interaction with the γ and β subunits of CF1 ATP synthase. Circled ‘P’ (yellow) represents phosphorylation events which AtSLP1 may regulate. CF1 ATP synthase has been shown to dock 14-3-3 proteins (Moorhead et al., 1999), with the A. thaliana γ and β subunits having been previously shown to be phosphorylated on conserved serine and threonine residues (del Riego et al., 2006; Reiland et al., 2009). 131 proteins, specifically Tim22, which is responsible for the integration of mitochondrial inner membrane proteins (Fraga and Ventura, 2013), and others (Banci et al., 2009; Reddehase et al., 2009; Kawamata and Manfredi, 2010; Darshi et al., 2012; Weckbecker et al., 2012). AtSLP2, however, represents the first protein phosphatase identified to interact with MIA40. MIA40 proteins have been found to catalyze the formation of disulphide bridges on target protein substrates through their oxidoreductase activity, trapping the MIA40 substrate in the mitochondrial IMS (Figure 4.14; (Banci et al., 2009; Sideris et al., 2009; Kawamata and Manfredi, 2010)). COX and TIM proteins represent the first MIA40 substrates identified (Banci et al., 2009; Sideris et al., 2009). Careful analysis found they maintain conserved dual Cx9C or Cx3C motifs named the mitochondrial IMS sorting signal / IMS-targeting signal (MISS/ITS). These dual cysteine-containing sequence stretches form anti-parallel alpha helices through MIA40-catalyzed disulphide bridges (Banci et al., 2009; Sideris et al., 2009). This is achieved via the cysteines of MIA40's active site (the CPC motif), which first forms disulphide bridge intermediates with the target protein substrate (Banci et al., 2009), followed by the formation of disulphide bridges on the target substrate and the release of reduced MIA40 all in the presence of reduced glutathione (GSH) (Banci et al., 2009; Sideris et al., 2009). GSH has been shown to enhance MIA40 activity in vitro (Bien et al., 2010), in addition to exerting an overall reducing pressure on yeast MIA40 redox state in vivo (Kojer et al., 2012). GSH exerts a similar effect on the oxidoreductases of the mammalian endoplasmic reticulum (Jessop and Bulleid, 2004), where lack of GSH production results in compromised yeast protein folding under oxidizing conditions (Cuozzo and Kaiser, 1999). In line with this, the activating effect of AtMIA40 on AtSLP2 was 132 Figure 4.14: Model depicting how AtSLP2 and AtMIA40 may operate in vivo. Prior evidence from human and yeast MIA40 indicates that client proteins undergo oxidative folding which traps them in the mitochondrial intermembrane space (IMS). This reaction was also shown to be dependent on the presence of reductant in the form of reduced glutathione (GSH). 133 most pronounced in the presence of 5 mM GSH; however, a measureable increase in AtSLP2 activity above baseline was also detected in assays conducted using reduced AtSLP2 in the absence of GSH. These findings primarily support AtMIA40 activation of AtSLP2 through the catalyzed formation of disulphide bridges to trap AtSLP2 in the IMS, but cannot discount the alternative possibility that AtMIA40-GSH facilitates AtSLP2 activation solely through protein isomerization of oxygen-induced disulphide bridges (Jessop and Bulleid, 2004; Kojer et al., 2012). MIA40s have also been found to govern the import of non-canonical CxnC motif containing proteins such as Ccs1 (Reddehase et al., 2009; Kawamata and Manfredi, 2010). SLP2 phosphatases represent a non-canonical target of MIA40 oxidoreductase activity, as they do not maintain either dual Cx9C or Cx3C motifs, but rather two regions of cysteines which form CxC and Cx14C motifs based on linear sequence alignments. Since no protein structures are available for a closely related SLP phosphatase ortholog of AtSLP2, structural alignments and/or molecular modeling could not be performed to reveal the positioning of these motifs relative to each other. Both cysteine containing motifs are located in the C-terminal half of the protein beyond key AtSLP2 catalytic motifs. This likely means disulphide bridge formation induces global conformational changes in protein structure resulting in enhanced AtSLP2 activity rather than directly affecting enzymatic catalysis, as is found with PTP phosphatases like starch excess 4 (SEX4) (Silver et al., 2013). Interestingly, these CxnC motifs bracket two previously annotated and highly conserved SLP protein phosphatase motifs whose functions remain unknown (Chapter 2; (Andreeva and Kutuzov, 2004)). The use of paralogous AtSLP1 as a control stems from its phylogenetic relatedness to AtSLP2 and its analogous divergence from classic eukaryotic PPP protein phosphatases (Chapter 134 3; (Moorhead et al., 2009)). Phylogenetically, SLP1 and SLP2 protein phosphatases originated in plants as a result of horizontal gene transfer from Myxobacteria early in photosynthetic Eukaryote evolution (Chapter 2). Likely gene duplication of the incorporated progenitor SLP phosphatase in early photosynthetic Eukaryote evolution then resulted in the proliferation of distinct SLP1 and SLP2 protein phosphatase lineages (Chapter 2). Sequence alignment analysis revealed SLP1 phosphatases maintain half of each of the conserved cysteine-containing CxnC motifs identified in SLP2 phosphatases in similar sequence positions. These points rendered AtSLP1 a logical control for both in vitro assays and TAP preparations (Chapter 3, (Andreeva and Kutuzov, 2004)). In both in vitro assays and TAP pull-downs, AtSLP1 failed to associate with AtMIA40, validating the observation of a specific AtSLP2-AtMIA40 complex. More broadly, this highlights the likely diversity in SLP1 and SLP2 protein phosphatase function within the plant cell. The mitochondrial localization of AtSLP2, combined with its 4- (pNPP) to 20(phosphorylated peptides) fold increase in activity upon interaction with AtMIA40, and a phosphorylated substrate preference indicative of Ser/Thr PPP protein phosphatases, suggests reversible protein phosphorylation maintains a regulatory role within the mitochondrial IMS. No studies to date have specifically examined the role of reversible protein phosphorylation in the mitochondrial IMS of any organism despite targeted proteomics studies revealing the presence of a number of protein kinases and protein phosphatases (Salvi et al., 2005; Pagliarini and Dixon, 2006; Vogtle et al., 2012; Duncan et al., 2013). In particular, Src, Lyn, Fyn, Csk and Fgr kinases (Salvi et al., 2002) as well as SHP-2 phosphatase (Salvi et al., 2004) have been found in the mitochondrial IMS of rat brain mitochondria (Salvi et al., 2005; Pagliarini and Dixon, 2006), along with yeast PP2C phosphatase PTC5 (Vogtle et al., 2012) and Trypanosoma brunei dual 135 specificity and Asp-based phosphatase Tim50, which represents a key mitochondrial protein import regulator (Duncan et al., 2013). Furthermore, recent elucidation of the IMS proteome of yeast identified a number of IMS proteins that have been previously documented in whole mitochondria phosphoproteomics studies to be phosphorylated (Vogtle et al., 2012). These include: peptidyl-prolyl cis-trans isomerase (YDR155C; CPR1), thioredoxin reductase (YDR353W; TRR1) and subunit VIB cytochrome c oxidase (YLR038C; COX12) (PhosphoPep Ver 2.0). Given the prevalence of regulatory protein phosphorylation across Eukaryotes, it is reasonable to predict that mitochondrial IMS reversible protein phosphorylation also represents a conserved phenomenon. 4.4.3 Co-localization of both AtSLP2 and AtMIA40 to the mitochondria Previous in silico analyses indicated that SLP2 phosphatases were cytosolic proteins (Chapter 2). Despite the employment of several mitochondrial prediction algorithms, none accounted for a potential mitochondrial inner membrane space (IMS) localization, likely because proteins targeted to the IMS lack canonical mitochondrial targeting peptides (Chacinska et al., 2008). AtMIA40 and AtSLP2 both lack a predicted mitochondrial targeting peptide (Chapter 3; (Chacinska et al., 2008)). Although we previously noted some punctate staining for AtSLP2RFP, a cytosolic localization was concluded by its co-localization with the cytosolic GFP-only marker construct in Vicia faba leaf cells (Chapter 3). As leaf cells are photosynthetic, it represents a location where endogenous SLP2 phosphatases are not expressed, which may have contributed to the deduced cytosolic localization. Here, through the combined isolation of mitochondria and transient co-expression of fluorescent AtSLP2 and AtMIA40 in a nonphotosynthetic tissue known to express SLP2 phosphatases, a clear mitochondrial localization 136 was uncovered for both AtSLP2 and AtMIA40. This corroborates previous findings that AtMIA40 is a mitochondrial-targeted protein (Carrie et al., 2010). With AtSLP2 and AtMIA40 both lacking a mitochondrial targeting sequence and AtSLP2 representing a client of AtMIA40, which is exclusively found in the mitochondrial IMS of other Eukaryotes (Herrmann and Riemer, 2012) and A. thaliana (Carrie et al., 2010), it is likely that AtSLP2 also resides within the mitochondrial IMS. Characterization of AtMIA40 and other plant MIA40s found an additional peroxisomal subcellular localization through a highly conserved C-terminal PTS1 motif (Carrie et al., 2010). AtSLP2 does not possess a C-terminal PTS1 motif and has not exhibited peroxisomal localization under any circumstance (Chapter 3), indicating that AtMIA40 likely maintains a role in regulating diverse protein targets alternative to SLP2 phosphatases. 4.5 Conclusion The employment of tandem affinity purification to investigate the AtSLP phosphatase protein interactome has yielded a number of interesting findings. In addition to isolating the protein interactome, the TAP protein expression and isolation methodology applied here provided a means of comparing the phosphorylated peptide substrate preferences of bacteriallyexpressed and purified HIS6-AtSLP2 to in planta constructed AtSLP2. The AtSLP phosphatase protein interactome differed between the AtSLP1 and AtSLP2 paralogs and was consistent with their documented subcellular localizations. AtSLP1 was found to interact with CF1 ATP synthase β and γ subunits of the chloroplast stroma, while AtSLP2 was found to interact with AtMIA40 of the IMS. Reciprocal interaction analysis was only performed using AtMIA40 as it was the most consistent and specific interactor identified through TAP pull-downs. Successful reciprocal isolation of the AtMIA40-AtSLP2 protein complex substantiated this specific 137 interaction, and lends credence to the legitimacy of the remaining AtSLP interactors. Elucidation of protein-protein interactors is central to characterizing the cellular role(s) of proteins, and therefore represents a key intermediate step in fully understanding the function of a protein from a broader biological perspective. 138 Chapter Five: Exploring the biological significance of AtSLP1 and AtSLP2: A reverse genetic approach 5.1 Introduction Careful examination of insertional gene-knockout mutant and/or over-expression plants is essential to elucidating the biological functions of proteins (Furbank and Tester, 2011). This provides both fundamental science and targeted agricultural biotechnology with a framework of how a particular protein operates within the cellular environment by examining the effect it has on a plant's phenotype (Furbank and Tester, 2011; Sozzani and Benfey, 2011). Identification of a plant phenotype in a gene knockout or over-expression plant line can require conditional circumstances, such as an abiotic stress (e.g. drought), while other phenotypes, such as developmental / morphological perturbations (e.g. stunted growth), can be identified under control conditions. Sometimes, even the application of experimental stresses may still miss subtle phenotypes which are occurring at the cellular level. In particular, alterations in metabolic flux and/or cell signaling may result in changes which lack visibly manifested organismal phenotypes (Lu et al., 2008). However, continued development of genome sequencing and mass spectrometry techniques / technologies has advanced the field of functional genomics (Hamilton and Buell, 2012) and metabolomics (Schauer et al., 2008), creating new opportunities to identify subtle plant phenotypes via targeted screening of gene knockout and/or over-expression plant lines. Despite the need to investigate biological processes in agriculturally relevant plants, A. thaliana remains the foremost model organism for the elucidation of new biological information on plant processes (Schauer et al., 2008). This is due to its short generation time (~6 weeks), extensively annotated genome (first draft published in 2000) and ease of genetic manipulation 139 (Agrobacterium-mediated transfection), as well as the public availability of online resources (Koornneef and Meinke, 2010). Here, A. thaliana was used as the organism of choice to investigate the roles of SLP phosphatases in plants. Investigation of homozygous atslp1 knockout and 35S::AtSLP1 over-expression plant lines yielded a lack of notable results under control conditions, and was not examined beyond standard growth parameters for conditional phenotypes. Conversely, homozygous atslp2 knockout and 35S::AtSLP2 over-expression plant lines presented dry seed and seed germination phenotypes under standard growth conditions, which were subsequently explored in homozygous atmia40 plants, given the annotated interaction between AtSLP2 and AtMIA40 (Chapter 4). Research surrounding AtSLP2 and AtMIA40 function at the organismal level represents the focus of Chapter 5. Seed germination is a process largely governed by the opposing activities of two key phytohormones, abscisic acid (ABA) and gibberellic acid (GA) (Kucera et al., 2005; Muller et al., 2006; Piskurewicz et al., 2008). ABA functions to establish dormancy in developing seeds (Kucera et al., 2005; Holdsworth et al., 2008), while also maintaining a negative regulatory role during seed germination by preserving seed dormancy (Kucera et al., 2005; Holdsworth et al., 2008). Conversely, GA positively regulates seed germination by stimulating seed swelling, endosperm rupture and testa cracking to facilitate radicle emergence and seedling establishment (Holdsworth et al., 2008). Knockout plant lines deficient in either ABA or GA biosynthesis and/or cell signaling components have been characterized, establishing the core proteins involved in these processes (Kucera et al., 2005; Holdsworth et al., 2008). For instance, mutation in ABA biosynthetic (aba1-1, aba2) and signaling (abi1 - abi5) proteins were found to exhibit a lack of seed dormancy, while ABA over-producing mutants (cyp707a2) maintained enhanced seed dormancy (Kucera et al., 2005). Similarly, GA biosynthetic mutant ga1 exhibited increased 140 dormancy and required exogenous GA to germinate, while DELLA transcription factor signaling mutant rgl2 displayed reduced seed dormancy resulting from enhanced GA-induced gene expression (Lee et al., 2002; Cao et al., 2005; Kucera et al., 2005). A loss of both GA biosynthesis (ga1-3) and signaling through crosses with higher order rga1/rgl2/gai mutants resulted in GA-independent germination (Cao et al., 2005). Interestingly, at the center of ABA-related cell processes are Ser/Thr PPM/PP2C-family phosphatases, which function as negative regulators of ABA signaling (Joshi-Saha et al., 2011). Cellular ABA binds ABA receptor pyrabactin resistance (PYR/PYL/RCAR) proteins, relieving PP2C phosphatase regulation of downstream Snf1-like related kinase 2 (SnRK2) and calciumdependent protein kinase (CDPK) targets. This frees these kinases to activate targets, including ABF transcription factors and ion channels through protein phosphorylation events (Joshi-Saha et al., 2011). To date no protein phosphatases have been implicated in regulating either GA biosynthesis and/or signaling. With seed development and germination essential to the daily lives of people world-wide as a source of food, elucidating the cellular mechanisms which govern these processes maintains both fundamental and biotechnological significance (Martinez-Andujar et al., 2012). As a key driver of seed germination, understanding GA biosynthesis and signaling regulation is of immense importance. Through a combination of biochemistry, cell biology and pharmacological data presented here, the involvement of a Ser/Thr PPP protein phosphatase in regulating GArelated processes during seed germination has been shown for the first time. AtSLP2 was found to be localized to the mitochondrial IMS through mTP-independent targeting, where it specifically interacts with, and is activated by, mitochondrial IMS import and assembly protein 40 (AtMIA40) to negatively regulate seed germination through GA-related processes. Reciprocal 141 interaction analysis and in vitro protein phosphatase assays verified a specific interaction between AtSLP2 and AtMIA40 (Chapter 4). Pharmacological screening of loss-of-function atslp2-2 and atmia40 mutant plant lines provided evidence consistent with GA over-production and/or signaling deficiencies, which was verified by quantitative PCR analysis of known GA biosynthetic and signaling protein-encoding genes. The localization of both AtSLP2 and AtMIA40 to the mitochondria, combined with their influence over seed GA-related processes, represents a novel connection between plant cell metabolism and signaling. 5.2 Methods 5.2.1 Plant growth conditions and seed weight Arabidopsis seeds used in non-pharmacological experimentation were surface-sterilized as described in Chapter 4. Transgenic 35S::AtSLP over-expressors were selected on plates containing 100 μg/mL kanamycin. Seeds were stratified for 2 d at 4°C in the dark for uniform germination, and then transferred to continuous light for 5 - 7 d at 22°C until seedlings formed true leaves. Developed seedlings were transplanted to soil and grown also as described in Chapter 4. Conditions for seeds used in pharmacological studies involved dry seed sterilization and stratification at 4oC for 2 d in the dark on MS agar plates containing 0.3, 1.0 µM ABA and 10 µM Uniconazole. Seeds were then germinated under 24 h light at 22oC at a light level of 125 µmol m-2s-1. Dry seed weight was measured for 50 seeds (n = 5) on a Sartorius MC-5 microbalance 5.1g (www.sartorius.dataweigh.com). 142 5.2.2 Molecular cloning and expression At1g18480 (AtSLP2) and At1g07010 (AtSLP1) cGFP over-expression constructs were created as previously mentioned (Chapter 4). Transformation of each construct into Agrobacterium tumefaciens strain GV3101 (Agrobacterium) was performed to facilitate stable integration of each construct into A. thaliana plants via the floral dip method previously described by (Zhang et al., 2006). 5.2.3 Seed extraction Seeds (50 mg) were extracted by micro-pestle on ice in a solution consisting of 50 mM HEPES-NaOH pH 7.5, 150 mM NaCl, 5 % (v/v) glycerol in addition to 0.2 % (v/v) Triton X100, 1 mM PMSF, 1 mM benzamidine and sand added fresh the day of use. Seed lysates were clarified by centrifugation at 4oC and 14000 x g. Protein estimations for each clarified seed lysate were obtained using a Bradford protein assay read spectrophotometrically at 595 nm. 5.2.4 Plant DNA isolation and PCR genotyping Four atslp1, two atslp2 and one atmia40 insertional mutant plant line(s) were obtained from either the Arabidopsis Information Resource (TAIR; www.arabidopsis.org; SALK), the Nottingham Arabidopsis Stock Center (NASC; www.arabidopsis.info; GABI-Kat) or the Riken Bioresource Center Experimental Plant Division (RIKEN; www.brc.rike.jp/lab/epd/Eng/; RATM). Respective plant lines included atslp1-1 (SALK_134367), atslp1-2 (SALK_129247), atslp1-3 (RATM15-3180-1), atslp1-4 (GABI_313CO4), atslp2-1 (SALK_005557), atslp2-2 (RATM13-2055-1) and atmia40 (SALK_044358) (Figure 5.1). PCR screening of atslp1, atslp2 and atmia40 insertional mutants for homozygosity was conducted using primers outlined in 143 Appendix C.1. DNA was extracted from 14 d old rosette tissue and screened via PCR. PCR conditions included 95oC for 5 min, [95oC 45 s, 55oC 45 s, 72oC 1.5 min] x 30 cycles, 72oC 10 min. 5.2.5 Immunoblot analysis Immunoblotting for AtSLP1 and AtSLP2 expression was performed as previously described (Chapters 3 and 4) to confirm PCR genotype data (Figure 5.2A and B). Affinity purified polyclonal rabbit anti-AtSLP1 and -2 antibodies previously generated were used at 1.5 and 1.2 µg/mL, respectively (Chapter 3). Immunoblotting for AtMIA40 expression was performed as previously described (Chapter 4) to confirm genotype data (Figure 5.2A and B; Appendix C.2.) using a 1:1000 dilution of anti-AtMIA40 crude immune serum. 5.2.6 Quantitative PCR analysis Total RNA from the various A. thaliana lines was isolated using a modified TRIzol (Invitrogen) protocol. First-strand cDNA was synthesized from 5 μg total RNA using oligo (dT)12-18 primer and SuperScript II Reverse Transcriptase (Invitrogen, USA) following manufacturer’s instructions. The qPCR was performed using StepOnePlus Real-Time PCR System (Applied Biosystems). Primer pairs are listed in Appendix C.3. Each PCR reaction contained 1× Fast SYBR Green Master Mix (Applied Biosystems, USA), 200 nM of each primer, and 0.5 μL cDNA in a final volume of 20 μL. PCR amplification was performed for 40 cycles at 95ºC, 3 s and 60ºC, 30 s with a preceding initial enzyme activation of 20 s at 95ºC. 144 Figure 5.1: AtSLP and AtMIA40 gene models depicting insertional mutant plant lines. SALK lines were obtained from the Arabidopsis Information Resource (TAIR; www.arabidopsis.org), GABI-kat line (GABI) was obtained from the Nottingham Arabidopsis Stock Center (NASC; www.arabidopsis.info) and RATM lines were obtained from the Riken Bioresource Center Experimental Plant Division (RIKEN; www.brc.rike.jp/lab/epd/Eng/). Dashed line represents 5’-UTR region upstream of exon 1. Black triangles denote the point of TDNA insertion. 145 Figure 5.2: PCR and immunoblot analysis of AtSLP1 and AtSLP2 knockout and overexpression plant lines. Genotyping of atslp1-3 (A) and atslp2-2 (B) mutant lines by PCR. Primers used are listed in Appendix C.1. A number of positive insertional mutant plants were obtained as shown. Star denotes homozygous insertional mutant line. Corroborative immunoblot analysis of various AtSLP1 (C) and AtSLP2 (D) plant lines confirmed the elimination (knockout) or elevated expression (over-expression) of each respective AtSLP. Italicized atslp denotes insertional knockout plant line, WT NÖ denotes wild-type Nossen A. thaliana (NÖ) and italicized 35S::AtSLP denotes over-expression plant line. 35S::AtSLP constructs are of an approximately 26 kDa due to fusion with GFP. Star denotes homozygous insertional mutant line. All primers used for genotyping atslp insertional mutant plant lines are listed in Appendix C.1. N.S. denotes non-specific. 146 Relative expression levels were calculated and all quantifications were normalized using ubiquitin 10 mRNA as an internal control. For each target gene, the reactions were carried out in duplicate from two biological replicates. 5.2.7 Metabolite analysis A standard set of metabolites was assessed for dry, mature atslp2-2, wild-type NÖ and 35S::AtSLP2 seeds by gas chromatography mass spectrometry through a collaborator (Dr. A. Fernie) as previously described (Lisec et al., 2006). Presented here are results for the seed fatty acid and amino acid content of each. 5.3 Results 5.3.1 Generation of homozygous insertional mutant and protein over-expression plant lines Homozygous atslp1 and atslp2 plant lines were made in conjunction with 35S::AtSLP1 and 35S::AtSLP2 protein over-expression lines to elucidate a spectrum of A. thaliana phenotypic effects. Homozygous atmia40 mutants were created in response to the identification of AtMIA40 as a bonafide interactor of AtSLP2 (Chapter 4). The atmia40 insertional plant line described here functioned to independently corroborate phenotypic findings observed in atslp2 plants. Originally atslp1 and 35S::AtSLP1 plant lines were created to identify plant phenotypes resulting from the complete loss and/or over-expression of AtSLP1. No obvious phenotypes were observed under standard conditions so further characterization was not pursued. All homozygous insertional mutant and protein over-expressing plant lines were analyzed by immunoblot analysis to confirm the loss or enhanced protein expression of AtSLP1, AtSLP2 or AtMIA40, respectively (Figure 5.1 and 5.2; Chapter 4). 147 5.3.2 AtSLP2 and AtMIA40 function to negatively regulate seed germination To elucidate the biological function of AtSLP2, insertional knockout (atslp2-2) and constitutive AtSLP2 over-expression (35S::AtSLP2) plants were examined and found to exhibit a number of unique seed-related phenotypes. Seed phenotype documentation was initiated through the observation of a spectrum in silique size between atslp2-2 and 35S::AtSLP2 plants, which qualitatively maintained larger and smaller siliques, respectively, relative to their wild-type Nossen (NÖ) background (Figure 5.3A). Subsequent examination of mature seeds found that atslp2-2 seeds possessed an elevated seed weight relative to NÖ and 35S::AtSLP2 seed populations (Figure 5.3B). This increased seed weight phenotype of atslp2-2 was mirrored by insertional atmia40 knockout plant seeds (Figure 5.3C). No effect on seed viability was noted in either case, with each seed set exhibiting an equal ability to sow subsequent plant generations. Upon germination, both atslp2-2 and atmia40 seeds exhibited an early seed swelling / cracking (testa cracking) phenotype relative to their respective wild-type backgrounds (Figure 5.4 - 5.7), with atslp2-2 seeds maintaining the most pronounced phenotype (Figure 5.4 and 5.6). Enhanced early testa cracking was accompanied by a spectrum of accelerated to delayed seed germination rates observed in atslp2-2 and 35S::AtSLP2 seeds, respectively (Figure 5.4 and 5.6). AtSLP2 and AtMIA40 protein levels were also examined during imbibition and germination (Figure 5.8). Here 'seed imbibition' refers to seed water uptake and swelling at 4oC in the dark, while 'seed germination' refers to seeds exposed to 24 h light at room temperature. 148 Figure 5.3: Silique size and seed weight. (A) Representative depiction of maturing atslp2-2, wild-type NÖ and 35S:AtSLP2 siliques. Qualitative visual comparison identified a consistent difference in silique size between atslp2, WT NÖ and 35S::AtSLP2 plants. Siliques depicted were obtained from A. thaliana plants grown under a 12 h light : 12 h dark photoperiod. The weight of 50 fully mature, dry atslp2-2 (B) and atmia40 (C) seeds was compared to their respective wild-type backgrounds NÖ and Col-0. 35S::AtSLP2 over-expressing seeds were also measured against wild-type NÖ. The error bars represent standard error (n = 5). Stars denote p < 0.05 when compared to seeds from each respective wild-type background. 149 These two processes were explicitly separated to fully resolve when AtSLP2 and AtMIA40 protein levels were maximal (Figure 5.8). AtSLP2 and AtMIA40 protein expression was found to largely parallel each other, with AtSLP2 expression peaking early in imbibition (3 6 h) and steadily decreasing over the course of germination (Figure 5.8). AtMIA40 protein expression was maintained in the dry seed through imbibition, followed by a steady decrease over the germination time course (Figure 5.8). 5.3.3 An AtSLP2-AtMIA40 protein complex negatively regulates GA biosynthesis The observed testa cracking and early germination phenotypes in atslp2-2 were suggestive of a likely alteration in hormonal metabolism and/or signaling events. Seed germination is governed by complex hormone metabolism and signaling mechanisms centering around the antagonistic interaction displayed by ABA and GA, with either enhanced GA biosynthesis/signaling or ABA under-production/insensitivity leading to rapid germination. To elucidate which process AtSLP2 functions in, a pharmacological approach was undertaken to test the rapid testa cracking and germination phenotypes of atslp2-2 in the presence of either Uniconazole (GA biosynthetic inhibitor) or ABA (to ascertain ABA sensitivity). Uniconazole (10 µM) completely inhibited the early testa cracking and germination phenotypes of both atslp2-2 and atmia40 testa cracking, while 0.3 and 1.0 µM ABA delayed, but did not abolish, atslp2-2 and atmia40 testa cracking (Figure 5.4 and 5.6). These observations indicate that endogenous GA is required for the observed germination phenotypes, suggesting that GA levels are increased in the absence of these proteins. In silico analysis of publically available 150 Figure 5.4: Depiction of atslp2-2, wild-type NÖ and 35S::AtSLP2 seeds germinated on 0.5 x MS-Agar plates containing ABA or Uniconazole. The atlsp2-2 seeds germinated earlier, exhibiting an accelerated rate of testa cracking, radicle emergence and cotyledon emergence compared to wild-type NÖ and 35S::AtSLP2. Application of 0.3 and 1.0 µM ABA slowed germination, while the application of 10 µM Uniconazole completely inhibited germination. Testa cracking is depicted by red arrows, while enlarged images of each panel show a close up of testa cracking. Control plates consisted of 0.5 x MS only. All seeds were imbibed at 4oC for 48 h on each plate in the dark prior to germination under 24 h light at 23oC for the denoted number of days. 151 Figure 5.5: Depiction of atmia40 and wild-type Col-O seeds germinated on 0.5 x MS-Agar plates containing ABA or Uniconazole. The atmia40 seeds germinated slightly earlier, exhibiting an accelerated rate of testa cracking, radicle emergence and cotyledon emergence compared to wild-type Col-O. The observable phenotype was moderate relative to atslp2-2 seeds. Application of 0.3 and 1.0 µM ABA slowed germination, while the application of 10 µM Uniconazole completely inhibited germination. Testa cracking is depicted by red arrows, while enlarged images of each panel show a close-up of testa cracking. Control plates consisted of 0.5 x MS only. All seeds were imbibed at 4oC for 48 h on each plate in the dark prior to germination under 24 h light at 23oC. 152 Figure 5.6: Quantitative analysis of seed testa cracking on 0.5 x MS-Agar plates containing ABA or Uniconazole. (A) Testa cracking exhibited by atslp2-2 (light grey; left), WT NÖ (black; middle) and 35S::AtSLP2 seeds (dark grey; right). Application of up to 1 µM ABA slowed, but did not prevent day 1 testa cracking in the atslp2-2. (B) Testa cracking in atmia40 (light grey; left) and WT Col-0 (black; right) seeds. Similar to atslp2-2 seeds, application of 1 µM ABA reduced but did not prevent day 1 testa cracking in atmia40 seeds. Seed testa cracking was equally inhibited by the application of 10 µM Uniconazole. Control plates consisted of 0.5 x MS only. Seeds were imbibed at 4oC for 48 h on each plate type prior to germination under 24 h light at 23oC. Error bars represent ± standard error (n = 3 plates of 100-150 seeds each). Stars denote 2-tailed Student t-test (p < 0.05) when atslp2-2, atmia40 and 35S::AtSLP2 seeds were examined against their respective WT NÖ (AtSLP2 related) and Col-0 (AtMIA40 related) backgrounds. 153 Figure 5.7: Quantitative analysis of seed radicle emergence on 0.5 x MS-Agar plates containing ABA or Uniconazole. (A) Radicle emergence by atslp2-2 (light grey), WT NÖ (dark grey) and 35S::AtSLP2 seeds (medium grey) is shown. (B) Radicle emergence in atmia40 (light grey) and WT Col-0 (dark grey) seeds. Similar to testa cracking, seed radicle emergence was equally inhibited across all seed types. Control plates consisted of 0.5 x MS only. Seeds were imbibed at 4oC for 48 h on each plate type prior to germination under 24 h light at 23oC. Error bars represent standard error (n = 3 plates of 100-150 seeds each). 154 Figure 5.8: Immunoblot analysis of AtSLP2 and AtMIA40 protein expression in imbibed and germinating seeds. Wild-type NÖ seeds were imbibed at 4oC for the given time prior to extraction. Both AtSLP2 and AtMIA40 are most abundant early in seed germination. Antibody dilutions included 1.2 µg/mL AP anti-AtSLP2 and 1:1000 anti-AtMIA40 crude immune serum. Ponceau stained transfer probed with the aforementioned antibodies depicts the loading in each lane. Each lane contains 30 µg of total protein. 155 microarray data indicated that AtSLP2 transcripts were up-regulated in seeds following treatment with 5 µM GA (Figure 5.9). Furthermore, GA-induced expression of AtSLP2 negatively correlated with A. thaliana GA3 oxidase (GA3OX; At5g25900) and GID1A (At3g05120), while positively correlating with DELLA transcription factors RGA1 (At1g14920) and RGL2 (At3g03450), indicating a role in the negative regulation of GA-related cellular processes (Figure 5.9). Subsequent assessment of A. thaliana MIA40, GA3OX, GID1, RGA1 and RGL2 expression by quantitative RT-PCR over a 12 h imbibition time course corroborated previously observed in silico data obtained through microarray analysis of A. thaliana seeds imbibed in the presence of GA, while also revealing new information about AtMIA40 expression (Figure 5.10). Over the imbibition time course, atslp2-2 seeds possessed significantly up-regulated expression of RGL2, a key regulator of seed germination and a transcriptional repressor degraded in the presence of GA (Figure 5.10; (Lee et al., 2002)). Accompanying up-regulation of RGL2 expression in atslp2-2 seeds was a significant decrease in RGL2 expression in the 35S::AtSLP2 seeds at 6 h and 12 h time points (Figure 5.10). Furthermore, over the imbibition time course, GA3OX and GID1A expression was significantly down-regulated in 35S::AtSLP2 seeds at 6 h and 12 h time points, which is consistent with repression of GA production (Figure 5.10). Expression of MIA40 was also significantly induced following wild-type seed imbibition and interestingly, in the absence of SLP2, MIA40 expression was constitutively up-regulated. 156 Figure 5.9: Relative transcript expression of GA-related biosynthetic and signaling proteins from Biological Arabidopsis Resource (BAR). (A) Changes in GA-related gene expression in response to seed imbibition in the presence of ABA. (B) Changes in GA-related gene expression in response to seed imbibition in the presence of GA. Gene identifiers (i.e. AtSLP2; At1g18480) are shown. Yellow represents no change, while red and blue represent a relative increase and decrease in transcript expression respectively. All data were obtained from microarray data compiled at BAR (http://bar.utoronto.ca/). 157 Figure 5.10: Quantitative PCR analysis of GA biosynthetic and signaling genes from 0 h, 6 h and 12 h imbibed atslp2-2, wild-type and 35S::AtSLP2 seeds. Seeds were imibibed for 0 h, 6 h and 12 h in water at 4oC in the dark followed by snap freezing. (A-E) Transcriptional expression of A. thaliana GA3 oxidase (GA3OX), GA insensitive dwarf 1a (GID1A), regulator of GA 1 (RGA1) and regulator of GA-like 2 (RGL2) as well as mitochondrial intermembrane space import and assembly 40 (MIA40) was assessed. Error bars represent standard error (n = 3). Individual stars denote a 2-tailed Student t-test (p < 0.05) when examined against the respective WT time-point. Dual stars denote 2-tailed Student t-test (p < 0.05) when examined against respective WT 0 h time point. 158 5.3.4 AtSLP2 influences seed fatty acid and amino acid content With dry, mature atslp2-2 and 35S::AtSLP2 seeds found to weigh more and less than wild-type NÖ seeds, respectively, metabolic profiling was undertaken to investigate how AtSLP2 may be influencing seed processes which could result in the observed weight differences (Figure 5.3). A range of metabolites were examined (sugars, polyols, amino acids, fatty acids) with the most significant changes being noted for fatty acids (FA) and amino acids (AA). Measurement of total seed FA (µg / mg seed tissue extracted) found it was reduced in atslp2-2 seeds and increased in 35S::AtSLP2 seeds relative to wild-type NÖ (Figure 5.11A). Further examination of seed FA composition revealed a significant decrease in atslp2-2 seed 16:0, 18:0, 18:2, 18:3 and 20:1 FA content (µg / mg seed tissue extracted), while 35S::AtSLP2 seeds increased their 16:0, 18:0, 18:2, 18:3, 20:0 and 20:1 FA content (Figure 5.11B). When examined based on a molecule percent per mg seed tissue extracted, significant changes in only 16:0, 18:1, 18:2, 18:3 and 20:0 FAs were observed (Figure 5.11C). In addition, examination of the AA composition of dry, mature atslp2-2 and 35S::AtSLP2 seeds relative to WT NÖ seeds uncovered a number of notable differences. In particular, a significant increase in valine and isoleucine content was observed in atslp2-2 seeds, which correspondingly decreased in 35S::AtSLP2 seeds (Figure 5.12). Additional AAs found to increase in atslp2-2 seeds were the nitrogen-rich AAs arginine, lysine, asparagine and ornithine as well as the aromatic AA tyrosine (Figure 5.12). Serine, aspartate and methionine also exhibited an increase in atslp2-2 seeds, while phenylalanine and proline were found to decrease (Figure 5.12). No notable changes were observed for pyroglutamic acid (glutamic acid & glutamine) nor alanine and glycine (Figure 5.12). 159 Figure 5.11: Comparative analysis of atslp2-2 and 35S::AtSLP2 seed FA content and composition. (A) Total FA content (µg/mg seed) of atslp2-2, 35S::AtSLP2 and WT NÖ seeds. Single stars denote p < 0.05 when examined relative to WT NÖ. (B) Breakdown of FA-type abundance (µg) and (C) FA-type percent per mg atslp2-2, 35S::AtSLP2 and WT NÖ seed. The error bars represent standard error (n = 3). Single stars denote p < 0.05 and double stars denote p < 0.01 when examined relative to WT NÖ. Significance was determined using a 2-tailed Student T-test. 160 Figure 5.12: Relative levels of AAs in atslp2-2, WT NÖ and 35S::AtSLP2 dry seeds. Highlighted are branched AAs (blue), nitrogen rich AAs (red) and aromatic AAs (green). The error bars represent standard error (n = 3). Stars denote p < 0.05 when compared to WT NÖ using a 2-tailed Student T-test. 161 5.4 Discussion 5.4.1 Negative regulation of seed germination through GA metabolism In exploring the biological function of AtSLP2, insertional atslp2-2 mutant seeds were found to maintain a heavier dry seed weight relative to NÖ seeds. When germinated on control 0.5 x MS-Agar plates, atslp2-2 seeds exhibited accelerated testa cracking compared to NÖ seeds, while 35S::AtSLP2 over-expressing seeds displayed delayed testa cracking. This indicated that AtSLP2 was a regulator of seed germination. With seed germinative processes predominantly governed by ABA and GA, this suggests that atslp2-2 seeds were either ABA insensitive / underproducing or GA-sensitive / over-producing. The lack of any dramatic ABA insensitivity and maintenance of testa cracking in atslp2-2 and atmia40 seeds in the presence of ABA indicated that the observed seed germination phenotype is likely due to enhanced GA production or altered GA signaling. If atslp2-2 seeds were ABA insensitive, application of 0.3 µM and 1.0 µM ABA should not have induced the concentration-dependent effect on seed germination that was documented here (Koornneef et al., 1984; Koornneef et al., 1989), and if they were ABA underproducing they should have germinated on Uniconazole plates. Furthermore, Uniconazole completely abrogated testa cracking and early germination of atslp2-2 and atmia40 seeds indicating that GA is required for the observed germination phenotypes. Thus, the AtSLP2AtMIA40 complex likely functions to negatively regulate GA biosynthesis since GA signaling mutants such as rgl2 and other DELLA protein (negative regulators) loss-of-function mutants can germinate in the presence of GA inhibitors and confer GA phenotypes in the absence of bioactive GA (Figure 5.13; (Sun and Kamiya, 1994; Lee et al., 2002; Cao et al., 2005)). 162 Figure 5.13: Model of AtSLP2-AtMIA40 protein complex function during seed germination. (A) Under wild-type conditions (Wild-type), AtMIA40 activates AtSLP2 to negatively regulate GA biosynthesis and in doing so positively reinforces ABA inhibition of seed germination. (B) Absence of AtSLP2 (atslp2-2) results in unimpeded GA biosynthetic processes leading to an accelerated seed germination phenotype. (C) Over-expression of AtSLP2 (35S::AtSLP2) leads to enhanced inhibition of GA-related cellular processes, which concomitantly enhances ABA inhibition of seed germination. This leads to a delayed seed germination phenotype. (D) Absence of AtMIA40 (atmia40) reduces the ability of AtSLP2 to negatively regulate GA biosynthesis. This leads to the moderately accelerated seed germination phenotype observed with atmia40 seeds as AtSLP2 alone has been shown to maintain phosphatase activity in vitro (Chapter 3). 163 If GA biosynthesis was up regulated in atslp2-2 seeds, balancing of GA-induced signaling would require a corresponding increase in the negative regulators to counter the increased GA action. In accordance with this hypothesis, expression of GA-inducible RGL2, the major determinant of seed germination (Lee et al., 2002; Stamm et al., 2012), was found to be constitutively up-regulated in atslp2-2, indicating a system attempting to balance GA biosynthetic and signaling outputs (Figure 5.10; (Lee et al., 2002)). In 35S::AtSLP2 seeds this pattern was reversed with these seeds having significantly lower RGL2 expression than wild-type following imbibition (Figure 5.10). 5.4.2 The AtSLP2-AtMIA40 protein complex functions to negatively regulate GA biosynthesis When compared to atslp2-2 seeds, atmia40 seeds exhibited more moderate testa cracking and early germination phenotype. This suggests that AtSLP2 likely represents the functional driver of the AtSLP2-AtMIA40 complex in negatively regulating GA-related cellular processes. Reinforcing this hypothesis, bacterially-expressed and purified AtSLP2 from E. coli maintained detectable protein phosphatase activity, even in the absence AtMIA40 (Chapter 3), indicating that atmia40 seeds likely possess some residual AtSLP2 protein phosphatase activity which mitigates, to a certain extent, the absence of AtMIA40 and its requirement to fully activate AtSLP2. Furthermore, with both atslp2-2 and atmia40 seed phenotype data supporting a role for mitochondrial-targeted AtSLP2 in regulating cytosolic GA-related processes, there remains a need to resolve the intermediate mechanisms connecting the mitochondria to cytosolic GA. Given the metabolic significance of mitochondria across all Eukaryotes, and that almost all mitochondrial proteins are nuclear encoded and imported (Fraga and Ventura, 2013), these 164 intermediate mechanisms may include the regulation of mitochondrial protein import and/or mitochondrial metabolism. Independently connecting the mitochondria to cytosolic GA-related processes is the Nicotiana sylvestris cytoplasmic male sterile II (CMSII) mutant (Gutierres et al., 1997). CmsII possesses a novel insertional mutation in the mitochondrial gene encoding NADH dehydrogenase 7 (NAD7), accompanied by an absence of NAD9, which results in a faulty mitochondrial electron transport chain (mETC) complex I (Gutierres et al., 1997). Similarly, atmia40 plants were also shown to possess a defective mETC complex I resulting from reduced mitochondrial NAD9, which exhibited a 40% decrease in complex I activity (Carrie et al., 2010). Furthermore, metabolite profiling of leaves from cmsII plants uncovered a nitrogen-rich phenotype, consisting of an elevated abundance of nitrogen-rich amino acids and a depletion of starch stores and 2-oxoglutarate (Dutilleul et al., 2005). Subsequent characterization revealed that topical application of bioactive GA could recover a wild-type growth phenotype in cmsII shoots (Pellny et al., 2008). Furthermore, GA biosynthesis requires 2-oxoglutarate (Hedden and Thomas, 2012), which is also a key mitochondrial metabolite required for nitrogen assimilation (Bunik and Fernie, 2009). With both nitrogen assimilation and GA biosynthesis linked through their requirement for 2-oxoglutarate and AtSLP2 potentially linked to electron transport chain complex I through AtMIA40, it may be possible that mitochondrial-targeted AtSLP2 negatively regulates cytosolic GA-related processes through the modulation of mitochondrial 2-oxoglutarate production. Exploration of potential AtSLP2 integration into this network represents an interesting future direction. 165 5.4.3 Influence of AtSLP on seed FA and AA composition Both the deletion and over-expression of AtSLP2 resulted in abnormal seed FA content and composition. Interestingly, when exogenous GA (500 µM) was continuously applied to whole A. thaliana plants after bolting throughout plant development to silique maturity (Chen et al., 2012), a decrease in the total seed FAs (µg / mg seed) was observed, which paralleled the total FA decrease observed in atslp2-2 seeds (Figure 5.11A). GC-MS analysis of the corresponding FA composition of seeds from GA-treated A. thaliana Col-0 plants revealed a significant increase in 18:1 FAs in conjunction with a decrease in 18:2 and 18:3 FAs (Chen et al., 2012). This exact FA fingerprint was observed here with atslp2-2 seeds (Figure 5.11C), reinforcing the prospect that atslp2-2 seeds maintain excess GA resulting from a lack of AtSLP2 negative regulation. Observed changes in total seed FA content and seed FA composition were complemented by a GA-stimulated increase in seed weight, where untreated A. thaliana Col-0 seeds were found to weigh approximately 1 mg / 50 seeds and seeds from GA-treated A. thaliana Col-0 plants weighed approximately 1.55 mg / 50 seeds (Chen et al., 2012). This was paralleled in atslp2-2 seeds from the NÖ ecotype which weighed approximately 1.4 mg / 50 seeds. Examining the effect of GA on A. thaliana seed FA composition stemmed from the identification of DELLA-regulated seed fatty acid reducer (SFAR) genes which function down-stream of GA signaling to reduce seed FA content (Chen et al., 2012). It was found that over-expression of SFAR genes lead to the same GA-stimulated seed FA fingerprint, while here, it was the absence of AtSLP2 which presented the GA-stimulated phenotype. Overall, these observed effects on seed FA content and composition indicate that AtSLP2 also plays a role during seed maturation. Examination of atslp2-2 and 35S::AtSLP2 seed AA composition relative to wild-type NÖ seeds was also performed (Figure 5.12). Of particular note was the dramatic increase in valine 166 and isoleucine in atslp2-2 seeds. Increase in valine and isoleucine concentrations has also been observed in dark-treated A. thaliana seedlings lacking the mitochondrial-targeted enzymes 2hydroxyglutarate dehydrogenase (D2HGDH) and isovaleryl-CoA dehydrogenase (IVDH) (Araujo et al., 2010). These proteins are generally characterized as part of a dark-induced senescence system which responds to carbon starvation through the electron-transfer flavoprotein / electron-transfer flavoprotein:ubiquinone oxidoreductase (ETF/ETFQO) complex of the mitochondrial inner membrane (Ishizaki et al., 2005). This complex represents an extension of the electron transport chain, where in humans and plants alike it is responsible for the breakdown of AAs as an alternative source of energy (Ishizaki et al., 2005; Araujo et al., 2010). Under prolonged dark conditions, both d2hgdh and ivdh insertional mutant seedlings accumulated significant amounts of branched (valine, isoleucine) and aromatic (tyrosine, tryptophan) AAs relative to wild-type seedlings (Araujo et al., 2010). This was paralleled in dry, mature atslp2-2 seeds, indicating an inability under either circumstance to utilize these metabolites as an energy source. It is possible that the significant accumulation of valine and isoleucine in atslp2-2 seeds functions in conjunction with enhanced seed GA biosynthesis to accelerate atslp2-2 seed germination. Also of note, was the significant increase in nitrogen-rich AAs arginine, lysine and asparagine (Figure 5.12). Nitrogen-rich AAs arginine and asparagine were also notably upregulated in the leaves of the Nicotiana sylvestris cytoplasmic male sterile II (CMSII) insertional mutant mentioned above (Dutilleul et al., 2005), which maintains a dysfunctional mETC complex I that was recoverable by the application of bioactive GA (Pellny et al., 2008). In addition to being a resource that may enhance the germination of atslp2-2 seeds, the observed elevation in nitrogen-rich AAs further relates AtSLP2 to both mETC function and cytosolic GA167 related processes. How exactly AtSLP2, a protein phosphatase, fits into these processes remains to be resolved; however, possessing a mitochondrial IMS subcellular localization could suggest AtSLP2 regulates either protein and/or metabolite import or mitochondrial matrix metabolic processes through integral inner membrane proteins. 5.5 Conclusion Unfortunately, the reverse genetics approach applied here did not uncover any phenotypes relating to the biological function of AtSLP1. Conversely, a number of interesting AtSLP2-related seed phenotypes were revealed. In particular, atslp2-2 seeds exhibited accelerated germination indicative of GA over-production, which was supported by elevated levels of RGL2 expression, a GA negative regulator. Further examination of atslp2-2 seed FA and AA composition noted significant content and compositional changes indicative of perturbed mitochondrial function. In addition to seeds, a potential role exists for AtSLP2 in roots, which was not explored here. Future research should look to examine roots from the atslp2-2 and 35S::AtSLP2 plant lines created. This may reveal additional biological roles for AtSLP2. . 168 Chapter Six: Rhizobiales-like phosphatase 2 from Arabidopsis thaliana is a novel phosphotyrosine-specific PPP protein phosphatase 6.1 Introduction Reversible protein phosphorylation, mediated by protein kinases and phosphatases, is a prolific regulatory mechanism key to the proper functioning of eukaryotic cells. Given the extensive catalog of proteins controlled by this mechanism (Olsen et al., 2010), connecting the biochemical characteristics of protein kinases and phosphatases to their cellular function is of central importance. The PPP, PPM/PP2C, PTP and Asp-based protein phosphatase families of A. thaliana collectively encode ~150 protein phosphatases (Kerk et al., 2008), with PPP and PPM/PP2C protein phosphatases maintaining a catalytic mechanism comprised of a tertiary structural fold that coordinates metal cations to assist in protein dephosphorylation (Egloff et al., 1995; Shi, 2009). Both PPP and PPM/PP2C protein phosphatases from Eukaryotes have been shown to specifically target pSer and pThr residues on protein substrates (Kerk et al., 2008). The remaining protein phosphatase families are the PTP and Asp-based protein phosphatases. Both the PTP and Asp-based protein phosphatases employ catalytic motifs generated from a secondary fold in protein structure. These include HCx5R and DxDxT motifs, respectively (Denu et al., 1996; Pannifer et al., 1998; Shi, 2009; Zhang et al., 2010). The PTP phosphatases operate independent of metal cation co-factors, while Asp-based protein phosphatases require Mg2+ (Shi, 2009; Zhang et al., 2010). PTP phosphatases employ a reduced cysteine to catalyze the removal of phosphate from pTyr residues of target protein substrates (Denu et al., 1996; Pannifer et al., 1998; Kerk et al., 2008). Some PTP phosphatases (DSP phosphatases) maintain the ability to dephosphorylate pSer/pThr residues as well as pTyr or other non-protein substrates via the same metal-free PTP phosphatase catalytic mechanism 169 (Vilela et al., 2010; Silver et al., 2013; Tonks, 2013). To date, A. thaliana genome analysis has only identified a single classic PTP protein phosphatase (SSU72), compared to humans, who possess 38 (Kerk et al., 2008). Despite the low abundance of classic PTP protein phosphatases, phosphoproteomic studies have consistently found levels of protein tyrosine phosphorylation (Sugiyama et al., 2008; Mithoe et al., 2012) in A. thaliana and Oryza sativa (O. sativa) which parallel in abundance to those found in humans (Sugiyama et al., 2008). Interestingly, many of the prokaryotic Ser/Thr protein phosphatases characterized to date have been found to maintain dual substrate specificity (Lai and Le Moual, 2005; Pereira et al., 2011; Arora et al., 2012). Given the prokaryotic heritage of SLP and RLPH phosphatases, research presented within this chapter looks to define the substrate specificities of A. thaliana SLP and RLPH PPP protein phosphatases. The Shewanella-like (SLP) phosphatases, were found to be evolutionarily related to PPP protein phosphatases (Chapter 2; (Andreeva and Kutuzov, 2004)). Bioinformatic analysis revealed that they form two highly conserved and phylogenetically distinct sub-groups (Group I, SLP1 and Group II, SLP2 phosphatases) across all plants (Chapter 2). Specific biochemical characterization of the A. thaliana (At) SLP1 and SLP2 phosphatases further uncovered unique biochemical properties and subcellular localizations distinct from other eukaryotic PPP protein phosphatases (Chapters 2 and 3). This included a complete insensitivity to classic PPP protein phosphatase inhibitors, in addition to chloroplastic (AtSLP1) and mitochondrial (AtSLP2) subcellular locations, respectively (Chapters 3 and 4). The second group of bacterial-like protein phosphatases is the Rhizobiales-like (RLPH) phosphatases, which were named due to their phylogenetic relatedness to phosphatases encoded by these bacterial groups (Andreeva and Kutuzov, 2004). Here, I present a biochemical 170 characterization of A. thaliana RLPH phosphatase 2 (AtRLPH2). Analyses uncover a complete insensitivity to classic PPP protein phosphatase inhibitors, in addition to a catalytic reaction mechanism highly resistant to inhibition by EDTA or EGTA. As well, through systematic screening of a set of phosphorylated peptides, AtRLPH2 was found to uniquely maintain a phosphatase activity solely directed towards pTyr peptides. This is the first example of a Eukaryotic PPP protein phosphatase displaying specific pTyr activity, and may offer to help bridge the gap between the prevalence of pTyr in plants and lack of classic PTP protein phosphatases. 6.2 Materials and Methods 6.2.1 Bioinformatics Genevestigator (www.genevestigator.com) was used to obtain Affimatrix 22k microarray data for AtRLPH1 and AtRLPH2 transcriptional expression across different plant tissues. AtRLPH2 orthologs from other plants were identified, cataloged in Chapter 2 and aligned here against representative PPP protein phosphatases using ClustalX2 (http://www.clustal.org/). The presented alignment was assembled using Genedoc as previous described (Chapter 2). 6.2.2 Molecular cloning of AtRLPH2 Fluorescent, tandem affinity purification (TAP) and recombinant protein expression construct creation utilized full-length At3g09970 (AtRLPH2) obtained from The Arabidopsis Information Resource (http://www.arabidopsis.org/). AtRLPH2 was cloned with Gatewaycompatible primers for insertion into pDONR221 (Invitrogen). Subsequently, AtRLPH2 was subcloned into the plant expression vectors pB7RWG2 (http://gateway.psb.ugent.be/) to create a 171 C-terminal RFP (cRFP) fusion construct, and pYL436 to create a C-terminal TAP construct (Rubio et al 2005). AtRLPH2 was also homologously recombined into pET-DEST42 to create a C-terminal V5-HIS6 fusion construct (AtRLPH2-HIS6). All cloning steps were performed using DH5α E. coli. Isolated plasmid DNA was then transformed into the bacterial strain BL21 (DE3) CodonPlus-RIL E. coli (Agilent Technologies) for expression and subsequent purification of AtRLPH2-HIS6 protein. AtSLP1-TAP (Chapter 4) and HIS6-AtSLP1 (Chapter 3) utilized here, were created previously. 6.2.3 Transient expression of fluorescent AtRLPH2 constructs and microscopy AtRLPH2-RFP was transiently co-expressed with either GFP-only (p2FGW7) to illuminate the nucleus and cytosol, or YFP-tagged Histone 2B (H2B), to illuminate the nucleus only (Boisnard-Lorig et al., 2001). All co-expression experimentation was performed using Vicia faba epidermal leaf cells as previously described (Chapter 3). Microscopic imaging was conducted using a Leica DMIRE2 spectral confocal and multiphoton microscope with a Leica TCS SP2 acoustic optical beam splitter (Leica Microsystems), with all cells and leaves visualized using a 63 x water immersion lens. The excitation/emission wavelengths (nm) employed were as follows: GFP, 488/505-515; YFP, 488/515-525; RFP, 594/610-650; chlorophyll autofluorescence, 488 and 594/685 to 715. Subsequent image processing was performed using the freely accessible MacBiophotonics ImageJ software (http://www.macbiophotonics. ca/downloads.htm). 172 6.2.4 Heterologous protein expression and purification AtRLPH2-HIS6 was expressed in BL21 (DE3) CodonPlus-RIL E. coli (Agilent Technologies) grown in LB supplemented with 1 % glucosamine at 22oC, 200 rpm and induced with 0.1 mM IPTG for 18 h. Bacteria were pelleted at 4,000 x g for 15 min and re-suspended in 1 x extraction buffer (50 mM HEPES-NaOH, pH 7.5, 150 mM NaCl, 5 % [v/v] glycerol, and 10 mM imidazole) followed by snap freezing and storage at -80oC prior to use. The day of purification saw the addition of 1 % (v/v) Tween-20, 20 mM imidazole, 1 mM PMSF, 1 mM benzamidine, 2 µg/mL leupeptin, and 5 µg/mL pepstatin upon thawing. Extraction involved mechanical lysis via French press at 3 x 16,000 p.s.i. (Sim-Aminco Spectronic Instruments). Crude lysates were clarified at 40,000 x g for 30 min at 4oC. Supernatants were removed, and an additional 1 mM PMSF and 1 mM benzamidine were added prior to incubation with NiNTAagarose matrix (Qiagen) end-over-end at 4oC for 1.5 h. Matrix was captured in a column via gravity sedimentation and washed with 500 cv of Buffer A (1 x extraction buffer containing 1 M NaCl, 1 % [v/v] Tween-20, and 20 mM imidazole) followed by 100 cv of Buffer B (1 x extraction buffer). NiNTA-bound proteins were eluted in 1x extraction buffer containing 500 mM imidazole, pH 7.5. Protein eluates were concentrated using a 30,000 Da molecular weight cut-off Amicon concentrator (Millipore). NiNTA-pure AtRLPH2 eluate was concentrated to 300 µL and subjected to two sequential concentration / re-suspension steps involving Mono-Q (MQ) Buffer A (50 mM HEPES-NaOH pH 7.5, 50 mM NaCl and 5 % [v/v] glycerol). Concentrated and MQ Buffer A exchanged AtRLPH2-HIS6 eluate was loaded onto a Mono-Q 5/50 anion exchange column using a fast liquid protein chromatography (FPLC) system at 0.5 mL/min. The unbound MQ fraction contained pure AtRLPH2-HIS6. This fraction was concentrated, proteins resolved by SDS-PAGE, aliquoted, snap frozen in liquid nitrogen, and stored at -80oC for 173 subsequent use. HIS6-AtSLP1 was purified using NiNTA as described previously described (Chapter 3). 6.2.5 Tandem affinity purification (TAP) isolation of AtSLP1 and AtRLPH2 AtSLP1 and AtRLPH2 TAP-tagged constructs were transformed into Agrobacterium GV3101 and subsequently used to transfect A. thaliana via the floral dip method as previously described (Zhang et al., 2006) and performed in Chapter 4. AtSLP1- and AtRLPH2-TAP expressing A. thaliana plants were grown as described in Chapter 4. AtSLP1- and AtRLPH2TAP protein was isolated in parallel, each from 20 g of rosette tissue, as previously described using 400 µL of settled NiNTA (Templeton et al., 2011). Each TAP-purified protein was employed in enzyme assays immediately after the final NiNTA purification step, where it was washed, but not eluted as previously described (Chapter 4). 6.2.6 Enzymatic analysis Enzymatic assessment of AtRLPH2 was conducted using two phosphatase assays. The first involved use of the small molecule phosphatase substrate pNPP (Sigma) while the second measured phosphatase-catalyzed phosphate release from phosphorylated peptide substrates derived from known phospho-proteins using malachite green reagent (Baykov et al., 1988). Assays employing pNPP as a phosphatase substrate were used to assess: AtRLPH2 pH activity optima, metal cation dependency (EDTA, EGTA, Mn2+, Fe3+, Zn2+, Ca2+ and Mg2+) and inhibitor sensitivity (MCLR, OA, NaF, sodium orthovanadate (NaOV), ATP, ADP, AMP, Pi, PPi, glucose 6-phosphate (Glu-6P) and glycerol 3-phosphate (Gly-3P)). Standard pNPP assay conditions included pre-incubation of 150 ng AtRLPH2 with an experimental variable in 1x 174 dilution buffer consisting of 100 mM HEPES-NaOH pH7.5, 150 mM NaCl (totaling a volume of 20 µL) for 10 min at 30oC. Each assay was then supplemented with 130 µL of 1x dilution buffer containing 5 mM pNPP followed by an additional 10 min incubation, shaking at 30oC. All pNPP assays were then quenched with 150 µL of 2 M NaOH. Cleavage of pNPP was measured spectrophotometrically at 405 nm. Malachite Green assays were performed as previously described (Baykov et al., 1988) alternatively using a 10 % ammonium molybdate solution. Each assay employed 1 µg of recombinant phosphatase protein incubated for 1 h at 30oC in 160 µL of 1 x dilution buffer containing each respective experimental phosphorylated peptide substrate (as described in Chapter 4). Addition of 40 µL malachite green solution quenched the reaction to reveal phosphatase catalyzed increase in free phosphate relative to control assays lacking protein phosphatase. Quenched assays were left at room temperature (23oC) for 10 min prior to spectrophotometric assessment at 630 nm. Amount of free phosphate was calculated using a standard curve derived from known amounts of potassium phosphate. Assays performed using TAP purified AtSLP1 and AtRLPH2 employed 65 µL of protein phosphatase coupled NiNTA isolated from 20 g of rosette tissue as described above. Each aliquot of coupled NiNTA matrix was re-suspended in 235 µL of 1 x dilution buffer containing the respective phosphorylated peptide and totaling 300 µL. Each assay mixture was incubated for 1 h in a 30oC water bath. NiNTA matrix was pelleted and 160 µL of reaction mix was removed and quenched with 40 µL of malachite green solution. Quenched assays were evaluated spectrophotometrically as outlined above. 175 6.3 Results Pursuit of AtRLPH2, and not AtRLPH1, for biochemical characterization was driven by the observed transcriptional expression profile of AtRLPH2 versus AtRLPH1 across A. thaliana tissue types (Figure 6.1). Interestingly, AtRLPH1 exhibited almost no transcript expression, while AtRLPH2 exhibited an almost ubiquitous tissue expression profile (Figure 6.1), suggesting RLPH1 may in fact be an inactive gene. 6.3.1 AtRLPH2 is a dual-localized nuclear and cytosolic protein Transient co-expression of AtRLPH2-cRFP with GFP- and YFP-tagged marker constructs in Vicia faba leaf epidermal pavement cells demonstrated that AtRLPH2 is a duallocalized nuclear / cytosolic protein (Figure 6.2). The observed dual nuclear / cytosolic subcellular localization confirmed in silico subcellular localization findings presented in Chapter 2. 6.3.2 Purification of recombinant HIS6-tagged AtRLPH2 Expression and purification of AtRLPH2-HIS6 from E. coli using sequential NiNTA affinity purification and Mono-Q (MQ) anion exchange provided pure protein for biochemical characterization (Figure 6.3). Anion exchange chromatography eliminated a number of nonspecific co-purifying bacterial proteins, yielding a significantly more pure AtRLPH2 protein fraction (Figure 6.3A). Immunoblotting of purified AtRLPH2 with V5-epitope antibodies identified the purified protein as AtRLPH2 (Figure 6.3B), which was supported by MALDITOF peptide mass fingerprinting. Subsequent calibrated size exclusion chromatography using a 176 Figure 6.1: AtRLPH1 and 2 transcriptional tissue expression profile. Compiled microarray transcript expression data for AtRLPH1 (At3g09960) and AtRLPH2 (At3g09970) was obtained from Genevestigator (www.genevestigator.com). Expression is indicated by a gradient of white (low expression) to red (high expression) as a percentage of the top 1% genes expressed in each tissue. The numbers denote the number of total arrays used in determining the displayed level of relative gene expression for AtRLPH1 and AtRLPH2. 177 Figure 6.2: Subcellular localization of AtRLPH2. Transient co-expression of AtRLPH2-RFP with (A) GFP-only and (B) histone 2B-YFP (H2BYFP) fluorescent marker constructs in Vicia faba leaf epidermal pavement cells. AtRLPH2-RFP consistently exhibited dual subcellular targeting to the cytosol and nucleus. Scale bar = 20 µm. 178 Figure 6.3: Purification of AtRLPH2-HIS6 from E. coli. (A) Colloidal blue stained 12 % SDS-PAGE depicting 5 µg of NiNTA and Mono-Q HIS6-V5tagged AtRLPH2 containing eluates. (B) Anti-V5 tag western immunoblot (1:500) demonstrating mono-specific detection of HIS6-V5-tagged AtRLPH2 (100 ng). The anti-V5 immunoreactive band was also confirmed as AtRLPH2 by mass spectrometry. 179 Superdex 200 gel filtration column, resolved an in-solution molecular mass of 49.1 kDa, despite a denatured SDS-PAGE mass of approximately 40 kDa (Figure 6.3 and Figure 6.4). 6.3.3 Enzymatic characterization of AtRLPH2 Bacterially-expressed and purified AtRLPH2 was found to maintain a pH optimum of 7.5 - 7.8 (Figure 6.5A) with a specific activity of approximately 20 nmol Pi min-1mg-1 towards the artificial phosphatase substrate pNPP (Figure 6.5B). Screening of known metal cation co-factors of various protein and non-proteinaceous phosphatases revealed a lack of metal cation-dependent phosphatase activity, with highest activity observed in the control condition containing 5 mM EDTA (Figure 6.5B). AtRLPH2 activity assays in the presence of 0.5 mM Mn2+, Fe3+, Ca2+ and Mg2+ all exhibited slightly lower activity, while Zn2+ demonstrated an inhibitory effect (Figure 6.5C). Lack of AtRLPH2 metal-dependent activity was further supported by activity assays conducted with up to 50 mM EDTA or EGTA (Figure 6.5D). Alignment of AtRLPH2 orthologs from across the kingdom Plantae against classic PPP protein phosphatases PP1 (TOPP1) and PP2A (PP2A-1) from A. thaliana and the related Shewanella-like protein phosphatases, also from A. thaliana (AtSLP1 and AtSLP2), identified two highly conserved cysteine residues in the active site of RLPH phosphatases (Figure 6.6), indicating a potential metal-independent PTP phosphatase catalytic mechanism (Figure 6.7), even though the PPP phosphatase metal-binding residues are present. Inhibition assays conducted using hydrogen peroxide (H2O2) and Nethylmaleimide (NEM) failed to inhibit AtRLPH2 at concentrations specific to a PTP phosphatase catalytic mechanism (Figure 6.8). 180 Figure 6.4: Calibrated Superdex 200 size exclusion chromatography of AtRLPH2-HIS6. The observed molecular mass of 49.1 kDa is consistent with a monomeric protein. Vo = void volume. Inset represents a Colloidal blue stained gel of the peak fractions. Each lane contains 30 µL of a 1 mL fraction. 181 Figure 6.5: AtRLPH2 pH activity profile, metal cation dependency, and sensitivity to metal chelators EDTA and EGTA. (A) AtRLPH2 pH activity profile was assessed from pH 5.5 to 9. Activity assays were conducted in the absence of metal cations. (B) Substrate saturation of AtRLPH2 using the artificial phosphatase substrate pNPP. (C) AtRLPH2 was assayed in the presence of (Mn2+, Fe3+, Ca2+, Zn2+ or Mg2+) and absence of metal cations (control). Highest phosphatase activity was detected in the control condition containing 5 mM EDTA. (D) Assays containing increasing amounts of EDTA and EGTA. All assays were performed using 150 ng of purified AtRLPH2 and the phosphatase substrate pNPP. Phosphatase activity was assessed through pNPP cleavage. Error bars depict ± standard error (n = 3). 182 Figure 6.6: ClustalX alignment of representative RLPH phosphatases from across photosynthetic Eukaryotes. Highlighted are cysteines conserved across RLPH phosphatases near or in active site motifs (labeled motif 1 and 2). These cysteine residues may influence AtRLPH2 protein phosphatase activity. Gene identifiers corresponding to the list of RLPH phosphatase depicted here can be found in Appendix A.2. 183 Figure 6.7: Cartoon depiction of PPP, PPM and PTP protein phosphatase motifs relative to AtRLPH2. Based on the maintenance of highly conserved PPP Ser/Thr protein phosphatase catalytic motifs GDxHG, GDxVDRG, GNHE and HGG, AtRLPH2 is a PPP protein phosphatase. Amino acids labeled in green and blue are important in metal ion coordination and enzyme catalysis. Cysteine (C) labeled in red depicts the catalytic cysteine of SEX4. Additional points of interest for AtRLPH2 include the lack of a C-terminal SAPNYC motif responsible for coordinating OA and MCLR inhibition. PP2Cs represents PPM Ser/Thr protein phosphatases (Shi, 2009) and starch excess 4 (SEX4) represents a PTP/DSP phosphatase maintaining the canonical PTP protein phosphatase catalytic motif (Silver et al., 2013; Tonks, 2013). 184 Figure 6.8: Effect of hydrogen peroxide (H2O2) and N-ethylmaleimide (NEM) on AtRLPH2 phosphatase activity. Both H2O2 and NEM failed to inhibit AtRLPH2 at the micromolar concentrations indicative of a PTP and/or DSP phosphatase (Silver et al., 2013; Tonks, 2013). 185 Sodium pyrophosphate (PPi) is a traditional phosphatase inhibitor and displays potent inhibition of AtRLPH2 with an IC50 of approximately 100 nM (Figure 6.9A). Additional phosphate-containing compounds ATP, ADP, AMP, Pi, Glu-6P and Gly-3P were also evaluated for their inhibitory effect on AtRLPH2 (Figure 6.9B). Using pNPP as a substrate, AtRLPH2 exhibited micromolar (µM) to millimolar (mM) sensitivity to ATP, ADP, Pi and AMP (Figure 6.9B). Glu-6P and Gly-3P had no notable inhibitory effect on AtRLPH2 activity up to a 2 mM concentration (Figure 6.9B). As well, phosphate analogs NaF and NaOV were examined for their inhibitory effect on AtRLPH2 (Figure 6.10). NaF and NaOV inhibit PPP and PTP protein phosphatases, respectively. Both NaF and NaOV substantially reduced AtRLPH2 activity to 5 and 20 percent of maximum when using up to 100 mM (Figure 6.10A) and 10 mM (Figure 6.10B) concentrations, respectively (Figure 6.5A and B). The naturally occurring PPP protein phosphatase-specific small molecule inhibitors OA and MCLR were also examined for their inhibitory effect on AtRLPH2. Up to 150 nM OA failed to inhibit AtRLPH2 (Figure 6.10C), while increasing amounts of MCLR exerted an activating effect on AtRLPH2 (Figure 6.10D). Coinciding with this lack of sensitivity to OA and MCLR was a noted lack of the key C-terminal SAPNYC motif responsible for coordinating OA and MCLR in other PPP protein phosphatases (Figure 6.6). 6.3.4 AtRLPH2 is a phosphotyrosine-specific PPP protein phosphatase Utilization of pSer, pThr and pTyr phosphorylated peptides to decipher the substrate preferences of PP1, PP2A, PP2B and PP2C has previously been employed (Donella Deana et al., 1990; Donella-Deana et al., 1991; Donella-Deana et al., 1994) . Therefore, using our own 186 Figure 6.9: AtRLPH2 sensitivity to inhibition by phosphate-containing small molecules. AtRLPH2-HIS6 was incubated with increasing amounts of (A) sodium pyrophosphate and (B) ATP (black circles), ADP (grey squares), AMP (grey triangles), Pi (black diamonds), Glu-6P (black squares; dashed black line), Gly-3P (grey circles; dashed grey line). All assays were performed using 150 ng of purified AtRLPH2 and the phosphatase substrate pNPP. Phosphatase activity was assessed through pNPP cleavage. Error bars represent standard error (n = 3). 187 Figure 6.10: AtRLPH2 sensitivity to classic PPP and PTP protein phosphatase small molecule inhibitors. AtRLPH2-HIS6 incubated with increasing amounts of (A) NaF, (B) NaOV, (C) OA and (D) MCLR in conjunction with the phosphatase substrate pNPP. Inhibition by the phosphate analogs NaF and NaOV is indicative of a PPP and PTP protein phosphatase, respectively. Phosphatase activity was assessed through pNPP cleavage. All assays were performed using 150 ng of purified AtRLPH2. Error bars represent standard error (n = 3). 188 phosphorylated peptides consisting of pSer, pThr, pTyr and pThr/pTyr peptides, the substrate specificity of AtRLPH2 was evaluated (Table 6.1 and Figure 6.11). The activity of bacteriallyexpressed and purified AtRLPH2 towards both phosphorylated peptide and non-peptide substrates was measured by the detection of enzyme-catalyzed release of phosphate from each phospho-substrate (Table 6.1 and Figure 6.11). AtRLPH2 exhibited exceptional preference for pTyr- and pThr/pTyr-containing peptides, with almost no activity toward pSer and pThr peptides (Table 6.1 and Figure 6.11). Highest activity was towards the dually phosphorylated pThr/pTyr mitogen-activated protein (MAP) kinase peptides p38beta and p38 with specific activities of 144278.0 ± 1459.9 and 168610.4 ± 641.3 pmol Pi min-1mg-1 respectively (Table 6.1 and Figure 6.11). Lowest AtRLPH2 activity was towards pSer- and pThr-containing peptides RRP1B and Ki67 with 996.8 ± 8.8 and 1478.6 ± 3.6 pmol Pi min-1mg-1 specific activities, respectively (Table 6.1 and Figure 6.11). No measurable activity was detected when employing non-phospho-peptide substrates (Table 6.1). Phylogenetically related Shewanella-like PPP protein phosphatase AtSLP1 was expressed and purified from bacteria and examined in conjunction with AtRLPH2 to assess its preference for pSer, pThr, pTyr and pThr/pTyr phosphorylated peptides (Figure 6.11). Unlike AtRLPH2, AtSLP1 exhibited a broader specificity profile for phosphorylated peptide substrates, effectively dephosphorylating pSer, pThr, pTyr and pThr/pTyr phosphorylated peptides (Figure 6.11). AtSLP1 also demonstrated an enhanced preference for the pThr/pTyr phosphorylated peptides p38 and p38beta over any singly phosphorylated peptide, maintaining a specific activity of 208287.8 ± 3276.2 and 205614.1 ± 4814.7 pmol Pi min-1mg-1, respectively (Figure 6.11). 189 Table 6.1: Malachite green dephosphorylation assay examining AtRLPH2 phosphatase activity towards phosphorylated peptide substrates. Assays were performed using either 500 µM phosphorylated peptide or 5 mM metabolite and 1 µg purified AtRLPH2. Grey highlighted area represents dual pTyr (pY) and pThr (pT) phosphorylated peptides. The phosphorylated protein from which each corresponding phosphorylated peptide was derived is listed. Values are followed by ± standard error (n = 3). 190 Figure 6.11: Phosphorylated substrate preferences of purified A. thaliana bacterial-like PPP protein phosphatases. A. thaliana SLP1 (AtSLP1 – black bars) and RLPH2 (AtRLPH2 – grey bars) were expressed and purified as HIS6-tagged fusion proteins in E. coli. Purified phosphatases were incubated with 500 µM of the designated phosphorylated peptide at 30oC for 1 h to determine their phospho-substrate preference. Phosphatase activity was determined using a stop time malachite green assay to detect phosphatase catalyzed phosphate release. Error bars represent standard error (n = 3). 191 Isolation of over-expressed, in planta constructed AtSLP1 and AtRLPH2 via tandem affinity purification (TAP) was performed to verify the phosphorylated peptide substrate preferences observed when employing recombinant, bacterially-produced protein (Figure 6.12). The deduced activities for each TAP isolated protein further substantiated the robust preference of AtRLPH2 for pTyr-containing peptides, while also demonstrating that AtSLP1 has a less pronounced specificity for pTyr-phosphorylated peptide substrates relative to its recombinant, bacterially-produced counterpart. Despite this, AtSLP1 still maintained measurable phosphatase activity towards pTyr phosphorylated peptides, but with a marked preference for pThr-containing phosphorylated peptides (Figure 6.12). 6.4 Discussion 6.4.1 Bridging the gap: AtRLPH2 is a phosphotyrosine-specific PPP protein phosphatase Phosphoproteomic research efforts in plants have revealed a similar ratio of tyrosine phosphorylation events as found in mammals with 82.7% (84.8%) pSer, 13.1% (12.3%) pThr and 4.2% (2.9%) pTyr in A. thaliana (O. sativa), respectively, compared to 86.4% pSer, 11.8% pThr and 1.8% pTyr in humans (Sugiyama et al., 2008; Nakagami et al., 2010; Olsen et al., 2010). Genomic studies indicate A. thaliana maintains only a single classic PTP protein phosphatase relative to the 38 encoded by humans and therefore should not be able to maintain an equivalent level of protein tyrosine phosphorylation (Kerk et al., 2008), suggesting that reversible protein tyrosine phosphorylation may alternatively be directed by either PPP protein phosphatases, such as the bacterial-like PPP protein phosphatases AtSLP1 and AtRLPH2, or DSP phosphatases that fall under the PTP phosphatase class. 192 Figure 6.12: Phosphorylated substrate preferences of TAP-purified A. thaliana bacteriallike PPP-family protein phosphatases. Constitutively-expressed AtSLP1-TAP and AtRLPH2-TAP were isolated from 20 g of A. thaliana rosette leaf tissue as per Templeton et al., 2011. Incubation of each TAP purified protein with either 500 µM BRCA1 (pThr – black bars) or rat SAPK3 (pTyr – grey bars) phospho-peptides revealed each protein phosphatase's substrate preference. Phosphatase activity was determined using a stop time malachite green assay to detect phosphatase-catalyzed phosphate release. Although assayed, AtRLPH2-TAP displayed no measurable activity against the pThr BRCA1 peptide. Error bars represent standard error (n = 3). 193 With RLPH phosphatases suggested to have entered plants through a horizontal gene transfer event involving Planctomycetes bacteria it is possible they may have maintained many of their ancestral biochemical properties (Chapter 2; (Andreeva and Kutuzov, 2004)). Studies examining bacterial protein phosphorylation in Salmonella enterica and Bacillus anthracis using phosphorylated peptide substrates have revealed two PP2C phosphatases which seem to possess a dual specificity for pSer/pThr and pTyr (Lai and Le Moual, 2005; Arora et al., 2012). Furthermore, characterization of bacterial PPP protein phosphatases has also found dual specificity towards pSer/pThr and pTyr despite maintaining a PPP protein phosphatase catalytic fold (Tsuruta and Aizono, 1999, 2000; Pereira et al., 2011). These combined findings may represent the tip of a widespread phenomenon across prokaryotes and render it possible that either dual specificity or sole pTyr-specific protein phosphatase capabilities may also extend to uncharacterized phosphatases (e.g. PP2C phosphatases) in plants in addition to the bacterial PPP protein phosphatases discussed here. Interestingly, plants encode considerably more PP2C protein phosphatases when compared to humans (76 PP2Cs in A. thaliana versus 18 PP2Cs in humans; (Kerk et al., 2008; Xue et al., 2008; Fuchs et al., 2013)). Further validation of substrate specificity results obtained using bacterially-expressed and purified AtSLP1 and AtRLPH2 was pursued using in planta constructed versions of each phosphatase TAP-isolated from A. thaliana rosette leaf tissue. Only a slightly reduced preference for pTyr-containing peptides was observed for TAP isolated AtSLP1, with the initial dualspecificity towards pSer/pThr and pTyr peptides remaining intact. TAP-isolated AtRLPH2 preference for pTyr remained unchanged. These corroborative phosphatase assays involving in planta constructed versions of both AtSLP1 and AtRLPH2 proteins were performed, as it was previously observed that protein phosphatase substrate specificity can differ between endogenous 194 and recombinant versions of the same phosphatase (MacKintosh et al., 1996). In this study, bacterially-produced and purified PP1 phosphatase exhibited activity towards pTyr, in addition to pSer/pThr, while the endogenous PP1 purified from rabbit skeletal muscle was pSer/pThr specific. This was not observed with either AtSLP1 nor AtRLPH2, which exhibited no major difference in phosphorylated substrate preference between their recombinant and in planta protein forms. However, previous examination of the AtSLP1 paralog AtSLP2 via identical enzymatic analysis exhibited a marked substrate specificity difference between its recombinant and in planta forms (Chapter 4). Recombinant AtSLP2 showed a broad dual specificity activity profile, while endogenous AtSLP2 was pSer/pThr specific (Chapter 4). 6.4.2 Function of AtRLPH2 protein phosphatase With the largest proportion of pTyr-containing proteins found in the A. thaliana nucleus and/or cytosol (Sugiyama et al., 2008; Mithoe et al., 2012) and AtRLPH2 localized to these compartments, it may be possible that AtRLPH2 regulates signal transduction events that result in its movement between the nucleus and cytosol. In particular, MAP kinases are key proteins responsible for translating developmental and extracellular environmental cues (e.g. physiological stress) into intracellular signaling cascades which modify gene expression (Bartels et al., 2010). They are also proteins which maintain pTyr as part of their activation loop (TxY) (Bartels et al., 2010). Involvement in either of these roles would fit with AtRLPH2's dual localization to the nucleus and cytosol. Current evidence indicates that MAPK phosphatases in plants are comprised of both PP2C and DSP phosphatases (Vilela et al., 2010). Coupled with a pH optimum consistent with the cytosolic environment and the diversity of phosphatases regulating MAP kinase signaling, AtRLPH2 may play a role in regulating the phosphorylation of 195 these protein targets. However, given the prevalence of pTyr-containing proteins outside of MAP kinases, this hypothesis remains one of many (Sugiyama et al., 2008; Nakagami et al., 2010; Olsen et al., 2010). AtRLPH2 also exhibited an enhanced sensitivity to pyrophosphate (PPi) compared to AtSLP1, AtSLP2 and A. thaliana TOPP (PP1) phosphatases (Chapter 3; (Stubbs et al., 2001)). Findings originally indicated that AtRLPH2 may not be a protein phosphatase based on its high affinity for PPi, but rather, a novel pyrophosphatase (May et al., 2011). This was reminiscent of DSP phosphatases which maintain a PTP phosphatase catalytic HCx5R motif, but exclusively dephosphorylate non-proteinaceous substrates such as starch and phosphoinositides (Kerk et al., 2008; Silver et al., 2013; Tonks, 2013). Further examination using phosphorylated metabolites with a demonstrated inhibitory effect, including PPi, failed to elicit any phosphatase activity as a substrate (Table 6.1). This indicated that the observed affinity of AtRLPH2 for these nonproteinaceous substrates was likely due to competition for active site occupancy in enzyme assays conducted with pNPP. Given its high affinity for PPi, AtRLPH2 may possess a cellular role in regulating PPirelated metabolic and/or signaling processes, in particular, enhanced sensitivity to PPi may tie AtRLPH2 to stress adaptation or metabolism (Plaxton, 1996; Mustroph et al., 2005). In addition to signaling, a large proportion of pTyr-containing proteins in A. thaliana were found to regulate protein metabolism and other metabolic processes, providing precedence for a connection between energy sensing and protein tyrosine phosphorylation (Sugiyama et al., 2008). AtRLPH2 also exhibited an increasing sensitivity to phosphate-containing compounds AMP, Pi, ADP and ATP, with no effect observed with Glu-6P or Gly-3P, further emphasizing a potential link between AtRLPH2 and the cellular energy status of the cell. Additional experimentation is still 196 required to identify the in vivo substrates of AtRLPH2 and to verify the significance of its sensitivity to PPi. The prokaryotic heritage of plant RLPH phosphatases suggests a selection pressure to remain unaltered in photosynthetic Eukaryotes, which could be indicative of a conserved role for AtRLPH2 in regulating core and/or ancient cellular processes. Like other photosynthetic Eukaryote PPP protein phosphatases, AtRLPH2 was sensitive to NaF inhibition (Pallen et al., 1985; Stubbs et al., 2001), in addition to inhibition by the PTP protein phosphatase inhibitor NaOV (Swarup et al., 1982; Pallen et al., 1985). With each of these compounds possessing specific inhibitory capabilities against their respective protein phosphatase families, it was unique to find that each compound was able to inhibit AtRLPH2 in a concentration-dependent manner. This supports the notion that AtRLPH2 is a pTyr specific protein phosphatase which maintains a PPP protein phosphatase active site fold capable of accommodating bulkier pTyr-containing proteins. 6.4.3 AtRLPH2 maintains an ancient insensitivity to naturally occurring protein phosphatase inhibitors Given the ancient prokaryotic lineage of RLPH phosphatases, along with previous biochemical findings involving related bacterial-like PPP phosphatases AtSLP1, AtSLP2 (Chapter 3) and PbSHLP1 (Patzewitz et al., 2013), it has become apparent that the bacterial phosphatases of Eukaryotes maintain a conserved insensitivity to OA and MCLR. A possible explanation for this observation may be that these phosphatases were under heavy selection pressure in Prokaryotes to maintain OA and MCLR insensitivity. This may have conveyed a survival advantage in environments containing competing organisms such as OA-producing dinoflagellates (Bialojan and Takai, 1988) and MCLR-producing cyanobacteria (MacKintosh et 197 al., 1990). The lack of OA and MCLR inhibition can be directly related to the lack of a key Cterminal SAPNYC motif which has been shown to be central in coordinating both inhibitors in PP1 and PP2A (Goldberg et al., 1995; Maynes et al., 2001). Whether the lack of OA and MCLR sensitivity in these bacterial-like protein phosphatases is of specific biological significance to photosynthetic Eukaryotes or an evolutionary by-product of their prokaryotic origin has yet to be resolved. 6.4.4 AtRLPH2 catalysis: Cysteine-mediated or metal-dependent catalytic mechanism Lack of detectable metal cation dependency and enzymatic assay quenching by EDTA or EGTA in vitro suggested that AtRLPH2 may be metal cation dependent despite possessing the entire complement of PPP protein phosphatase catalytic motifs required to coordinate metal cations. Sensitivity to Zn2+ inhibition was observed however, indicating a capacity for metal coordination similar to other classic PPP protein phosphatases (Shi, 2009) and suggesting that in solution metal cations can access the active site in vitro. Interestingly, an alignment of plant RLPH2 phosphatases uncovered a conserved cysteine present in active site motif 2 where other PPP phosphatases would possess hydrophobic residues. Subsequent biochemical exploration of this cysteines involvement in a PTP-like phosphatase catalytic mechanism produced results inconsistent with a metal-independent PTP phosphatase. Both classic PTP protein phosphatases and DSP phosphatases utilize a conserved HCx5R redox responsive catalytic motif which maintains a low micromolar (µM) IC50 response to oxidizing agents H2O2 and NEM (Tanner et al., 2011; Silver et al., 2013). In addition to being purified in the absence of reductant, incorporation of these compounds into AtRLPH2 assays failed to achieve complete inhibition, even at millimolar (mM) concentrations. This result, combined with the observed inhibition of 198 AtRLPH2 by Zn2+ cations and conservation of metal cation-binding residues, collectively indicates that AtRLPH2 likely utilizes a metal cation-mediated catalytic mechanism consistent with other PPP protein phosphatases. 6.5 Conclusion Presented here for the first time is a pTyr-specific PPP protein phosphatase. This finding helps to bridge the current disconnect between the high abundance of protein tyrosine phosphorylation in plants and the lack of pTyr-specific protein phosphatases. Furthermore, findings presented here open the door to the larger possibility that reversible pTyr specific protein phosphorylation in plants is regulated by a number of unconventional protein phosphatases such as AtSLP1 and AtRLPH2, as well as PPM/PP2C protein phosphatases and DSP phosphatases. Given the massive family expansion of PPM/PP2C protein phosphatases in photosynthetic Eukaryotes relative to other eukaryotic organisms, future investigations may reveal an unique bacterial origin of some PPM/PP2C protein phosphatases which result in dual pSer/pThr- and pTyr-specific protein phosphatase activity. As well, the dual cytosolic/nuclear localization of AtRLPH2 may indicate a role in signal transduction between these two compartments. Further research is required to identify its interaction partners and endogenous phosphorylated protein substrates. This exciting new finding of a pTyr-specific PPP protein phosphatase brings a fresh perspective to regulatory protein phosphorylation in plants, and should serve as a model for the further identification and characterization of unconventional pTyr protein phosphatases. 199 Chapter Seven: Perspectives and Future Directions 7.1 Summary With reversible protein phosphorylation representing one of the most well characterized post-translational modifications known, it is amazing that new protein kinases and phosphatases are being indentified in extensively examined organisms like A. thaliana. The goal of this Ph.D. thesis was to characterize two recently resolved members of the PPP protein phosphatase family, the SLP and RLPH phosphatases (Andreeva and Kutuzov, 2004; Kerk et al., 2008), to offer insight into their evolutionary heritage, subcellular localizations and biochemical properties, as well as protein interactors and biological function. Careful dissection of their evolution trajectory uncovered a mixed mechanism of acquisition by Eukaryotes involving a combination of classic mitochondrial endosymbiosis and horizontal gene transfer mechanisms (Chapter 2). Initial in silico subcellular prediction algorithms were largely corroborated by transient expression of fluorescent fusion proteins in various tissues (Chapters 2 and 3). This found SLP1 and SLP2 phosphatases to be chloroplast and cytosol targeted, respectively (Chapters 2 and 3), while RLPH2 phosphatases were found to be nuclear / cytosolic targeted (Chapters 2 and 6). By means of mitochondrial isolation from A. thaliana cell culture, AtSLP2 presented a notable exception in that it was found to be exclusively mitochondrial (Chapter 4). This was later corroborated by transient expression in tissues known to contain endogenous AtSLP2, demonstrating the SLP2 phosphatases are mitochondrial rather than cytosolic. As well, all A. thaliana bacterial-like PPP protein phosphatases exhibited a common insensitivity to inhibitors OA and MCLR, which may represent a remnant of their prokaryotic origin (Chapter 3). Employment of phosphorylated peptide substrates uncovered additional layers of complexity by identifying the phosphorylated substrate preferences of AtSLP1, AtSLP2 and AtRLPH2. Here, I found that AtSLP1, AtSLP2 200 and AtRLPH2 represent dual-specificity, Ser/Thr-specific and Tyr-specific protein phosphatases respectively (Chapters 4 and 6). In the case of SLP phosphatases, I was also able to elucidate a protein interactome using TAP pull-downs, revealing AtSLP1 and AtSLP2 to maintain completely unique sets of protein interaction partners, indicative of independent cellular and biological functions (Chapter 4). AtSLP1 was found to interact with CF1 ATP synthase β and γ subunits of the chloroplast stroma, while AtSLP2 interacted specifically with AtMIA40 protein of the mitochondrial intermembane space. Unfortunately, reciprocal validation of these interactions was only achieved between AtMIA40 and AtSLP2, while the verification of an AtSLP1 - CF1 ATP synthase interaction remains incomplete. Verification of a specific AtSLP2 - AtMIA40 interaction was followed by careful plant phenotype analysis of atslp2-2 and 35S::AtSLP2 plants which resulted in the identification of a pronounced early germination phenotype in atslp2-2 seeds. Corroborating evidence for this phenotype was found using atmia40 seeds, which exhibited an identical, but more moderate, early germination phenotype (Chapter 5). Additional pharmacological characterization involving the topical application of ABA and Uniconazole indicated that this phenotype was a result of enhanced GA production in the atslp2-2 plant, suggesting AtSLP2 to be a negative regulator of GA biosynthesis. This discovery was reminiscent of the key role of PP2C phosphatases in ABA signaling, rendering AtSLP2 an exciting new candidate for the regulation of plant phytohormone-related processes. Overall, the research I present throughout this thesis functions as a solid foundation from which future studies of the bacterial-like protein phosphatases from A. thaliana and beyond can be based. Given this, I outline below what I believe are some of the logical next steps to take in further exploring the bacterial-like PPP protein phosphatases. 201 7.2 Novel PPP phosphatase of the chloroplast: SLP1 phosphatases The documented interaction between AtSLP1 and CF1 ATP synthase β and γ subunits requires validation through either independent or reciprocal protein-protein interaction analysis. This would likely prove easiest through the application of immunoprecipitation. Here, antibodies to the soluble CF1 ATP synthase of A. thaliana could be employed to obtain an AtSLP1 chloroplast ATP synthase complex from either rosette tissue or isolated chloroplast extracts harvested from plants during the day and extracted in the presence of phosphatase inhibitors. Subsequent immunoblotting for AtSLP1 with specific antibodies developed in this thesis would circumvent the need for mass spectrometry. Based on available literature, immunoprecipitation should be able to be applied under more stringent conditions and in the presence of phosphatase inhibitors to isolate intact and phosphorylated A. thaliana CF1 ATP synthase (del Riego et al., 2006; Reiland et al., 2009). This could then also be used in conjunction with bacteriallyexpressed and purified AtSLP1 to assess CF1 ATP synthase β and γ subunits as a potential phosphorylated substrate of AtSLP1. A number of in vitro assay readouts could be employed to validate this possibility. These include malachite green enzyme assays, Pro-Q diamond phosphoprotein stain (Invitrogen) or quantitative mass spectrometry. Furthermore, with standard growth conditions failing to identify an obvious atslp1 / 35S::AtSLP1 plant phenotype, application of experimental growth conditions should be explored. Given the putative interaction identified between AtSLP1 and chloroplast ATP synthase, photoperiod could be envisioned as a logical starting point. With A. thaliana growth examined here under a 12 h light / 12 h dark photoperiod, 'long' day and 'short' day experimentation could be conducted using 16 h light / 8 h dark and 8 h light / 16 h dark photoperiods. With AtSLP1 202 representing the first chloroplast targeted PPP protein phosphatase identified to date, and the chloroplast possessing a number of phosphorylated proteins, elucidating the cellular function of AtSLP1 could be of fundamental importance. 7.3 Connecting AtSLP2 to GA biosynthesis The fortuitous connection uncovered here between AtSLP2 and GA biosynthesis requires further characterization. First and foremost, understanding this connection requires identifying the phosphorylated substrate of AtSLP2. Given its unique location in the mitochondrial IMS along with AtMIA40, a direct regulatory connection between AtSLP2 and cytosolic GA metabolism seems unlikely, but given that mitochondria are of central importance to plant cell function, other indirect connections remain viable possibilities worthy of exploration (Araujo et al., 2012). One of the primary functions of mitochondria in plants is to provide carbon skeletons, through the TCA cycle, to the cytosol for anabolic construction of a wide array of metabolites. This is powered largely through pyruvate which feeds directly into the TCA cycle. Analogous to auxotrophic systems (e.g. humans), plant mitochondria can also utilize amino acids as an alternative carbon source when carbon is in short supply (Ishizaki et al., 2005; Araujo et al., 2012), in particular, carbon rich amino acids such as the branched amino acids valine, leucine and isoleucine (Ishizaki et al., 2005; Araujo et al., 2010; Araujo et al., 2012). With atslp2-2 seeds exhibiting massively elevated valine and isoleucine levels, it has become evident that loss of AtSLP2 perturbs homeostatic levels of branched amino acids possibly by preventing their import into the mitochondria for degradation. Although FA are not broken down in the mitochondria of plants, they too exhibited a significant change in atslp2-2 seeds, which may also have resulted from disfunctional mitochondrial processes. With AtSLP2's subcellular compartmentation 203 relegated to the mitochondrial IMS, there are only a few mechanistic possibilities for how it exerts control over GA-related processes, amino acid degradation and FA biosynthesis. These options include: direct regulation of an inner membrane transporter(s), involvement in regulating mitochondrial matrix protein import or direct regulation of an integral mitochondrial inner membrane protein, which exerts influence over matrix metabolic processes. Identifying AtSLP2's substrate will be key to resolving which of these hypotheses are true. 7.4 The SLP phosphatase of Schizosaccharomyces pombe Also called fission yeast, Schizosaccharomyces pombe (S. pombe) represents a model, unicellular eukaryotic organism, that is ideal for examining DNA damage, DNA replication and cell division (Simanis, 1995; Gould and Simanis, 1997). It has a sequenced genome, a comprehensive and freely accessible online resources such as PomBase (www.pombase.org) and a library of available knockout yeast strains for reverse genetics. In particular, the SpSLP phosphatase SPCC1840.0c, noted in Chapter 2, is more similar to the SLP3 phosphatases of Green Algae than it is to the SLP1 and SLP2 phosphatases of plants, representing a more ancient derivation of the overall SLP protein phosphatase ancestry. With all the tools already in place for the S. pombe inclined researcher, and SLP1 and 2 phosphatase characterization already underway from A. thaliana, the characterization of SpSLP may prove interesting and complementary. Contributing to the potentially unique nature of SpSLP is its ER subcellular localization (Matsuyama et al., 2006). To date, independent functions seem to exist for both AtSLP1 and AtSLP2, rendering the characterization of ER-targeted SpSLP likely to yield yet another independent SLP phosphatase function. 204 7.5 Bridging the gap: the AtRLPH2 is a phosphotyrosine-specific PPP protein Phosphoproteomics has revealed that both A. thaliana and O. sativa maintain amounts of protein tyrosine phosphorylation comparable to that found in humans (Sugiyama et al., 2008; Nakagami et al., 2010), but lack the corresponding abundance of protein tyrosine kinases and phosphatases (Kerk et al., 2008). This thesis presented results involving A. thaliana RLPH2 which demonstrated it to be a novel, pTyr-specific, bacterial-like PPP protein phosphatase. Given this, I believe it imperative to characterize the protein interactome of AtRLPH2. This will provide an understanding of AtRLPH2's cellular function through the proteins it interacts with. As well, I would accompany these experiments with the reverse genetics approach previously applied to AtSLP1 and AtSLP2 involving both insertional mutants and over-expressing plant lines. This will assist in uncovering the broader biological role of AtRLPH2. Finding that AtRLPH2 was a pTyr-specific PPP protein phosphatase suggests that tyrosine phosphorylation in plants may be facilitated by unconventional protein tyrosine kinases and phosphatases. From a phosphatase perspective this may involve DSP and/or PP2C phosphatases. DSP phosphatases possess an identical catalytic motif to the classic PTP phosphatases, rendering them capable of dephosphorylating pTyr, while PP2C phosphatases, which are a group classically comprised of Ser/Thr phosphatases, have extensively proliferated across the genomes of plants. 205 Literature Cited: Abascal F, Zardoya R, Posada D (2005) ProtTest: selection of best-fit models of protein evolution. Bioinformatics 21: 2104-2105 Acuna R, Padilla BE, Florez-Ramos CP, Rubio JD, Herrera JC, Benavides P, Lee SJ, Yeats TH, Egan AN, Doyle JJ, Rose JK (2012) Adaptive horizontal transfer of a bacterial gene to an invasive insect pest of coffee. Proc Natl Acad Sci U S A 109: 4197-4202 Ahn CS, Han JA, Lee HS, Lee S, Pai HS (2011) The PP2A regulatory subunit Tap46, a component of the TOR signaling pathway, modulates growth and metabolism in plants. Plant Cell 23: 185-209 Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25: 3389-3402 Andreeva AV, Kutuzov MA (1999) RdgC/PP5-related phosphatases: novel components in signal transduction. Cellular Signalling 11: 555-562 Andreeva AV, Kutuzov MA (2001) Nuclear localization of the plant protein Ser/Thr phosphatase PP7. Molecular cell biology research communications : MCBRC 4: 345-352 Andreeva AV, Kutuzov MA (2001) PPP family of protein Ser/Thr phosphatases: two distinct branches? Molecular Biology and Evolution 18: 448-452 Andreeva AV, Kutuzov MA (2004) Widespread presence of "bacterial-like" PPP phosphatases in eukaryotes. BMC Evolutionary Biology 4: 47 Andreeva AV, Kutuzov MA (2008) Protozoan protein tyrosine phosphatases. International Journal of Parasitology 38: 1279-1295 Andreeva AV, Kutuzov MA (2009) PPEF/PP7 protein Ser/Thr phosphatases. Cellular and Molecular Life Sciences : CMLS 66: 3103-3110 Anisimova M, Gil M, Dufayard JF, Dessimoz C, Gascuel O (2011) Survey of branch support methods demonstrates accuracy, power, and robustness of fast likelihood-based approximation schemes. Syst Biol 60: 685-699 Araujo WL, Ishizaki K, Nunes-Nesi A, Larson TR, Tohge T, Krahnert I, Witt S, Obata T, Schauer N, Graham IA, Leaver CJ, Fernie AR (2010) Identification of the 2hydroxyglutarate and isovaleryl-CoA dehydrogenases as alternative electron donors linking lysine catabolism to the electron transport chain of Arabidopsis mitochondria. Plant Cell 22: 1549-1563 Araujo WL, Nunes-Nesi A, Nikoloski Z, Sweetlove LJ, Fernie AR (2012) Metabolic control and regulation of the tricarboxylic acid cycle in photosynthetic and heterotrophic plant tissues. Plant Cell Environment 35: 1-21 Arora G, Sajid A, Arulanandh MD, Singhal A, Mattoo AR, Pomerantsev AP, Leppla SH, Maiti S, Singh Y (2012) Unveiling the novel dual specificity protein kinases in Bacillus anthracis: identification of the first prokaryotic dual specificity tyrosine phosphorylationregulated kinase (DYRK)-like kinase. Journal of Biological Chemistry 287: 26749-26763 Ballesteros I, Dominguez T, Sauer M, Paredes P, Duprat A, Rojo E, Sanmartin M, Sanchez-Serrano JJ (2012) Specialized functions of the PP2A subfamily II catalytic subunits PP2A-C3 and PP2A-C4 in the distribution of auxin fluxes and development in Arabidopsis. The Plant Journal 206 Banci L, Bertini I, Cefaro C, Ciofi-Baffoni S, Gallo A, Martinelli M, Sideris DP, Katrakili N, Tokatlidis K (2009) MIA40 is an oxidoreductase that catalyzes oxidative protein folding in mitochondria. Nature Structural & Molecular Biology 16: 198-206 Bannai H, Tamada Y, Maruyama O, Nakai K, Miyano S (2002) Extensive feature detection of N-terminal protein sorting signals. Bioinformatics 18: 298-305 Barajas-Lopez Jde D, Kremnev D, Shaikhali J, Pinas-Fernandez A, Strand A (2013) PAPP5 Is Involved in the Tetrapyrrole Mediated Plastid Signalling during Chloroplast Development. PloS One 8: e60305 Bartels S, Gonzalez Besteiro MA, Lang D, Ulm R (2010) Emerging functions for plant MAP kinase phosphatases. Trends in Plant Science 15: 322-329 Bayer RG, Stael S, Rocha AG, Mair A, Vothknecht UC, Teige M (2012) Chloroplastlocalized protein kinases: a step forward towards a complete inventory. Journal of Experimental Botany 63: 1713-1723 Baykov AA, Evtushenko OA, Avaeva SM (1988) A malachite green procedure for orthophosphate determination and its use in alkaline phosphatase-based enzyme immunoassay. Analytical Biochemistry 171: 266-270 Belbahri L, Calmin G, Mauch F, Andersson JO (2008) Evolution of the cutinase gene family: evidence for lateral gene transfer of a candidate Phytophthora virulence factor. Gene 408: 1-8 Bialojan C, Takai A (1988) Inhibitory effect of a marine-sponge toxin, okadaic acid, on protein phosphatases. Specificity and kinetics. Biochemical Journal 256: 283-290 Bien M, Longen S, Wagener N, Chwalla I, Herrmann JM, Riemer J (2010) Mitochondrial disulfide bond formation is driven by intersubunit electron transfer in Erv1 and proofread by glutathione. Molecular Cell 37: 516-528 Blakeslee JJ, Zhou HW, Heath JT, Skottke KR, Barrios JA, Liu SY, DeLong A (2008) Specificity of RCN1-mediated protein phosphatase 2A regulation in meristem organization and stress response in roots. Plant Physiology 146: 539-553 Bligny R, Gardestrom P, Roby C, Douce R (1990) 31P NMR studies of spinach leaves and their chloroplasts. J Biol Chem 265: 1319-1326 Boden M, Hawkins J (2005) Prediction of subcellular localization using sequence-biased recurrent networks. Bioinformatics 21: 2279-2286 Boisnard-Lorig C, Colon-Carmona A, Bauch M, Hodge S, Doerner P, Bancharel E, Dumas C, Haseloff J, Berger F (2001) Dynamic analyses of the expression of the HISTONE::YFP fusion protein in arabidopsis show that syncytial endosperm is divided in mitotic domains. Plant Cell 13: 495-509 Bollen M, Peti W, Ragusa MJ, Beullens M (2010) The extended PP1 toolkit: designed to create specificity. Trends in Biochemical Sciences 35: 450-458 Boto L (2010) Horizontal gene transfer in evolution: facts and challenges. Proc Biol Sci 277: 819-827 Bozzo GG, Raghothama KG, Plaxton WC (2002) Purification and characterization of two secreted purple acid phosphatase isozymes from phosphate-starved tomato (Lycopersicon esculentum) cell cultures. Eur J Biochem 269: 6278-6286 Brautigan DL (2013) Protein Ser/ Thr phosphatases - the ugly ducklings of cell signalling. The FEBS Journal 280: 324-325 207 Brautigan DL, Gruppuso PA, Mumby M (1986) Protein phosphatase type-1 and type-2 catalytic subunits both bind inhibitor-2 and monoclonal immunoglobulins. J Biol Chem 261: 14924-14928 Breitkreutz A, Choi H, Sharom JR, Boucher L, Neduva V, Larsen B, Lin ZY, Breitkreutz BJ, Stark C, Liu G, Ahn J, Dewar-Darch D, Reguly T, Tang X, Almeida R, Qin ZS, Pawson T, Gingras AC, Nesvizhskii AI, Tyers M (2010) A global protein kinase and phosphatase interaction network in yeast. Science 328: 1043-1046 Bunik VI, Fernie AR (2009) Metabolic control exerted by the 2-oxoglutarate dehydrogenase reaction: a cross-kingdom comparison of the crossroad between energy production and nitrogen assimilation. The Biochemical Journal 422: 405-421 Bunney TD, van Walraven HS, de Boer AH (2001) 14-3-3 protein is a regulator of the mitochondrial and chloroplast ATP synthase. Proc Natl Acad Sci U S A 98: 4249-4254 Cao D, Hussain A, Cheng H, Peng J (2005) Loss of function of four DELLA genes leads to light- and gibberellin-independent seed germination in Arabidopsis. Planta 223: 105-113 Carrie C, Giraud E, Duncan O, Xu L, Wang Y, Huang S, Clifton R, Murcha M, Filipovska A, Rackham O, Vrielink A, Whelan J (2010) Conserved and novel functions for Arabidopsis thaliana MIA40 in assembly of proteins in mitochondria and peroxisomes. The Journal of Biological Chemistry 285: 36138-36148 Caunt CJ, Keyse SM (2013) Dual-specificity MAP kinase phosphatases (MKPs): Shaping the outcome of MAP kinase signalling. The FEBS Journal 280: 489-504 Ceulemans H, Stalmans W, Bollen M (2002) Regulator-driven functional diversification of protein phosphatase-1 in eukaryotic evolution. BioEssays 24: 371-381 Chacinska A, Guiard B, Muller JM, Schulze-Specking A, Gabriel K, Kutik S, Pfanner N (2008) Mitochondrial biogenesis, switching the sorting pathway of the intermembrane space receptor Mia40. The Journal of Biological Chemistry 283: 29723-29729 Chatzi A, Tokatlidis K (2012) The Mitochondrial Intermembrane Space: A Hub for Oxidative Folding Linked to Protein Biogenesis. Antioxidants & Redox Signaling Chen GI, Gingras AC (2007) Affinity-purification mass spectrometry (AP-MS) of serine/threonine phosphatases. Methods 42: 298-305 Chen M, Du X, Zhu Y, Wang Z, Hua S, Li Z, Guo W, Zhang G, Peng J, Jiang L (2012) Seed Fatty Acid Reducer acts downstream of gibberellin signalling pathway to lower seed fatty acid storage in Arabidopsis. Plant Cell Environment 35: 2155-2169 Chen MS, Silverstein AM, Pratt WB, Chinkers M (1996) The tetratricopeptide repeat domain of protein phosphatase 5 mediates binding to glucocorticoid receptor heterocomplexes and acts as a dominant negative mutant. Journal of Biological Chemistry 271: 3231532320 Chen MX, McPartlin AE, Brown L, Chen YH, Barker HM, Cohen PT (1994) A novel human protein serine/threonine phosphatase, which possesses four tetratricopeptide repeat motifs and localizes to the nucleus. EMBO J 13: 4278-4290 Cho US, Xu W (2007) Crystal structure of a protein phosphatase 2A heterotrimeric holoenzyme. Nature 445: 53-57 Clarke M, Lohan AJ, Liu B, Lagkouvardos I, Roy S, Zafar N, Bertelli C, Schilde C, Kianianmomeni A, Burglin TR, Frech C, Turcotte B, Kopec KO, Synnott JM, Choo C, Paponov I, Finkler A, Soon Heng Tan C, Hutchins AP, Weinmeier T, Rattei T, Chu JS, Gimenez G, Irimia M, Rigden DJ, Fitzpatrick DA, Lorenzo-Morales J, 208 Bateman A, Chiu CH, Tang P, Hegemann P, Fromm H, Raoult D, Greub G, Miranda-Saavedra D, Chen N, Nash P, Ginger ML, Horn M, Schaap P, Caler L, Loftus B (2013) Genome of Acanthamoeba castellanii highlights extensive lateral gene transfer and early evolution of tyrosine kinase signaling. Genome Biol 14: R11 Claros MG, Vincens P (1996) Computational method to predict mitochondrially imported proteins and their targeting sequences. Eur J Biochem 241: 779-786 Cohen P (2000) The regulation of protein function by multisite phosphorylation--a 25 year update. Trends Biochem Sci 25: 596-601 Cohen P (2002) The origins of protein phosphorylation. Nature Cell Biology 4: E127-130 Cohen P, Holmes CF, Tsukitani Y (1990) Okadaic acid: a new probe for the study of cellular regulation. Trends in Biochemical Science 15: 98-102 Cohen PT (1997) Novel protein serine/threonine phosphatases: variety is the spice of life. Trends Biochem Sci 22: 245-251 Cohen PT, Philp A, Vazquez-Martin C (2005) Protein phosphatase 4--from obscurity to vital functions. FEBS Letter 579: 3278-3286 Crooks GE, Hon G, Chandonia JM, Brenner SE (2004) WebLogo: a sequence logo generator. Genome Res 14: 1188-1190 Cuozzo JW, Kaiser CA (1999) Competition between glutathione and protein thiols for disulphide-bond formation. Nature Cell Biology 1: 130-135 Dai M, Xue Q, McCray T, Margavage K, Chen F, Lee JH, Nezames CD, Guo L, Terzaghi W, Wan J, Deng XW, Wang H (2013) The PP6 Phosphatase Regulates ABI5 Phosphorylation and Abscisic Acid Signaling in Arabidopsis. The Plant Cell Dai M, Zhang C, Kania U, Chen F, Xue Q, McCray T, Li G, Qin G, Wakeley M, Terzaghi W, Wan J, Zhao Y, Xu J, Friml J, Deng XW, Wang H (2012) A PP6-type phosphatase holoenzyme directly regulates PIN phosphorylation and auxin efflux in Arabidopsis. The Plant Cell 24: 2497-2514 Dancheck B, Ragusa MJ, Allaire M, Nairn AC, Page R, Peti W (2011) Molecular investigations of the structure and function of the protein phosphatase 1-spinophilininhibitor 2 heterotrimeric complex. Biochemistry 50: 1238-1246 Darshi M, Trinh KN, Murphy AN, Taylor SS (2012) Targeting and import mechanism of coiled-coil helix coiled-coil helix domain-containing protein 3 (ChChd3) into the mitochondrial intermembrane space. The Journal of Biological Chemistry 287: 3948039491 Dawson RM (1998) The toxicology of microcystins. Toxicology 36: 953-962 de la Fuente van Bentem S, Vossen JH, de Vries KJ, van Wees S, Tameling WI, Dekker HL, de Koster CG, Haring MA, Takken FL, Cornelissen BJ (2005) Heat shock protein 90 and its co-chaperone protein phosphatase 5 interact with distinct regions of the tomato I-2 disease resistance protein. The Plant Journal 43: 284-298 de la Fuente van Bentem S, Vossen JH, Vermeer JE, de Vroomen MJ, Gadella TW, Jr., Haring MA, Cornelissen BJ (2003) The subcellular localization of plant protein phosphatase 5 isoforms is determined by alternative splicing. Plant Physiology 133: 702712 De Munter S, Kohn M, Bollen M (2013) Challenges and opportunities in the development of protein phosphatase-directed therapeutics. ACS Chemical Biology 8: 36-45 209 del Riego G, Casano LM, Martin M, Sabater B (2006) Multiple phosphorylation sites in the beta subunit of thylakoid ATP synthase. Photosynthetic Research 89: 11-18 DeLong A (2006) Switching the flip: protein phosphatase roles in signaling pathways. Current Opinion in Plant Biology 9: 470-477 Denu JM, Lohse DL, Vijayalakshmi J, Saper MA, Dixon JE (1996) Visualization of intermediate and transition-state structures in protein-tyrosine phosphatase catalysis. Proc Natl Acad Sci U S A 93: 2493-2498 Denu JM, Stuckey JA, Saper MA, Dixon JE (1996) Form and function in protein dephosphorylation. Cell 87: 361-364 Derelle E, Ferraz C, Rombauts S, Rouze P, Worden AZ, Robbens S, Partensky F, Degroeve S, Echeynie S, Cooke R, Saeys Y, Wuyts J, Jabbari K, Bowler C, Panaud O, Piegu B, Ball SG, Ral JP, Bouget FY, Piganeau G, De Baets B, Picard A, Delseny M, Demaille J, Van de Peer Y, Moreau H (2006) Genome analysis of the smallest freeliving eukaryote Ostreococcus tauri unveils many unique features. Proc Natl Acad Sci U S A 103: 11647-11652 Di Rubbo S, Irani NG, Russinova E (2011) PP2A phosphatases: the "on-off" regulatory switches of brassinosteroid signaling. Science Signaling 4: pe25 Donella-Deana A, Krinks MH, Ruzzene M, Klee C, Pinna LA (1994) Dephosphorylation of phosphopeptides by calcineurin (protein phosphatase 2B). European Journal of Biochemistry 219: 109-117 Donella-Deana A, Meyer HE, Pinna LA (1991) The use of phosphopeptides to distinguish between protein phosphatase and acid/alkaline phosphatase activities: opposite specificity toward phosphoseryl/phosphothreonyl substrates. Biochim Biophys Acta 1094: 130-133 Donella Deana A, Mac Gowan CH, Cohen P, Marchiori F, Meyer HE, Pinna LA (1990) An investigation of the substrate specificity of protein phosphatase 2C using synthetic peptide substrates; comparison with protein phosphatase 2A. Biochim Biophys Acta 1051: 199-202 Dong L, Ermolova NV, Chollet R (2001) Partial purification and biochemical characterization of a heteromeric protein phosphatase 2A holoenzyme from maize (Zea mays L.) leaves that dephosphorylates C4 phosophoenolpyruvate carboxylase. Planta 213: 379-389 Dorrell RG, Smith AG (2011) Do red and green make brown?: perspectives on plastid acquisitions within chromalveolates. Eukaryot Cell 10: 856-868 Douglas P, Moorhead GB, Ye R, Lees-Miller SP (2001) Protein phosphatases regulate DNAdependent protein kinase activity. Journal of Biological Chemistry 276: 18992-18998 Douglas P, Zhong J, Ye R, Moorhead GB, Xu X, Lees-Miller SP (2010) Protein phosphatase 6 interacts with the DNA-dependent protein kinase catalytic subunit and dephosphorylates gamma-H2AX. Molecular and Cellular Biology 30: 1368-1381 Duncan MR, Fullerton M, Chaudhuri M (2013) Tim50 in Trypanosoma brucei possesses a dual specificity phosphatase activity and is critical for mitochondrial protein import. The Journal of Biological Chemistry 288: 3184-3197 Dutilleul C, Lelarge C, Prioul JL, De Paepe R, Foyer CH, Noctor G (2005) Mitochondriadriven changes in leaf NAD status exert a crucial influence on the control of nitrate assimilation and the integration of carbon and nitrogen metabolism. Plant Physiology 139: 64-78 Eddy SR (1998) Profile hidden Markov models. Bioinformatics 14: 755-763 210 Egloff MP, Cohen PT, Reinemer P, Barford D (1995) Crystal structure of the catalytic subunit of human protein phosphatase 1 and its complex with tungstate. Journal of Molecular Biology 254: 942-959 Egloff MP, Johnson DF, Moorhead G, Cohen PT, Cohen P, Barford D (1997) Structural basis for the recognition of regulatory subunits by the catalytic subunit of protein phosphatase 1. EMBO J 16: 1876-1887 Emanuelsson O, Nielsen H, Brunak S, von Heijne G (2000) Predicting subcellular localization of proteins based on their N-terminal amino acid sequence. J Mol Biol 300: 1005-1016 Emanuelsson O, Nielsen H, von Heijne G (1999) ChloroP, a neural network-based method for predicting chloroplast transit peptides and their cleavage sites. Protein Sci 8: 978-984 Embley TM, Martin W (2006) Eukaryotic evolution, changes and challenges. Nature 440: 623630 Eubel H, Heazlewood JL, Millar AH (2007) Isolation and subfractionation of plant mitochondria for proteomic analysis. Methods in Molecular Biology 355: 49-62 Farkas I, Dombradi V, Miskei M, Szabados L, Koncz C (2007) Arabidopsis PPP family of serine/threonine phosphatases. Trends in Plant Science 12: 169-176 Forreiter C, Kirschner M, Nover L (1997) Stable transformation of an Arabidopsis cell suspension culture with firefly luciferase providing a cellular system for analysis of chaperone activity in vivo. Plant Cell 9: 2171-2181 Fraga H, Ventura S (2013) Oxidative folding in the mitochondrial intermembrane space in human health and disease. International Journal of Molecular Sciences 14: 2916-2927 Fuchs S, Grill E, Meskiene I, Schweighofer A (2013) Type 2C protein phosphatases in plants. The FEBS Journal 280: 681-693 Fuerst JA, Sagulenko E (2011) Beyond the bacterium: planctomycetes challenge our concepts of microbial structure and function. Nat Rev Microbiol 9: 403-413 Fukao Y, Hayashi M, Nishimura M (2002) Proteomic analysis of leaf peroxisomal proteins in greening cotyledons of Arabidopsis thaliana. Plant Cell Physiol 43: 689-696 Furbank RT, Tester M (2011) Phenomics--technologies to relieve the phenotyping bottleneck. Trends in Plant Science 16: 635-644 Futami R, Llorens C, Vicente-Ripolles M, Moya A (2008) The Alignment Format Converter Server 1.0. Biotechvana Bioinformatics Gennidakis S, Rao S, Greenham K, Uhrig RG, O'Leary B, Snedden WA, Lu C, Plaxton WC (2007) Bacterial- and plant-type phosphoenolpyruvate carboxylase polypeptides interact in the hetero-oligomeric Class-2 PEPC complex of developing castor oil seeds. The Plant Journal 52: 839-849 Genoud T, Santa Cruz MT, Kulisic T, Sparla F, Fankhauser C, Metraux JP (2008) The protein phosphatase 7 regulates phytochrome signaling in Arabidopsis. PloS One 3: e2699 Gentry MS, Roma-Mateo C, Sanz P (2013) Laforin, a protein with many faces: glucan phosphatase, adapter protein, et alii. FEBS J 280: 525-537 Girzalsky W, Platta HW, Erdmann R (2009) Protein transport across the peroxisomal membrane. Biol Chem 390: 745-751 Goldberg J, Huang HB, Kwon YG, Greengard P, Nairn AC, Kuriyan J (1995) Threedimensional structure of the catalytic subunit of protein serine/threonine phosphatase-1. Nature 376: 745-753 211 Goldman BS, Nierman WC, Kaiser D, Slater SC, Durkin AS, Eisen JA, Ronning CM, Barbazuk WB, Blanchard M, Field C, Halling C, Hinkle G, Iartchuk O, Kim HS, Mackenzie C, Madupu R, Miller N, Shvartsbeyn A, Sullivan SA, Vaudin M, Wiegand R, Kaplan HB (2006) Evolution of sensory complexity recorded in a myxobacterial genome. Proc Natl Acad Sci U S A 103: 15200-15205 Gould KL, Simanis V (1997) The control of septum formation in fission yeast. Genes Development 11: 2939-2951 Gray MW (1998) Rickettsia, typhus and the mitochondrial connection. Nature 396: 109-110 Gray MW (2012) Mitochondrial evolution. Cold Spring Harb Perspect Biol 4: a011403 Groth G, Pohl E (2001) The structure of the chloroplast F1-ATPase at 3.2 A resolution. Journal of Biological Chemistry 276: 1345-1352 Gutierres S, Sabar M, Lelandais C, Chetrit P, Diolez P, Degand H, Boutry M, Vedel F, de Kouchkovsky Y, De Paepe R (1997) Lack of mitochondrial and nuclear-encoded subunits of complex I and alteration of the respiratory chain in Nicotiana sylvestris mitochondrial deletion mutants. Proc Natl Acad Sci U S A 94: 3436-3441 Hamilton JP, Buell CR (2012) Advances in plant genome sequencing. The Plant Journal 70: 177-190 Hammet A, Pike BL, McNees CJ, Conlan LA, Tenis N, Heierhorst J (2003) FHA domains as phospho-threonine binding modules in cell signaling. IUBMB Life 55: 23-27 Hastie CJ, Borthwick EB, Morrison LF, Codd GA, Cohen PT (2005) Inhibition of several protein phosphatases by a non-covalently interacting microcystin and a novel cyanobacterial peptide, nostocyclin. Biochim Biophys Acta 1726: 187-193 Hedden P, Thomas SG (2012) Gibberellin biosynthesis and its regulation. The Biochemical Journal 444: 11-25 Heidari B, Matre P, Nemie-Feyissa D, Meyer C, Rognli OA, Moller SG, Lillo C (2011) Protein phosphatase 2A B55 and A regulatory subunits interact with nitrate reductase and are essential for nitrate reductase activation. Plant Physiology 156: 165-172 Hell K (2008) The Erv1-Mia40 disulfide relay system in the intermembrane space of mitochondria. Biochimica et Biophysica Acta 1783: 601-609 Heroes E, Lesage B, Gornemann J, Beullens M, Van Meervelt L, Bollen M (2013) The PP1 binding code: a molecular-lego strategy that governs specificity. The FEBS journal 280: 584-595 Herrmann JM, Riemer J (2012) Mitochondrial disulfide relay: redox-regulated protein import into the intermembrane space. Journal of Biological Chemistry 287: 4426-4433 Herzog F, Kahraman A, Boehringer D, Mak R, Bracher A, Walzthoeni T, Leitner A, Beck M, Hartl FU, Ban N, Malmstrom L, Aebersold R (2012) Structural probing of a protein phosphatase 2A network by chemical cross-linking and mass spectrometry. Science 337: 1348-1352 Hoch JA (2000) Two-component and phosphorelay signal transduction. Current Opinion in Microbiology 3: 165-170 Holdsworth MJ, Bentsink L, Soppe WJ (2008) Molecular networks regulating Arabidopsis seed maturation, after-ripening, dormancy and germination. The New Phytologist 179: 33-54 Horton P, Park KJ, Obayashi T, Fujita N, Harada H, Adams-Collier CJ, Nakai K (2007) WoLF PSORT: protein localization predictor. Nucleic Acids Res 35: W585-587 212 Hunter T, Pawson T (2012) The evolution of protein phosphorylation. Preface. Philosophical transactions of the Royal Society of London 367: 2512 Hurley TD, Yang J, Zhang L, Goodwin KD, Zou Q, Cortese M, Dunker AK, DePaoliRoach AA (2007) Structural basis for regulation of protein phosphatase 1 by inhibitor-2. Journal of Biological Chemistry 282: 28874-28883 Iki T, Yoshikawa M, Meshi T, Ishikawa M (2012) Cyclophilin 40 facilitates HSP90-mediated RISC assembly in plants. EMBO J 31: 267-278 Ingebritsen TS, Cohen P (1983) Protein phosphatases: properties and role in cellular regulation. Science 221: 331-338 Ishizaki K, Larson TR, Schauer N, Fernie AR, Graham IA, Leaver CJ (2005) The critical role of Arabidopsis electron-transfer flavoprotein:ubiquinone oxidoreductase during dark-induced starvation. Plant Cell 17: 2587-2600 Ito J, Taylor NL, Castleden I, Weckwerth W, Millar AH, Heazlewood JL (2009) A survey of the Arabidopsis thaliana mitochondrial phosphoproteome. Proteomics 9: 4229-4240 Janouskovec J, Horak A, Obornik M, Lukes J, Keeling PJ (2010) A common red algal origin of the apicomplexan, dinoflagellate, and heterokont plastids. Proc Natl Acad Sci U S A 107: 10949-10954 Jeffares DC, Penkett CJ, Bahler J (2008) Rapidly regulated genes are intron poor. Trends in Genetics 24: 375-378 Jessop CE, Bulleid NJ (2004) Glutathione directly reduces an oxidoreductase in the endoplasmic reticulum of mammalian cells. Journal of Biological Chemistry 279: 5534155347 Jia Z, Barford D, Flint AJ, Tonks NK (1995) Structural basis for phosphotyrosine peptide recognition by protein tyrosine phosphatase 1B. Science 268: 1754-1758 Jonassen EM, Heidari B, Nemie-Feyissa D, Matre P, Lillo C (2011) Protein phosphatase 2A regulatory subunits are starting to reveal their functions in plant metabolism and development. Plant Signaling & Behavior 6: 1216-1218 Joshi-Saha A, Valon C, Leung J (2011) A brand new START: abscisic acid perception and transduction in the guard cell. Science Signaling 4: re4 Kalanon M, McFadden GI (2010) Malaria, Plasmodium falciparum and its apicoplast. Biochem Soc Trans 38: 775-782 Kanekatsu M, Saito H, Motohashi K, Hisabori T (1998) The beta subunit of chloroplast ATP synthase (CF0CF1-ATPase) is phosphorylated by casein kinase II. Biochemistry and Molecular Biology International 46: 99-105 Katoh K, Misawa K, Kuma K, Miyata T (2002) MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res 30: 3059-3066 Kawaguchi M, Hiroi J, Miya M, Nishida M, Iuchi I, Yasumasu S (2010) Intron-loss evolution of hatching enzyme genes in Teleostei. BMC Evol Biol 10: 260 Kawamata H, Manfredi G (2010) Import, maturation, and function of SOD1 and its copper chaperone CCS in the mitochondrial intermembrane space. Antioxidants & Redox Signaling 13: 1375-1384 Keeling PJ (2009) Role of horizontal gene transfer in the evolution of photosynthetic eukaryotes and their plastids. Methods Mol Biol 532: 501-515 Keeling PJ, Palmer JD (2008) Horizontal gene transfer in eukaryotic evolution. Nat Rev Genet 9: 605-618 213 Kennelly PJ (2002) Protein kinases and protein phosphatases in prokaryotes: a genomic perspective. FEMS Microbiol Lett 206: 1-8 Kerk D, Templeton G, Moorhead GB (2008) Evolutionary radiation pattern of novel protein phosphatases revealed by analysis of protein data from the completely sequenced genomes of humans, green algae, and higher plants. Plant Physiology 146: 351-367 Kim EE, Wyckoff HW (1991) Reaction mechanism of alkaline phosphatase based on crystal structures. Two-metal ion catalysis. J Mol Biol 218: 449-464 Kim TW, Guan S, Burlingame AL, Wang ZY (2011) The CDG1 kinase mediates brassinosteroid signal transduction from BRI1 receptor kinase to BSU1 phosphatase and GSK3-like kinase BIN2. Molecular Cell 43: 561-571 Kim TW, Guan S, Sun Y, Deng Z, Tang W, Shang JX, Burlingame AL, Wang ZY (2009) Brassinosteroid signal transduction from cell-surface receptor kinases to nuclear transcription factors. Nature Cell Biology 11: 1254-1260 Kloeker S, Reed R, McConnell JL, Chang D, Tran K, Westphal RS, Law BK, Colbran RJ, Kamoun M, Campbell KS, Wadzinski BE (2003) Parallel purification of three catalytic subunits of the protein serine/threonine phosphatase 2A family (PP2A(C), PP4(C), and PP6(C)) and analysis of the interaction of PP2A(C) with alpha4 protein. Protein Expression and Purification 31: 19-33 Klumpp S, Thissen MC, Krieglstein J (2006) Protein phosphatases types 2Calpha and 2Cbeta in apoptosis. Biochem Soc Trans 34: 1370-1375 Kohzuma K, Dal Bosco C, Meurer J, Kramer DM (2013) Light- and Metabolism-related Regulation of the Chloroplast ATP Synthase Has Distinct Mechanisms and Functions. Journal of Biological Chemistry 288: 13156-13163 Kojer K, Bien M, Gangel H, Morgan B, Dick TP, Riemer J (2012) Glutathione redox potential in the mitochondrial intermembrane space is linked to the cytosol and impacts the Mia40 redox state. EMBO J 31: 3169-3182 Koonin EV (2010) The origin and early evolution of eukaryotes in the light of phylogenomics. Genome Biol 11: 209 Koornneef M, Hanhart CJ, Hilhorst HW, Karssen CM (1989) In Vivo Inhibition of Seed Development and Reserve Protein Accumulation in Recombinants of Abscisic Acid Biosynthesis and Responsiveness Mutants in Arabidopsis thaliana. Plant Physiology 90: 463-469 Koornneef M, Meinke D (2010) The development of Arabidopsis as a model plant. The Plant Journal 61: 909-921 Koornneef M, Reuling G, Karssen CM (1984) The isolation and characterization of abscisic acid-insensitive mutants of Arabidopsis thaliana. Physiological Plant 61: 377-383 Kotting O, Kossmann J, Zeeman SC, Lloyd JR (2010) Regulation of starch metabolism: the age of enlightenment? Current Opinion in Plant Biology 13: 321-329 Krah A, Pogoryelov D, Meier T, Faraldo-Gomez JD (2010) On the structure of the protonbinding site in the F(o) rotor of chloroplast ATP synthases. J Mol Biol 395: 20-27 Kucera B, Cohn MA, Leubner-Metzger G (2005) Plant hormone interactions during seed dormancy release and germination. Seed Science Research 15: 281–307 Kutuzov MA, Andreeva AV (2002) Protein Ser/Thr phosphatases with kelch-like repeat domains. Cellular Signalling 14: 745-750 214 Kutuzov MA, Andreeva AV (2008) Protein Ser/Thr phosphatases of parasitic protozoa. Mol Biochem Parasitol 161: 81-90 Kutuzov MA, Andreeva AV (2012) Prediction of biological functions of Shewanella-like protein phosphatases (Shelphs) across different domains of life. Funct Integr Genomics 12: 11-23 Kutuzov MA, Bennett N, Andreeva AV (2001) Interaction of plant protein Ser/Thr phosphatase PP7 with calmodulin. Biochemical and Biophysical Research Communications 289: 634-640 Kutuzov MA, Evans DE, Andreeva AV (1998) Expression and characterization of PP7, a novel plant protein Ser/Thr phosphatase distantly related to RdgC/PPEF and PP5. FEBS Letter 440: 147-152 Lai SM, Le Moual H (2005) PrpZ, a Salmonella enterica serovar Typhi serine/threonine protein phosphatase 2C with dual substrate specificity. Microbiology 151: 1159-1167 Lang BF, Gray MW, Burger G (1999) Mitochondrial genome evolution and the origin of eukaryotes. Annu Rev Genet 33: 351-397 Lartillot N, Lepage T, Blanquart S (2009) PhyloBayes 3: a Bayesian software package for phylogenetic reconstruction and molecular dating. Bioinformatics 25: 2286-2288 Lazarow PB (2006) The import receptor Pex7p and the PTS2 targeting sequence. Biochim Biophys Acta 1763: 1599-1604 Le Corguille G, Pearson G, Valente M, Viegas C, Gschloessl B, Corre E, Bailly X, Peters AF, Jubin C, Vacherie B, Cock JM, Leblanc C (2009) Plastid genomes of two brown algae, Ectocarpus siliculosus and Fucus vesiculosus: further insights on the evolution of red-algal derived plastids. BMC Evol Biol 9: 253 Lee S, Cheng H, King KE, Wang W, He Y, Hussain A, Lo J, Harberd NP, Peng J (2002) Gibberellin regulates Arabidopsis seed germination via RGL2, a GAI/RGA-like gene whose expression is up-regulated following imbibition. Genes & Development 16: 646658 Lehti-Shiu MD, Shiu SH (2012) Diversity, classification and function of the plant protein kinase superfamily. Philosophical transactions of the Royal Society of London. 367: 2619-2639 Leivar P, Antolin-Llovera M, Ferrero S, Closa M, Arro M, Ferrer A, Boronat A, Campos N (2011) Multilevel control of Arabidopsis 3-hydroxy-3-methylglutaryl coenzyme A reductase by protein phosphatase 2A. Plant Cell 23: 1494-1511 Li H, Lin D, Dhonukshe P, Nagawa S, Chen D, Friml J, Scheres B, Guo H, Yang Z (2011) Phosphorylation switch modulates the interdigitated pattern of PIN1 localization and cell expansion in Arabidopsis leaf epidermis. Cell Research 21: 970-978 Lima T, Auchincloss AH, Coudert E, Keller G, Michoud K, Rivoire C, Bulliard V, de Castro E, Lachaize C, Baratin D, Phan I, Bougueleret L, Bairoch A (2009) HAMAP: a database of completely sequenced microbial proteome sets and manually curated microbial protein families in UniProtKB/Swiss-Prot. Nucleic Acids Res 37: D471-478 Lingard MJ, Gidda SK, Bingham S, Rothstein SJ, Mullen RT, Trelease RN (2008) Arabidopsis PEROXIN11c-e, FISSION1b, and DYNAMIN-RELATED PROTEIN3A cooperate in cell cycle-associated replication of peroxisomes. Plant Cell 20: 1567-1585 Lisec J, Schauer N, Kopka J, Willmitzer L, Fernie AR (2006) Gas chromatography mass spectrometry-based metabolite profiling in plants. Nature Protocols 1: 387-396 215 Liu G, Zhang J, Larsen B, Stark C, Breitkreutz A, Lin ZY, Breitkreutz BJ, Ding Y, Colwill K, Pasculescu A, Pawson T, Wrana JL, Nesvizhskii AI, Raught B, Tyers M, Gingras AC (2010) ProHits: integrated software for mass spectrometry-based interaction proteomics. Nature Biotechnology 28: 1015-1017 Lu Y, Gehan JP, Sharkey TD (2005) Daylength and circadian effects on starch degradation and maltose metabolism. Plant Physiol 138: 2280-2291 Lu Y, Savage LJ, Ajjawi I, Imre KM, Yoder DW, Benning C, Dellapenna D, Ohlrogge JB, Osteryoung KW, Weber AP, Wilkerson CG, Last RL (2008) New connections across pathways and cellular processes: industrialized mutant screening reveals novel associations between diverse phenotypes in Arabidopsis. Plant Physiology 146: 14821500 MacKintosh C (1992) Regulation of spinach-leaf nitrate reductase by reversible phosphorylation. Biochim Biophys Acta 1137: 121-126 MacKintosh C, Beattie KA, Klumpp S, Cohen P, Codd GA (1990) Cyanobacterial microcystin-LR is a potent and specific inhibitor of protein phosphatases 1 and 2A from both mammals and higher plants. FEBS Letter 264: 187-192 MacKintosh C, Coggins J, Cohen P (1991) Plant protein phosphatases. Subcellular distribution, detection of protein phosphatase 2C and identification of protein phosphatase 2A as the major quinate dehydrogenase phosphatase. Biochemical Journal 273 ( Pt 3): 733-738 MacKintosh C, Garton AJ, McDonnell A, Barford D, Cohen PT, Tonks NK, Cohen P (1996) Further evidence that inhibitor-2 acts like a chaperone to fold PP1 into its native conformation. FEBS Letter 397: 235-238 MacKintosh C, MacKintosh RW (1994) Inhibitors of protein kinases and phosphatases. Trends in Biochemical Sciences 19: 444-448 MacKintosh RW, Dalby KN, Campbell DG, Cohen PT, Cohen P, MacKintosh C (1995) The cyanobacterial toxin microcystin binds covalently to cysteine-273 on protein phosphatase 1. FEBS Lett 371: 236-240 Malyan A (2010) Nucleotide binding to noncatalytic sites is essential for ATP-dependent stimulation and ADP-dependent inactivation of the chloroplast ATP synthase. Photosynthic Research: 243-248 Manning G, Plowman GD, Hunter T, Sudarsanam S (2002) Evolution of protein kinase signaling from yeast to man. Trends in Biochemical Science 27: 514-520 Martinez-Andujar C, Martin RC, Nonogaki H (2012) Seed traits and genes important for translational biology--highlights from recent discoveries. Plant Cell Physiology 53: 5-15 Matsuda S, Vert JP, Saigo H, Ueda N, Toh H, Akutsu T (2005) A novel representation of protein sequences for prediction of subcellular location using support vector machines. Protein Sci 14: 2804-2813 Matsuyama A, Arai R, Yashiroda Y, Shirai A, Kamata A, Sekido S, Kobayashi Y, Hashimoto A, Hamamoto M, Hiraoka Y, Horinouchi S, Yoshida M (2006) ORFeome cloning and global analysis of protein localization in the fission yeast Schizosaccharomyces pombe. Nature Biotechnology 24: 841-847 May A, Berger S, Hertel T, Kock M (2011) The Arabidopsis thaliana phosphate starvation responsive gene AtPPsPase1 encodes a novel type of inorganic pyrophosphatase. Biochim Biophys Acta 1810: 178-185 216 Mayer WE, Schuster LN, Bartelmes G, Dieterich C, Sommer RJ (2011) Horizontal gene transfer of microbial cellulases into nematode genomes is associated with functional assimilation and gene turnover. BMC Evol Biol 11: 13 Maynes JT, Bateman KS, Cherney MM, Das AK, Luu HA, Holmes CF, James MN (2001) Crystal structure of the tumor-promoter okadaic acid bound to protein phosphatase-1. Journal of Biological Chemistry 276: 44078-44082 McCarty RE (1992) A PLANT BIOCHEMIST'S VIEW OF H+-ATPases AND ATP SYNTHASES. Journal of Experimental Biology 172: 431-441 Meek S, Morrice N, MacKintosh C (1999) Microcystin affinity purification of plant protein phosphatases: PP1C, PP5 and a regulatory A-subunit of PP2A. FEBS Letter 457: 494498 Michniewicz M, Zago MK, Abas L, Weijers D, Schweighofer A, Meskiene I, Heisler MG, Ohno C, Zhang J, Huang F, Schwab R, Weigel D, Meyerowitz EM, Luschnig C, Offringa R, Friml J (2007) Antagonistic regulation of PIN phosphorylation by PP2A and PINOID directs auxin flux. Cell 130: 1044-1056 Miller MA, Pfeiffer W, Schwartz T (2010) Creating the CIPRES Science Gateway for inference of large phylogenetic trees. In Proceedings of the Gateway Computing Environments Workshop (GCE). IEEE, New Orleans, LA, pp 1-8 Mithoe SC, Boersema PJ, Berke L, Snel B, Heck AJ, Menke FL (2012) Targeted quantitative phosphoproteomics approach for the detection of phospho-tyrosine signaling in plants. Journal of Proteome Research 11: 438-448 Moller SG, Kim YS, Kunkel T, Chua NH (2003) PP7 is a positive regulator of blue light signaling in Arabidopsis. Plant cell 15: 1111-1119 Moorhead G, Douglas P, Cotelle V, Harthill J, Morrice N, Meek S, Deiting U, Stitt M, Scarabel M, Aitken A, MacKintosh C (1999) Phosphorylation-dependent interactions between enzymes of plant metabolism and 14-3-3 proteins. The Plant Journal 18: 1-12 Moorhead GB, De Wever V, Templeton G, Kerk D (2009) Evolution of protein phosphatases in plants and animals. . Biochemical Journal 417 Moorhead GB, Trinkle-Mulcahy L, Ulke-Lemee A (2007) Emerging roles of nuclear protein phosphatases. Nature reviews. Molecular cell biology 8: 234-244 Mora-Garcia S, Vert G, Yin Y, Cano-Delgado A, Cheong H, Chory J (2004) Nuclear protein phosphatases with Kelch-repeat domains modulate the response to brassinosteroids in Arabidopsis. Genes Development 18: 448-460 Muller K, Tintelnot S, Leubner-Metzger G (2006) Endosperm-limited Brassicaceae seed germination: abscisic acid inhibits embryo-induced endosperm weakening of Lepidium sativum (cress) and endosperm rupture of cress and Arabidopsis thaliana. Plant Cell Physiology 47: 864-877 Mustroph A, Albrecht G, Hajirezaei M, Grimm B, Biemelt S (2005) Low levels of pyrophosphate in transgenic potato plants expressing E. coli pyrophosphatase lead to decreased vitality under oxygen deficiency. Ann Bot 96: 717-726 Nakagami H, Sugiyama N, Mochida K, Daudi A, Yoshida Y, Toyoda T, Tomita M, Ishihama Y, Shirasu K (2010) Large-scale comparative phosphoproteomics identifies conserved phosphorylation sites in plants. Plant Physiology 153: 1161-1174 Nelson BK, Cai X, Nebenfuhr A (2007) A multicolored set of in vivo organelle markers for colocalization studies in Arabidopsis and other plants. Plant Journal 51: 1126-1136 217 Nguyen TH, Brechenmacher L, Aldrich JT, Clauss TR, Gritsenko MA, Hixson KK, Libault M, Tanaka K, Yang F, Yao Q, Pasa-Tolic L, Xu D, Nguyen HT, Stacey G (2012) Quantitative Phosphoproteomic Analysis of Soybean Root Hairs Inoculated with Bradyrhizobium japonicum. Molecular & Cellular Proteomics : MCP 11: 1140-1155 Nicholas KB, Nicholas HBJ, Deerfield DWI (1997) GeneDoc: analysis and visualization of genetic variation. EMBNEW.news 4: 1-4 Ogawa D, Abe K, Miyao A, Kojima M, Sakakibara H, Mizutani M, Morita H, Toda Y, Hobo T, Sato Y, Hattori T, Hirochika H, Takeda S (2011) RSS1 regulates the cell cycle and maintains meristematic activity under stress conditions in rice. Nature Communications 2: 278 Olsen JV, Blagoev B, Gnad F, Macek B, Kumar C, Mortensen P, Mann M (2006) Global, in vivo, and site-specific phosphorylation dynamics in signaling networks. Cell 127: 635648 Olsen JV, Vermeulen M, Santamaria A, Kumar C, Miller ML, Jensen LJ, Gnad F, Cox J, Jensen TS, Nigg EA, Brunak S, Mann M (2010) Quantitative phosphoproteomics reveals widespread full phosphorylation site occupancy during mitosis. Science signaling 3: ra3 Pagliarini DJ, Dixon JE (2006) Mitochondrial modulation: reversible phosphorylation takes center stage? Trends in Biochemical Sciences 31: 26-34 Pallen CJ, Valentine KA, Wang JH, Hollenberg MD (1985) Calcineurin-mediated dephosphorylation of the human placental membrane receptor for epidermal growth factor urogastrone. Biochemistry 24: 4727-4730 Pannifer AD, Flint AJ, Tonks NK, Barford D (1998) Visualization of the cysteinyl-phosphate intermediate of a protein-tyrosine phosphatase by x-ray crystallography. Journal of Biological Chemistry 273: 10454-10462 Park JH, Lee SY, Kim WY, Jung YJ, Chae HB, Jung HS, Kang CH, Shin MR, Kim SY, Su'udi M, Yun DJ, Lee KO, Kim MG (2011) Heat-induced chaperone activity of serine/threonine protein phosphatase 5 enhances thermotolerance in Arabidopsis thaliana. The New Phytologist 191: 692-705 Patzewitz EM, Guttery DS, Poulin B, Ramakrishnan C, Ferguson DJ, Wall RJ, Brady D, Holder AA, Szoor B, Tewari R (2013) An ancient protein phosphatase, SHLP1, is critical to microneme development in Plasmodium ookinetes and parasite transmission. Cell Reports 3: 622-629 Patzewitz EM, Guttery DS, Poulin B, Ramakrishnan C, Ferguson DJ, Wall RJ, Brady D, Holder AA, Szoor B, Tewari R (2013) An Ancient Protein Phosphatase, SHLP1, Is Critical to Microneme Development in Plasmodium Ookinetes and Parasite Transmission. Cell Rep 3: 622-629 Pellny TK, Van Aken O, Dutilleul C, Wolff T, Groten K, Bor M, De Paepe R, Reyss A, Van Breusegem F, Noctor G, Foyer CH (2008) Mitochondrial respiratory pathways modulate nitrate sensing and nitrogen-dependent regulation of plant architecture in Nicotiana sylvestris. The Plant Journal 54: 976-992 Pereira SF, Goss L, Dworkin J (2011) Eukaryote-like serine/threonine kinases and phosphatases in bacteria. Microbiol Mol Biol Rev 75: 192-212 218 Perez J, Castaneda-Garcia A, Jenke-Kodama H, Muller R, Munoz-Dorado J (2008) Eukaryotic-like protein kinases in the prokaryotes and the myxobacterial kinome. Proc Natl Acad Sci U S A 105: 15950-15955 Peti W, Nairn AC, Page R (2013) Structural basis for protein phosphatase 1 regulation and specificity. The FEBS Journal 280: 596-611 Petsalaki EI, Bagos PG, Litou ZI, Hamodrakas SJ (2006) PredSL: a tool for the N-terminal sequence-based prediction of protein subcellular localization. Genomics Proteomics Bioinformatics 4: 48-55 Piskurewicz U, Jikumaru Y, Kinoshita N, Nambara E, Kamiya Y, Lopez-Molina L (2008) The gibberellic acid signaling repressor RGL2 inhibits Arabidopsis seed germination by stimulating abscisic acid synthesis and ABI5 activity. Plant cell 20: 2729-2745 Plaxton WC (1989) Molecular and immunological characterization of plastid and cytosolic pyruvate kinase isozymes from castor-oil-plant endosperm and leaf. Eur J Biochem 181: 443-451 Plaxton WC (1996) The Organization and Regulation of Plant Glycolysis. Annu Rev Plant Physiol Plant Mol Biol 47: 185-214 Poole AM, Penny D (2007) Evaluating hypotheses for the origin of eukaryotes. Bioessays 29: 74-84 Raymond JA, Kim HJ (2012) Possible role of horizontal gene transfer in the colonization of sea ice by algae. PLoS One 7: e35968 Reddehase S, Grumbt B, Neupert W, Hell K (2009) The disulfide relay system of mitochondria is required for the biogenesis of mitochondrial Ccs1 and Sod1. Journal of Molecular Biology 385: 331-338 Reiland S, Messerli G, Baerenfaller K, Gerrits B, Endler A, Grossmann J, Gruissem W, Baginsky S (2009) Large-scale Arabidopsis phosphoproteome profiling reveals novel chloroplast kinase substrates and phosphorylation networks. Plant Physiology 150: 889903 Remmert M, Biegert A, Hauser A, Soding J (2012) HHblits: lightning-fast iterative protein sequence searching by HMM-HMM alignment. Nat Methods 9: 173-175 Rolland N, Curien G, Finazzi G, Kuntz M, Marechal E, Matringe M, Ravanel S, Seigneurin-Berny D (2012) The biosynthetic capacities of the plastids and integration between cytoplasmic and chloroplast processes. Annu Rev Genet 46: 233-264 Ronquist F, Teslenko M, van der Mark P, Ayres DL, Darling A, Hohna S, Larget B, Liu L, Suchard MA, Huelsenbeck JP (2012) MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space. Syst Biol 61: 539-542 Roy J, Cyert MS (2009) Cracking the phosphatase code: docking interactions determine substrate specificity. Science signaling 2: re9 Roy LM, Barkan A (1998) A SecY homologue is required for the elaboration of the chloroplast thylakoid membrane and for normal chloroplast gene expression. Journal of Cell Biology 141: 385-395 Rubio V, Shen Y, Saijo Y, Liu Y, Gusmaroli G, Dinesh-Kumar SP, Deng XW (2005) An alternative tandem affinity purification strategy applied to Arabidopsis protein complex isolation. The Plant Journal 41: 767-778 Russell JA, Roy MK, Sanford JC (1992) Physical trauma and tungsten toxicity reduce the efficiency of biolistic transformation. Plant Physiol 98: 1050-1056 219 Ryu JS, Kim JI, Kunkel T, Kim BC, Cho DS, Hong SH, Kim SH, Fernandez AP, Kim Y, Alonso JM, Ecker JR, Nagy F, Lim PO, Song PS, Schafer E, Nam HG (2005) Phytochrome-specific type 5 phosphatase controls light signal flux by enhancing phytochrome stability and affinity for a signal transducer. Cell 120: 395-406 Salvi M, Brunati AM, Bordin L, La Rocca N, Clari G, Toninello A (2002) Characterization and location of Src-dependent tyrosine phosphorylation in rat brain mitochondria. Biochim Biophys Acta 1589: 181-195 Salvi M, Brunati AM, Toninello A (2005) Tyrosine phosphorylation in mitochondria: a new frontier in mitochondrial signaling. Free Radical Biology & Medicine 38: 1267-1277 Salvi M, Stringaro A, Brunati AM, Agostinelli E, Arancia G, Clari G, Toninello A (2004) Tyrosine phosphatase activity in mitochondria: presence of Shp-2 phosphatase in mitochondria. Cellular and Molecular Life Sciences : CMLS 61: 2393-2404 Schauer N, Semel Y, Balbo I, Steinfath M, Repsilber D, Selbig J, Pleban T, Zamir D, Fernie AR (2008) Mode of inheritance of primary metabolic traits in tomato. Plant Cell 20: 509523 Schein AI, Kissinger JC, Ungar LH (2001) Chloroplast transit peptide prediction: a peek inside the black box. Nucleic Acids Res 29: E82 Schliebner I, Pribil M, Zuhlke J, Dietzmann A, Leister D (2008) A Survey of Chloroplast Protein Kinases and Phosphatases in Arabidopsis thaliana. Current Genomics 9: 184-190 Schonknecht G, Chen WH, Ternes CM, Barbier GG, Shrestha RP, Stanke M, Brautigam A, Baker BJ, Banfield JF, Garavito RM, Carr K, Wilkerson C, Rensing SA, Gagneul D, Dickenson NE, Oesterhelt C, Lercher MJ, Weber AP (2013) Gene transfer from bacteria and archaea facilitated evolution of an extremophilic eukaryote. Science 339: 1207-1210 Seibel NM, Eljouni J, Nalaskowski MM, Hampe W (2007) Nuclear localization of enhanced green fluorescent protein homomultimers. Anal Biochem 368: 95-99 Seifried A, Schultz J, Gohla A (2013) Human HAD phosphatases: structure, mechanism, and roles in health and disease. The FEBS Journal 280: 549-571 Sents W, Ivanova E, Lambrecht C, Haesen D, Janssens V (2013) The biogenesis of active protein phosphatase 2A holoenzymes: a tightly regulated process creating phosphatase specificity. The FEBS Journal 280: 644-661 Sheppeck JE, 2nd, Gauss CM, Chamberlin AR (1997) Inhibition of the Ser-Thr phosphatases PP1 and PP2A by naturally occurring toxins. Bioorg Med Chem 5: 1739-1750 Shi Y (2009) Serine/threonine phosphatases: mechanism through structure. Cell 139: 468-484 Sideris DP, Petrakis N, Katrakili N, Mikropoulou D, Gallo A, Ciofi-Baffoni S, Banci L, Bertini I, Tokatlidis K (2009) A novel intermembrane space-targeting signal docks cysteines onto Mia40 during mitochondrial oxidative folding. The Journal of Cell Biology 187: 1007-1022 Siegl G, MacKintosh C, Stitt M (1990) Sucrose-phosphate synthase is dephosphorylated by protein phosphatase 2A in spinach leaves. Evidence from the effects of okadaic acid and microcystin. FEBS Letter 270: 198-202 Silver DM, Silva LP, Issakidis-Bourguet E, Glaring MA, Schriemer DC, Moorhead GB (2013) Insight into the redox regulation of the phosphoglucan phosphatase SEX4 involved in starch degradation. The FEBS Journal 280: 538-548 220 Simanis V (1995) The control of septum formation and cytokinesis in fission yeast. Semin Cell Biol 6: 79-87 Small I, Peeters N, Legeai F, Lurin C (2004) Predotar: A tool for rapidly screening proteomes for N-terminal targeting sequences. Proteomics 4: 1581-1590 Smith SM, Fulton DC, Chia T, Thorneycroft D, Chapple A, Dunstan H, Hylton C, Zeeman SC, Smith AM (2004) Diurnal changes in the transcriptome encoding enzymes of starch metabolism provide evidence for both transcriptional and posttranscriptional regulation of starch metabolism in Arabidopsis leaves. Plant Physiol 136: 2687-2699 Snel B, Bork P, Huynen MA (2002) Genomes in flux: the evolution of archaeal and proteobacterial gene content. Genome Res 12: 17-25 Sozzani R, Benfey PN (2011) High-throughput phenotyping of multicellular organisms: finding the link between genotype and phenotype. Genome Biology 12: 219 Stamatakis A, Hoover P, Rougemont J (2008) A rapid bootstrap algorithm for the RAxML Web servers. Syst Biol 57: 758-771 Stamm P, Ravindran P, Mohanty B, Tan EL, Yu H, Kumar PP (2012) Insights into the molecular mechanism of RGL2-mediated inhibition of seed germination in Arabidopsis thaliana. BMC Plant Biology 12: 179 Stefansson B, Ohama T, Daugherty AE, Brautigan DL (2008) Protein phosphatase 6 regulatory subunits composed of ankyrin repeat domains. Biochemistry 47: 1442-1451 Stock JB, Stock AM, Mottonen JM (1990) Signal transduction in bacteria. Nature 344: 395400 Stubbs MD, Tran HT, Atwell AJ, Smith CS, Olson D, Moorhead GB (2001) Purification and properties of Arabidopsis thaliana type 1 protein phosphatase (PP1). Biochim Biophys Acta 1550: 52-63 Sugiyama N, Nakagami H, Mochida K, Daudi A, Tomita M, Shirasu K, Ishihama Y (2008) Large-scale phosphorylation mapping reveals the extent of tyrosine phosphorylation in Arabidopsis. Mol Syst Biol 4: 193 Sun TP, Kamiya Y (1994) The Arabidopsis GA1 locus encodes the cyclase ent-kaurene synthetase A of gibberellin biosynthesis. Plant cell 6: 1509-1518 Sun X, Kang X, Ni M (2012) Hypersensitive to red and blue 1 and its modification by protein phosphatase 7 are implicated in the control of Arabidopsis stomatal aperture. PLoS Genetics 8: e1002674 Swarup G, Cohen S, Garbers DL (1982) Inhibition of membrane phosphotyrosyl-protein phosphatase activity by vanadate. Biochemical and Biophysical Research Communications 107: 1104-1109 Szurmak B, Strokovskaya L, Mooney BP, Randall DD, Miernyk JA (2003) Expression and assembly of Arabidopsis thaliana pyruvate dehydrogenase in insect cell cytoplasm. Protein Expression and Purification 28: 357-361 Tagliabracci VS, Engel JL, Wen J, Wiley SE, Worby CA, Kinch LN, Xiao J, Grishin NV, Dixon JE (2012) Secreted kinase phosphorylates extracellular proteins that regulate biomineralization. Science 336: 1150-1153 Takemiya A, Ariyoshi C, Shimazaki K (2009) Identification and functional characterization of inhibitor-3, a regulatory subunit of protein phosphatase 1 in plants. Plant Physiology 150: 144-156 221 Takemiya A, Kinoshita T, Asanuma M, Shimazaki K (2006) Protein phosphatase 1 positively regulates stomatal opening in response to blue light in Vicia faba. Proc Natl Acad Sci U S A 103: 13549-13554 Takemiya A, Yamauchi S, Yano T, Ariyoshi C, Shimazaki K (2013) Identification of a regulatory subunit of protein phosphatase 1 which mediates blue light signaling for stomatal opening. Plant & Cell Physiology 54: 24-35 Tamura K, Fukao Y, Iwamoto M, Haraguchi T, Hara-Nishimura I (2010) Identification and characterization of nuclear pore complex components in Arabidopsis thaliana. Plant Cell 22: 4084-4097 Tang W, Yuan M, Wang R, Yang Y, Wang C, Oses-Prieto JA, Kim TW, Zhou HW, Deng Z, Gampala SS, Gendron JM, Jonassen EM, Lillo C, DeLong A, Burlingame AL, Sun Y, Wang ZY (2011) PP2A activates brassinosteroid-responsive gene expression and plant growth by dephosphorylating BZR1. Nature Cell Biology 13: 124-131 Tanner JJ, Parsons ZD, Cummings AH, Zhou H, Gates KS (2011) Redox regulation of protein tyrosine phosphatases: structural and chemical aspects. Antioxidants & Redox Signaling 15: 77-97 Templeton GW, Nimick M, Morrice N, Campbell D, Goudreault M, Gingras AC, Takemiya A, Shimazaki K, Moorhead GB (2011) Identification and characterization of AtI-2, an Arabidopsis homologue of an ancient protein phosphatase 1 (PP1) regulatory subunit. Biochemical Journal 435: 73-83 Tirichine L, Bowler C (2011) Decoding algal genomes: tracing back the history of photosynthetic life on Earth. The Plant Journal 66: 45-57 Tonks NK (2006) Protein tyrosine phosphatases: from genes, to function, to disease. Nature Reviews. Molecular Cell Biology 7: 833-846 Tonks NK (2013) Protein tyrosine phosphatases - from housekeeping enzymes to master regulators of signal transduction. The FEBS Journal 280: 346-378 Tran HT, Uhrig RG, Nimick M, Moorhead GB (2012) Interfacing protein lysine acetylation and protein phosphorylation: Ancient modifications meet on ancient proteins. Plant Signaling & Behavior 7: 901-903 Tran HT, Ulke A, Morrice N, Johannes CJ, Moorhead GB (2004) Proteomic characterization of protein phosphatase complexes of the mammalian nucleus. Molecular & Cellular Proteomics : MCP 3: 257-265 Trotta A, Wrzaczek M, Scharte J, Tikkanen M, Konert G, Rahikainen M, Holmstrom M, Hiltunen HM, Rips S, Sipari N, Mulo P, Weis E, von Schaewen A, Aro EM, Kangasjarvi S (2011) Regulatory subunit B'gamma of protein phosphatase 2A prevents unnecessary defense reactions under low light in Arabidopsis. Plant Physiology 156: 1464-1480 Tseng TS, Briggs WR (2010) The Arabidopsis rcn1-1 mutation impairs dephosphorylation of Phot2, resulting in enhanced blue light responses. Plant Cell 22: 392-402 Tsuruta H, Aizono Y (1999) Enzymatical properties of psychrophilic phosphatase I. J Biochem 125: 690-695 Tsuruta H, Aizono Y (2000) Cloning of phosphatase I gene from a psychrophile, Shewanella sp., and some properties of the recombinant enzyme. J Biochem 127: 143-149 Ubersax JA, Ferrell JE, Jr. (2007) Mechanisms of specificity in protein phosphorylation. Nat Rev Mol Cell Biol 8: 530-541 222 Uhrig RG, O'Leary B, Spang HE, MacDonald JA, She YM, Plaxton WC (2008) Coimmunopurification of phosphorylated bacterial- and plant-type phosphoenolpyruvate carboxylases with the plastidial pyruvate dehydrogenase complex from developing castor oil seeds. Plant Physiology 146: 1346-1357 Vilela B, Pages M, Lumbreras V (2010) Regulation of MAPK signaling and cell death by MAPK phosphatase MKP2. Plant Signaling & Behavior 5: 1497-1500 Villarino A, Duran R, Wehenkel A, Fernandez P, England P, Brodin P, Cole ST, ZimnyArndt U, Jungblut PR, Cervenansky C, Alzari PM (2005) Proteomic identification of M. tuberculosis protein kinase substrates: PknB recruits GarA, a FHA domain-containing protein, through activation loop-mediated interactions. J Mol Biol 350: 953-963 Virshup DM, Shenolikar S (2009) From promiscuity to precision: protein phosphatases get a makeover. Mol Cell 33: 537-545 Vogtle FN, Burkhart JM, Rao S, Gerbeth C, Hinrichs J, Martinou JC, Chacinska A, Sickmann A, Zahedi RP, Meisinger C (2012) Intermembrane space proteome of yeast mitochondria. Molecular & Cellular Proteomics : MCP 11: 1840-1852 Walker G, Dorrell RG, Schlacht A, Dacks JB (2011) Eukaryotic systematics: a user's guide for cell biologists and parasitologists. Parasitology 138: 1638-1663 Weckbecker D, Longen S, Riemer J, Herrmann JM (2012) Atp23 biogenesis reveals a chaperone-like folding activity of Mia40 in the IMS of mitochondria. EMBO J 31: 43484358 Wen F, Wang J, Xing D (2012) A protein phosphatase 2A catalytic subunit modulates blue light-induced chloroplast avoidance movements through regulating actin cytoskeleton in Arabidopsis. Plant & Cell Physiology 53: 1366-1379 Wenzl P, Wong L, Kwang-won K, Jefferson RA (2005) A functional screen identifies lateral transfer of beta-glucuronidase (gus) from bacteria to fungi. Mol Biol Evol 22: 308-316 Wu G, Wang X, Li X, Kamiya Y, Otegui MS, Chory J (2011) Methylation of a phosphatase specifies dephosphorylation and degradation of activated brassinosteroid receptors. Science Signaling 4: ra29 Xu Y, Xing Y, Chen Y, Chao Y, Lin Z, Fan E, Yu JW, Strack S, Jeffrey PD, Shi Y (2006) Structure of the protein phosphatase 2A holoenzyme. Cell 127: 1239-1251 Xue T, Wang D, Zhang S, Ehlting J, Ni F, Jakab S, Zheng C, Zhong Y (2008) Genome-wide and expression analysis of protein phosphatase 2C in rice and Arabidopsis. BMC Genomics 9: 550 Yu H, Braun P, Yildirim MA, Lemmens I, Venkatesan K, Sahalie J, Hirozane-Kishikawa T, Gebreab F, Li N, Simonis N, Hao T, Rual JF, Dricot A, Vazquez A, Murray RR, Simon C, Tardivo L, Tam S, Svrzikapa N, Fan C, de Smet AS, Motyl A, Hudson ME, Park J, Xin X, Cusick ME, Moore T, Boone C, Snyder M, Roth FP, Barabasi AL, Tavernier J, Hill DE, Vidal M (2008) High-quality binary protein interaction map of the yeast interactome network. Science 322: 104-110 Zhang J, Zhang Z, Brew K, Lee EY (1996) Mutational analysis of the catalytic subunit of muscle protein phosphatase-1. Biochemistry 35: 6276-6282 Zhang M, Liu J, Kim Y, Dixon JE, Pfaff SL, Gill GN, Noel JP, Zhang Y (2010) Structural and functional analysis of the phosphoryl transfer reaction mediated by the human small C-terminal domain phosphatase, Scp1. Protein Science 19: 974-986 223 Zhang X, Henriques R, Lin SS, Niu QW, Chua NH (2006) Agrobacterium-mediated transformation of Arabidopsis thaliana using the floral dip method. Nature Protocols 1: 641-646 Zhang X, Ozawa Y, Lee H, Wen YD, Tan TH, Wadzinski BE, Seto E (2005) Histone deacetylase 3 (HDAC3) activity is regulated by interaction with protein serine/threonine phosphatase 4. Genes Development 19: 827-839 224 Appendix A.1. SLP Phylogenetic Tree and Alignment Sequence Information 225 226 227 228 229 Appendix A.2. RLPH Phylogenetic Tree and Alignment Sequence Information 230 231 232 Appendix B. Phylogenetic Tree Node Scores 233 Appendix C.1. Cloning and Genotyping Primers 234 Appendix C.2. Plant Genotyping Genotyping was performed as outlined in the Materials and Methods in Chapter 5. Primers used are outlined in Appendix C.1. 235 Appendix C.3. Primers for qPCR of GA Related Genes 236