Download Experimental Approaches to Protein–Protein Interactions

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Endomembrane system wikipedia , lookup

Hedgehog signaling pathway wikipedia , lookup

Multi-state modeling of biomolecules wikipedia , lookup

Phosphorylation wikipedia , lookup

SR protein wikipedia , lookup

Histone acetylation and deacetylation wikipedia , lookup

Proteasome wikipedia , lookup

Signal transduction wikipedia , lookup

G protein–coupled receptor wikipedia , lookup

Magnesium transporter wikipedia , lookup

Protein (nutrient) wikipedia , lookup

Homology modeling wikipedia , lookup

Protein folding wikipedia , lookup

Protein wikipedia , lookup

Protein domain wikipedia , lookup

Protein phosphorylation wikipedia , lookup

Cyclol wikipedia , lookup

Protein structure prediction wikipedia , lookup

Intrinsically disordered proteins wikipedia , lookup

Protein moonlighting wikipedia , lookup

List of types of proteins wikipedia , lookup

Nuclear magnetic resonance spectroscopy of proteins wikipedia , lookup

Protein purification wikipedia , lookup

Proteolysis wikipedia , lookup

Protein–protein interaction wikipedia , lookup

Transcript
A Biochemical Society Focused Meeting held at University of Sheffield, Sheffield, U.K., 11–12 January 2010. Organized and Edited by Michael Sutcliffe
Biochemical Society Transactions
www.biochemsoctrans.org
(Manchester, U.K.) and Mike Williamson (Sheffield, U.K.).
Protein–protein interactions
Mike P. Williamson*1 and Michael J. Sutcliffe†
*Department of Molecular Biology and Biotechnology, University of Sheffield, Firth Court, Western Bank, Sheffield S10 2TN, U.K., and †School of Chemical
Engineering and Analytical Science, The Mill, University of Manchester, Manchester M13 9PL, U.K.
Abstract
In the present article, we describe the two standard high-throughput methods for identification of protein
complexes: two-hybrid screens and TAP (tandem affinity purification) tagging. These methods have been
used to characterize the interactome of Saccharomyces cerevisiae, showing that the majority of proteins
are part of complexes, and that complexes typically consist of a core to which are bound ‘party’ and ‘dater’
proteins. Complexes typically are merely the sum of their parts. A particularly interesting type of complex
is the metabolon, containing enzymes within the same metabolic pathway. There is reasonably good
evidence that metabolons exist, but they have not been detected using high-thoughput assays, possibly
because of their fragility.
Introduction
Traditionally, protein–protein interactions have been identified and characterized using low-throughput biophysical
methods such as NMR, crystallography, and a range of
spectroscopic and calorimetric methods. There remains,
of course, a vital place for such methods, and indeed
many of the presentations at this meeting, Experimental
Approaches to Protein–Protein Interactions, demonstrate
the continued vitality of such measurements. However, more
recently, high-throughput methods have provided a new
insight into protein–protein interactions on a proteome-wide
scale, altering some of our ideas of the nature of interactions.
It is therefore appropriate in this introductory article to
consider the lessons learned from such studies.
High-throughput methods
There are two workhorses of high-throughput studies: twohybrid screens and TAP (tandem affinity purification) tagging. The two-hybrid screen (Figure 1) is a simple and rapid
method of finding the interaction partners of a target protein
Key words: high-throughput method, interactome, metabolon, tandem affinity purification tag
(TAP-tag), two-hybrid screen, yeast.
Abbreviations used: MIPS, Munich Information Center for Protein Sequences; TAP, tandem
affinity purification; TEV, tobacco etch virus; UAS, upstream activation sequence.
1
To whom correspondence should be addressed (email [email protected]).
Biochem. Soc. Trans. (2010) 38, 875–878; doi:10.1042/BST0380875
[1,2]. In particular, it finds proteins that interact as a binary
complex with the target protein. The screen uses a reporter
gene, whose activation is easily detected. A popular such gene
is the Escherichia coli lacZ gene, whose activation is detected
because, when grown in a medium containing X-Gal (5bromo-4-chloroindol-3-yl β-D-galactopyranoside), the cells
turn blue. The gene is activated only when a neighbouring
region of DNA known as the upstream activation sequence
(UAS) binds to a protein containing a transcription activation
domain. In unmodified cells, this protein consists of two domains: one whose function is to bind to DNA, and a
second which is the transcription activation domain. Protein–
protein interactions can then be identified by splitting
this protein into two parts, as shown in Figure 1. The DNAbinding domain is attached to a putative dimerization domain,
whereas the transcription activation domain is attached in
different clones to a library of putative dimerization partners.
Activation of the reporter gene is observed only when the
two dimerization domains interact and therefore position
the transcription activation domain in the correct position upstream of the reporter gene, giving rise to blue colonies.
This method has been very widely used, and has uncovered
a large number of interactions. It has, however, received a
good deal of criticism, mainly because it tends to identify
a high proportion of false positives: protein hits that are
unlikely to associate in vivo.
!
C The
Authors Journal compilation
!
C 2010
Biochemical Society
Experimental Approaches to Protein–Protein Interactions
Experimental Approaches to
Protein–Protein Interactions
875
876
Biochemical Society Transactions (2010) Volume 38, part 4
Figure 1 The two-hybrid screen
A DNA-binding domain that binds to a UAS is fused to the target
or bait protein. A library is constructed of possible binding partners
(prey) fused to a transcriptional activation domain (which is normally
attached to the DNA-binding domain). In this way, binding of bait to prey
leads to transcription of the reporter gene such as lacZ.
TAP tagging is a clever way of purifying low-abundance
proteins and pulling them out of the cell still attached to
any binding partners they may have [3], and is discussed in
more detail in this issue of Biochemical Society Transactions by
Völkel et al. [4] It is an extension of the standard protein purification tag such as the hexahistidine tag, in which a His6 sequence is added to one end of the protein: this sequence binds
to metals such as nickel, whereas normal proteins do not, and
therefore the protein may be purified in one step by use of a
nickel-affinity column. In TAP tagging, two tags are used sequentially (Figure 2). Typically, the C-terminus of the protein
is tagged with a sequence consisting of a calmodulin-binding
peptide, followed by a cleavage site recognized by TEV (tobacco etch virus) protease, followed by Protein A. Protein A
binds to IgG. The tagged protein is therefore purified first on
IgG beads, after which the Protein A sequence is cleaved off
by TEV protease. The protein is then purified on calmodulin
beads in the presence of calcium, which is required for correct
folding of calmodulin. It is then eluted from the column using
EGTA, which removes the calcium. The second affinity step
serves as a second independent purification and also removes
the TEV protease, thereby leaving the desired complex as
(ideally) the only protein present. The method therefore
gives a reproducibly very high purification factor, allowing
the preparation of essentially completely pure protein
from the cell without overexpression. The method is
commonly used to isolate complexes, because the purification
treatments are relatively mild and so leave many complexes
intact. The purified complex can then be analysed, typically
by SDS/PAGE, after which the component proteins can
be identified by Western blotting, or more commonly by
protease digestion followed by MS of the resultant peptide
fragments and comparison with a sequence database to
identify the parent protein. By comparison with two-hybrid
screens, it identifies groups of proteins that assemble into
complexes with the target protein, and can therefore identify
proteins that do not necessarily interact directly, but do
belong to the same complex.
The TAP-tag method is also prone to errors. It generates
some false positives, in that abundant proteins, particularly
housekeeping (e.g. glycolytic) and ribosomal proteins,
tend to be picked up more often than they should [5]. It
also under-represents several classes of interactions, such
as membrane proteins (as does the two-hybrid screen).
Most significantly, the purification stages, even though
mild, mean that weak interactions are lost. It has been
frequently noted that the intracellular environment is very
Figure 2 TAP tagging
A two-step purification is used: the first uses Protein A, which binds to IgG beads. The Protein A is subsequently removed
by TEV protease. The second purification uses the affinity of a calmodulin-binding peptide (CaM-bp) for calmodulin, which is
removed by EGTA.
!
C The
C 2010 Biochemical Society
Authors Journal compilation !
Experimental Approaches to Protein–Protein Interactions
crowded, with protein occupying up to 40% of the total
fluid volume in cells (implying that the protein concentration
is very similar to that within protein crystals!) [6]. One
consequence of this macromolecular crowding is that
protein–protein interactions are stronger in the cell than they
would be in dilute solution, often by very large factors [7].
Therefore purification out of the intracellular medium almost
necessarily implies that the weaker interactions will be lost.
The interactome
The interactome (i.e. the set of interactions between all
proteins in the cell) of the yeast Saccharomyces cerevisiae has
been described by two independent groups, in both cases
using TAP tagging [8,9]. The results are broadly consistent,
in that they show that approx. 70% of proteins in the
cell have at least one interacting partner: in other words,
interaction with other proteins is the norm rather than
the exception. (And, as can be seen from other articles in
this issue of Biochemical Society Transactions, homotropic
interactions such as homodimerization are also the norm
rather than the exception; the majority of proteins exist not
as monomers, but as multimers.) The network of interactions
has the property of being scale-free: although most proteins
interact with only one or two other partners, a small number
of proteins form a large number of interactions. And protein
complexes make up virtually all aspects of cellular function.
The majority of complexes, containing only two or
three proteins, often form stable complexes in which the
components have obvious functional roles: for example, they
may be sequential enzymes in a pathway, or an enzyme plus a
regulatory protein, or an enzyme plus a protein that interacts
with substrate, or an enzyme plus a protein that locates it in
a particular region of the cell. The larger complexes tend to
contain a ‘core’ or ‘hub’, playing much the same role: these are
the key proteins that carry out the catalytic activity, and tend
to be genes that are essential for survival of the cell [10]. For
example, in the RNA polymerase II complex, the core is the
polymerase itself plus TFIIB (transcription factor IIB) and
TFIIF (transcription factor IIF), which function to track the
polymerase along its DNA substrate. In addition, there are
other proteins, which have been described as either ‘party’ or
‘dater’ proteins [11,12]. The binding of these proteins to the
core is often variable, being controlled by phosphorylation,
cell cycle, growth conditions, local concentrations and
many other factors. The party proteins are so called because
they are found attached to a range of core complexes,
but the party proteins themselves are always in the same
group. These proteins have again a well-defined function,
such as recognition of the promoter site or termination
of transcription. In this example, these two functions are
mutually exclusive, and therefore in any given complex
one might find one or the other, but not both. (Similar
conclusions were reached on purely structural grounds;
some proteins apparently have very large numbers of possible
partners, which is not physically possible simultaneously
[13].) The dater proteins are so called because they are found
with different partners in each complex, and they often have
more general functions than the party proteins.
In each case where we understand the workings of the
complex, the function of the complex is reassuringly more
or less just the sum of its parts, with different functions
being provided by modular groups of proteins, which
carry out similar functions in different complexes [8]. This
straightforward logic is, however, obscured by frequent
examples of redundancy or non-orthologous replacement.
This is not unexpected, given that the components of
complexes arose as a result of evolutionary processes.
There are clearly still many problems to be ironed out
in high-throughput methods. The two TAP-tag studies
described here show little overlap in detail [11]. There is
a hand-curated database of protein interactions assembled
by the Munich Information Center for Protein Sequences
(MIPS, http://mips.gsf.de/genre/proj/corum), which is often
cited as the ‘gold standard’ of interacting proteins [14].
Between half and one third of the complexes in MIPS
were not identified in the TAP-tag screens. The results
from the two TAP-tag screens have been combined [15] to
produce a set that is suggested to be as accurate as traditional
low-throughput methods, and the paper concludes that
complexes probably exist in a continuum of components:
some components are always associated, whereas others
come and go at different rates and with different affinities.
The yeast interactome has also been probed using twohybrid technology, generating a set of interactions that is
different again. This may of course reflect real differences
in the types and durations of interactions, and it has been
suggested that two-hybrid screens are more suited for
identifying transient and intercomplex interactions [16],
implying again that many proteins interact weakly and
transiently, often with more than one complex.
The metabolon
A particularly interesting type of complex is provided by
the metabolon. This is a concept first proposed by Welch
in the 1970s [17] and since promoted heavily by Srere [18]
among others. It is argued that many groups of enzymes
within common metabolic pathways are physically associated
together into metabolons: for example, glycolytic enzymes
or TCA (tricarboxylic acid) cycle enzymes. This idea makes
good theoretical sense: it facilitates channelling and therefore
speeds up multienzyme pathways and improves control and
regulation. The main problem, however, is that is not clearly
supported by experimental observations. The TAP-tag studies described above found almost no examples of metabolons:
indeed, the complexes identified are strikingly deficient in
metabolic complexes. It has been argued that metabolic
complexes may be weakly associated in vivo, because their
assembly needs to be regulated; for the reasons described
above, this could explain why they have not been observed.
And there are many observations that support the metabolon
concept. For example, there are six enzymes (three of which
are multifunctional) that catalyse purine biosynthesis in
!
C The
C 2010 Biochemical Society
Authors Journal compilation !
877
878
Biochemical Society Transactions (2010) Volume 38, part 4
eukaryotes. In vitro, no co-localization is observed, and
there is no evidence that they associate. However, fluorescent
labelling has demonstrated that these enzymes co-localize
in vivo, and that this localization can be disrupted in response
to purine levels [19]. That study, in common with many
others, suggested further that the complexation occurs at a
membrane surface. An elegant study using isotopic labels
demonstrated channelling for most, if not all, of the glycolytic
enzymes in Arabidopsis and potato, the complexes being
formed at the mitochondrial outer membrane in response
to respiratory demand [20]. The evidence is still mixed, but it
seems likely that TAP tagging really does miss a large number
of weakly interacting, but important, complexes.
Where next?
Two clear conclusions can be reached from this analysis. The
first is that functionally important protein–protein complexes
can adopt a very wide range of affinities, ranging from very
stable homo- and hetero-dimers to transient and tightly
regulated complexes that fall apart as soon as the cell is
broken open, and therefore that have to be studied in
vivo. Such different complexes require very different experimental methods for their study. The second is that highthroughput methods are vital, but need validation and extension to weaker interactions. The participants at this meeting
illustrate these conclusions nicely, and provide an excellent introduction to the range of experimental techniques used in the
increasingly important study of protein–protein interactions.
References
1 Fields, S. and Sternglanz, R. (1994) The two-hybrid system: an assay for
protein–protein interactions. Trends Genet. 10, 286–292
2 Warbrick, E. (1997) Two’s company, three’s a crowd: the yeast two
hybrid system for mapping molecular interactions. Structure 5, 13–17
3 Dziembowski, A. and Séraphin, B. (2003) Recent developments in the
analysis of protein complexes. FEBS Lett. 556, 1–6
4 Völkel, P., Le Faou, P. and Angrand, P.-O. (2010) Interaction proteomics:
characterization of protein complexes using tandem affinity
purification–mass spectrometry. Biochem. Soc. Trans. 38, 883–887
5 Jansen, R., Yu, H.Y., Greenbaum, D., Kluger, Y., Krogan, N.J., Chung, S.B.,
Emili, A., Snyder, M., Greenblatt, J.F. and Gerstein, M. (2003) A Bayesian
networks approach for predicting protein-protein interactions from
genomic data. Science 302, 449–453
!
C The
C 2010 Biochemical Society
Authors Journal compilation !
6 Zimmerman, S.B. and Trach, S.O. (1991) Estimation of macromolecule
concentrations and excluded volume effects for the cytoplasm of
Escherichia coli. J. Mol. Biol. 222, 599–620
7 Hall, D. and Minton, A.P. (2003) Macromolecular crowding: qualitative
and semiquantitative successes, quantitative challenges. Biochim.
Biophys. Acta 1649, 127–139
8 Gavin, A.C., Aloy, P., Grandi, P., Krause, R., Boesche, M., Marzioch, M.,
Rau, C., Jensen, L.J., Bastuck, S., Dümpelfeld, B. et al. (2006) Proteome
survey reveals modularity of the yeast cell machinery. Nature 440,
631–636
9 Krogan, N.J., Cagney, G., Yu, H.Y., Zhong, G.Q., Guo, X.H., Ignatchenko, A.,
Li, J., Pu, S.Y., Datta, N., Tikuisis, A.P. et al. (2006) Global landscape of
protein complexes in the yeast Saccharomyces cerevisiae. Nature 440,
637–643
10 Batada, N.N., Hurst, L.D. and Tyers, M. (2006) Evolutionary and
physiological importance of hub proteins. PLoS Comput. Biol. 2,
748–756
11 Devos, D. and Russell, R.B. (2007) A more complete, complexed and
structured interactome. Curr. Opin. Struct. Biol. 17, 370–377
12 Robinson, C.V., Sali, A. and Baumeister, W. (2007) The molecular
sociology of the cell. Nature 450, 973–982
13 Kim, P.M., Lu, L.J., Xia, Y. and Gerstein, M.B. (2006) Relating threedimensional structures to protein networks provides evolutionary
insights. Science 314, 1938–1941
14 Mewes, H.W., Amid, C., Arnold, R., Frishman, D., Guldener, U.,
Mannhaupt, G., Munsterkotter, M., Pagel, P., Strack, N., Stumpflen, V.
et al. (2004) MIPS: analysis and annotation of proteins from whole
genomes. Nucleic Acids Res. 32, D41–D44
15 Collins, S.R., Kemmeren, P., Zhao, X.C., Greenblatt, J.F., Spencer, F.,
Holstege, F.C.P., Weissman, J.S. and Krogan, N.J. (2007) Toward a
comprehensive atlas of the physical interactome of Saccharomyces
cerevisiae. Mol. Cell. Proteomics 6, 439–450
16 Yu, H.Y., Braun, P., Yildirim, M.A., Lemmens, I., Venkatesan, K., Sahalie, J.,
Hirozane-Kishikawa, T., Gebreab, F., Li, N., Simonis, N. et al. (2008)
High-quality binary protein interaction map of the yeast interactome
network. Science 322, 104–110
17 Welch, G.R. (1977) On the role of organized multienzyme systems in
cellular metabolism: a general synthesis. Prog. Biophys. Mol. Biol. 32,
103–191
18 Srere, P.A. (1987) Complexes of sequential metabolic enzymes. Annu.
Rev. Biochem. 56, 89–124
19 An, S.G., Kumar, R., Sheets, E.D. and Benkovic, S.J. (2008) Reversible
compartmentalization of de novo purine biosynthetic complexes in living
cells. Science 320, 103–106
20 Graham, J.W.A., Williams, T.C.R., Morgan, M., Fernie, A.R., Ratcliffe, R.G.
and Sweetlove, L.J. (2007) Glycolytic enzymes associate dynamically
with mitochondria in response to respiratory demand and support
substrate channeling. Plant Cell 19, 3723–3738
Received 20 January 2010
doi:10.1042/BST0380875