Survey
* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project
Cell, Vol. 59, 553-562, November 3, 1969, Copyright 8 1969 by Cell Press A Single Amino Acid Can Determine the DNA Binding Specificity of Homeodomain Proteins Jessica Tteisman, Pier@ G&ncxy, Malini Vashishtha, Esther Harris, and Claude Desplan Howard Hughes Medical Institute The Rockefeller University New York, New York 10021-8399 Summary Many Drosophila developmental genes contain a DNA binding domain encoded by the homeobox. This homeodomain contains a region distantly homologous to the helix-turn-helix motif present in several pmlcsryotic DNA binding proteins. We investigated the nature of homeodomain-DNA interactions by making a series of mutations in the helix-turn-helix motif of the Drosophila homeodomain protein Paired (Prd). This protein does not recognize sequences bound by the homeodomain proteins Fushi taraxu (Ftz) or Bicoid @cd). We show that changing a single amino acid at the C-terminus of the recognition hei!x is both necessary and sufficient to confer the DNA blnding specificity of either Ftz or Bed on Prd. This simple rule indicates that the amino acids that determine the specificity of homeodomains are different fmm those mediating protein-DNA contacts in pmkaryotic proteins. We further show that Prd contains two DNA binding activities. The Prd homeodomain is responsible for one of them while the other is not dependent on the recognition helix. Introduction Embryonic development in Drosophila is controlled by a set of regulatory genes that are expressed in a very precise temporal and spatial pattern (for reviews see Scott and O ’Farreii, 1988; Scott and Carroll, 1987). The genetic identification (NOssiein-Voihard and Wieschaus, 1980) and the subsequent molecular analysis of most of these developmental genes have led to an understanding of their interactions. Maternal information establishes the polarity of the embryo (Nussiein-Voihard et al., 1987) while a hierarchy of gap, pair-rule, segment polarity, and homeotic genes interprets this information, dividing the embryo into repeating units and assigning identities to them (reviewed in ingham, 1988). A molecular comparison of the products of the developmental genes has ailowed the identification of protein domains shared by several of them, raising the hypothesis that these genes have arisen through duplication and subsequent divergence (McGinnis et al., 1984a; Scott and Weiner, 1984; Scott and O ’Farreii, 1988; Frigerio et al., 1988). The presence of similar motifs in a large variety of multicellular organisms argues for the utilization of similar developmental processes throughout evolution (McGinnis et al., 1984b; Chowdhury et al., 1987; Deutsch et al., 1988; Dressier and Gruss, 1988). One of the most prevalent protein motifs among the products of the developmental genes is the homeodomain (McGinnis et al., 1984a; Scott and Weiner, 1984). This 80 amino acid protein domain is present in the product of the Drosophila maternal effect gene bicoid as well as in those of several classes of zygotic genes involved in early deveiopment and in later specification of ceil fates (for references see Scott et al., 1989). The homeodomain is also encoded by genes expressed early in development in other species. its sequence is highly conserved in otherwise unrelated proteins. An important characteristic of the homeodomain is the presence in its C-terminal part of a region similar to the helix-turn-helix motif that mediates DNA binding by many prokaryotic proteins (Laughon and Scott, 1984; Pabo and Sauer, 1984) supporting the idea that many of the developmental genes encode transcriptional regulators (Garcia-Beiiido, 1975). indeed, several homeodomain-containing proteins have been shown to bind specifically to DNA (Despian et al., 1985, 1988; Hoey and Levine, 1988; Beachy et al., 1988; Driever and Niisslein-Volhard, 1989; Miiiier et al., 1988). Recently, nuclear magnetic resonance studies have demonstrated that the purified Antennapedia homeodomain contains three heiices separated by turns (Otting et al., 1988). The third of these heiices is thought to correspond to the recognition helix of the prokaryotic proteins, which forms specific hydrogen bonds and van der Waais contacts with the base pairs of DNA (see Ptashne, 1988). In the homeodomain, the third helix extends further than the 9 amino acids that constitute the prokaryotic recognition helix (Otting et al., 1988). Based on sequence homologies, the homeodomains fail into several groups that are conserved across species (Gehring et al., 19Sr; Scott et al., 1989). Interestingly, there is no direct correlation between the group a homeodomain belongs to and the developmental function of its gene. We proposed a classification of the various homeodomains based on the sequences of their recognition heiices (Desplan et al., 1988), expecting that the sequence differences in this motif would reflect differences in DNA binding specificity and therefore in function. in the lambda and 434 repressors, positions 1, 2, 5, and 8 form critical contacts with the base pairs of the operator sequence (Ptashne, 1988; Wharton and Ptashne, 1985, 1987; Agganvai et al., 1988; Anderson et al., 1987). Mutational analysis of the Trp repressor, another prokaryotic helix-turn-helix protein, has suggested that residue 3 of the recognition helix is the most crucial for specific recognition of the operator (Bass et al., 1988). Surprisingly, differences at these positions do not seem to affect the DNA binding specificity of the homeodomain. For exampie, homeodomain proteins with different amino acids at positions 2,3, and 8 (Fushi tarazu [Ftz] and Even-skipped [Eve]; Laughon and Scott, 1984; Macdonaid et al., 1988) nevertheless recognize the same consensus sequence with only minor differences in specificity (Despian et al., 1988; Hoey and Levine, 1988; C. D., unpublished data). Cdl 554 We therefore hypothesized that the protein-DNA contacts are different for the homeodomain proteins than for the prokaryotic proteins that have been crystallized and genetically characterized in great detail (Ptashne, 1966; Hochschild et al., 1966; Hochschild and Ptashne, 1966; Anderson et al., 1967; Aggarwal et al., 1966; Jordan and Pabo, 1966; Wolberger et al., 1966; Otwinowski et al., 1966; Bass et al., 1966). We carried out a systematic analysis of the roles of various amino acids of the helix-turn-helix motif in specifying the DNA sequences bound by homeodomain proteins. For this purpose, we changed specific amino acids in the Paired protein (Prd; Bopp et al., 1966; Frigerio et al., 1966) and analyzed the DNA binding specificities of the wildtype and mutant proteins produced in Escherichia coli. The unmodified Prd protein does not recognize sequences specifically bound by the homeodomain protein Ftz, nor those specifically bound by Bicoid (Bed; Driever and Nirsslein-Volhard, 1969). We tested the ability of the mutant proteins to gain binding to these two classes of sequences. From these studies we conclude that the amino acid at position 9 of the recognition helix (as defined for prokaryotic proteins) determines the specificity of the homeodomain. When a glutamine occupies this position, as in Ftz, the mutated Prd is able to bind to the Ftz sites. When a lysine occupies this position, as in Bed (Berleth et al., 1966) the Prd mutant recognizes the Bed sites present in the hunchback promoter (Driever and NOssleinVolhard, 1969). Prd itself has a serine at position 9 (Bopp et al., 1966) and its homeodomain recognizes a distinct sequence. This provides a very simple rule to explain the overall specificity of homeodomains, which distinguishes their mechanism of interaction with DNA from that of A Helix 2 REELAQRT -I-----L -I-----L -I-IG--L -IDI-NAL Turn Helix 3 NP6 Figure 1. Structure Results Changing a Single Amino Acid in the Prd Recognition Helix Switches Its Specificity to That of Ftz The homeodomain proteins Ftz, Zerkniillt (Zen; Doyle et al., 1966) and other proteins of class I and class II, as well as the Eve protein, bind to the consensus sequence TCAATTAAAT (NP) and to cognate sites in the engrhd gene (Desplan et al., 1966; Hoey and Levine, 1966), whereas Prd (class IV) does not recognize these DNA sequences (Hoey and Levine, 1966; see also Figure 2). This difference prompted us to investigate the nature of the specificity underlying homeodomain-DNA interactions. We used in vitro mutagenesis to create a series of altered Prd constructs that have redesigned helix-turn-helix motifs borrowing residues from the Ftz homeodomain (Figure lA, Ml through M7). We then overexpressed these constructs in E. coli (Figure 1B) under the control of the T7 promoter (Studier and Moffatt, 1966) and assayed the e6 Bed NLT EARIQVWFS + - s-s-------- -RQ-KI---RQ-KI--Q -RQ-KI---RQ-KI--Q - - --- -- -Q -R--KI--Q -RQ--I--Q -RQDKIEAQ + + + nd nd nd nd nd nd nd s-c s-s -RQVKI--Q -RQ-KI--Q + + + + + + + + + + + + (+) - - + + nd + --------K LAD-SAKL prokaryotic DNA binding proteins. Hanes and Brent (1969) have recently reached similar conclusions by measuring the rate of transcriptional activation by mutants in the bicoid gene expressed in yeast. We also report that the Prd protein possesses two DNA binding functions. One of them is dependent on the homeodomain and is masked by the C-terminal part of the Prd protein, which contains the Prd repeat (Bopp et al., 1966). Deletion of this part allows recognition of the consensus sequence TTTGACGT. Another distinct specificity is not affected by dramatic changes in the recognition helix and is therefore not mediated by this motif. A-G and Expression T-QVKI--K of Prd Constructs nd Containing Redesigned Helix-Turn-Helix B kd Motifs (A) Mutations in the helix-turn-helix motif of Prd. The sequences of Zen, Ftz, Bed, and the various mutants are indicated. A dash indicates that the amino acid is the same as in Prd. Binding to the class I consensus sequence (NPs). to the eve promoter site (ec), or to the hunchback promoter &es (Bed Al, A2. and A3) is indicated. Binding of Bed to hunchbeck has been reported by Driever and Ngsslein-\lblhard (lgeg). The amino acid changes in each mutant that alter its binding specifictty are indicated in boldface. nd, not determined. (B) Coomassie blue-stained SDS-polyacrylamide gel showing the crude bacterial extracts used for these experiments. Twenty micrograms of total protein was loaded in each lane. Specificity 555 G/A of Homeodomains ZSII Prd . ..-I Figure 2. Binding of the Prd Mutant Proteins to the Class I Consensus ..a ..- Sequence DNAase I protection of the NPs sequence (Desplan et al., 1999). Approximately 2 ng of DNA was incubated without pmtein (-) or with 2, 10, or 40 ug of bacterial extracts containing the indicated proteins (increasing protein concentration is shown by arrows). DNAase I footprinting was carried out as described in Experimental Procedures. The G/A lane shows a Maxam and Gilbert sequencing ladder identifying the purines. The six repeats of the NP sequence are indicated by arrows showing their orientation (5’-TCAATTAAATGA-39. DNAase l-hypersensitive sites (arrowheads) appear between protected regions for those proteins that bind to this sequence. mutant proteins by DNAase I footprinting for their ability to bind to a hexameric repeat of the class l/class II consensus sequence (NPrJ. Interestingly, changing amino acids 2, 3, 5, and 6 of the Prd recognition helix to those of the Ftz sequence, making Prd identical to Ftz over amino acids 1 through 6, gave only a very low affinity bind- A hb Figure 3. Binding Specificity of the Prd B9 Mutant B ing to the NP sequence (Figure 2, Ml), despite a normal level of expression (Figure 16) and binding activity to an alternate site (see below and Figure 4). When the whole recognition helix of Prd was changed to that of Ftz, the mutant protein now bound to NPs, suggesting a crucial role for residue 9 of the recognition helix (Figure 2, M2). N% C eve Protein concentrations and footprint procedures were as described in Figure 2. (A) DNAase I protection of a Bed-specific site from the hunchback (hb) promoter. The Bed Al site (CGTAATCCC; Driever and Niisslein-Volhard. is indicated. (B) DNAaae I protection of the NPs sequence described in Figure 2. The six repeats are indicated by arrows. (C) DNAase I protection of the Prd-specific site from the eve promoter. es (CACGATTAGCACCGTTCCGCTC) is indicated. 1999) Cdl 556 G/A Zen --.-----4---m--_-. Ftz Prd Ml M2 M3 M4 Figure 4. Binding of the Prd Mutant Proteins to the e5 Site from the eve Promoter Described Protein concentrations and footprint procedures were as described MS M6.--- M7 Control in Figure 3 in Figure 2. This was confirmed by generating a mutant protein (M5) that had only this single amino acid (serine) changed to the Ftz sequence (glutamine). This single change conferred binding to the NPe sequence at an affinity similar to that of Ftz (Figure 2, M5). Further single amino acid changes in the M2 mutant (M6 and M7) did not produce changes in specificity. Changing amino acids at nonconserved positions of the Prd helix 2 to those present in Ftz also had no effect on the specificity of the mutant proteins, as can be seen by comparing Ml with M3 or M2 with M4 (Figure 2). Similar binding patterns were observed when mutant proteins were tested for their ability to bind to the cognate sequence in the engrailed promoter (data not shown). Taken together, these results show that a glutamine as residue 9 of the recognition helix is both necessary and sufficient to confer F&specific DNA binding on the Prd homeodomain protein. The Same Residue Determines the Binding Specificity of Bicoid To see whether these observations could be expanded to other homeodomain proteins, we made use of the binding specificity of the Bed protein. Bed is a homeodomain protein that has recently been shown to bind target sequences in the hunchback promoter (Driever and Niisslein-\lollhard, 1969). These sites are not bound by Prd, by Ftz-type homeodomain proteins, or by the Prd mutant M5, which has a glutamine at position 9 (Figure 3A). The Bed homeodomain is quite divergent from that of Ftz and has a lysine as residue 9 of the recognition helix (Figure 1A; Frigerio et al., 1966). According to the data presented above, changing residue 9 of Prd into that present in Bed (serine to lysine) should be sufficient to allow binding of this mutant protein to the sites in the hunchback promoter. As shown in Figure 3A (mutant B9), this is indeed the case. The 89 protein does not bind to the NP sequence (Figure 36); thus two different substitutions at position 9 confer two different binding specificities on the Prd pro- tein. We can therefore generalize that this single amino acid is critical in determining the specificity of homeodomain proteins. Another DNA Binding Activity Resides outside the Prd Recognition Helix As shown in Figure 18, the mutant proteins were all produced in E. coli at similar levels. In addition, all proteins were active in DNA binding as judged by the footprint patterns shown in Figures 3C and 4. Prd binds specifically to a DNA sequence (CACGATTAGCACCGTTCCGCTC) in the eve promoter, overlapping a site named e5 that is bound by the Eve protein (Hoey and Levine, 1966). This sequence is distinct from the NP sequence, to which Eve also binds. We utilized the binding to the e5 site as a control for the DNA binding activity of our mutant proteins. Prd itself as well as all the mutant proteins retained their ability to bind to the e5 sequence (Figures 3C and 4). Ftz bound only weakly to the es sequence, while the footprint pattern of Zen differed from those of Prd and its derivatives (Figure 4). The observation that mutants with specificities switched to those of Ftz or Bed still retained their ability to bind to the e5 site was puzzling in the light of the above model. One would have expected only those mutant proteins retaining the Prd-type residue 9 (serine) in the recognition helix to bind specifically to this sequence. However, this is not the case, suggesting that the binding of Prd to the e5 site is not directed by residue 9 of the recognition helix. We therefore tried to assess whether the recognition helix of Prd was mediating binding to the eve promoter. For this purpose, we engineered a construct with a recognition helix dramatically changed at three positions of helix 3 (isoleucine to aspartic acid at position 4, tryptophan to glutamic acid at position 7, and phenylalanine to alanine at position 6) (Figure lA, MP-IWF). These three positions are absolutely conserved in all homeodomains, and changing them is expected to make this helix inactive as a DNA binding domain. In particular, isoleucine at posi- Specificity 557 of Homeodomains eve G 7 A -- M2 Figure 5. Destruction MP-IWF of the Recognition control G “A------- M2 MLNVF control Helix of M2 DNAase I protection of (A) the NPs -containing fragment described in Figure 2 and (6) the e5 -containing in Figure 3. Protein concentrations and footprint procedures were as described in Figure 2. tion 4 is homologous to a residue that contacts helix 2 in the prokaryotic proteins, providing structural stability (Ptashne, 1988). We made these changes in the Prd mutant M2, which recognizes both the NPs and the e5 sequences (Figures lA, 2, and 4). As shown in Figure 5A, binding to NPB is indeed destroyed, but binding to the e5 sequence is not affected by these changes (Figure 58). This implies that some part of Prd outside the helix-turnhelix is able to bind to e5. We are currently further refining the location of this novel DNA binding region. The Prd Homeodomain Contains a DNA Binding Specificity Masked by the GTerminal Region of the Molecule We could make a Prd protein that binds to Ftz and not to Bed sites by making one substitution at residue 9 (M5, Figure 2) and another protein that binds to Bed and not to Ftz sites by placing a different amino acid at this position (B9, Figure 3). VW initially were unable to define a specificity for Prd itself that was dependent on the presence of a serine at position 9. However, by using a truncated version of the Prd protein that lacks the C-terminal region (PrdA; see Experimental Procedures for details), we were able to define a sequence specifically recognized by the Prd homeodomain. We used PrdA to search a complex DNA fragment from the eve promoter described mixture (the 50 kb lambda genome) which should by chance contain sequences approaching those actually recognized by the Prd protein in vivo (Ross and Landy, 1982; Desplan et al., 1985; Biedenkapp et al., 1988). We also searched the engrai/ed promoter, which is expected to contain Prd binding sites. To identify fragments containing binding sites for the PrdA protein, we used a precipitation assay based on our observation that most proteins overproduced in E. coli are partially insoluble but are still able to bind to DNA (Treisman and Despfan, 1989). We then used the protein extract to footprint the binding sites present in the precipitated fragments. From these sequences, we derived the Prd consensus sequence TTTGACGT. A cloned synthetic oligonucleotide containing three Prd consensus sequences (Prd3) exhibited a strong footprint with PrdA and not with Ftz, the M5 mutant (Figure 8A), or the full-length Prd protein. We therefore concluded that the Prd homeodomain was able to specify binding to the Prd3 sequence but that this activity was masked by the C-terminal part of the molecule (Figure 8A). To demonstrate further that the Prd homeodomain was responsible for the binding to the Prd3 sequence, we swapped the homeodomains of Prd and M5 into the Ftz protein (Ftz/PrdHD and Ftz/M5HD, respectively). Ftz/ Cdl 558 Prd 3 A Ftr G /n-.-----t Prd -- M5 - Figure 6. Binding of Prd Derivatives - Prd A control -_c =/a - Ftz MSHD - Ftz PrdHD - - Ftz M5HD ------ Ftz PrdHD Prd II to the Prd 3 Sequence (A) DNAase I protection of three copies of the Prd-specific sequence. Three repeats of the Prd consensus sequence TTTGACGT are indicated by arrows. PrdA has a deletion of the C-terminal part of the Prd protein. The FWM5HD and Ftz/PrdHD proteins contain the M5 or Prd homeodomain, respectively, inserted in place of the Ftz homeodomain. (6) DNAase I protection of the NPs sequence described in Figure 2. The protein concentrations were 10 and 40 ug. Footprint procedures were as described in Figure 2 PrdHD was able to bind to the Prd3 sequence while Ftz/M5HD was not (Figure 8A). The failure of Ftz/M5HD to bind to the Prd3 sequence demonstrates that this specificity is dependent on the serine at postion 9 of the recognition helix. Furthermore, unlike the case of Prd, the context of the Ftz protein was not able to inhibit the function of the Prd homeodomain. We also tested the ability of these proteins to bind to NPG. FtzIM5HD bound to NPs with a much higher affinity than Ftz/PrdHD (Figure 66) indicating that the specificity was indeed due to the recognition helix. However, both PrdA and FtzlPrdHD were able to bind weakly to the NPB sequence (Figure 6B), showing that the Prd homeodomain has a low affinity for the class I sequence. Discussion We have analyzed the DNA binding specificity of the homeodomain as part of our effort to understand its function in Drosophila developmental genes. Our approach was guided by the large body of work with the analogous helix-turn-helix motif present in prokaryotic DNA binding proteins. According to the current model based on the crystal structures of the lambda and 434 repressors and of the 434 Cro protein, as well as on molecular and genetic data (Ptashne, 1986; Anderson et al., 1987; Wharton and Ptashne, 1985, 1987; Wolberger et al., 1988; Aggarwal et al., 1988), the N-terminal part of the recognition helix (helix 3) fits into the major groove of DNA and the amino acids on one face of this helix establish hydrogen bonds and van der Waals contacts with the base pairs. The specificity-determining residues occupy positions 1 and 2 in the first turn of the recognition helix and positions 5 and 6 in the second turn. In the lambda and 434 repressors, the amino acid at position 9 (asparagine) participates only nonspecifically in the binding by interacting with the DNA backbone at positions that also contact the glutamines at the beginning of both helices 2 and 3 (Aggarwal et al., 1988; S. C. Harrison, personal communication). The Amino Acid at Position 9 of the Recognition Helix Is Essential for Specificity The homeodomain was initially expected to bind to DNA in a conformation similar to that described for prokaryotic proteins. Indeed, homeodomains of class I, which have identical recognition helices, recognize the same sequence (Desplan et al., 1988; Hoey and Levine, 1988), consistent with the idea that the helix-turn-helix is important in generating their specificity. Homeodomain proteins from more divergent classes, such as Prd (class IV), do not bind to this sequence (Hoey and Levine, 1988; Figure 2). Alternative consensus sequences have been determined for Bed (TCXAATCCC; Driever and Niisslein-Volhard, 1989) and for the POU homeodomain proteins (e.g., Specificity of Homeodomains 559 octamer sequence ATGCAAAT for Ott-l/OTF-1) (Clerc et al., 1988; Herr et al., 1988; Ko et al., 1988; Scheidereit et al., 1988; Sturm et al., 1988). It has also been reported that class I proteins are able to bind to repeats of the trinucleotide TAA and to activate transcription of genes linked to this sequence (Beachy et al., 1988; Krasnow et al., 1989; Winslow et al., 1989). The results we have presented here demonstrate that the specificity of a homeodomain is critically dependent on the nature of the amino acid at position 9 of its recognition helix and that the protein-DNA contacts are therefore different from those of the prokaryotic proteins. If amino acid 9 is a glutamine (as in Ftz, Zen, and the M2, M4, M5, M8, and M7 mutants), the homeodomain will bind to the NP sequence (Figure 2). Another example of a protein with a glutamine at position 9 is the Xenopus protein Mix, which has a homeobox otherwise homologous to Prd (Rosa, 1989). This protein also binds to the NP sequence (J. T., unpublished data). When a lysine occupies position 9 (as in Bed or the B9 mutant), the protein will recognize the Bed sites (Figure 3A). In addition, the product of the cut gene (Blochlinger et al., 1988) which has a histidine at this position, does not bind to the NP sequence (J. T., unpublished data). Helix 2 does not play a significant role in the specificity and probably has only a structural role, as in most prokaryotic proteins. An extensive swap of the recognition helix or of the entire helix-turn-helix between Prd and Ftz (excluding position 9) results only in weak binding of the mutants Ml and M3 to NP6, with an affinity at least 2CMold lower than when position 9 is a glutamine. This shows that the other amino acids of the helix-turnhelix may also exert some weak influence on the specificity of the homeodomain. This is not surprising, since it is unlikely that a single amino acid interacts with all the base pairs of a 9-10 bp consensus sequence. The POU homeodomain proteins are very divergent and appear to require regions outside the homeodomain to bind to DNA (Herr et al., 1988). All POU homeodomains have a cysteine at position 9 (He et al., 1989), but replacement of the Prd recognition helix with that of OCT-2/OrF-2, including a cysteine at position 9, did not allow recognition of the octamer sequence from the histone H2b promoter (J. T, unpublished data). Therefore, the specificity determinant of the POU homeodomains does not reside entirely in residue 9 of the recognition helix. It seems that the recognition helix of the homeodomain does not interact with the major groove of DNA in the same way as the lambda or 434 repressor. The region just C-terminal to position 9, which is the most highly conserved part of the homeodomain, contains a stretch of 4 or 5 lysines or arginines. This region forms a helical C-terminal extension of the recognition helix (Otting et al., 1988). It is possible that this very basic region can interact closely with the phosphates on the DNA backbone, tilting the recognition helix to bring the amino acid at position 9 closer to the major groove than in the 434 repressor. The importance of amino acid 9 of the recognition helix has also recently been reported by Hanes and Brent (1989) from their analysis in yeast of transcriptional activation by the products of IexA-bed fusion genes carrying mutations in the recognition helix. Our independent observations with Prd, Ftz, and Bed generalize the rule and demonstrate that the results can be attributed directly to DNA binding and not to interactions with the transcriptional machinery. Together, the two sets of results support the model that amino acid 9 of the recognition helix is the major determinant of the specificity of the bona fide homeodomain proteins. Prd Has a Novel DNA Binding Function We have shown that two different specificities can be created in the Prd homeodomain by substituting two different amino acids at position 9 of its recognition helix. We have also shown that the Prd homeodomain itself, which contains a serine at this position, is capable of binding to a third sequence, TTTGACGT. However, its binding appears to be masked by the C-terminal part of the Prd protein, which contains the so-called Prd repeat (Bopp et al., 1988). A similar effect has been described for the glucocorticoid receptor, which becomes a constitutive DNA binding protein when the C-terminal steroid binding domain is deleted (Godowski et al., 1987; Picard et al., 1988). We should note that the C-terminal part of Prd does not affect the binding of the Prd mutants M2 and M5 to the NP sites or of B9 to the Bed sites. Deletion of the Prd repeat region also allows the Prd protein to bind with low affinity to the NP sites, indicating that there is some crossreactivity between the various classes of homeodomain proteins. Similarly, Han et al. (1989) have shown that Prd can activate transcription of a promoter containing sites closely related to the NP sequence. It has also been reported that the homeodomain proteins Ultrabithorax and Abdominal-B are able to weakly activate transcription from a promoter containing octamer sequences, which are normally recognized by the POU homeodomain protein OTF-2 (Thali et al., 1988). Our results suggest that a serine at position 9 makes Prd more susceptible to interactions with the rest of the protein than glutamine or lysine at this position. The data presented here also show that Prd binding to the e5 site is not mediated by the helix-turn-helix motif. The evidence is that substitutions in the Prd helix-turnhelix motif that created a new specificity did not affect the ability of the mutant proteins to bind to a sequence in the evB promoter (Figure 4). More dramatically, replacing the three most conserved amino acids in the recognition helix with nonconservative amino acids also did not diminish this binding (Figure 5B). We therefore suggest that the Prd protein contains two distinct DNA binding domains. Eve is another example of a homeodomain with dual specificity (Hoey and Levine, 1988). Its binding to the e4 and e5 sites is similarly influenced by interaction with regions of the protein outside the homeodomain (Hoey et al., 1988). It has also been reported that the yeast HAP-l protein has two different DNA binding specificities that can be distinguished by mutations. However, the mechanisms of its differential binding are not understood (Pfeifer et al., 1987; L. Guarente, personal communication). Thus in addition to the specificity directed by residue 9 of the recognition helix, individual homeodomain proteins may have other Cdl 580 complexities in their interaction with DNA, as we show here for the two DNA binding specificities of Prd, one of which can interact with the rest of the protein. This could provide an explanation for the conservation throughout evolution of the differences between the various classes of homeodomains outside the recognition helix. Experimental Procedures Plasmids The pARPrd and pARZen expression plasmids were the gift of Tim Hoey and Mike Levine (Hoey and Levine, 1988); the pGEMf1 Ftz expression plasmid was provided by Henry Krause (Krause et al., 1966). The origin of replication of fl was cloned into the EcoRl site of pARPrd as an EcoRl fragment obtained from the plasmid pAREn.oriCla (Heitman et al., 1989). This construct packages the sense strand of theprd DNA. The plasmid ~273, containing the e4 and es sites, was provided by Mike Levine, and the pEE.BlgOO plasmid, containing the Bed Al, A2, and A3 sites, was provided by Ulrike Gaul and Herbert Jackie. PrdA was constructed by removing a 923 bp BamHl fragment (starting 128 bp downstream of the 3’end of the homeobox and ending after the stop codon) in the pARPrd vector. The ends were filled in with Klenow poly merase and the plasmid was religated. The protein produced from this plasmid has a deletion of the last 298 amino acids and an addition of 26 amino acids from an open reading frame after the stop codon. To construct the Ftz/PrdHD and FtzlM5HD plasmids, the Ftz homeodomain was removed from the pGEMfl-ORI plasmid by cutting out a 267 bp Nael fragment. A 231 bp Ddel fragment containing the PAR-Prd or M5 homeodomain was blunt-ended and inserted in the resulting plasmid. The Prd3 plasmid was obtained by cloning the oligonucleotide GGGAATTCAGCTGGATCCTTTGACGIACTTTGACGTGATTTGACGTAATGAGTCGAC in the Smal site of the pBSM13+ plasmid (Stratagene). Mutagenesls Single-stranded uridine-containing pARPrd DNA was prepared from E. coli BW313 (dut- ung thi-I r&l spoT//F’/ysA) according to the method of Kunkel(l985) but omitting thymidine and deoxyadenosine from the medium and growing phage for only a single cycle. Kinased oligonucleotides were annealed to this template in 20 ul of 20 m M Tris (pH 7.5), 10 m M MgCIP. 56 m M NaCI, and these primers were extended for 90 min at 37% in 100 ul of 20 m M Tris (pH 7.5) 5 m M DTT, 10 m M MgCI,, 1 m M ATR 0.5 m M each of dATR dGTR dCTP and dTTR with 1 U of the Klenow fragment of DNA polymerase I and 200 U of T4 DNA ligase. After phenol extraction and ethanol precipitation, the DNA was methylated according to the procedure of Horton and Lord (1986) using 50 m M Tris (pH 7.5) 5 m M MgClp, 1 m M DTT as the reaction buffer. It was then used to transform E. coli DH5aF’ (Bethesda Research Laboratories). Colonies were screened by hybridization to the radioactively labeled mutagenic oligonucleotide (Maniatis et al., 1982). Positives were confirmed by dideoxy sequencing (Sanger et al., 1977). The sequences of the oligonucleotides used for mutagenesis were: Ml: CGCCGGTTGCTGAACCAIAICTTGATXGTECTCCGTGAGATTGGTG M2, M4: GCTGCTTGCGGAGACGGGCCCGCCGGTTTTGAACCATAT CTTG, creating an Apa I site M3: GTCTCTCCGTGAGAC~&GCGCTGGGCCAGCTC~ACGCGTGTAGATATCAGGGTATTGG, creating Mlul and Aflll sites MS: GCTGCTTGCGGAGACGGGCCCGCCGGTTT~GAACCACACCTGGATGCG, creating an Apa I site M6: CCATATCTTGAT~GTCTCTCCGTGAG M7: GGTTTTGGAACCATATCTGGATTTGTCTCTC MCTF: GGAGACGAGCACGCCGGTTGCAGAACCACACCCGGATG~~C~CGTGAGATTGGTGCGCTGGG B9: GAGCACGCCGGTT_CITGAACCACACCTGG MP-IWF: GCCGGTTTTGGGCCTCTATCTTG~TTGTCTCTCCGTGAGATTGG. Nucleotides altering the sequences are underlined. Protein Praparatlon Crude bacterial extracts were prepared from E. coli BL21(DE8) contain- ing each expression construct or the pAR3040 vector alone (Studier and Moffatt, 1966) as a control, according to the method of Hoey and Levine (1988). except that all extracts were initially resuspended in buffer 2 containing 4 M guanidine-HCI and 1 m M EOTA without MgCl2 and were renatured by dialysis against the same buffer, first containing 1 M guanidine-HCI and then without guanidine-HCI. DNAese I Footprint Reactlons DNA fragments were end-labeled with T4 polynucleotide kinase, and 1-5 x IO4 cpm was used per lane. The footprint procedure used is described in Desplan et al. (1988) except that the final DNAase I concentration was either 12.5 or 25 nglml. Binding to the class l-specific sites (NPs) was shown by footprinting approximately 2 ng of a 125 bp EcoRI-Hindlll fragment containing the NPs sequence, 5’ end-labeled at the EcoRl site of the M13mp18 polylinker (Desplan et al., 1988). For analysis of binding to Bed sites, a 222 bp Neil-Nsil fragment of the plasmid pE8.81000 (Tautr et al., 1987) containing the Bicoid Al site from the hunchback promoter, was end-labeled at the Neil site and footprinted. Binding to the ea site was tested by footprinting a 280 bp Xhol-Hindlll fragment of the eke promoter-containing plasmid ~273 (Hoey and Levine, 1986) end-labeled at the Xhol site. To footprint the Prd3 repeats, the Prd3 plasmid was end-labeled at the Asp718 site and recut with Hindlll. Acknowledgments We are very grateful to Tim Hoey and Mike Levine for the gift of the pARZen, pARPrd, and ~273 plasmids as well as for fruitful discussions and encouragement. We thank Henry Krause for the gift of the pGEMfl vector, Ulrike Gaul and Herbert Jackie for providing plasmid pEE.BlOOO (Tautz et al., 1987) and Wolfgang Driever and Christiane Nt’isslein-Volhard for the Bed plasmid and for useful discussions about the Bed binding sites. We also want to thank Ana Maria Bravo-Angel for her enthusiasm in starting the analysis of the Prd protein as a summer student. Cheryl Fleisher has provided excellent technical assistance. Steve DiNardo, Elettra Ronchi, and Scott Dougan made help ful comments on the manuscript. We also thank the DiNardo and Desplan laboratories for constant help, interactions, and encouragement. J. T. is supported in part by the Lucille Markey Charitable Trust. The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked “advertisement” in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. Received July 12, 1969; revised September 8, 1989. References Aggarwal, A. K., Rodgers, D. W., Drottar, M., Ptashne, M., and Harrison, S. C. (1988). Recognition of a DNA operator by the repressor of phage 434: a view at high resolution. Science 242, 699-907. Anderson, J. E., Ptashne, M., and Harrison, S. C. (1987). Structure of the repressor-operator complex of bacteriophage 434. Nature 326, 846-852. Bass, S., Sorrells, V, and Youderian, P (1988). Mutant Trp repressors with new DNA-binding specificities. Science 242, 240-245. Beachy, F! A., Krasnow, M. A., Gavis, E. R., and Hogness, D. S. (1988). An U/frabifhomx protein binds sequences near its own and the Anrennapedia Pl promoters. Cell 55, 1069-1061. Berleth, T., Burri, M., Thoma. G., Bopp, D., Richstein, S., Frigerio, G., Nell, M., and Nusslein-Volhard, C. (1988). The role of localization of bicoid RNA in organizing the anterior pattern of the Drosophila embryo. EMBO J. 7; 1749-1756. Btedenkapp. H., Borgmeyer, U., Sippel, E. L.. and Klempnauer, N.-H. (1968). Viral rnyb oncogene encodes a sequence-specific DNAbinding activity. Nature 335, 635-837 Blochlinger, K., Bodmer, R.. Jack, J., Jan, L. Y., and Jan, Y. N. (1988). Primary structure and expression of a product from cur, a locus involved in specifying sensory organ identity in Drosophils. Nature 333, 629-635. Bopp, D., Burri, M., Baumgartner, S., Frigerio, G., and Nell. M. (1986). Specificity 561 of Homeodomains Conservation of a large protein domain in the segmentation gene related genes in Drosophila. Cell 47, 1033- paired and in functionally 1049. Chowdhury, K., Deutsch, U., and Gruss, P (1967). A multigene family encoding several “finger” structures is present and differentially active in mammalian genomes. Cell 48, 771-778. Clerc, R. G., Corcoran, L. M., LeBowitz, J. l-f., Baltimore, D., and Sharp, P A. (1968). The B-cell-specific Ott-2 protein contains POU box- and homeo box-type domains. Genes Dev. 2, 1570-1581. Desplan, C., Theis, J., and O’Farrell, P H. (1965). The Drosophile developmental gene engfailed encodes a sequence specific DNA binding activity. Nature 318, 630-635. Desplan, C., Theis, J., and D’Farrell, P H. (1966). The sequence specificity of homeodomain-DNA interaction. Cell 54, 1081-1090. Deutsch, U., Dressler, G. R., and Gruss, P (1968). Pax 1, a member of a paired box homologous murine gene family, is expressed in segmented structures during development. Cell 53, 617-625. Doyle, H. J., Harding, K., Hoey, T., and Levine, M. (1986). Transcripts encoded by a homeo box gene are restricted to dorsal tissues of Drosophile embryos. Nature 323, 76-79. at 2.5 A resolution: details of the repressor-operator Science 242, 693-899. interactions. Ko, H.-S., Fast, f?, McBride, W., and Staudt, L. M. (1988). A human protein specific for the immunoglobulin octamer DNA motif contains a functional homeobox domain. Cell 55, 135144. Krasnow, M. A., Saffman, E. E., Kornfeld, K., and Hogness, D. S. (1969). Transcriptional activation and repression by Ukrabithorax proteins in cultured Drosophila cells. Cell 57, 1031-1043. Krause, H. M., Klemenz, R., and Gehring, W. J. (1988). Expression modification and localization of the fushi tarazu protein in Dmsophile embryos. Genes Dev. 2, 1021-1036. Kunkel, T. A. (1965). Rapid and efficient site-specific mutagenesis without phenotypic selection. Proc. Natl. Acad. Sci. USA 82, 488492. Laughon, A., and Scott, M. P (1984). Sequence of a DrosophL segmentation gene: protein structure homology with DNA binding proteins Nature 310, 25-31. Macdonald, f? M., Ingham. t? W.. and Struhl, G. (1986). Isolation, structure, and expression of even-skipped: a second pair-rule gene of Drosophila containing a homeo box. Cell 47, 721-734. Dressler, G. R., and Gruss, P (1988). Do multigene families regulate vertebrate development? Trends Genet. 4, 214-219. Maniatis, T., Fritsch, E. F, and Sambrook, J. (1982). Molecular Cloning: A Laboratory Manual (Cold Spring Harbor, New York: Cold Spring Harbor Laboratory). Driever, W., and Niisslein-Volhard, C. (1969). The bicoid protein is a positive regulator of hunchback transcription in the early Drosophila embryo. Nature 337, 136143. McGinnis, W., Levine, M., Hafen, E., Kuroiwa, A., and Gehring, W. J. (1984a). A conserved DNA sequence in homeotic genes of the Dro sophila Antennapedia and bithorax complexes. Nature 308, 428-433. Frigerio, G., Burri, M., Bopp, D., Baumgartner, S., and Nell, M. (1986). Structure of the segmentation gene paired and the Drosophila PRD gene set as part of a gene network. Cell 47, 735-746. McGinnis, W., Garber, R. L., Wirz, J., Kuroiwa, A., and Gehring, W. J. (1964b). A homologous protein-coding sequence in Drosophila homeotic genes and its conservation in other metazoans. Cell 37, 403406. Garcia-Bellido, A. (1975). Genetic control of wing disc development in Drosophila. Ciba Foundation Symp. 29, 161-182. Gehring, W. J. (1967). Homeo Science 236, 1245-1252. boxes in the study of development. Godowski, P, Rusconi, S., Miesfield, R., and Yamamoto, K. R. (1967). Glucocorticoid mutants that are constitutive activators of transcriptional enhancement. Nature 325, 365-368. Han, K., Levine, M.. and Manley, J. L. (1969). Synergistic activation and repression of transcription by Drosophila homeobox proteins. Cell 56, 573-583. Muller, M., Affolter, M., Leupin, W., Otting, G., Wiithrich, K., and Gehring, W. J. (1968). Isolation and sequence-specific DNA binding of the Antennapedia homeodomain. EMBO J. 7,4299-4304. Niisslein-Volhard, C., and Wieschaus, E. (1980). Mutations affecting segment number and polarity in Drosophila. Nature 287, 795-801. Nusslein-Volhard, C., Frohnhbfer, H. G., and Lehmann, R. (1987). Determination of anteroposterior polarity in Drosophila. Science 238, 1675-1681. Hanes, S. D., and Brent, R. (1969). DNA specificity of the Bicoid activator protein is determined by homeodomain recognition helix residue 9. Cell 57, 1275-1283. Otting, G., Qian, Y.-Q., Miiller, M., Affolter, M., Gehring. W. J., and Wiithrich, K. (1988). Secondary structure determination for the Antennapedia homeodomain by nuclear magnetic resonance and evidence for a helix-turn-helix motif. EMBO J. 7, 4305-4309. He, X., Treaty, M. N., Simmons, D. M., Ingraham, H. A., Swanson, L. W., and Rosenfeld, M. G. (1989). Expression of a large family of POU-domain regulatory genes in mammalian brain development. Nature 340, 35-42. Otwinowski, Z.. Schevitz, R. W., Zhang, R.-G., Lawson, C. L., Joachimiak, A., Marmorstein, R. Q., Luisi, B. F., and Sigler, t? B. (1988). Crystal structure of frp repressor/operator complex at atomic resolution. Nature 335, 321-329. Heitman, J., Treisman, J., Davis, N. G., and Russel, M. (1989). Cassettes of the 11 intergenic region. Nucl. Acids Res. 7Z 4413. Pabo, C. O., and Sauer, R. T. (1984). Protein-DNA Rev. Biochem. 53, 293-321. Herr, W., Sturm, R. A., Clerc, R. G., Corcoran, L. M., Baltimore, D., Sharp, t? A., Ingraham, H. A., Rosenfeld, M. G., Finney, M.. Ruvkun. G., and Horvitz, H. R. (1968). The POU domain: a large conserved region in the mammalian pit-l, ocf-1, o&2 and Caenorhabditis ekgens uric-86 gene products. Genes Dev. 2, 1513-1516. Hochschild, A., and Ptashne, M. (1986). Homologous interactions repressor and 1 Cro with the 1 operator. Cell 44, 925-933. of 5 Hochschild, A., Douhan, J., III, and Ptashne, M. (1986). How )i repressor and 5 Cro distinguish between 0~1 and On3. Cell 47, 807-816. Hoey, T., and Levine, M. (1988). Divergent homeo box proteins recognize similar DNA sequences in Drosophile. Nature 332, 856-861. Hoey, T., Warrior, R., Manak, J., and Levine, M. (1968). DNA-binding activities of the Drosophila melanogasrer even-skipped protein are mediated by its homeo domain and influenced by protein context. Mol. Cell. Biol. 8, 4596-4607. Horton, R. D., and Lord, S. T. (1986). In vitro methylation doubles the efficiency of oligonucleotide-directed mutagenesis. Nucl. Acids Res. 14. 5112. Ingham, P W. (1986). The molecular geneticsof mation in Drosophila. Nature 335, 25-34. embryonic pattern for- Jordan, S. R., and Pabo, C. 0. (1988). Structure of the lambda complex recognition. Annu. Pfeifer, K., Prezant, T., and Guarente, L. (1967). Yeast HAP1 activator binds to two upstream activation sites of different sequence. Cell 49, 19-27. Picard, D., Salser, S. J., and Yamamoto, K. R. (1988). A movable and regulable inactivation function within the steroid binding domain of the glucocorticoid receptor. Cell 54, 1073-1080. Ptashne, M. (1966). A Genetic Switch (Cambridge, Massachusetts, and Palo Alto, California: Cell Press and Blackwell Scientific Publications). Rosa, F. M. (1969). Mix.7, a homeobox mRNA inducible by mesoderm inducers, is expressed mostly in the presumptive endodermal cells of Xenopus embryos. Cell 57, 965-974. Ross, W., and Landy, A. (1982). Bacteriophage )L Int protein recognizes two classes of sequences in the phage attsite: characterization of armtype sites. Proc. Natl. Acad. Sci. USA 79, 7724-7728. Sanger, F., Nicklen, S., and Coulson, A. R. (1977). DNA sequencing with chain-terminating inhibitors. Proc. Natl. Acad. Sci. USA 74, 5463-5467. Scheidereit, C., Cromlish, J. A., Gerster, T., Kawakami, K., Balmaceda, C.-G., Currie, R. A., and Roeder, R. G. (1988). A human lymphoid- specific transcription factor that activates immunoglobulin homeobox protein. Nature 336, 551-557. genes in a Scott, M. P, and Carroll, S. B. (1987). The segmentation and homeotic gene network in early Drosophila development. Cell 57, 689-698. Scott, M. P, and O’Farrell, P. H. (1986). Spatial programming of gene expression in early Drosopbile embryogenesis. Annu. Rev. Cell Biol. 2, 49-80. Scott, M. P, and Weiner, A. J. (1984). Structural relationships among genes that control development: sequence homology between the Antennapedia, Ultrabithorax, and fushi tarazu loci of Drosophila. Proc. Natl. Acad. Sci. USA 81, 4115-4119. Scott, M. f?, Tamkun, J. W., and HartzelI, G. W. (1989). The structure and function of the homeodomain. Biochim. Biophys. Acta, in press. Studier, F. W., and Moffatt, 8. A. (1986). Use of bacteriophage T7 RNA polymerase to direct selective high-level expression of cloned genes. J. Mol. Biol. 189, 113-130. Sturm, Ft. A., Das, G., and Herr, W. (1988). The ubiquitous octamerbinding protein Ott-1 contains a POU domain with a homeo box subdomain. Genes Dev. 2, 1582-1599. Tautz, D., Lehmann, R., Schnurch, H., Schuh, R., Seifert. E., Kienlin, A., Jones, K. and Jackie, H. (1987). Finger protein of novel structure encoded by hunchback, a second member of the gap class of Drosophila segmentation genes. Nature 327, 383-389. Thali, M., Miiller, M M., Delorenzi, M., Matthias, P, and Bienz, M. (1988). Drosophila homeotic genes encode transcriptional activators similar to mammalian DTF-2. Nature 336, 598-601. Treisman, J., and Desplan, C. (1989). The Drosophile ucts Kriippel and Hunchback bind to the hunchback ture, in press. gap gene prodpromoters. Na- Wharton, R. P, and Ptashne, M. (1985). Changing the binding specificity of a repressor by redesigning an alpha-helix. Nature 316, 601-605. Wharton, R. P.. and Ptashne, M. (1987). A new-specificity mutant of 434 repressor that defines an amino acid-base pair contact. Nature 326, 888-891. Winslow, G. M., Hayashi, S., Krasnow, M., Hogness, D. S., and Scott, M. P (1989). Transcriptional activation by the Antennapedia and fushi tarazu proteins in cultured Drosophila cells. Cell 57, 1017-1030. Wolberger, C., Dong, Y., Ptashne, M., and Harrison, S. C. (1988). Structure of a phage 434 Cro/DNA complex. Nature 335, 789-795