* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download The trimethoprim-resistant dihydrofolate reductase associated with
Endogenous retrovirus wikipedia , lookup
Non-coding DNA wikipedia , lookup
Real-time polymerase chain reaction wikipedia , lookup
Promoter (genetics) wikipedia , lookup
Metalloprotein wikipedia , lookup
Gene expression wikipedia , lookup
Two-hybrid screening wikipedia , lookup
Molecular cloning wikipedia , lookup
Genomic library wikipedia , lookup
Western blot wikipedia , lookup
Enzyme inhibitor wikipedia , lookup
Proteolysis wikipedia , lookup
Vectors in gene therapy wikipedia , lookup
Agarose gel electrophoresis wikipedia , lookup
Silencer (genetics) wikipedia , lookup
Gel electrophoresis of nucleic acids wikipedia , lookup
Restriction enzyme wikipedia , lookup
Protein structure prediction wikipedia , lookup
Gel electrophoresis wikipedia , lookup
Deoxyribozyme wikipedia , lookup
Genetic code wikipedia , lookup
Nucleic acid analogue wikipedia , lookup
Point mutation wikipedia , lookup
Community fingerprinting wikipedia , lookup
Amino acid synthesis wikipedia , lookup
Artificial gene synthesis wikipedia , lookup
volume 9 Number 31981 Nucleic A c i d s Research Characterization of a R plasmid-associated, trimethoprim-resistant dihydrofolate reductase and determination of the nucleotide sequence of the reductase gene J.Wemer Zolg1 and Urs J.Hanggi Institut fiir Physiologische Chemie, Physikalische Biochemie und Zellbiologie der Univeisitat, Goethestr. 33, 8000 Miinchen 2, GFR Received 17 November 1980 ABSTRACT The trimethoprim-resistant dihydrofolate reductase associated with the R plasmid R388 was isolated from strains that overproduce the enzyme. It was purified to apparent homogeneity by affinity chromatography and two consecutive gel filtration steps under native and denaturing conditions. The purified enzyme is composed of four identical subunits with molecular weights of 8300. A 11OO bp long DNA segment which confers resistance to trimethoprim was sequenced. The structural gene was identified on the plasmid DNA by comparing the amino acid composition of the deduced proteins with that of the purified enzyme. The gene is 234 bp long and codes for 78 amino acids. No homology can be found between the deduced amino acid sequence of the R388 dihydrofolate reductase and those of other prokaryotic or eukaryotic dihydrofolate reductases. However, it differs in only 17 positions from the enzyme associated with the trimethoprimresistance plasmid R67. INTRODUCTION Trimethoprim prevents the bacterial production of tetrahydrofolate coenzymes by inhibiting the enzyme dihydrofolate reductase (2). The resulting deficiency of tetrahydrofolates leads to impaired protein, RNA, and DNA synthesis and eventually to cell death (3). The production of tetrahydrofolates is not inhibited in bacteria which harbor trimethoprim-resistance plasmids. These cells contain additional dihydrofolate reductases that are insensitive to the drug (4-6). Gene dosage experiments and investigations on the production of the plasmid-associated enzymes in minicells have shown that they are encoded on the plasmid DNA (7-9) . Several trimethoprim-resistance plasmids have been mapped by in vitro recombinant DNA techniques (7-11). These studies have shown that the genes for the resistant dihydrofolate reductases V) IRL Press Limited. 1 Falconberg Court, London W 1 V 5FG. U.K. 697 Nucleic Acids Research of the plasmids R483 and R67 reside on DNA fragments of about 41OO bp and 24OO bp length (9) and that of the plasrnid R388 on a DNA segment of less than 1200 bp length (7). In the present study the nucleotide sequence of the latter DNA segment was determined. The resulting amino acid sequences were compared to the amino acid composition of the isolated enzyme. The comparison made it possible to identify the structural gene on the DNA segment, to analyze the organization of the regulatory elements, and to deduce the amino acid sequence of the enzyme. MATERIALS AND METHODS Chemicals and enzymes. Acrylamide, 2 times crystallized,was obtained from Serva, Heidelberg. Piperidine, dimethylsulfate, and guanidinium chloride were from E. Merck, Darmstadt, hydrazine from Roth, Karlsruhe, lyophilyzed alkaline phosphatase (calf intestine), T4 polynucleotide kinase, and NADPH from Boehringer, Mannheim, and 3-(4,5-dimethylthiazolyl-2-)-2,5-diphenyltetrazolium bromide from Sigma, Miinchen. Matrex Gel Red A was purchased from Amicon, Witten. All other chemicals used were as described (7) . Dihydrofolate was prepared according to (12) . Purification of dihydrofolate reductase. E.coli C carrying PWZ82O or pWZ7O3 (7) was grown in a fermenter in L-broth to a density of 2-5.10 cells/ml. About 6O g of cells were broken up in a French-press in 2OO-3OO ml of 1O mM Tris-HCl, pH 8.0 containing 1 mM mercaptoethanol and 1 mM EDTA (TME-8 buffer). After high speed centrifugation (45 min at 75OOO x g, O C) solid ammonium sulfate was added first to 2 M, then to 3.6 M. The 2 M precipitate was discarded. The 3.6 M ammonium sulfate precipitate was dissolved in 100-200 ml of TME-8 and applied to a Sephadex G-7 5 column (10x60 cm) equilibrated with TME-8. 20 ml fractions were collected. The fractions containing the trimethoprimresistant dihydrofolate reductase were pooled and precipitated by adding ammonium sulfate to 3.6 M, the pellet was dissolved in 50 ml of 10 mM Tris-HCl, pH 9.0 containing 1 mM mercaptoethanol, 1 mM EDTA, and 1O mM NaCl (TMENa-9 buffer), dialyzed against 3 times 1 1 of TMENa-9, and applied to a Matrex Gel Red A column (2.5x30 cm) equilibrated with TMENa-9. The column was eluted with the same buffer without using a gradient. The two trimetho- 698 Nucleic Acids Research prim resistant activity peaks (see Results) were pooled and concentrated to 10 ml. 7.6 g of guanidinium chloride and 1O ul of concentrated mercaptoethanol were added, and the mixture was held at 100 C for 5-10 min. After cooling it was applied to a Sephadex G-150 column (3.5x30 cm) which previously was equilibrated with 10 fold concentrated TMENa-9 containing 6 M urea. Elution was with the same buffer mixture. Fractions of 5 ml were collected. The fractions containing dihydrofolate reductase were freed of urea by gel filtration in Sephadex G-2 5 (3.5x30 cm) equilibrated with 1O mM Na-phosphate, pH 6.8 containing 1 mM mercaptoethanol. The enzyme pool was concentrated to 0.5 ml by flash evaporation and rechromatographed on a small column (1.5x15 cm) of Sephadex G-2 5 in the phosphate buffer mentioned above. The most active fractions were pooled and stored at -20 C. Dihydrofolate reductase activity was assayed according to Ref. 13. 2 nM trimethoprim was included in the assay mixture whenever the trimethoprim-resistant activity had to be distinguished from the trimethoprim-sensitive E.coli reductase activity. Molecular weight determinations. The subunit molecular weight of the purified R388 dihydrofolate reductase was determined by electrophoresis on 7% and 10% polyacrylamide gels containing 0.2% SDS (14). The gels were calibrated with insulin chain B (Mr 34OO), aprotinin (Mr 6500), cytochrorae C (Mr 12 5O0), and soybean trypsin inhibitor (M 215OO) (Combithek I, Boehringer). The molecular weight of the purified native enzyme was determined by gel filtration in Sephadex G-200 superfine. The column (1x100 cm) was equilibrated with 1O mM Tris-HCl, pH 8.O, containing 10 mM mercaptoethanol and 1 mM EDTA. It was calibrated with bovine serum albumin (M 68OOO), ovalbumin (M 4500O), chymotrypsinogen A (M 25000), and cytochrome C (M 12500) (Combithek II, Boehringer). Gel electrophoresis. Non-denaturing polyacrylamide gels were prepared and stained for proteins or dihydrofolate reductase activity according to Ref. 15. The conditions used for separating DNA fragments on agarose gels were as described (7). DNA fragments were isolated from agarose by the glass powder adsorption method (16) . The sizes of the fragments were determined by coelectrophoresis with pBR322 DNA cleaved with either Sau3A or Hpall• 699 Nucleic Acids Research Restriction enzyme analysis. DNA was isolated and cleaved with restriction enzymes as previously described (7) . The restriction nucleases BamHl, EcoRl, Sau3A and Haelll were kindly donated by T. Igo-Kemenes. Alul, Pstl, Hhal, Hinfl, Sau96l, and TagI were gifts of R.E. Streeck. Hinfl was prepared by chromatography on DEAE-cellulose and heparin-agarose, Sau96l and TagI according to Ref. 17. The latter enzyme was used in a buffer containing 6 mM Tris, pH 7.4, 6 mM MgCl2» and 0.6 mM dithiothreitol at 50 C. Hpall was prepared by H. Feldmann, and Hindu was obtained from Boehringer, Mannheim. Terminal labelling and seguence analysis. 2O-5O ug of DNA weBB cleaved with restriction nucleases and the resulting mixture of fragments was treated with phosphatase prior to the separation of the fragments by electrophoresis in polyacrylamide gels. The fragments were recovered from the gels by adding 2 volumes of 1 M NaCl to the gel slices and by passing the gelNaCl mixture through a syringe. The resulting paste was incubated overnight at 37 c and centrifuged. The supernatant was filtered through Whatman 3MM paper. The isolated fragments were denatured, labelled at their 5' ends with polynucleotide kinase, renatured, and cleaved according to Maxam and Gilbert (18) with minor modifications (19). The reactions specific for C, C+T, A > C, and G were performed according to Ref. 18, and that for A+G according to Ref. 20. Separation of the partial fragments was on 8% or 2O% polyacrylamide gels in 8.3 M urea. Autoradiography was at -70 C with and without intensifying screens (SE6, Cawo, Schrobenhausen). Methylated cytosines were detected according to Ref. 21. RESULTS Purification and characterization of the R388 dihydrofolate reductase. In a previous study we had found that dihydrofolate reductase is stable under denaturing conditions and that the level of the enzyme is considerably increased in bacteria which carry multiple copies of the reductase gene (7). These two observations were used as part of a novel purification scheme. Starting from cell extracts of bacteria with increased dihydrofolate levels the enzyme was partially purified by ultracentri- 700 Nucleic Acids Research fugation, ammonium sulfate precipitation and gel filtration on Sephadex G-7 5 as previously described (7) . Further purification was achieved by chromatography on triacyl dye agarose (Procion Red HE 3B, Amicon Matrex Gel Red A) which selectively adsorbs NADP -dependent enzymes (22). Preliminary experiments had shown that the R388 dihydrofolate reductase was retained by the gel. In order to purify the enzyme on a preparative scale, pH and buffer compositions were chosen which allowed the separation of the bulk of the non-dihydrofolate reductase proteins from the enzyme without using a salt gradient (Fig. 1A) . 1.5 U of enzyme were retained per ml of affinity gel. Chromosomal dihydrofolate reductase is strongly absorbed and is only eluted at high salt concentrationsThe R388 dihydrofolate reductase displayed a two-peak elution pattern on the Procion Red affinity column (Fig. 1A). However, since no difference between the two fractions could be detected on SDS gels, both were combined. To remove remaining 05 - SO * • _ • 400 1/ • 02 > 01 E i : 100 500 U/ml 1 \ / / " 11 1 / B E < < - 300 200 ACIiv • y a 10 t] - too ..•••••• * A "*— IS E 0 l»> 200 Elulion v o l u m e , ml 2S0 100 SO 100 150 200 250 300 Etulion volume,ml Figure 1. (A). Affinity chromatography of R388 dihydrofolate reductase. The concentrated and dialyzed Sephadex G-7 5 pool was chromatographed on Procion Red agarose as described in Materials and Methods. Aliquots of the eluate were assayed for absorbance at 280 nm ( A — A ) and for activity (•--•). The bar indicates the fractions which were pooled. (B).Gel filtration under denaturing conditions. The enzyme fractions of the affinity gel were denatured and chromatographed on Sephadex G-150 in the presence of 6 M urea as described in Materials and Methods. Aliquots of 1-1O nl of the eluate were assayed for activity without removing the urea. 701 Nucleic Acids Research non-reductase contaminants, which in their native conformation had copurified with the native R388 dihydrofolate reductase on the first Sephadex G-7 5 column, the pooled enzyme fractions were denatured by boiling in 6 M guanidinium chloride and chromatagraphed in Sephadex G-150 in the presence of 6 M urea (Fig. IB) . The R388 dihydrofolate reductase disaggregated under these conditions and was well separated from the larger, non-dihydrofolate reductase proteins. The fractions containing dihydrofolate reductase were freed of urea by gel filtration on Sephadex G-5O, concentrated, and rechromatographed on a small column of Sephadex G-50. The resulting enzyme was more than 95% pure as judged by electrophoresis on SDS gels and the overall yield was between 15-2o% in the different preparations. The specific activity was about 1.5 U/mg of protein (Table 1 ) . The molecular weight of the purified, catalytically active enzyme was estimated by gel filtration on Sephadex G-2OO. R388 dihydrofolate reductase eluted as a single protein and activity peak in a position corresponding to a molecular weight of about 3600O. A single protein band was also seen on SDS gels. However, its molecular weight was only about 8400. Since the band showed weak dihydrofolate reductase activity after prolonged staining for the enzyme, it was concluded that the native enzyme is composed of four identical subunits with molecular weights of 8400. A similar subunit structure has been found for the R67 dihydrofolate reductase (23). Table 1. Purification of R388 dihydrofolate reductase Fraction Volume Protein Acticoncentration vity Total activity Specific activity ml Crude extract 3OO (NH4)2SO4 precipitate 17O Sephadex G-7 5 pool 26O Procion Red agarose 1OO Sephadex G-5O 5 702 10 0.77 0.42 0.93 157 430 136O Nucleic Acids Research The amino acid composition of the purified enzyme is shown in Table 2. The calculated molecular weight (8O2O) is in close agreement with the value determined from SDS gels. Identification of the coding sequence of the dihydrofolate reductase gene. Our previous analysis of the trimethoprim-resistance gene of R388 has shown that the plasmid-induced dihydrofolate reductase is encoded on a DNA segment of less than 12OO bp length (7) . Further cloning experiments (not shown) suggested that the entire segment might be essential for proper expression of the gene. Therefore the DNA was cleaved with various restriction nucleases and sequenced by the Maxam and Gilbert technique. The restriction map and the sequence strategy Table 2. Amino acid composition of R388 dihydrofolate reductase. About O.6 nmoles of purified enzyme were hydrolyzed for 30 min at 160 in 5.7 N HCl as described by Wachter and Werhahn (24) . Amino acid Cys Asx Asp Asn Met Thr c) d) Ser Glx nmoles found/nmol of enzyme a ' suggested theoretical content °> content 1 O.98 4.O6 — O.32 2.98 6.55 10.02 - Glu Gin Pro Gly Ala Val He Leu Tyr Phe His Lys Arg e Trp ' 2 1 — 2 1 1 3 7 5 6 5 8 11 8 1 5 3 2 1 1 3 3 2 3 3 2 4 — 1 3 7 10 4 9 11 6 1 5 4.06 8.9O 1O.68 6.21 1.19 5.O7 2.38 1.93 l.OO 3.23 2.56 1.58 3 M : 8O2O M ; 8266 r a) mean of two determinations; b) deduced from the nucleotide sequence shown in Fig. 3 and 4; c) Cystein was determined as cysteic acid; d) Methionine was determined as methionine sulfone; e) Tryptophan was determined according to Beaven and Holiday (2 5) r 703 Nucleic Acids Research are shown in Fig. 2. The sequence, 112 5 nucleotides long, was determined, about 80% of which was confirmed by sequencing both DNA strands. The whole nucleotide sequence is shown in Fig. 3. The GC-content is 60% between bases 1 and 400, 50% between bases 400 and 500, and 63% in the remaining part of the DNA. The overall GC-content is 56%. Three methylated cytosines were found at positions 234, 333, and 959. Eight open reading frames could be detected by locating the positions of the start and stop codons in the nucleotide sequence. Three have no stop codons within the sequenced part. The lengths of the proteins which are encoded in the open reading frames vary between 43 and 177 amino acids or more. In order to find the coding sequence of the dihydrofolate reductase gene, the amino acid compositions of all proteins that could be derived from the nucleotide sequence were compared to the composition of the isolated enzyme. Only a single protein had the composition, the length, and the characteristic His/Ile/Met/Cys (1:1:1:1), Leu/Ile (5:1), and Tyr/Trp (3:2) ratios of the isolated enzyme (cf. Table 2). Therefore it was concluded that this protein is the dihydrofolate reductase and that the corresponding nucleo- I — i w ~ V If i f—I "WV\ >t i n n Figure 2. Restriction map of the DNA segment carrying the R388 dihydrofolate reductase gene and strategy used for determining the nucleotide sequence. The numbers refer to the cleavage sites on the DNA sequence shown in Fig. 3. The directions in which the DNA fragments were sequenced and the distances are indicated by arrows. The reductase structural gene is delineated by the bar. 704 Nucleic Acids Research GGTCCCGATC CTTGGAGCCC TTGCCCTCCC GCACGATGAT CGTGCCGTGA TCGAAATCCA GATCCTTGAC CCGCAGTTGC AAACCCTCAC TGATCCGCAT 100 GCCCGTTCCA TACAGAAGCT GGGCGAACAA ACGATGCTCG CCTTCCAGAA AACCGAGGAT GCGAACCACT TCATCCGGGG TCAGCACCAC CGGCAAGCGC 200 CGCGACGGCC GAGGTCTTCC GATCTCCTGA AGCCAGGGCA GATCCGTGCA CAGCACCTTG CCGTAGAAGA ACAGCAAGGC CGCCAATGCC TGACGATGCG 300 TGGAGACCGA AACCTTGCGC TCGTTCGCCA GCCAGGACAG AAATGCCTCG ACTTCGCTGC TGCCCAAGGT TGCCGGGTGA CGCACACCGT GGAAACGGAT 400 GAAGGCACGA ACCCAGTTGA CATAAGCCTG TTCGGTTCGT AAACTGTAAT GCAAGTAGCG TATGCGCTCA CGCAACTGGT CCAGAACCTT GACCGAACGC 500 AGCGGTGGTA ACGGCGCAGT GGCGGTTTTC ATGGCTTGTT ATGACTGTTT TTTTGTACAG TCTAGCCTCG GGCATCCAAG CTAGCTAAGC GCGTTACGCC 600 GTGGGTCGAT GTTTGATGTT ATGGAACAGC AACGATGTTA CGCAGCAGGG TAGTCGCCCT AAAACAAAGT TAGGCAGCCG TTGTGCTGGT GCTTTCTAGT 700 AGTTGTTGTG GGGTAGGCAG TCAGAGTTCG ATTTGCTTGT CGCCATAATA GATTCACAAG AAGGATTCGA CATGGGTCAA AGTAGCGATG AAGCCAACGC 800 TCCCGTTGCA GGGCAGTTTG CGCTTCCCCT GAGTGCCACC TTTGGCTTAG GGGATCGCGT ACGCAAGAAA TCTGGTGCCG CTTGGCAGGG TCAAGTC6TC 900 GGTTGGTATT GCACAAAACT CACTCCTGAA GGCTATGCGG TCGAGTCCGA ATCCCACCCA GGCTCAGTGC AAATTTATCC TGTGGCTGCA CTTGAACGTG 1 000 TGGCCTAACA ATTCGCTCAG GACGGTTCGC CGCCCGCTGA GCTTTGTCGT TAGGCGTCAT GGGCTGACGC TCACAATCGA ACCAAGCGGC GAGAGTCGCG 1100 GGCACTGCGA CTGCTGTGGG AATTC Figure 3. Nucleotide sequence of the R388 DNA segment carrying the dihydrofolate reductase gene. The coding sequence for the R388 dihydrofolate reductase is enclosed. tide sequence is the structural gene for the enzyme. This sequence is enclosed in Fig. 3. Comparison of the amino acid sequence of the R388 and R67 dihydrofolate reductases. The complete amino acid sequence deduced from the nucleotide sequence of the R388 dihydrofolate 705 Nucleic Acids Research reductase gene is shown in Fig. 4. Extensive homologies can be found to the amino acid sequence of the dihydrofolate reductase isolated from the trimethoprim-resistance plasmid R67 (26). Both enzymes have exactly the same number of amino acid residues and 61 of the 78 amino acids are identical. The remaining 17 amino acids, for the most part, are conservative replacements, eight of which can be explained by transitions of single bases (amino acid residues 3,6,8,9,17,18, and 21; Fig. 4) and two by transversions of single bases (residues 49 and 61) . At least two substitutions are needed to explain the remaining 7 exchanges (residues 2,10,15,20,26,77, and 78). Organization of the dihydrofolate reductase gene. The dihydrofolate reductase gene (cf. Fig. 3) is only 234 bp long. Its promoter contains the sequence CATAAT (pos. 744-749) and GTTCGATTT (pos. 726-734) which differ only minimally from the -10 and -3 5 consensus sequences described by Rosenberg and Court (27). In analogy to other mRNAs by which the dihydrofolate' reductase mRNA is expected to start at the A at position 7 56, seven nucleotides downstream of the ubiquitous T of the -10 hexamer (pos. 749). Between the start of the mRNA and the AUG initia- > S 10 >S 20 ATG GTT CAA AGT AGC GAT GAA GCC AAC GCT CCC GTT GCA GGG CAG TTT GCG CTT CCC CTG Gly Gin Asp Olu Aim Asn Aim Pro Vml Aim O.V Olu Arg Asn Glu Vml Smr Am Pro Vml Aim Gly 25 30 35 <0 AGT GCC ACC TTT GGC TAA GGG GAT CGC GTA CGC AAG AAA TCT GGG GCC GCT TGG CAG GGT Aim Thr Phm Oly Oly Asp A,3 Vml Arg Lys Lyi Smr Oly Aim Aim Tip Oln Gly Aim Thr Phm Oly Oly Asp Arg Vml Arg Lys Lym Smr Oly Aim Aim Tip Oln Oly 45 50 55 SO CAA GTC GTC GGT TGG TAT TGC ACA AAA CTC ACT CCT GAA GGC TAT GCG GTC GAG TCC GAA ft 388 ft 67 Gin Vml oty Trp Tyr Cys Thr Lmu Thr Pro Glu Oly Tyr Aim Vml Olu Smr Olu Gin Vml Oly Trp Tyr Cyt Thr Lmu Thr Pro Olu Oly Tyr Aim Vml Clu Smr Glu 05 70 75 79 TCC CAC CCA GGC TCA GTG CAA ATT TAT CCT GTG GCT GCA CTT GAA CGT GTG GCC TAA R 388 ft67 Smi His Pro Oly Smr Vml Gin Urn T,r Pro Vml Aim Aim Lmu Olu Arg Aim His Pro Oly Smr Vml Gin Urn Tyr Pro Vml Aim Aim Lmu Glu Aig Figure 4. Amino acid sequence of the R388 dihydrofolate reductase. The sequence is compared to that of the R67 enzyme (26). Identical amino acids are enclosed. The number above the nucleotide triplets refer to the amino acid residues. 706 Nucleic Acids Research tion codon there is an untranslated region of 16 nucleotides which contains the sequence AAGGA (pos. 761-765) that could serve as a ribosome binding site (28) . The mRNA can form a hairpin structure in this region by pairing GGAUUC (763-768) to GAAGCC (pos. 790-795). The initiation codon thus would be exposed in a non-paired loop. Two GC-rich sequences with dyad symmetries are found at the 31 side of the gene: the sequence GCGTCA (pos. 1O54-1O59) and TGACGC (pos. 1O6 5-1O7O) and the sequence AGTCG (pos. 1094-1099) and CGACT (pos. 11O8-1112). Both inverted repeat structures resemble the terminator signals (27) and thus might be involved in termination of mRNA transcription. There are two stop codons in phase between the end of the coding sequence and those structures (TAA 1OO6-1OO8 and TAG 1O51-1O53) . DISCUSSION The amino acid sequence of the R388 dihydrofolate reductase shown in Fig. 4 bears no apparent homology to the sequences of other prokaryotic or eukaryotic dihydrofolate reductases (for a recent compilation see Ref. 29). This lack of homology raises the intriguing question of whether the R388 and the similarly organized R67 dihydrofolate reductase represent a novel type of dihydrofolate reductases or if these enzymes are non-reductase proteins with fortuitous reductase activities. The latter possibility is suggested by the low turnover number which can be deduced from the specific activity of the purified R388 enzyme. Its value (50-100 moles/min/mole of enzyme) is one to two orders of magnitude lower than those of other dihydrofolate reductases (3) . However, Smith and Burchall (30), who have asked the same question for the R67 enzyme, could detect no other activity besides that of the dihydrofolate reducing one. A comparison of the amino acid sequences of the R388 and the R67 dihydrofolate reductase gives instructive insights into the organization of the two enzymes. Of the 17 amino acids which are different, all but four map at the amino- and carboxy termini of the enzymes (Fig. 4 ) . The sequences between amino acid residues 22 and 76 are virtually identical. This suggests that only the central part of the molecule is essential for the acti- 707 Nucleic Acids Research vity. This is supported by the conspicuous accumulation of functional groups therein (e.g. Arg-Val-Arg-Lys-Lys-Ser (29-34), Trp-Tyr-Cys-Thr (45-48), Thr-Pro-Glu-Gly-Tyr (51-55), Glu-SerGlu (58-60), or Ile-Tyr-Pro-Val (68-71)). In this respect it would be interesting to know if there are kinetic differences between the R67 and the R388 dihydrofolate reductase since glutamine residue 49 in the R67 dihydrofolate reductase is replaced by a lysine residue in the R388 enzyme. The amount of R388 dihydrofolate reductase activity is rather low in E.coli (4,7). This might be due to a low level of gene expression. However, the organization of the promoter region is as in other prokaryotic genes. The nucleotide sequences of the "Pribnow box" (31), of the RNA polymerase recognition site (32,33), and of the ribosome binding site (28) have no more deviations from the prototype sequences than other genes (27). This suggests that the low amount of enzyme in E.coli is mostly, if not exclusively due to the low specific activity/of the enzyme. However, it should be recalled that we never have obtained a trimethoprim-resistant recombinant plasmid that is smaller than about 12OO bp. This might well indicate that far more sequence information is needed to express the gene than the information stored in the putative promoter region and that the auxilary sequences modulate the rate of gene expression. The nucleotide sequence at the end of the gene contains two termination codons in phase. It is conceivable that the first stop codon is translated in suppressor positive strains. This would add the sequence X-Gln-Leu-Ala-Gln-Glu-Gly-Ser-Pro-Pro-AlaGlu-Leu-Cys-Arg to the carboxy end of the protein and increase the molecular weight by about 1600. The recent finding of Fling and Elwell that the R388 dihydrofolate reductase is somewhat larger than the R67 enzyme in minicell producing strains (9) might be explained by this mechanism. The R388 dihydrofolate reductase gene is unusually small in size. With a length of less than 24O bp it should be possible to insert the gene into viral genomes without grossly affecting the size of the viral DNA. Since the R388 dihydrofolate reductase gene also confers resistance to methotrexate, which is toxic for fungi (34,35) , plants (36) , and animal cells (2) , 708 Nucleic Acids Research recombinants between the dihydrofolate reductase gene and the viruses could possibly be used as selectable DNA vectors to transfer foreign genes into the cells of fungi, plants, and animals. ACKNOWLEDGEMENTS We would like to thank Dr. H.G. Zachau for helpful suggestions and discussions during this work. Dr. E. Wachter for performing the amino acid analysis, and Ms. Ch. Graf, Ms. I. Wagner, and Mr. Neumaier for their excellent technical assistance. This work was supported in parts by the Deutsche Forschungsgemeinschaft (Forschergruppe Genomorganisation) and the Fonds der Chemischen Industrie. J.W. Zolg was supported by Cusanuswerk, Bischofliche Studienforderung. REFERENCES 1. Present address: Institute of Animal Genetics, University of Edinburgh, West Mains Road, GB Edinburgh EH9 3JN 2. Burchall, J.J. and Hitchings, G.H. (1965) Mol. Pharm. 1, 126-136 3. Blakley, R.L. (1969) The biochemistry of folic acid and related pteridines, pp. 139-187, North-Hoiland, Amsterdam 4. Amyes, S.G.D. and Smith, J.T. (1974) Biochem. Biophys. Res. Comm. 58, 412-418 5. Skold, 0. and Widh, A. (1974) J. Biol. Chem. 249, 4324-4325 6. Pattishall, K.H., Acar, J., Burchall, J.J., Goldstein, F.W., and Harvey, R.J. (1977) j . Biol. Chem. 2 52, 2319-2323 7. Zolg, J.W., Hanggi, U.J., and Zachau, H.G. (1978) Molec. Gen. Genet. 164, 15-29 8. Fling, M.E. and Elwell, L.P. (1978) in Genetic Engineering, Boyer, H.W. and Nicosia, S.» Eds., pp. 173-180, Elsevier/ North-Ho Hand, Amsterdam 9. Fling, M.E. and Elwell, L.P. (198O) J.Bacteriol. 141, 779-785 10. Barth, P.T. and Grinter, N.J. (1977) J.Mol.Biol. 113, 455-474 11. Ward, J.M. and Grinsted, J. (1978) Gene 3, 87-95 12. Blakley, R.L. (I960) Nature 188, 231-232 13. Peter, G., Hanggi, U.J., and Zachau, H.G. (1979) Molec. Gen. Genet. 175, 333-341 14. Laemmli, U.K. (197O) Nature 227, 68O-685 15. Hanggi, U.J. and Littlefield, J.W. (1974) J. Biol. Chem. 249, 1390-1397 16. Vogelstein, B. and Gillespie, D. (1979) Proc. Natl. Acad. Sci. USA 76, 615-619 17. Green, P.J., Heyneker, H.L., Bolivar, F., Rodriguez, R.L., Betlach, M.C., Covarrubias, A.A., Backman, K., Russel, D.J., Tait, R., and Boyer, H.W. (1978) Nucl. Acids Res. 5, 2373-2380 18. Maxam, A.M. and Gilbert, W. (1980) Methods in Enzymol. 65, 499-56O 709 Nucleic Acids Research 19. Pech, M. . Streeck, R.E., and Zachau, H.G. (1979) Cell 18, 883-893 20. Gray, C.P., Sommer, R., Polke, C , Beck, E., and Schaller, H. (1978) Proc. Natl. Acad. Sci. USA 75, 5O-53 21. Ohmori, H., Tomizawa, J., and Maxam, A.M. (1978) Nucl. Acids Res. 5, 1479-1485 22. Watson, D.H., Harvey, M.J., and Dean, P.D.G. (1978) Biochem. J. 173, 591-596 23. Smith, S.L., Stone, D., Novak, P., Baccanari, D.P., and Burchall, J.J. (1979) J. Biol. Chem. 254, 6222-6225 24. Wachter, E. and Werhahn, R. (1979) Anal. Biochem. 97, 56-64 25. Beaven, G.H. and Holiday, E.R. (1952) Adv. Protein chem. 7, 319-386 26. Stone, D. and Smith, S.L. (1979) J. Biol. Chem. 254, 10857-10861 27. Rosenberg, M. and Court, D. (1979) Ann. Rev. Genet. 13, 319-353 28. Shine, J. and Dalgarno, L. (1974) Proc. Natl. Acad. Sci. USA 71, 1342-1346 29. Kumar, A.A., Blankenship, D.T., Kaufman, B.T., and Freisheim, J.H. (198O) Biochem. 19, 667-678 30. Smith, S.L. and Burchall. J.J. (198O) Fed. Proc. 39, 1771 31. Pribnow, D. (1979) Proc. Natl. Acad. Sci. USA 72, 784-788 32. Sugimoto, K., Sugisaki, H-, Okamoto, T., and Takanami, M. (197 5) Nucl. Acids Res. 2, 2O91-21OO 33. Seeburg, P.H., Nusslein, C , and Schaller, H. (1977) Eur. J. Biochem. 74, 1O7-113 34. Scholz, K. and Jaenicke, L.(1968) Eur. J. Biochem. 4,448-457 35. Reddy, V.A., Kalghatgi, K.K., Rao, P.V.S., and Rao, N.A. (1976) Indian J. Biochem. Biophys. 13, 1O1-1O5 36. Reddy, V.A. and Rao, N.A. (1975) Indian J. Biochem. Biophys. 12, 7 5-8O 710