Survey
* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project
PROTEINS: Structure, Function, and Genetics 30:155–167 (1998) Crystal Structures and Properties of De Novo Circularly Permuted 1,3-1,4-b-glucanases Jacqueline Aÿ,1 Michael Hahn,1 Klaas Decanniere,1 Kirill Piotukh,2 Rainer Borriss,2 and Udo Heinemann1,3* 1Forschungsgruppe Kristallographie, Max-Delbrück-Centrum für Molekulare Medizin, Berlin, Germany 2Institut für Biologie, Humboldt-Universität, Berlin, Germany 3Institut für Kristallographie, Freie Universität, Berlin, Germany ABSTRACT The 1,3-1,4-b-glucanases from Bacillus macerans and Bacillus licheniformis, as well as related hybrid enzymes, are stable proteins comprised of one compact jellyroll domain. Their structures are studied in an effort to reveal the degree of redundancy to which the three-dimensional structure of protein domains is encoded by the amino acid sequence. For the hybrid 1,3-1,4-b-glucanase H(A16-M), it could be shown recently that a circular permutation of the sequence giving rise to the variant cpA16M-59 is compatible with wildtype-like enzymatic activity and tertiary structure (Hahn et al., Proc. Natl. Acad. Sci. USA 91:10417-10421, 1994). Since the circular permutation yielding cpA16M-59 mimicks that found in the homologous enzyme from Fibrobacter succinogenes, the question arose whether de novo circular permutations, not guided by molecular evolution of the 1,3-1,4-bglucanases, could also produce proteins with native-like fold. The circularly permuted variants cpA16M-84, cpA16M-127, and cpA16M-154 were generated by PCR mutagenesis of the gene encoding H(A16-M), synthesized in Escherichia coli and shown to be active in b-glucan hydrolysis. CpA16M-84 and cpA16M-127 were crystallized in space groups P21 and P1, respectively, and their crystal structures were determined at 1.80 and 2.07 Å resolution. In both proteins the main parts of the b-sheet structure remain unaffected by the circular permutation as is evident from a root-mean-square deviation of main chain atoms from the reference structure within the experimental error. The only major structural perturbation occurs near the novel chain termini in a surface loop of cpA16M-84, which becomes destabilized and rearranged. The results of this study are interpreted to show that: (1) several circular permutations in the compact jellyroll domain of the 1,3-1,4-b-glucanases are tolerated without radical change of enzymatic activity or tertiary structure, (2) the three-dimensional structures of simple domains are encoded by the amino acid sequence with sufficient redundancy to r 1998 WILEY-LISS, INC. tolerate a change in the sequential order of secondary structure elements along the sequence, and (3) the native N-terminal region is not needed to guide the folding polypeptide chain toward its native conformation. Proteins 30:155–167, 1998. r 1998 Wiley-Liss, Inc. Key words: X-ray diffraction; protein folding; genetic engineering; circular permutation; 1,3-1,4-b-glucanase INTRODUCTION There is an apparent paradox in that the tertiary structures of small globular proteins are only marginally more stable than the corresponding denatured states, but at the same time they are encoded in their amino acid sequences in a highly redundant way. Under physiological conditions, the free energy stabilizing the native fold, DG(H2O), is usually in the range of 5-15 kcal mol-1 for a protein domain of 100-200 amino acid residues.1,2 This is equivalent to the energy contribution of only a small number of hydrogen bonds formed in the protein interior. Yet, it is well known that the protein sequence can be modified in various ways with conservation of the native fold as has been shown for bacteriophage T4 lysozyme3 and many other proteins. The question therefore is: How is sequence redundancy brought about in protein structures of very limited conformational stability? Bacterial 1,3-1,4-b-glucanases (1,3-1,4-b-D-glucan 4-glucanohydrolases, E.C. 3.2.1.73) have been studied for some time to investigate the degree of sequence redundancy in a compact single-domain protein. Enzymes of this class hydrolyse b-1,4 glycosidic linkages of 3-O substituted glucose units in b-glucan, Contract grant sponsor: Deutsche Forschungsgemeinschaft; Contract grant number: He 1318/9-3; Contract grant sponsor: Fonds der Chemischen Industrie. Michael Hahn is now at NRC Biotechnology Research Institute, Montreal, Canada. *Correspondence to: Udo Heinemann, Robert-Rössle-Str. 10, D-13122 Berlin, Germany. E-mail: [email protected] Received 13 March 1997; Accepted 18 August 1997 156 J. AŸ ET AL. a mostly linear polysaccharide with b-1,3 and b-1,4 glycosyl bonds in a ratio of 1:2.5. Crystal structure analyses showed the enzymes from Bacillus macerans (MAC) and Bacillus licheniformis (LIC) to be folded into closely similar jellyroll domains predominantly composed of two seven-stranded antiparallel b-sheets.4,5 The proteins both contain a cation binding site occupied by Ca21 and one disulfide bond, and their polypeptide chain termini are located in close spatial proximity on the protein surface. Attempts to generate b-glucanases with altered thermal stability and enzymatic pH optimum yielded hybrid enzymes composed of sequences derived from MAC and the homologous protein from Bacillus amyloliquefaciens (AMY).6,7 For example, the hybrid H(A16-M) contains 16 N-terminal residues from AMY and residues 17-214 from MAC. The crystal structures of H(A16-M) with either Ca21 or Na1 occupying the cation binding site8-10 were very similar to that of MAC demonstrating that the native fold is retained after redesigning the sequence in this way. Since the structure of H(A16-M) could be determined at 1.6 Å resolution,9 this protein was selected as reference for further sequence modifications. The directed double mutation of active site residues Glu-105 and Glu-109 to Gln gave rise to a variant, H(A16-M)/ E105Q/E109Q, which was enzymatically incompetent, but had a crystal structure unchanged from that of the parent protein, again demonstrating redundancy in the folding code11 (also Gaiser et al., unpub.). In a further experiment, the observed proximity of N- and C-terminus permitted the construction of a circularly permutated (cp) protein, cpA16M-59, in which the chain termini are linked by a peptide bond and novel termini were created at sequence positions 58 and 59 of H(A16-M). The cp variant was obtained by PCR mutagenesis of the gene, synthesized in the periplasm of E. coli and shown to be stable, enzymatically active, and folded to a three-dimensional structure indistinguishable from the reference structure throughout most of the molecule.12 The design of cpA16M-59 was inspired by the sequence of the 1,3-1,4-b-glucanase from Fibrobacter succinogenes in which an analogous circular permutation has been brought about by molecular evolution.13,14 The bacterial 1,3-1,4-b-glucanases are one of but a few documented examples of proteins that have undergone a circular permutation in their evolution.15-19 In contrast, there is a growing number of reports of engineered cp proteins comprising many of the most common tertiary folds.20-23 So far, cpA16M-59 and a corresponding protein based on the sequence of MAC12 and variants of the a-spectrin SH3 domain24 are the only examples where X-ray crystallography could prove native-like folding in a cp polypeptide. The aim of the present study was to demonstrate yet another level of sequence redundancy in H(A16-M) by producing de novo cp variants not guided by molecular evolution of the 1,3-1,4-b- glucanase jellyroll and characterizing their structure and function. In proteins cpA16M-84, cpA16M127, and cpA16M-154, the native chain termini are again linked by a peptide bond, but the novel termini have been placed in different surface loops. Here, we describe the generation of these cp proteins, their enzymatic activity and thermal stability, as well as the crystal structures of cpA16M-84 and cpA16M-127. EXPERIMENTAL PROCEDURES PCR Mutagenesis, Protein Purification, and Enzymatic Activities The genes encoding cpA16M-84, cpA16M-127, and cpA16M-154 were constructed using splicing by overlap extension25 in complete analogy to cpA16M-59.12 In each mutagenesis, the cloned gene for H(A16-M)6 served as the starting point. DNA and amino acid sequences around the fusion points and the PCR primers used (TIBMolbiol, Berlin) are shown in Figure 2. CpA16M-84, cpA16M-127, and cpA16M-154 were purified from the E. coli periplasm as described12 with the following modifications. Periplasm fractions were dialyzed against 20 mM Tris/HCl, pH 8.5, 2 mM Ca-chloride, 0.02% (by weight) Na-azide and loaded onto a Q-Sepharose HiLoad 16/100 (Pharmacia, Uppsala, Sweden) column. Cp proteins were eluted with a linear gradient of 0-300 mM Nachloride. Enzymatic activities and thermal stabilities were determined as described.12 Catalytic constants were determined in triplicate with lichenan as substrate at 50oC in 20 mM Ca-chloride, 50 mM Na-acetate, pH 6.0. The half life (t1/2) is the time of 50% enzyme inactivation in 10 mM Ca-chloride, 50 mM Naacetate, pH 5.0. Crystallization and Acquisition of X-ray Diffraction Data Crystals of cpA16M-84 and cpA16M-127 were grown at room temperature by standard vapor diffusion methods. Optimal crystal growth was achieved by single-parameter optimization of starting conditions found in a multidimensional initial screen.26 For cpA16M-84, a solution of 12 mg of protein per 1 ml of 10 mM HEPES, pH 7.0, 2 mM Ca-chloride, was mixed with an equal volume of 50 mM K-phosphate, pH 7.0, 20% (by weight) polyethylene glycol (PEG) 8000 and equilibrated against this precipitating solution in sitting drops. Crystals grew to a final size of 0.4 x 0.3 x 0.2 mm within 28 days. CpA16M-127 was used in a 18 mg/ml solution in 20 mM Tris/HCl, pH 9.0, 2 mM Ca-chloride and crystallized by hanging-drop vapor diffusion followed by macroseeding. Here, the precipitating solution contained 0.1 M imidazole, pH 7.5, 2 mM Ca-chloride, 10% (by weight) CIRCULARLY PERMUTED b-GLUCANASES 157 TABLE I. X-ray Diffraction Data Space group Cell axes (Å) Cell angles (°) Resolution limits (Å) Resolution limits, outer shell (Å) No. of observations No. of unique reflections Data statistics1 Average multiplicity Completeness (%) R(merge)2 (%) 7I/s(I)8 Reflections with I $ 3 s(I) (%) cpA16M-84 cpA16M-127 P21 45.54 65.82 69.61 90 106.87 90 33.33–1.76 1.83–1.76 132,154 38,698 P1 44.95 46.55 63.99 92.60 109.28 116.68 29.62–2.07 2.15–2.07 43,819 24,275 3.5 (3.1) 98.3 (94.4) 5.6 (18.8) 7.9 (3.9) 92.3 (70.0) 1.8 (1.6) 93.5 (90.0) 10.9 (19.6) 4.0 (3.3) 79.7 (57.9) 1Numbers in parentheses refer to data in outer resolution shell. 5 100 3 (Sh,i 0 Ih,i 2 Ih 0 /Sh,i Ih,i ), where the summation is over all observations Ih,i contributing to the reflection intensities Ih . 2R(merge) PEG 8000, 8% (by vol.) ethylene glycol, and crystals grew to 1.0 x 0.2 x 0.1 mm within 7 days. Diffraction data were collected on MarResearch (Hamburg, Germany) imaging plate systems using X-rays from direct-drive Rigaku Denki (Tokyo, Japan) rotating anode generators. One low and one high-resolution data set for cpA16M-84 were collected on a 180 mm imaging plate with graphite monochromatized Cu Ka radiation produced with 0.3 mm focal spot size. A single data set for cpA16M127 was acquired on a 300 mm imaging plate using X-rays produced with 0.2 mm focus and monochromatized by Supper (Natick, MA) double mirror focussing optics and filtering through 17.5 µm Ni. Diffraction data were processed with DENZO27 and scaled with SCALA from the CCP4 program system.28 Cell parameters and data statistics are summarized in Table I. Space groups were derived from Laue symmetries and systematic absence of reflection classes. For both proteins the presence of two molecules per asymmetric unit is required to yield a packing density in the expected range29, VM 5 2.1 Å3/Da for cpA16M-84 and VM 5 2.3 Å3/Da for cpA16M-127. Crystal Structure Determination and Refinement Neither cpA16M-84 nor cpA16M-127 crystallized isomorphously with any of the b-glucanases studied so far.11 The crystal structures were therefore determined by molecular replacement using the structure of the Ca21-bound form of H(A16-M)9 as search model in the program system AMORE.30 In both cpA16M-84 and cpA16M-127, the self-rotation function provided clear evidence for a noncrystallographic dyad axis. The two highest peaks in the cross-rotation functions at 14.4 s and 10.8 s for cpA16M-84 and at 18.1 s and 17.5 s for cpA16M-127 were in agreement with the noncrystallographic dyads and represented the correct solution of the rotation problem in both crystal forms. In the translation function, again, the correct solutions could be selected from the top 10 solutions of the list. They appeared at 12.8 s and 18.7 s for the two independent molecules of cpA16M-84 and at 10.4 s for the second molecule of cpA16M-127. After rigid body refinement, the molecular replacement solutions gave R values of 44.9% and 36.6% for cpA16M-84 and cpA16M-127, respectively. The H(A16-M) model was then manually rebuilt with O31 to accommodate the circular permutations and subjected to slow-cooling simulated annealing refinement with X-PLOR32 starting from kinetic energies adjusted to a temperature of 2,000 K. After further positional and B factor refinement with X-PLOR, the structures were completed with maximum likelihood refinement as implemented in REFMAC.33 The free R value34 (R(free)) calculated from a representative fraction of the reflections was used throughout to guide the refinement strategy. For both structures, refinement started with strict noncrystallographic restraints (constraints), which were partially relaxed as indicated by R(free). Solvent molecules were added as indicated by difference electron density and retained in the model if they maintained reasonable hydrogen bonding geometry, electron density, and B values. In both cp variants, the cation binding site was occupied with a Ca21 ion in octahedral coordination as observed earlier.10 In addition, both independent molecules of cpA16M-84 showed a portion of electron density in the active site channel which could be unambiguously assigned to bound HEPES buffer molecules (see Figure 3B). Lastly, a phosphate molecule from the crystallization buffer is bound between residues Lys-56 and Arg-66 of two symmetryrelated molecules of cpA16M-84. Refinement was judged complete when positional and thermal parameter shifts converged and no interpretable features 158 J. AŸ ET AL. Fig. 1. Design of the variants cpA16M-84, cpA16M127, and cpA16M-154. A. Schematic drawing of the polypeptide chain organizations. The N-terminus of cpA16M-84 corresponds to Asn-84 of H(A16-M), cpA16M-127 starts with Val-127 of H(A16-M), cpA16M154 starts with Ile-154 of H(A16-M). Residues Lys-83, Gly-126 and Tyr-153 of the parent protein are the C-termini of cpA16M-84, cpA16M-127 and cpA16M154, respectively. B. Secondary structure of H(A16-M). New termini of the cp variants are indicated by scissors. The circular permutations change the sequential, but not the spatial arrangement of secondary structure elements in the variants. The drawing was prepared with HERA35 based on an analysis of secondary structure with DSSP36. C. Schematic drawing of H(A16-M) with scissors indicating the location of circular permutations. This drawing as well as Figures 5 and 6 was prepared with MOLSCRIPT37. remained present in the difference electron density. Atomic coordinates and structure amplitudes were submitted with the Brookhaven Protein Databank (entry codes 1AJK for cpA16M-84 and 1AJO for cpA16M-127) from where they are available upon request. RESULTS Design and Generation of Cpa16m-84, Cpa16m-127, and Cpa16m-154 The design of the de novo cp variants cpA16M-84, cpA16M-127, and cpA16M-154 was based on the known crystal structures8,9,12 of the parent protein, CIRCULARLY PERMUTED b-GLUCANASES 159 the generation of new termini. In these proteins, therefore, N- and C-terminus are on neighboring strands of one sheet as in the parent protein H(A16-M). In contrast, cpA16M-84 is produced by opening an interstrand connection, and the new ends are located on different b-sheets. On the gene level the cp variants were generated by separating the coding sequence in two parts and exchanging their order by PCR mutagenesis as described above. Figure 2 displays DNA and peptide sequences around the fusion points. The cp proteins were directed into the E. coli periplasm by the signal peptide of wildtype MAC from where they could be easily purified in milligram quantities. Enzymatic Activity and Stability of cp Proteins Fig. 2. DNA and amino acid sequences around the polypeptide chain termini of H(A16-M), cpA16M-84, cpA16M-127, and cpA16M154. Portions of H(A16-M) derived from MAC and AMY are labelled with the sequence numbers of their parent protein. Amino acids in the signal peptide (sp) absent from the mature protein have negative sequence numbers. Asterisks denote the length of the PCR primers (sequence of lower strand) used in mutagenesis. H(A16-M), and of cpA16M-59. Figure 1 shows the constructs on the primary, secondary, and tertiary structure level of the protein. As with cpA16M-59, the parental polypeptide termini are simply linked with a peptide bond, and no attempt was made to design more complicated peptide linkers. This approach was encouraged by the elevated thermal mobility of the terminal residues observed in the crystal structure of H(A16-M)9 indicating the absence of important stabilizing intramolecular interactions involving these residues. New chain termini were placed in surface loops so as to preserve the core of the protein. Two kinds of b-strand connections are present in jellyroll domains, those that link strands within a b-sheet and those that link strands between sheets. In cpA16M-127 and cpA16M-154, as before in cpA16M-5912, a loop within a b-sheet is cleaved for The enzymatic activity of H(A16-M) and its cp variants was determined as described.12 Both cpA16M-84 and cpA16M-127 hydrolyse the natural substrate lichenan (Table II), but with reduced activity. At the temperature optimum for catalysis, the enzymatic activity of cpA16M-84 is reduced by a factor of roughly 4 with respect to H(A16-M) and cpA16M-59, whereas for cpA16M-127 the drop in activity is roughly 20-fold. The reduction in cpA16M-84 activity is brought about by small changes in both kcat and Km. Conversely, cpA16M-127 shows a more drastic drop in kcat with nearly unchanged Km. A fourth variant, cpA16M-154, generated in the same way as the other three but not subjected to crystallographic analysis, has very similar characteristics as the reference protein, H(A16-M). The conformational stability of the proteins is reflected in their activity versus temperature profiles. The temperature optimum (Topt) for catalysis is between 60oC and 65oC for H(A16-M) and all but one of the cp variants. This reflects the fact that the enzymes are derived from the 1,3-1,4-b-glucanase MAC, a protein from a thermophilic organism. The single exception is cpA16M-84 displaying a drop by 20oC in its Topt. At 80oC where all other variants show significant residual activity, cpA16M-84 can no longer hydrolyse lichenan. We note that the effects of the circular permutations on Topt and enzyme kinetics are uncorrelated. Although cpA16M-84 appears less stable than cpA16M-127, it is the better enzyme over a wide range of temperatures. A more direct measure of conformational stability is the half life of enzymatic activity at elevated temperature. Table II shows that all four cp variants are destabilized relative to H(A16-M), but to various degrees. Clearly, cpA16M-84 is by far the least stable of the cp proteins. Crystal Structure Analysis Structure refinement converged at R values of 16.8% and 17.6% at 1.80 and 2.07 Å resolution for cpA16M-84 and cpA16M-127, respectively (Table 160 J. AŸ ET AL. TABLE II. Enzymatic Activity and Stability of cp Variants Topt (°C) A (U mg21) at Topt A (U mg21) at 50°C pHopt t1/2 (min) at 70°C Km (mg ml21) at 50°C kcat (s21) at 50°C kcat/Km (ml mg21 s21) A (%) at 40°C A (%) at 80°C H(A16-M)1 cpA16M-591 cpA16M-84 cpA16M-127 cpA16M-154 (61–67)2 62 (58–66) 3930 6 100 2833 6 69 6.5–6.8 46 0.47 6 0.02 1450 6 90 3085 58 6 40 (37–45) 1197 6 30 1088 6 27 6.0–6.5 n.d.4 0.8 6 0.04 461 6 18 576 100 0 60 (57–70) 247 6 6 231 6 5 6.5–8.0 5 0.52 6 0.06 80 6 5 154 75 43 65 (65–70) 4131 6 98 3050 6 83 6.2–6.8 14 0.43 6 0.1 1363 6 28 3170 49 55 64 4890 6 120 3731 6 91 6.5–7.03 137 0.49 6 0.1 1860 6 50 3796 535 415 1Data in part taken from ref. 12. in parentheses are temperature intervals for 97–100% activity. 3pH intervals for 95–100% activity. 4Not determined. At 63, 64 and 65°C, t 1/2 is 64, 27 and 9 min, respectively. 5With respect to activity at T opt . 2Numbers III). Both protein models yield reasonable R and R(free) values, have proper stereochemistry as indicated by PROCHECK38 and the Ramachandran diagrams (not shown), and display the prespecified variance in geometric parameters with respect to the target values.39 As indicated by R(free), NCS restraints were removed for cpA16M-84 but kept for cpA16M-127 at the end of refinement. For a detailed comparison of the crystal structures of the cp variants with the reference protein, it is crucial to know the error in the atomic coordinates of the protein molecules. An upper limit for the mean coordinate error may be obtained from a Luzzati plot40 based on R(free). This procedure yields mean coordinate errors of 0.2-0.35 Å and 0.25-0.35 Å for cpA16M-84 and cpA16M-127, respectively. Both crystal structures are well defined throughout most of the polypeptide chains. In both crystal structures, the cation binding sites of the cp 1,3-1,4b-glucanases are occupied by Ca21 in much the same way as in H(A16-M). In general, the novel polypeptide chain termini and the newly formed peptide bond between N- and C-terminus of H(A16-M) tend to be less well defined than the remainder of the molecules. However, they are present in electron density as shown for a representative example in Figure 3. In the crystal structure of cpA16M-127, there is no electron density for the two aminoterminal residues of molecule 1 and for residue Thr-90 in molecule 2. In cpA16M-84, the long surface loop Thr-104 to Pro-118 is destabilized by the novel amino and carboxy termini, which are located nearby. This loop attains a completely new conformation in molecule 1 of the asymmetric unit (see Figs. 5, 6) where it is without electron density for residues Gly-105 and Val-106 and the side chain of Asp-107. In molecule 2, there is insufficient density in the entire region for modelling the polypeptide chain. In addition, the loop from Ser-185 to Lys-189 shows elevated thermal TABLE III. Refinement and Model Statistics Resolution range (Å) No. of protein non-hydrogen atoms No. of solvent molecules No. of reflections No. of reflections, working set No. of reflections, test set Degrees of freedom Data-to-parameter ratio R(cryst)1 (%) R(free)2 (%) F(obs)/F(calc) correlation coefficient R.m.s. deviation from target values3 Bond lengths (Å) Bond angle distances (Å) Planar 1–4 distances (Å) Mean B, all protein atoms (Å2) Mean B, solvent (Å2) Ramachandran outliers4 cpA16M-84 cpA16M-127 33.33–1.80 9.99–2.07 3,284 308 35,910 34,124 1,786 14,500 2.48 16.8 21.9 3,376 244 24,053 21,672 2,381 14,488 1.66 17.6 23.5 0.953 0.954 0.015 0.033 0.035 0.013 0.032 0.034 19.5 33.7 none 20.9 33.0 none 5 100 3 (Sh 0 F(obs,h) 2 F(calc,h) 0 /ShF(obs,h)), where F(obs) and F(calc) are observed and calculated structure amplitudes, respectively, and the summation is over all reflections h. 2R(free) 5 R(cryst) calculated for a test set of reflections not used in refinement. 3With respect to the parameters of Engh and Huber.39 4According to an analysis with PROCHECK.38 1R(cryst) parameters in this molecule, and the side chains are not well defined. In the crystal lattice, this loop is juxtaposed the disordered region 104-118 of another molecule in an area of apparently high flexibility. A contiguous portion of difference electron density in the active-site cleft of cpA16M-84 could be unambiguously assigned to a bound HEPES molecule (Fig. 3). The buffer molecule forms hydrogen bonds to Glu-26 and Trp-101 in the active site of cpA16M-84. CIRCULARLY PERMUTED b-GLUCANASES 161 Fig. 3. Electron density in cpA16M-84. A. The newly formed peptide bond linking the termini of H(A16-M) in molecule 2 of cpA16M-84 with 2Fo-Fc difference electron density contoured at 0.35s above the mean. B. 2Fo-Fc difference electron density contoured at 0.4s above the mean representing a HEPES buffer molecule bound to the active site. Hydrogen bonds are shown as dotted lines and their lengths are given in Å units. Stereo drawings prepared with SETOR.38 In spite of this active-site contact by the buffer molecule, however, cpA16M-84 is not inhibited by HEPES. Under standard assay conditions (see Experimental Procedures), kinetic parameters remain unchanged within the range of experimental error (5-8%) at HEPES concentrations up to 50 mM, i.e., up to five times the concentration present in the crystallization buffer. The lack of inhibition of cpA16M-84 by HEPES may be explained by the intermolecular mode of HEPES binding. Its sulfonic acid group is in hydrogen-bonding contact to a water molecule coordinated to the Ca21 ion of a symmetryrelated protein molecule. In one of the HEPES molecules, there is electron density suggesting two alternative orientations of its hydroxyl group as shown in the Figure 3. rial 1,3-1,4-b-glucanases.11 Crystal packing diagrams are shown in Figure 4. In both lattices, the pattern of intermolecular contacts experienced by a given protein molecule is complex due to the presence of two molecules in the asymmetric unit. Molecule 1 of cpA16M-84 forms 24 direct hydrogen bonds (donor-acceptor distance 3.5 Å or less) to neighboring molecules in the lattice, whereas molecule 2 engages in only six such interactions. For the two independent molecules of cpA16M-127, the corresponding numbers are 15 and 13. The somewhat higher packing density in the crystal of cpA16M-84 may explain the better diffracting quality in comparison with cpA16M-127. Crystal Packing Both cpA16M-84 and cpA16M-127 have adopted novel packing modes not observed before with bacte- Tertiary Structures The three-dimensional structures of cpA16M-84 and cpA16M-127 are shown schematically in Figure 5 in an orientation that permits a view into the active-site channel running across. The crystal 162 J. AŸ ET AL. Fig. 4. Stereo drawing of the packing of protein molecules in crystals of (A) cpA16M-84 and (B) cpA16M-127. The unit cells are oriented to have their longest axis oriented vertical and the shortest axis projecting out at the viewer. Fig. 5. Schematic drawing of cpA16M-84 (center) and cpA16M127 (right). For comparison, cpA16M-5912 (left) is also shown in the same orientation looking into the active site channel. b-Strands are shown as green or yellow curved arrows and a-helices as red wound ribbons. The N-terminal polypeptide portions, which have been relocated to the C-terminus of the protein by the circular permutation, are shown in yellow. The bound Ca21 and the disulfide bond are shown in all molecules as sphere and in ball-and-stick representation, respectively. structure of the other known cp variant, cpA16M5912, is shown for comparison. It is clear from a casual look that the tertiary structures of the three cp proteins and H(A16-M) (see Fig. 1C) are very similar. All display the jellyroll architecture typical for bacterial 1,3-1,4-b-glucanases, which is characterized by two major, completely antiparallel seven-stranded b-sheets and a minor, mixed three-stranded sheet. The twist and curvature of these sheets, along with a number of connecting loops, give rise to the prominent channel known to bind the polysaccharide substrate and to accommodate the catalytic site residues.8 Common to all four proteins, in addition, is the binding of a cation, Ca21, to a site consisting of main chain and side chain atoms of Pro-9, Gly-45, and Asp-207 (H(A16-M) numbering), which is remote from the active site. The number of water molecules completing the Ca21 coordination sphere varies between two and three. In all proteins the single disulfide bond between Cys-32 and Cys-61 (H(A16-M) numbering) is formed. The topology of this bond, however, is not uniform. Since in cpA16M-59 the novel termini are located within the 30-residue loop closed by the disulfide bond in H(A16-M), the resulting disulfide bond closes a large loop of 184 residues in this cp variant. In contrast, cpA16M-84, cpA16M-127, and cpA16M-154 all have the same disulfide bond topology as the reference protein. Structural Comparison of cp Variants With Parent Protein A systematic comparison of the three-dimensional structures of cpA16M-84 and cpA16M-127 with H(A16-M) and cpA16M-59 can be made by superim- CIRCULARLY PERMUTED b-GLUCANASES posing all pairs of protein molecules and determining the root-mean-square (rms) distances of equivalent atoms. In order to eliminate noise from different side chain conformations that are more influenced by crystal contacts than the main chain, these computations are done for Ca atoms only. Residues whose a-carbons are more than 1.5 Å apart are removed from the atom lists and no longer used in subsequent least-squares superpositions to avoid obscuring real differences between the structures. The structures are sufficiently similar so as not to lose many residues in this process (Table IV). In total, seven individual 1,3-1,4-b-glucanase molecules enter into the structural comparison. The reference protein, H(A16-M) has been characterized in two forms differing in cation binding (Na1 versus Ca21) and crystal packing. In addition, for both cpA16M-84 and cpA16M-127, there are two independent molecules in the crystallographic asymmetric unit. These are denoted cpA16M-84#1 and cpA16M84#2, and cpA16M-127#1 and cpA16M-127#2, respectively. Thus we are looking at two independent molecules of H(A16-M) and at five molecules representing the cp forms of the protein yielding N 5 [n 3 (n-1)]/2 5 21 pairs of superimposed molecules for n 5 7. As seen in Table IV, the molecules of cpA16M-127 are still subject to NCS restraints and thus not strictly independent. It is important to note that every pair of molecules differs in one to three parameters out of the set (1) chain organization, (2) crystal contacts, and (3) cation binding. Yet, the tertiary structures are very similar with rms D values below 0.4 Å in nearly all comparisons. The inclusion of all backbone and side chain atoms in the superposition of H(A16-M) with cpA16M-84#1 raises the rms D to 0.93 Å. Similar effects may be expected for the other all-atom superpositions. Considering the estimated Luzzati coordinate errors of up to 0.35 Å for cpA16M-84 and cpA16M-127, the tabulated rms distances between equivalent atoms are thus within twice the experimental coordinate error, and the rms D values expected for all-atom comparisons do not exceed this error limit by much. The portions of the protein molecules considered nonequivalent and excluded from the superpositions are mostly around the old and the newly created chain termini and in some surface loops. The result of the structural comparisons is shown in Figure 6. The sevenfold superpositions of H(A16M), cpA16M-59, cpA16M-84, and cpA16M-127 reveal that the central b-sheet core of the 1,3-1,4-bglucanase jellyroll is practically identical in all structures. Some of the observed differences appear to be due to crystal packing rather than the circular permutations as is evident for the loop between Pro-9 and Thr-17 in the Ca21-form of H(A16-M), which differs in its conformation from all other molecules, including the Na1-bound form of H(A16M). A closeup comparison of side chain geometries in 163 the catalytic site of the enzymes reveals very small structural changes from molecule to molecule consistent with the enzymatic activity of all molecular species. An explanation of the kinetic parameters of the different enzymes in quantitative terms, especially the significant drop in kcat of cpA16M-127 with respect to H(A16-M) (see Table II), however, is not afforded by the crystal structures. DISCUSSION The present study aims at defining the degree to which the folding of the 1,3-1,4-b-glucanase jellyroll domain is redundantly encoded by the amino acid sequence. Starting from the pioneering studies of the Kirschner group,20,43 it has become clear that the circular permutation of polypeptide chains is one of the more global ways by which a protein sequence can be altered while intramolecular interactions stabilizing the tertiary structure may remain intact. Earlier,12 we had shown that a circular permutation of the hybrid protein H(A16-M) designed after a permutation present in the family of bacterial 1,3-1,4b-glucanases13,14 was tolerated with conservation of tertiary structure and enzymatic activity. The question addressed here was whether de novo circular permutations of H(A16-M), not guided by the molecular evolution of 1,3-1,4-b-glucanases, would also yield polypeptides with native-like form and function. Three cp proteins with the simplest possible design were generated by genetic rearrangement and studied with respect to enzymatic activity, thermal stability, and three-dimensional structure. These variants, cpA16M-84, cpA16M-127, and cpA16M154, have amino and carboxy termini in surface loops of H(A16-M) and a peptide bond linking the termini of the parental protein. They are thus identical in sequence to H(A16-M) except for the circular permutation, which changes the sequential order of secondary structure elements but may be expected to leave tertiary interactions unaffected (see Fig.1). The three cp variants folded spontaneously in the E. coli periplasm and were conformationally stable and enzymatically active, albeit to different degrees. CpA16M-84 is the least thermostable and cpA16M127 is the least active of the three cp variants. The crystal structures of these proteins determined at 1.80 and 2.07 Å resolution, however, show nativelike folding whereby the core of the tertiary structures remains unchanged within the limits of accuracy of the experiments. No attempt has been made to create a comprehensive set of cp 1,3-1,4-bglucanases. We would predict that the chain termini can be moved to a fairly large number of locations on the protein surface without loss of tertiary structure and function, but that termini inside the hydrophobic core of the jellyroll might cause misfolding. The results of a recent random circular permutation 164 5–39, 48–106, 119– 184, 188–213 5–88, 93–141, 147–213 4–87, 93–99, 105–213 4–87, 93–141, 145–210 cpA16M-127#2 cpA16M-127#1 cpA16M-84#2 cpA16M-84#1 *H(A16-M) · Na1, H(A16-M) with Na1 in the cation binding site10; in all other structures Ca21 is in the cation binding site. Values above the diagonal are rms distances (Å) of equivalent a-carbon atoms, numbers below the diagonal denote those residues of H(A16-M) which were used in the superposition. Rms distances were calculated with LSQKAB.42 For details, see text. — 1–89, 92–214 0.094 — 5–39, 48–106, 119– 184, 188–213 5–39, 48–103, 105– 131, 138–183, 190–213 5–39, 48–103, 105– 183, 190–213 0.339 0.328 — 4–130, 138–184, 189–214 5–15, 18–103, 105– 131, 136–184, 191–212 5–88, 93–141, 147–213 0.276 0.271 — 5–106, 120–131, 136–184, 191–212 0.330 0.261 0.266 0.289 0.263 0.269 0.283 0.334 0.416 0.388 0.266 0.304 0.251 0.252 0.318 — 0.328 — 1–154, 161–167, 173–209 5–106, 119–129, 136–142, 147–184, 188–212 5–103, 119–129, 136–142, 147–184, 188–212 4–87, 93–99, 105–213 — 2–10, 17–42, 45–214 1–154, 161–169, 173–209 4–13, 18–103, 119– 127, 136–184, 189–212 4–13, 18–103, 105– 127, 136–185, 189–212 4–87, 93–141, 145–210 H(A16-M) · Na1 H(A16-M) cpA16M-59 cpA16M-84#2 this work cpA16M-84#1 this work cpA16M-59 ref. 12 H(A16-M) ref. 9 H(A16-M) · Na1 ref. 10 TABLE IV. Least-squares Superposition of 1,3-1,4-b-glucanases* cpA16M-127#1 this work cpA16M-127#2 this work J. AŸ ET AL. experiment with aspartate transcarbamoylase44 appear to be in line with this assumption. The properties of the cp variants are similar but not identical to those of the reference protein, H(A16M). Can the crystal structures of cpA16M-84 and cpA16M-127 explain the observed differences? The most extreme behavior with regard to enzymatic activity is displayed by cpA16M-127 where kcat is reduced by a factor of more than 20. Thus this enzyme is impaired in turnover, whereas substrate binding appears nearly unchanged. The reaction mechanism of the bacterial 1,3-1,4-b-glucanases is known to proceed with overall retention of configuration at the anomeric carbon45 and to involve the side chains of Glu-105 of H(A16-M) as nucleophile and of Glu-109 as general acid in the first reaction step.4 The crystal structure of cpA16M-127 shows the catalytic-site glutamic acid residues to be in very similar arrangement and environment as in the parent enzyme (see Fig. 6B). Whether the reaction intermediate after the first step remains covalently attached to the nucleophile or exists as an oxocarbonium ion stabilized by negative charges in the catalytic site is not known with certainty. Given this ambiguity, it is difficult to relate precisely the moderate 20-fold change in kcat to geometric features of the active sites. It appears possible, however, that the creation of new chain termini near the active-site channel subtly changes substrate binding and modifies the strain thought to drive the reaction.9 In the presence of 1 mM Ca21 and 1.5 M guanidine hydrochloride, H(A16-M) was shown to undergo two thermal unfolding transitions characterized by Tm values of 55.9oC and 57.3oC, whereas without Ca21 one transition with a Tm of 51.2 °C and a calculated free energy change at 25oC, DG25oC, of 10.8 kcal mol-1 was observed.46 All cp variants including cpA16M5912 crystallize with bound Ca21 and display reduced thermal stability by comparison with H(A16-M) in the order cpA16M-59 . cpA16M-154 . cpA16M127 . cpA16M-84 as measured by the half-life of enzymatic activity at 70oC. It must be emphasized that the reference temperature was chosen so as to maximize apparent differences in thermal stability and that thermodynamic stabilities from urea or guanidine hydrochloride unfolding experiments are expected to yield smaller deviations. Since the threedimensional structures and hence the intramolecular interactions of the protein variants are so similar, systematic differences between them in the numbers of hydrogen bonds, salt bridges, or van der Waals contacts are not discernible. Earlier we speculated that the changed properties of cpA16M-59 might be due to strain introduced into the molecule by the novel peptide bond linking the termini of the reference protein.12 With three more cp variants and two more crystal structures at hand, this interpretation is no longer tenable, since all variants share the same peptide link but differ CIRCULARLY PERMUTED b-GLUCANASES Fig. 6. Superposition of the crystal structures of H(A16-M)9 (pink), H(A16-M)·Na1 (ref. 10, red), cpA16M-5912 (yellow), cpA16M-84 (two molecules in dark and light blue) and cpA16M127 (two molecules in dark and light green). a-Carbon traces of the protein backbones are shown in (A) where the view is significantly in their activity and stability. Apparently, it is the local environment of the novel termini that decides about these properties. The changed disulfide topology of cpA16M-59 where a large loop of 184 residues is formed in place of the 30-residue loop in H(A16-M) (see above) may contribute to the comparatively high stability of this cp variant 165 onto the active site channel. Numerical values for the leastsquares fits are given in Table IV. A closeup view of the catalytic residues is displayed in (B) with side chains added to the structures after the global superposition based on a-carbons. The numbering of residues is according to the H(A16-M) sequence. through entropic effects on the protein’s unfolded form as discussed.2 The low thermal stability of cpA16M-84 is correlated with a loss of tertiary structure in a large loop next to the novel termini, which is flexible and partly ill defined in the crystal structure. Obviously, the positioning of the termini on different b-sheets serves to destabilize the adja- 166 J. AŸ ET AL. cent intersheet crossover loop. We may note that the protein yielded very well diffracting crystals despite this apparent loss of tertiary order. Nonwithstanding the small differences in structure and function between H(A16-M) and its cp variants discussed above, the main result of the present study is that the compact jellyroll fold of the 1,3-1,4-b-glucanase is defined by its amino acid sequence in a sufficiently redundant way to tolerate global changes such as circular permutations of the sequence, including those not prescribed by molecular evolution. The resultant proteins show nativelike fold and physical properties resembling those of H(A16-M). The biological significance of this observation lies in reinforcement of the idea of a highly redundant sequence-structure code that is expected to be valid as long as changes to a protein structure and topology are restricted to the molecular surface. In addition, the results may be interpreted to prove that the native amino terminus is not required for proper in vivo folding of the 1,3-1,4-b-glucanase jellyroll domain. At least for this domain, they argue against the vectorial protein folding hypothesis proposed by Chantrenne47 and reiterated often thereafter, which assigns to the amino terminus a role in guiding the folding process by acquiring structure before the polypeptide chain is completely synthesized. The amino terminus of the 1,3-1,4-b-glucanase jellyroll has obviously not evolved to assume this responsibility. Recently proposed gradients from N- to C-terminus of substructure stability,48 local compactness,49 or secondary structure predictability,50, if they exist in H(A16-M), do not influence the in vivo foldability and native-like tertiary structure of its cp variants. Circular permutations of polypeptide chains have been engineered for a number of proteins of different global folds.20-23 Although their topologies are changed by altering the sequential order of secondary structure elements, this change may appear superficial. More fundamental changes could be introduced by designing protein variants with several new strand connections or by implanting independent folding units into a compact domain. The bacterial 1,3-1,4-bglucanases may prove to be good targets for further work along these lines. ACKNOWLEDGMENTS Advice and support from M.K. Ponnuswamy (Madras, India) and J.J. Müller in structure analysis and refinement matters is gratefully acknowledged. REFERENCES 1. Creighton, T.E. Proteins: Structures and Molecular Properties, 2nd ed. New York: Freeman, 1993. 2. Pace, C.N., Heinemann, U., Hahn, U., Saenger, W. Ribonuclease T1: Structure, function, and stability. Angew. Chem. Int. Ed. Engl. 30:343-360, 1991. 3. Matthews, B.W. Studies on protein stability with T4 lysozyme. Adv. Protein Chem. 46:249-278, 1996. 4. Hahn, M., Olsen, O., Politz, O., Borriss, R., Heinemann, U. Crystal structure and site-directed mutagenesis of Bacillus macerans endo-1,3-1,4-b-glucanase. J. Biol. Chem. 270: 3081-3088, 1995. 5. Hahn, M., Pons, J., Planas, A., Querol, E., Heinemann, U. Crystal structure of Bacillus licheniformis 1,3-1,4-b-Dglucan 4-glucanohydrolase at 1.8 Å resolution. FEBS Letters 374:221-224, 1995. 6. Olsen, O., Borriss, R., Simon, O., Thomsen, K.K. Hybrid Bacillus (1-3,1-4)-b-glucanases: Engineering thermostable enzymes by construction of hybrid genes. Mol. Gen. Genet. 225:177-185, 1991. 7. Politz, O., Simon, O., Olsen, O., Borriss, R. Determinants for the enhanced thermostability of hybrid (1,3-1,4)-bglucanases. Eur. J. Biochem. 216:829-834, 1993. 8. Keitel, T., Simon, O., Borriss, R., Heinemann, U. Molecular and active-site structure of a Bacillus 1,3-1,4-b-glucanase. Proc. Natl. Acad. Sci. USA 90:5287-5291, 1993. 9. Hahn, M., Keitel, T., Heinemann, U. Crystal and molecular structure at 0.16 nm resolution of the hybrid Bacillus endo-1,3-1,4-b-D-glucan 4-glucanohydrolase H(A16-M). Eur. J. Biochem. 232:849-858, 1995. 10. Keitel, T., Meldgaard, M., Heinemann, U. Cation binding to a Bacillus (1,3-1,4)-b-glucanase. Geometry, affinity and effect on protein stability. Eur. J. Biochem. 222:203-214, 1994. 11. Heinemann, U., Aÿ, J., Gaiser, O., Müller, J.J., Ponnuswamy, M.N. Enzymology and folding of natural and engineered bacterial b-glucanases studied by X-ray crystallography. Biol. Chem. 377:447-454, 1996. 12. Hahn, M., Piotukh, K., Borriss, R., Heinemann, U. Nativelike in vivo folding of a circularly permuted jellyroll protein shown by crystal structure analysis. Proc. Natl. Acad. Sci. USA 91:10417-10421, 1994. 13. Teather, R.M., Erfle, J.D. DNA sequence of a Fibrobacter succinogenes mixed-linkage glucanase (1,3-1,4-b-D-glucan 4-glucanohydrolase) gene. J. Bacteriol. 172:3837-3841, 1990. 14. Schimming, S., Schwarz, W.H., Staudenbauer, W.L. Structure of the Clostridium thermocellum gene licB and the encoded b-1,3-1,4-glucanase. Eur. J. Biochem. 204:13-19, 1992. 15. Delarue, M., Poterszman, A., Nikonov, S., Garber, M., Moras, D., Thierry, J.C. Crystal structure of a prokaryotic aspartyl tRNA synthetase. EMBO J. 13:3219-3229, 1994. 16. Ponting, C.P., Russell, R.B. Swaposins: Circular permutations within genes encoding saposin homologues. Trends Biochem. Sci. 20:179-180, 1995. 17. Heinemann, U., Hahn, M. Circular permutations of protein sequence: Not so rare? Trends Biochem. Sci. 20:349-350, 1995. 18. Lupas, A. A circular permutation event in the evolution of the SLH domain? Mol. Microbiol. 20:897-898, 1996. 19. Lindqvist, Y., Schneider, G. Circular permutations of natural protein sequences: Structural evidence. Current Opinion Struct. Biol. 7:422-427, 1997. 20. Luger, K., Hommel, U., Herold, M., Hofsteenge, J., Kirschner, K. Correct folding of circularly permuted variants of a ba barrel enzyme in vivo. Science 243:206-210, 1989. 21. Heinemann, U., Hahn, M. Circular permutation of polypeptide chains: Implications for protein folding and stability. Prog. Biophys. Mol. Biol. 64:121-143, 1996. 22. Vignais, M.-L., Corbier, C., Mulliert, G., Branlant, C., Branlant, G. Circular permutation within the coenzyme binding domain of the tetrameric glyceraldehyde-3-phosphate dehydrogenase from Bacillus stearothermophilus. Protein Sci. 4:994-1000, 1995. 23. Koebnik, R., Krämer, L. Membrane assembly of circularly permuted variants of the E. coli outer membrane protein OmpA. J. Mol. Biol. 250:617-626, 1995. 24. Viguera, A.R., Serrano, L., Willmanns, M. Different folding transitions may result in the same native structure. Nature Struct. Biol. 3:874-880, 1996. 25. Horton, R.M., Hunt, H.D., Ho, S.N., Pullen, J.K., Pease, L.R. Engineering hybrid genes without the use of restric- CIRCULARLY PERMUTED b-GLUCANASES 26. 27. 28. 29. 30. 31. 32. 33. 34. 35. 36. 37. 38. tion enzymes: Gene splicing by overlap extension. Gene 77: 61-68, 1989. Jancarik, J., Kim, S.-H. Sparse matrix sampling: A screening method for crystallization of proteins. J. Appl. Crystallogr. 24:409-411, 1991. Otwinowski, Z. DENZO: An Oscillation Data Processing Program for Macromolecular Crystallography. New Haven, CT: Yale Univ. Press. 1993. Collaborative Computational Project, Number 4. The CCP4 suite: Programs for protein crystallography. Acta Crystallogr. D50:760-763, 1994. Matthews, B.W. Solvent content of protein crystals. J. Mol. Biol. 33:491-497, 1968. Navaza, J. AMoRe: an automated package for molecular replacement. Acta Crystallogr. A50:157-163, 1994. Jones, T.A., Zou, J.-Y., Cowan, S.W., Kjeldgaard, M. Improved methods for building protein models in electron density maps and the location of errors in these models. Acta Crystallogr. A47:110-119, 1991. Bruenger, A.T., Krukowski, A., Erickson, J.W. Slow-cooling protocols for crystallographic refinement by simulated annealing. Acta Crystallogr. A46:585-593, 1990. Murshudov, G.N., Vagin, A.A., Dodson, E.J. Refinement of macromolecular structures by the maximum-likelihood method. Acta Crystallogr. D53:240-255, 1997. Brünger, A.T. Free R-value: A novel statistical quantity for assessing the accuracy of crystal structures. Nature 355: 472-475, 1992. Hutchinson, E.G., Thornton, J.M. HERA - a program to draw schematic diagrams of protein secondary structures. Proteins 8:203-212, 1990. Kabsch, W., Sander, C. Dictionary of protein secondary structure: Pattern recognition of hydrogen-bonded and geometrical features. Biopolymers 22:2577-2637, 1983. Kraulis, J.P. MOLSCRIPT: A program to produce both detailed and schematic plots of protein structure. J. Appl. Crystallogr. 24:946-950, 1991. Laskowski, R.A., MacArthur, M.W., Moss, D.S., Thornton, J.M. PROCHECK: A program to check the stereochemical 39. 40. 41. 42. 43. 44. 45. 46. 47. 48. 49. 50. 167 quality of protein structures. J. Appl. Crystallogr. 26:283291, 1993. Engh, R.A., Huber, R. Accurate bond and angle parameters for X-ray protein structure refinement. Acta Crystallogr. A47:392-400, 1991. Luzzati, V. Traitement statistique des erreurs dans la determination des structures cristallines. Acta Crystallogr. 5:802-810, 1952. Evans, S.V. SETOR: Hardware-lighted three-dimensional solid model representations of macromolecules. J. Mol. Graphics 11:134-138, 1993. Kabsch, W. A solution for the best rotation to relate two sets of vectors. Acta Crystallogr. A32:922-923, 1976. Buchwalder, A., Szadkowski, H., Kirschner, K. A fully active variant of dihydrofolate reductase with a circularly permuted sequence. Biochemistry 31:1621-1630, 1992. Graf, R., Schachman, H.K. Random circular permutation of genes and expressed polypeptide chains: Application of the method to the catalytic chains of aspartate transcarbamoylase. Proc. Natl. Acad. Sci. USA 93:11591-11596, 1996. Malet, C., Jiménez-Barbero, J., Bernabé, M., Brosa, C., Planas, A. Stereochemical course and structure of the products of the enzymic action of endo-1,3-1,4-b-D-glucan 4-glucanohydrolase from Bacillus licheniformis. Biochem. J. 296:753-758, 1993. Welfle, K., Misselwitz, R., Welfle, H., Simon, O., Politz, O., Borriss, R. Microcalorimetric determination of the thermostability of three hybrid (1-3,1-4)-b-glucanases. J. Biomol. Struct. Dyn. 11:1417-1424, 1994. Chantrenne, H. The Biosynthesis of Proteins. New York: Pergamon,1961, pp. 92-125. Shortle, D., Wang, Y., Gillespie, J.R., Wrabl, J.O. Protein folding for realists: A timeless phenomenon. Protein Sci. 5:991-1000, 1996. Alexandrov, N. Structural argument for N-terminal initiation of protein folding. Protein Sci. 2:1989-1992, 1993. Schulz, G.E. A critical evaluation of methods for prediction of protein secondary structures. Annu. Rev. Biophys. Bioeng. 12:183-210, 1988.