Download Glucose 6-Phosphate Dehydrogenase Mutations

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Lactoylglutathione lyase wikipedia , lookup

Isomerase wikipedia , lookup

Enzyme kinetics wikipedia , lookup

Alcohol dehydrogenase wikipedia , lookup

Transcript
From www.bloodjournal.org by guest on April 30, 2017. For personal use only.
Glucose 6-Phosphate Dehydrogenase Mutations CausingEnzyme Deficiency
in a Model of the Tertiary Structure of the Human Enzyme
By
C.E.Naylor, P. Rowland, A.K. Basak, S. Gover, P.J. Mason, J.M. Bautista, T.J. Vulliamy, L. Luzzatto,
and M.J. Adams
Human glucose 6-phosphate dehydrogenase (G6PD) has a
particularly large number of variants resulting from
point
mutations; some 60
mutations have been sequenced
to date.
Many variants, some polymorphic, are associatedwith enzyme deficiency. Certain variants have severe clinical manifestations; for such variants, the mutant enzyme almost always displaysareduced
thermal stability. A homology
model of human GGPD has been built, based on the threedimensional structure
of the enzyme from Leuconostoc mes-
enteroides. The model has suggested structural reasons for
the diminished enzymes t a b i l i and hence for deficiency.It
has shown that a cluster of mutations in exon 10, resulting
in severeclinicalsymptoms,occurs at or near the dimer
interface of the enzyme, that the eight-residue deletion in
the variant Nara isat a surface loop, and
that thetwo mutations in the A- variant areclose together in the three-dimensional structure.
0 1996 by The American Societyof Hematology.
G
both polymorphic and sporadic variants there is always some
residual enzyme activity and this is invariably lower in RBCs
than in other cells. The lack of null variants and thelow
activity in RBCs (which cannot make up for enzyme breakdown through de novo protein synthesis) implies that instability of mutant G6PD molecules is probablythe commonest
cause of G6PD deficiency. This has been confirmed in several cases by appropriate biochemical analysis.
Following the determination of the primary structure of
human G6PD through cloning of the corresponding cDNA,’
the molecular lesions of more than 60 G6PD variants have
been dete~mined~.~;
nearly all are missense point mutations
causing single amino acid replacements. These mutations,
with differing clinical manifestations, vary in their effect on
residual enzyme activityand on substrate binding; almost
all cause a decrease in enzyme stability. Figure 1 shows the
distribution of severe (class I) mutations with respect to the
primary structure of the protein; notable is a cluster of severe
mutations between residues 380 and 450. This analysis has
proved most valuable for diagnosis and to define which mutations account for G6PD deficiency in various populations.
However, it has been less successful in defining the role of
different domains and of individual amino acid residues in
the stability of the protein, andin explaining why some
mutants are almost asymptomatic whereas others cause severe CNSHA.
Some of us have recently solved and given
a detailed description of the three-dimensional structure of G6PD from
Leuconostoc mesenteroides.‘ The protein from this species has significant homology to human G6PD, and residues shown to be
important for activity are conserved. Thus, we can use knowledge of the structure of the bacterial enzyme to model normal
human G6PD B.The dimer is shown tobe the smallest molecular form of G6PD that is active in the human enzyme’ as in
all species8 The crystal structure of the L mesenteroides apoenzyme hasa dimer in the asymmetric unit. The
two monomers
are related by an axis which is very close to
a dyad and it is
anticipated that the subunit interface is retained in the human
enzyme. Our homology model allows us to predict, with some
confidence, the position and role of both normal and mutated
residuesinthetertiary
structure ofhuman G6PD,andthus
answer some of these questions.
LUCOSE-6-PHOSPHATE dehydrogenase (G6PD) is a
housekeeping enzyme that catalyzes the first and ratelimiting step in the pentose phosphate pathway. Its key role
in metabolism is to provide reducing power in the form of
NADPH. This role is particularly important in redblood
cells (RBCs) where NADPH is required for detoxification
of hydrogen peroxide and other compounds, via glutathione.
In these cells the reaction catalyzed by G6PD (and by 6phosphogluconate dehydrogenase, which depends on G6PD)
is the only source of NADPH. Human G6PD, a homodimer
encoded by a gene which maps to the region Xq28 on the
X-chromosome, exhibits an extraordinary degree of genetic
variability: more than 100 deficient variants have been reported and characterized.’ These variants are either true polymorphisms, with relatively mild clinical manifestations and
variant alleles reaching frequencies of 1% to 50% in various
parts of the world, or sporadic variants, brought to light
because they cause chronic nonspherocytic hemolytic anemia (CNSHA) in affected males. The former group of G6PDdeficient variants have most likely reached polymorphic frequencies because deficient individuals are protected to some
extent against severe Plasmodium fakiparum malaria.’ In
From the Laboratory of Molecular Biophysics, Universityof Oxford; the Department of Haematology, Royal Postgraduate Medical
School, Hammersmith Hospital, London, UK; Departamento de Binquimica y Biologia Molecular IV, Universidad Complutense de Madrid, Facultad de Vetinaria, Madrid, Spain; and the Department of
Human Genetics,Memorial-Sloan Kettering CancerCenter, New
York, NY.
Submitted March 27, 1995; accepted November 9, 1995.
Supported in part by a programme grant from the MRC (UK) to
L. L. and P.J.M. C.E.N. has a Wellcome Prize Studentship, S.G. was
funded by the Oxford Centre for Molecular Sciences and P.R. was
supported by a Science and Engineering Research Council studentship. M.J.A. is Dorothy Hodgkin-E.P.
Abraham Fellow of Somerville
College and an associate member of OCMS.
Address reprint requests toM.J.Adams,DPhil,Laboratoryof
Molecular Biophysics, University of Oxford, S Parks Rd, Oxford,
OX1 3QU, UK.
The publication costsof this article were defrayed in part by page
chargepayment. This article must therefore be hereby marked
“advertisement” in accordance with 18 U.S.C. section 1734 solely to
indicate this fact.
0 1996 by The American Society of Hematology.
0006-4971/96/8707-0028$3.00/0
2974
MATERIALS AND METHODS
The human model was generated
by homology modeling from
the known structure of L mesenteroides G6PD6; the 33% sequence
Blood, Vol 87,No 7 (April l),1996 pp 2974-2982
From www.bloodjournal.org by guest on April 30, 2017. For personal use only.
10
5
5
20
5
30
5
40
5
50
5
A
M A E Q V A L ~ R T H V C G I L R E E L F Q G D A F H Q S D T H I F I I ~ ' ~ ~ ~ AA SK K*? I Y p T T ' ' 9 W-L
V S E 1 K T L V T F F " G : G T
A K R K L Y . P S V F N"L
5
10
-beta
60
5
5
70
80
5
5
90
5
alpha a-------
20
"""_
A----
100
5
5
P
A T P E E K L K L E D F F A R N S
K. Y V
F R D G L % P E N T F I ! C G Y A W S R L T V A D I R K Q S E P F F K
Y K K G Y L Q K H F A I V G ~ T A ~ R ~ Q A L N D D E F K Q L V R D S I K D F T D D Q A Q A AE F I E H F S Y R
5
70
5
80
30
5
40
5
50
5
60
"-
B""
"-beta
110
5
""""-alpha
120
b-------------
130
5
5
alpha
b.---
"
"
140
150
5
-beta C
160
5
d
A G Q Y D D A A S - Y Q R L N S H M N A L H L G S Q
A N R L F m L A L P @ T V - Y E A V T K N I H E S C M
S
A H D V T D A A S Y A V L K E A I E E A A D K F D I D G N R I F ' Y i M S V A r P , R F F G T I A K Y L K S E G L L A
5
-
5
5
280
5
100
90
5
----------alpha c
""""-
110
290
5
120
5
130
"----alpha d-----"""-
5
-beta D--
300
5
5
320
A A A A
F
K
A
~ K E ~ S A
T N S D D V Y D E
~ K V
v L K c I S Ev Q A N N v
LAIM:EK:P~ESFTDKDIR;AA!~:~NAAFNALKIYDEAEVNKYFVRAQY
v
310
5
5
250
5
260
5
330
270
5
340
5
5
V L G Q Y V G N P D G E G E A T K G
G A G
D S A D F K P
280
5
5
350
290
360
5
5
A"
K
5
370
&y
E R K A
Q * y
H D
v
F Y V ' R S - G ' K . ' R L A A K Q T R V D I V F K A G
340
5
--beta I---
5
390
5
400
5
410
H
C
R E C
H* 4
K
I
D
A G D I F H
QQCKRNELV"I:;RV,Q&NEA?$YTKMMTmKPGMF
T F N
F G S E Q E A Q E A V L S I I I D 5 P i K G A I E L K L N A ? f S V E
380
5
360
430
5
,;.
440
5
370
"-beta
5
L"--
_""
5
450
P".E
alpha m
"
"
"
5
490
5
380
5
beta M------
5
460
5
350
beta K----
420
5
K
F N P E E S E ' : E , D L T Y G N R Y
D A F N T R T I D L G W T V S D E
390
"_ 5
400
beta N------
470
5
5
5
alpha
480
H
V R S D E L R E A ~ R I F T P - ' L L H Q I E L E K P P P I P ~ I
A D W N G V S I A : - W ~ K F V D A ~ I S A V Y T A D K A P L E T : ! ~ K
K N V K L P D A m E R ' * L " T L'.'D V F C* S Q M H
D K K N T P E PZr!E R M f H D T M N;&D G S N
410
5
420
5
430
1
"
5
"-"
--beta J---
-------
500
5
5
"""""""-alpha
440
n
""
-
5
450
5
460
beta0
510
V
Y B S R G R T E A D E L M K R V G F Q Y E G T Y K W
V N P H K L
S:G.S M G'P:E A S D K L L A A N
G
D A W V F K G
5
470
5
480
485
alpha 0"-
_""
*Fully conserved residues
.-Highlyconserved residues
A = deletion
A- mutations in lower case
Fig 1. Sequence alignment of human and L mesenteroides GGPD. The alignment was used for model building of human GGPD. Residues
fully or highly conserved m e r 14 sequencesare indicated in different shades of gray.
The secondary structural elements for the L mesenteroides
enzyme are shown. The locations of amino acid substitutions found in severe (class I) variants are indicated by capital letters above the
alignment. Note the cluster found around amino acid 400.
From www.bloodjournal.org by guest on April 30, 2017. For personal use only.
NAYLOR
2976
ET AL
p + a domain
Fig 2.
6
Fig 4.
Fig 5.
Fig 6.
identity is sufficient to justify the assumption that it has a similar
fold. A multiple sequence alignment of 14 currently known G6PD
sequences (human,' L mesenteroide~,~
Synechococcus PCC 7942,"
Zymomonas mobilis," Saccharomyces cerevisiae," Drosophila melanogasrer.I3 rat,I4 mouse," Erwinia chrysanrhemi m. Hugouvieux-
Cotte-Pattat, J. Robert-Baudouy, P R database,
entry
S370531,
Esherichia coli,16Kluyveromyces marxianus lactis [M. WesolowskiLouvel, C. Tanguy-Rougeau, H. Fukuhau,PIRdatabase,Ihaentry
S 3 13371, Plasmodium falciparum," Pichia jadinii," Bacillus SP
HT-3H. [H Sagai, K Hattori, M Takahashi, US Patent No. 5,137,821,
From www.bloodjournal.org by guest on April 30, 2017. For personal use only.
A 3D MODEL
HUMAN
OF DEFICIENT
IN
MUTANTS
GGPD
2977
Fig 2. Model of the human dimer.In this figure subunit P (equivalentto subunit A in published illustrations of the dimer of L mesenteroides
G6PD') is shown in red, green, and yellow, andQ is in pale blue, pink, andwhite. The monomer consistsof two domains-a smaller coenzyme
domain encompassing residues 1-198 and a larger
p + a domain comprising residues 199-515. The requenca GASGDLA (residues
38-44)is at
the coenzyme binding site (arrow). The G6P binding site includes residues from the perfectly conserved 9-amino acid sequence RIDHYLGKE
(198-206). Three adjacent strands
of the p-sheet of the p + a domain (residues 380-425) arein the area of the dimer interface. Theorientation
used for this and other figures is shown schematically at upper right as a guide for the reader. Figures 2,4, 5, and 6 were drawn using the
program BOBSCRIPT, an enhancedversion of MOLSCRIPT."
Fig 4. Serious mutations at the dimer interface. Amagnification of the dimer interface region of the model. Subunit P is inred, green, and
yellow, Q in pale blue, pink, and
white. The C a carbons of mutations causing seriously deficient (class
I)variants areindicated by black spheres
in subunit P and grey spheresin 0. All variants are labeled andthe region of the dimer enlarged isshown at upper right. The class I variant
residues form two symmetry-relatedclusters at the dimer interface.The cluster at the lower right hand cornerof the diagram involves rasidues
from subunit P 213Val --t Leu (Minnesota=), 216Phe
Leu (HarilaouZ7)(not visible), 274GIu
Lys (Corum"), 278Ser
Phe YWexham'"),
393Arg + His (NashvillP), 394Val- Lys (Alhambra"), 396Pro -,Leu (Bari") and 398Glu
Lys (Puerto Limonu); andfrom subunit 0: 385Cys
+ Arg (Tomaha), 386Lys + Glu (Iowa=), 387Arg --t Cys (Guadalajara3') 387Arg
His (Beverly Hills?, 405Met -t Ile (Clinic?, 410Gly --t Cys
(Riversidea); 41OGly + Asp (Japan3') and 416Glu
Lys (Tokyo3*). The other cluster contains the same residues in the alternative subunit.
Further from the cluster are 363Asn + Lys (Loma Lindaz6), and 439Arg Pro (Pawnee3'). These residues interact across the dimer interfaca.
generating the major interaction energy between the two subunits. Mutations disrupt and weaken the dimer-dimer interactions, leading to
destabilization. Highlighted in the diagram is Lys 386, originally predicted as a residueinteracting with NADP+.
-
--
+
+
+
+
Fig 5. The aight-residue deletion situated in a flexible loop. A magnificationof the region of the eight-residue deletion in the class I variant,
Nara?5which is predicted to be missing residues 318-325. Thisis the only variant identified so far that has more than two residues deleted.
The first and last deletad residues are indicated by black spheres at their C a positions. The chain trace highlighted in red in (A) shows the
modeled conformation of the normal enzyme a t this point. The blue chain trace in (B) shows a suggested alternative conformation after
deletion of residues318-325. This alternative conformationwas createdby removingresidues 318to 325 and then modelinga new conformation
using 0; surrounding residues were altered as little as possible. No energy minimization was performed. lt is possible to model the deletion
without affecting any secondary structural elements or the active site of the protein.
Fig 6. The region including 68Val- Met and 126Asn --t Asp of the A- deficient variant. A magnification of helix ac, and the @sheet of the
coenzyme binding domain. The residues mutated in the variant A- are shown in ball and stick (grey, carbon; blue, nitrogen; red, oxygen;
yellow, sulphur) to indicate their proximity. The left-hand diagram showsthe residues in the common G6PD B (Val 68 and Asn 126). The
righthand diagram shows those in G6PD A- (Met 68 and Ay, 126). Val 68, on pB, is also a valine in L mesenteroides G6PD; the sheet strand is
immediately followed by the conserved Arg 72 and the position of the sidechain of Val68 can thus be modeled with confidence. The common
African variant A (126Asn Asp) is notdeficient nor is the engineered mutant 68Val -t Met. The normal behavior of the mutant 68Val- Met
allows us to assume that pB remains unchanged andthat the sidechain of Met 68 will be on the same side of the sheet as is Val 68. In the
model, the sidechains of Val 68 and Met 125 are in direct contact; direct contact between 68 and 126 may be achieved by a small rotation of
a c on itsaxis.
<
+
19921)wasgenerated,usingAlignment
of MultipleProtein Sequences (AMPS).I9The resulting alignmentis shown for the human
and L mesenteroides enzymes (Fig I). The secondary structural elements for the bacterial enzyme, also given in Fig 1, were identified
from its refinedthree-dimensionalstructureusingtheprogram
DSSP'" as is described in Rowland et al.6 This alignment was used
withtheprogramMUTATE
(R. Esnouf, D. Phil.thesis,Oxford
University, 1992) toexchangethesidechains
of L mesenteroides
G6PD with those in the human enzyme and place each altered sidechain as its most common rotomer; at this stage insertions were not
modeled and residues in deletions were replaced with dummy residues (Ala).
Eight insertions totaling 16 residues were modeled into the structure, and six deletions totaling 14 amino acids were removed, using
the molecular graphics package 0." For each insertion or deletion,
a plausible main chain conformation was chosen that avoided strain
andbad contacts with the surrounding protein. Any severe clashes
between sidechains were remedied on inspectionof the protein. The
first 26 residues of the human protein were not modeled; they form
an amino-terminal tail of unknown structure present in all the mammalianG6PDssequenced so far,butnotpresentinG6PDsfrom
other genera, including L mesenteroides. The final three residues of
human G6PD were also not modeled.
Energyminimizationwasperformedonthemodelusing
XPLOR." The protocol constrained all main chain atoms not in insertions or deletions, and consisted of two stages: an initial minimization with a monomer model, followed by a second using the dimer.
Totalenergydecreasedfrom
mol".
265,116 kcalmol"to
-7,649 kcal
RESULTS
The use of 14 sequences and a multiple alignment
algoof the
rithm such as that in AMPS enhances the reliability
alignment over that from a pairwise comparison of the two
sequences of interest. The method uses a matrixof pairwise
identities and optimizes the conservation between all pairs
of sequences; thus, it can take advantage of subgroups with
higher conservation in comparing two more distantly related
sequences. Although the human sequence has
33.1% identity
with that fromL mesenteroides, uncertainties can be resolved
by using, for instance, the 48.1% identity of human G6PD
withthat of S cerevisiae, which itself has 35.7% identity
with L mesenteroides G6PD. ThealignmentinFig
1 was
achievedwithoutdisruption of secondarystructure except
for a shortening of helices a b and of a k by one turn and a
lengthening of ab' to the same extent. All three helices are
attheextreme edges of thedimer.The multiple sequence
alignmenthasresultedinseveralregionswherethereare
one or two residue insertions and deletions, usually loops.
in
The position of these discontinuities results from optimizing
sequencesimilaritiesover the completeset of sequences;
From www.bloodjournal.org by guest on April 30, 2017. For personal use only.
2978
NAYLOR ET AL
I
coenzyme
I
H HH H H H HHH
H H HH
A a B b b ’ C c D d E e F Y ’ f g C h i
jkH
H
H
H
H
H
H
H
H
B+*
I J K
H H H
HH H H
LMNlm n O o
HHH
H
HELIX
SHEEI
20.0
%
17.5
15.0
3
.-
e
12.5-
v)
10.0U
a
7.55.0-
2.50.Q
Sequence Number
one-to-one correspondence of residues cannot be assumed
in these regions. Most insertions and deletions arise from
preserving the alignment of the L mesenteroides sequence
with those of other prokaryotes and the human sequence
with the eukaryotic sequences. Typical examples are the loop
between PK and @L, whose length varies from 14 to 19
residues, and the insertioddeletion at ab-ab‘ where the exact
position of the comer between the two external helices may
vary between the two groups of structures. After residue 497
there is considerable variation between sequences and the
alignment is rather arbitrary. Although the PM-PN loop is
lengthened by one residue in two of the bacterial sequences
andby three residues in that of Z mobilis, the structural
elements which make up the dimer surface, af, ag, PMand
PN, are not subject to insertions or deletions.
The model shows the extended nature of the dimer (Fig
2) with “P + a” domains forming the dimer interface and
“coenzyme” domains distant from the dyad axis. The following description of the human enzyme is based on the
detailed description given of L rnesenteroides G6PD.6 The
domain structure as defined for the bacterial enzyme is retained; the domain boundary is at the end of sheet strand
@F.This is at the terminus of a standard dinucleotide binding
foldz3and is in a region of high sequence conservation. The
N-terminal dinucleotide binding domain comprises residues
27-200 in the human enzyme (1-177 of the L mesenteroides
enzyme). The conserved NAD(P) fingerprint forms a tight
turn that begins at Gly 38; its position is indicated in the
figure. We have shown this to be the NADP+ binding site
in L mesenteroides G6PD (C.E.N., unpublished results, November 1994). The coenzyme specificity is ensured by the
Fig 3. Schematic
showing
residuesinvolvedin the dimer
interface.ThedHerencein
the
mean accessible surface area of
the atoms in each residuein the
dimer from their accessible surface area in the putative monomer. The dimer surface consists
of the residues with changed
solvent accessibility. Secondary
structure elements are shown.
presence of the conserved Arg 72, which binds the 2”phosphate of NADP. The active site can be identified at the
domain boundary and includes residues from a nonapeptide
(198-206), which is conserved in the 14 G6PD species considered, as well as residues distant in the primary structure.
His 201, whichis predicted to interact with the substrate
(G6P), is 17 A from Gly 38, the neighbor of the adenine
ribose of NADP+.
The dimer interface of human G6PD is formed by association of the sheets and two helices in the second (p + a )
domain of each subunit to form a barrel. The residues involved in the dimer surface (defined as those which contain
atoms which are less accessible to solvent in the dimer than
in the putative monomer) are shown schematically in Fig 3;
they are primarily from af,ag, PM, PN, and loops connecting these elements (see Fig 2). The two active sites (His 201
of the G6P binding sites) in the dimer are more than 50 A
apart, separated by the large, predominantly antiparallel
sheets of the dimer interface which together form a halfbarrel.
Some 29 class I variants have been considered; they comprise two deletions, of 2 (StonybrookZ4)and 8 (Nara”) residues, and 26 point mutations. The mutated positions are
shown in Fig l. Both deletions occur in surface loops distant
from the dimer axis; they are close to each other. Seventeen
of the remaining point mutations (20 variants) are in or close
to the dimer surface (Table 1). The variant residue is defined
as being a part of the surface if one of the atoms in the
corresponding residue in G6PD B has a changed solvent
accessibility as discussed above. The class I residues involved in the dimer surface are 2 13 (Minnesotaz6),2 16 (Hari-
From www.bloodjournal.org by guest on April 30, 2017. For personal use only.
MODEL
A 3D
HUMAN
OF DEFICIENT
IN
MUTANTS
G6PD
2979
Table 1. Class I Mutations Close to the Dimer Surface
Variant
Minnesota
Harilaou
Corum
278 “Wexham”
Tomah
386
Iowa
Guadalajara
Beverley Hills
Nashville
394 Alhambra
Bari
Puerto
398
Limon
Clinic
Riverside
Japan
Tokyo
Pawnee
Kobe
Santiago de Cuba
--
Mutation
Site
213 Val Leu
216Phe Leu
274 Glu Lys
Ser Phe
385 Cys Arg
Lys Glu In
387 Arg Cys
387 Arg His
393 Arg -t His
Val Leu
BL
396 Pro Leu
Glu Lys
405 Met Ile
410 Gly Cys In
410 Gly -t Asp
416 Glu Lys
439 Arg + Pro
440 Leu Phe
447 Gly Arg
-+
-
-+
-+
Secondary
Structure
Element Active
Distance From
Dimer
Distance From
Interface
Reference
Subunit Involved
in Patch
In surface
In surface
6.7 A
In surface
1.4 A
surface
2.2 A
>l0 A
>l0 A
>l0 A
>l0 A
>l0 A
>l0 A
>l0 A
P
P
P
P
>l0
>l0
>l0
>l0
A
A
A
A
>IO A
P
P
P
P
PM--PN
4.2 A
5.2 A
8.6 8,
4.1 A
In surface
surface
>l0 A
Q
Q
BN
am
am
End a m
In surface
In surface
4.8 A
3.2 A
>l0 A
>l0 A
5.9 A
>l0 A
Q
P
P
P
af
af
ai-aj
ai--aj
PK--PL
BK--BL
PK--PL
Q
Q
Q
-+
PL
-+
-
-+
-+
-+
-+
-
-+
BL--BM
DL--PM
BM
laou2’), 278 (“Wexham”28),386 (Iowa29),405 (Clinic3’),
410 (Riverside29 and Japan3’), 416 (Tokyo32), and 439
(Pawnee3’); of those close to the surface, residues 385 (Tomah29),387 (Guadalajara3’),393 (Nashville26),394 (Alhambra3’), 398 (Puerto Limon33),440 (Kobe”) and 447 (Santiago
de Cuba3’) have an atom less than 5 A from one with changed
accessibility.
DISCUSSION
Based on this model of the human G6PD dimer, we have
attempted to rationalize the previously reported pattern of
naturally occurring mutations and to analyze some specific
examples. The model of the human enzyme has allowed
us to locate the different variants on particular secondary
structural elements and to see their spatial relationships to
the coenzyme binding site, the active site and the dimer
interface. Almost all residues seen to be important in binding
NADP in L mesenteroides G6PD, and those predicted to
bind substrate and to promote catalysis, have been conserved
in the human enzyme. None of these residues has been mutated in any of the variants described so far. Coenzyme and
substrate binding have been modeled by Rowland et a16; the
three-dimensional structure of a binary complex of the L
mesenteroides enzyme confirms the features of NADP binding (C.E.N., unpublished results, November 1994) and this
will not be elaborated further here.
Regions of the surface of the dimer that may be of importance for enzyme stability may be recognized in that class I
variants cluster in such an area. It is observations of this
kind which we will now address in this report; it is not our
intention to use our homology model to answer detailed
questions concerning the particular contacts made between
variant residues and their neighbors in the three-dimensional
structure. It is clear that a detailed analysis of the different
26
27
28
28
29
29
31
29
26
31
40
33
30
29
31
32
31
34
35
sidechain-sidechain interactions that arise in each variant
and their consequences in terms of the stability and activity
of the human G6PD molecule must await the three-dimensional structure determination of both human G6PD B and
of the different variants. We will confine our detailed discussion to three particular examples.
Class I variants: Subunit contacts. A striking previous
observation has been that a set of severely deficient variants
associated with CNSHA is clustered in exons 10 and 1l,29
and it has been suggested that they correspond to the NADP+
binding site. It is now clear that the binding site is elsewhere,
and this cluster corresponds in fact to the subunit interface.
These variants are located close to the ends of the sheet
strands of the P + a domain; they form two surfaces, each
shared by both subunits (Fig 4). The surfaces also include
some severely deficient mutations mapping outside exons 10
and 11. The fold brings residues from different regions of
primary structure in the same monomer close together in
space: residues 213 and 216 on a f are a part of the same
surface patch as 393, 394, 396, and 398 on PL and in the
PL-PM turn. Similarly, the dyad axis brings residue 405, at
the end of BM, and 410, in the PM-PN turn, of the second
subunit into the same surface patch. It can be seen that
residues at the two ends of PL contribute to different patches;
it is the residues that precede @Lof the second subunit, 385,
386, and 387, which are in the same cluster as 393, 394,
396, and 398 of the first subunit (Fig 4). It should be pointed
out that, of the class I residues close to or in the dimer
surface, only residue 440 is within 10 A of the modeled
substrate site of its own subunit. None of the residues is
within 10 A of either coenzyme site or of the substrate site
in the second subunit. The large distance between the two
active sites of the dimer has already been noted.
Mutations affect both hydrophobic and charge-charge in-
From www.bloodjournal.org by guest on April 30, 2017. For personal use only.
2980
NAYLOR ET AL
teractions. Salt bridges may be broken: for instance in the
variants G6PD Corum,28274Glu ”+ Lys, and G6PD Iowa,29
386Lys Glu. Glu 274 makes a salt bridge to Arg 348 and
Lys 386 is a neighbor of Glu 417; G6PD Iowa is described
as having greatly decreased thermal stability (E. Beutler, L.
Forman, P.A. Alarcon, unpublished results, 1986). The effect
of NADP+ and NADPH in changing the equilibrium between
monomers, dimers, and tetramed6 together with the possibility of reactivating mutants such as G6PD Iowa in high
concentrations of NADP+ has led to the proposal of Lys 386
as an NADP+ binding re~idue.’~
However, it is 36 W from
His 201, thought to be the site of substrate binding and is
even more distant from the known coenzyme site of the L
rnesenteroides enzyme, and so is unlikely to bind catalytic
NADP+. The rate-limiting dimerization stage in the reactivation of urea denatured L rnesenteroides G6PD has been
shown to be enhanced by addition of coen~yme.’~
Stimulation of these processes by ligands is likely to arise by their
stabilizing an important intermediate (or the end product) in
the folding pathway by forming a binary complex; for L
mesenteroides G6PD an intermediate is indicated. The position of Lys 386 in the model for the human enzyme, in the
dimer interface and distant from the proposed NADP+ site,
would suggest a similar mechanism.
Hydrophobic contact surfaces at the interface are decreased in some variants: for instance, in G6PD Harila~u,’~
216Phe + Leu. Two variants, namely G6PD Clinic,’”
405Met ”* Ile, and G6PD
213Val +Leu, disrupt
the same hydrophobic contact, which involves residues from
exon 7 as well as exon 10. The sidechains of these residues
on different monomers make contact across the dimer interface. The large number of class I variants in this part of the
dimer contact area suggests that the contribution of these
residues to inter-subunit contacts is important for the integrity of the dimer andthat this region of the interface is
particularly sensitive to change and is crucial to stability of
the enzyme.
Class I variant with an eight-residue deletion. Only one
deletion of more than two residues has been described thus
far. In G6PD Nara,” associated with CNSHA, residues 3 18325 are deleted. Some of these residues are part of a surface
loop that was found to be highly mobile in L rnesenteroides
G6PD. There are insertions and deletions in the sequence
alignment in this region (Fig l). The deletion in the human
enzyme at 304 is secure and is a prokaryote/eukaryote difference; the alignment of pH, which is well conserved, generally preserves the amphiphilic character and has equivalent
small and large residues. The residues of the loop between
3 1 1 and 3 17 need notbe equivalent in the three-dimensional
structure, but equivalence is certainly achieved by Tyr 322.
In the crystal structure of the bacterial enzyme, the residues
that are aligned with those in the deletion immediately follow
the only loop for which sidechain conformations are not
clearly identifiable. The loop has the highest temperature
factors seen in the L mesenteroides G6PD structure6; the
average main-chain temperature factor f?r the residues 292297 (312-317 in human G6PD) is 64.5 A? whereas that for
the whole protein is 33.7 A.’ The great flexibility of this
+
loop in G6PD from L mesenteroides suggests that a loop of
different length and conformation as proposed for the normal
human enzyme is readily accommodated (Fig 5A). Removal
of the eight residues 3 18-325, including the totally conserved
Tyr 322, can be accommodated by a local rearrangement of
the main chain (Fig 5B). A direct connection between residues 317 and326canbe
made without disruption of the
structure butat the expense of exposing otherwise buried
residues. The major loss of activity in the human enzyme is
likely to arise from deletion of Tyr 322, which shields these
hydrophobic residues, Val 309 and Gly 310, from solvent.
This would explain why G6PD Nara still has residual activity
but is highly unstable.
Less deleterious variants: G6PD A-. Not all sequenced
variants have been classified in terms of activity. As well as
the class I variants, considered in detail above, we have
considered the locations of those 15 of the least deleterious
variants that have known single-residue mutations, retain at
least 10% residual activity and are not associated with
CNSHA. Withonly a model of the human enzyme, we
should not seek explanations for the ways in which thestructure may alter to accommodate each of these changes. Nonetheless we havenotedthat there is a strong tendency for
these variant residues to be accessible, on external helices
or loops. Only one variant, 285Arg -+ His, involves mutation
ofan inaccessible residue that is well conserved between
species, and our modelwill accommodate a histidine. In
contrast, of the class I variant residues, 12 have verylow
accessibility and 2 of these and 5 others are totally conserved.
Among the less deleterious variants, G6PD A-, the common African deficient variant contains two mutations,
68Val+ Met and 126Asn + Asp relative to G6PD B.?*The
126Asn Asp mutation is found on its own in the nondeficient variant G6PD A.By contrast, the mutation 68Val
Met, which is the commonest in G6PD A-, has not yet been
encountered on its own and when artificiallyengineered into
normal G6PD B it does not cause deficiency.” It has
been concluded that the two mutations 126Asn + Asp and
68Val+ Met act synergistically to cause enzyme deficiency.
This so far unique findingisnow justified by our threedimensional model: indeed, we find that these twy residues,
on OB and (YC, are spatially very close (some 8 A apart, as
illustrated inFig 6). In the absence of the experimentally
determined structure of the human enzyme, it is not reasonable to speculate further on the detailed differences in the
sidechain interactions of the residues intheA-variant.
However, we may surmise that the two substitutions interact
specifically with each other causing disruption in the protein
structure, possibly in the ,&sheet of the coenzyme domain.
To understand infiner detail what amino acid changes
may affect the dimer-tetramer equilibrium, and what weight
must be assigned to different changes that may contribute to
enzyme instability in the many known variants with differing
severity of G6PD deficiency, it will be necessary to solve
the three-dimensional structure of human G6PD. However,
the structure modeled here on the basis of the L rnesenteroides G6PD has already shown that residues in the active site
+
+
From www.bloodjournal.org by guest on April 30, 2017. For personal use only.
A 3 0 MODEL OF DEFICIENT MUTANTS IN HUMAN GGPD
have not been modified in known variants, that a weakened
dimer interaction is responsible for a large class of variants
which give rise to CNSHA, and that residues distant in the
primary structure and from different subunits are involved
in the same sensitive region of the dimer interface.
ACKNOWLEDGMENT
We are grateful to Prof L.N. Johnson for facilities and support
and to Robert Esnouf for use of his program BOBSCRIPT.
REFERENCES
1. Luzzatto L, Mehta A: Glucose-6-phosphate deficency, in
Scriver CR, Beaudet AL, Sly WS, Valle D (eds): The Metabolic
Basis of Inherited Disease (ed 7). NewYork, NY, McGraw-Hill,
1995, p 3367
2. Greene LS: G6PD deficiency as protection against falciparummalaria- An epidemiologic critique of population and experimental
studies. Yearbook Physi Anthropol 36:153, 1993
3. Persico MG, Viglietto G, Martini G, Toniolo D, Paonessa G,
Moscatelli C, Dono R, Vulliamy T, Luzzatto L, D’Urso M: Isolation
of human glucose 6-phosphate (G6PD) cDNA clones: Primary structure of the protein and unusual 5’ non-coding region. Nucleic Acids
Res 14:2511, 1986
4. Vulliamy TJ, Beutler E, Luzzatto L: Variants of glucose 6phosphate dehydrogenase are dueto mis-sense mutations spread
throughout the coding region of the gene. Hum Mutat 2:159, 1993
5. Hirono A, Miwa S: Human glucose-6-phosphate dehydrogenase: Structure and function of normal and variant enzymes. Haematologia (Budap) 25:85, 1993
6. Rowland P, Basak AK, Gover S, Levy HR, Adams MJ: The
3-dimensional structure of glucose 6-phosphate dehydrogenase from
Leuconostoc mesenreroides refined at 2.0 A resolution. Structure
2:1073, 1994
7. Cancedda R, Ogunmola G, Luzzatto L: Genetic variants of
human erythrocyte glucose 6-phosphate dehydrogenase. Discrete
conformational states stabilised by NADP+ and NADPH. Eur J
Biochem 34:199, 1973
8. Levy HR: Glucose 6-phosphate dehydrogenases. Adv Enzymol
Relat Areas Mol Biol 48:97, 1979
9. Lee W, Flynn TG, Lyons C, Levy HR: Cloning of the gene
and amino acid sequence for glucose 6-phosphate dehydrogenase
from Leuconostoc mesenteroides. J Biol Chem 266:13028, 1991
10. Scanlan DJ, Newman J, Sebaihia M, MannNH, Carr NG:
Cloning and sequence analysis of the glucose-6-phosphate dehydrogenase gene from the cyanobacterium Synechococcus PCC 7942.
Plant Mol Biol 19:877, 1992
11. Bamell W, Yi KC, Conway T: Sequence and genetic organisation of a Zymomonas mobilis gene cluster that encodes several
enzymes of glucose metabolism. J Bacteriol 172:7227, 1990
12. Nogae I, Johnston M: Isolation and characterization of the
ZWFI gene of Saccharomyces cerevisiae, encoding glucose 6-phosphate dehydrogenase. Gene 96:161, 1990
13. Fouts D, Ganguly R, Gutierrez AG, Lucchesi JC, Manning
JE: Nucleotide sequence of the Drosophila glucose 6-phosphate dehydrogenase gene and comparison withthe homologous human
gene. Gene 63:261, 1988
14. Ho Y-S, Howard AJ, Crap0 JD: Cloning and sequence of a
cDNA encoding rat glucose 6-phosphate dehydrogenase. Nucleic
Acids Res 16:7746, 1988
15. Toniolo D, Filippi M, Dono R, Lettieri T, Martini G: The
CpG island in the 5’ region of the G6PD gene of man and mouse.
Gene 102:197, 1991
2981
16. Rowley DL, Wolf RE Jr: Molecular characterization of the
Escherichia coli K-l2 zwf gene encoding glucose 6-phosphate dehydrogenase. J Bacteriol 173:968, 1991
16a. Sidman KE, George DE, Barker WC, Hunt L: The Protein
Identification Resource (PIR). Nucleic Acids Res 16(S):1869, 1988
17.O’Brien E, Kurdi-Haidar B, Wanachiwanawin W, Carvajal
J-L, Vulliamy TJ, Cappadoro M, Mason PJ, Luzzatto L: Cloning
of the glucose 6-phosphate dehydrogenase gene from Plasmodium
falciparum. Mol Biochem Parasitol 64:313, 1994
18. Jeffery J, Persson B, Wood I, Bergman T, Jeffery R, Jornvall
H: Glucose-6-phosphate dehydrogenase. Structure-function relationships and the Pichia jadinii enzyme Eur J Biochem 212:41, 1993
19. Barton GJ: Protein multiple sequence alignment and flexible
pattern matching. Methods Enzymol 183:403, 1990
20. Kabsch W, Sander C: Dictionary of protein secondary structure: Pattern recognition of hydrogen bonded and geometrical features. Biopolymers 22:2577, 1983
21. Jones TA, Kjeldgaard M: 0-The manual (Version 5.9). Uppsala University, Sweden, 1993
22. Briinger AT: X-PLOR. Version 3.1. A system for X-ray crystallography and NMR. New Haven, CT, Yale University Press, 1992
23. Rossmann MG, Liljas A, Brandtn CI, Banaszak W:Evolutionary and structural relationships among dehydrogenases, in Boyer
PD (ed): The Enzymes Vol XI (ed 3). New York, NY, Academic,
1975, p 61
24.Xu WM, Westwood B, Bartsocas CS, Malcorraazpiazu JJ,
Indrak K, Beutler E: Glucose-6-phosphate-dehydrogenasemutations
and haplotypes in various ethnic-groups. Blood 85257, 1995
25. Hirono A, Fujii H, Shima M, Miwa S: G6PD Nara-A new
class-l glucose-6-phosphate-dehydrogenasevariant with an 8 aminoacid deletion. Blood 82:3250, 1993
26. Beutler E, Kuhl W, Gelbart T, Forman W: DNA sequence
abnormalities ofhuman glucose-6-phosphate dehydrogenase variants. J Biol Chem 266:4145, 1991
27. Poggi V, Town M, Foulkes NS, Luzzatto L: Identification a
single base change in a newhuman mutant glucose-6-phosphate
dehydrogenase gene by polymerase-chain-amplificationof the entire
coding region from genomic DNA. Biochem J 271:157, 1990
28. Mason PJ, Sonati MF, MacDonald D, Lanza C, Busutil D,
Town M, Corcoran CM, Kaeda JS, Stevens D, Al-Ismail S, Altay
C, Hatton C, Lewis DS, McMullin MF, Meloni T, Paul B, Pippard
M, Prentice AG, Vulliamy TJ, Luzzatto L: New glucose-6-phosphate
dehydrogenase mutations associated with chronic anaemia. Blood
85:1377, 1995
29. Hirono A,Kuhl W, Gelbart T, Forman L, Fairbanks VF,
Beutler E: Identification of the binding domain for NADP of human
glucose-6-phosphate dehydrogenase. Proc NatlAcad
Sci USA
86:10015, 1989
30. Corrons JLV, Rovira A, Pujades A, Vulliamy T, Luzzatto
L: Molecular heterogeneity of glucose-6-phosphate-dehydrogenase
(G6PD) in Spain and identification of 2 new base substitutions in
the G6PD gene. Blood 84:551a, 1994 (abstr, suppl 1)
31. Beutler E, Westwood B, Prchal JT, Vaca CS, Bartocas CS,
Baronciani L: New glucose-6-phosphate dehydrogenase mutations
from various ethnic groups. Blood 80:255, 1992
32. Hirono A, Fujii H, Hirono K, Kanno H, Miwa S: Molecular
abnormality of a Japanese glucose-6-phosphate-dehydrogenasevariant (G6PD Tokyo) associated with chronic non-spherocytic hemolyticanemia. Hum Genet 88:347, 1992
33. Beutler E, Kuhl W, Saenz GF, Rodriguez W Mutation analysis of glucose-6-phosphate dehydrogenase (G6PD) variants in Costa
Rica. Hum Genet 87:462, 1991
34. Hirono A, Nakayama S, Fujii H, Miwa S: Molecular abnormality of a Japanese glucose-6-phosphate dehydrogenase variant
From www.bloodjournal.org by guest on April 30, 2017. For personal use only.
2982
NAYLOR ET AL
(G6PD Kobe) associated with a greatly increased affinity for galac38. Hirono A, Beutler E: Molecular cloning and cDNA sequence
tose-6-phosphate. Am J Haematol 45:185, 1994
for human glucose-6-phosphate dehydrogenase. Proc Natl Acad Sci
35. Vulliamy TJ, D’Urso M, Battistuzzi G, Estrada M, Foulks NS,
USA 85:3951, 1988
MartiniG,Calabro V, Poggi V, Giodono R, Town M, Luzzatto L,
39. Town M, Bautista JM, Mason PJ, Luzzatto L: Both mutations
Persico MG: Diverse point mutations in the human glucose 6-phosphatein G6PD A- are necessary to produce the G6PD deficient phenodehydrogenase gene cause enzyme defieciency and mild or severe he- type. Hum Mol Genet l :17l , 1992
molytic anaemia. Proc Natl Acad Sci USA 85:5171, 1988
40. Filosa S, Cai WW, Galanello R, Cao A, Demattia D, Schettini
36. Bonsignore A, Cancedda R, Nicolini A, Damiani G, De Flora
F, Martini G: A novel single-base mutation in the glucose-6-phosA: Metabolism of human erythrocyte glucose-6-phosphate dehydrophate-dehydrogenase gene is associated with chronic non-spherogenase-VI. Interconversion of multiple molecular forms. Arch Biocytic hemolytic-anemia. Hum Genet 94:560, 1994
chem Biophys 147:493, 1971
41. Kraulis PJ: MOLSCRIPT: A program toproduceboth de37. Haghighi B, Levy HR: Glucose-6-phosphate dehydrogenase
tailed and schematic plots of protein structures. J Appl Crystallogr
from Leuconostoc mesenteroides. Kinetics of reassociation and reac24:946, 1991
tivation from active subunits. Biochemistry 21:6429, 1982
From www.bloodjournal.org by guest on April 30, 2017. For personal use only.
1996 87: 2974-2982
Glucose 6-phosphate dehydrogenase mutations causing enzyme
deficiency in a model of the tertiary structure of the human enzyme
CE Naylor, P Rowland, AK Basak, S Gover, PJ Mason, JM Bautista, TJ Vulliamy, L Luzzatto and
MJ Adams
Updated information and services can be found at:
http://www.bloodjournal.org/content/87/7/2974.full.html
Articles on similar topics can be found in the following Blood collections
Information about reproducing this article in parts or in its entirety may be found online at:
http://www.bloodjournal.org/site/misc/rights.xhtml#repub_requests
Information about ordering reprints may be found online at:
http://www.bloodjournal.org/site/misc/rights.xhtml#reprints
Information about subscriptions and ASH membership may be found online at:
http://www.bloodjournal.org/site/subscriptions/index.xhtml
Blood (print ISSN 0006-4971, online ISSN 1528-0020), is published weekly by the American
Society of Hematology, 2021 L St, NW, Suite 900, Washington DC 20036.
Copyright 2011 by The American Society of Hematology; all rights reserved.