Download Flanking Mutated A and T Nucleotides Reverse Complementarity of

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
This information is current as
of June 16, 2017.
Hypermutation at A-T Base Pairs: The A
Nucleotide Replacement Spectrum Is Affected
by Adjacent Nucleotides and There Is No
Reverse Complementarity of Sequences
Flanking Mutated A and T Nucleotides
Jo Spencer and Deborah K. Dunn-Walters
J Immunol 2005; 175:5170-5177; ;
doi: 10.4049/jimmunol.175.8.5170
http://www.jimmunol.org/content/175/8/5170
Subscription
Permissions
Email Alerts
This article cites 33 articles, 15 of which you can access for free at:
http://www.jimmunol.org/content/175/8/5170.full#ref-list-1
Information about subscribing to The Journal of Immunology is online at:
http://jimmunol.org/subscription
Submit copyright permission requests at:
http://www.aai.org/About/Publications/JI/copyright.html
Receive free email-alerts when new articles cite this article. Sign up at:
http://jimmunol.org/alerts
The Journal of Immunology is published twice each month by
The American Association of Immunologists, Inc.,
1451 Rockville Pike, Suite 650, Rockville, MD 20852
Copyright © 2005 by The American Association of
Immunologists All rights reserved.
Print ISSN: 0022-1767 Online ISSN: 1550-6606.
Downloaded from http://www.jimmunol.org/ by guest on June 16, 2017
References
The Journal of Immunology
Hypermutation at A-T Base Pairs: The A Nucleotide
Replacement Spectrum Is Affected by Adjacent Nucleotides
and There Is No Reverse Complementarity of Sequences
Flanking Mutated A and T Nucleotides1,2
Jo Spencer and Deborah K. Dunn-Walters3
T
he generation of a diverse Ig repertoire is initiated at the
IgH locus by recombination of V, D, and J gene segments.
If recombination is unsuccessful on the first allele of IGH,
rearrangement at the second allele is initiated. If this is successful,
V and J segments rearrange at the Ig L chain loci on each allele in
turn, until a successful rearrangement is achieved. As a consequence of this process, some B cells carry out-of-frame Ig gene
rearrangements. A second wave of diversification of IG genes is
generated by somatic hypermutation (SHM)4 in the germinal centres of B cell follicles. SHM is characterized by predominantly
single base substitutions and results in a characteristic spectrum of
mutations (1–3). Functional IG genes diversified by SHM are seDepartment of Immunobiology, King’s College London School of Medicine at Guy’s,
King’s and St. Thomas’ Hospitals, Guy’s Campus, London, United Kingdom
Received for publication April 14, 2005. Accepted for publication August 11, 2005.
The costs of publication of this article were defrayed in part by the payment of page
charges. This article must therefore be hereby marked advertisement in accordance
with 18 U.S.C. Section 1734 solely to indicate this fact.
1
This work was supported by the Wellcome Trust (to J.S.) and the Medical Research
Council (to D.K.D.-W.).
2
The sequence(s) presented in this article has been submitted to GenBank under
accession number(s). DNA sequences are available in GenBank (具www.ncbi.nlm.
nih.gov/Genbank/index.html典) under the following accession numbers: AJ493283-88,
AJ493291, AJ508940-45, AJ508947-49, AJ535508-11, AJ535514, AJ535516,
AJ576317-19, X87032, Y13167, Y13172, Y16646-49, Z738020-21, Z80389,
Z80708, Z93132, Z93134, Z93138, Z93141, Z93153-54, Z93156, Z93158-59,
Z73863, Z93213-14, Z93216, Y13167-68, Z73839, Z73858, Z73860, Z80570-71,
Z80760, Z93198-99, Z93204, Z80673, X87013, X87064, X87075, X97784, Z80753,
X87035, X87082, Z80396, Z803564, and Z803719.
3
Address correspondence and reprint requests to Dr. Deborah K. Dunn-Walters, Department of Immunobiology School of Medicine at Guy’s, King’s and St. Thomas’
Hospitals, 2nd Floor New Guy’s House, Guy’s Hospital, London SE1 9RT, U.K.
E-mail address: [email protected]
4
Abbreviations used in this paper: SHM, somatic hypermutation; AID, activationinduced cytidine deaminase; BER, base excision repair; MMR, mismatch repair;
UNG, uracil N-glycosylase.
Copyright © 2005 by The American Association of Immunologists, Inc.
lected based on the affinity of the newly encoded Ig for Ag. Thus
the mutations observed in the functional genes are a result of both
SHM and selection. Any out-of-frame IG gene rearrangements
within the same cell also undergo SHM but, because these genes
are not expressed, the distribution of mutations is unaffected by the
selection process. These genes are therefore of use in the study of
the mechanism of hypermutation.
Analysis of the sequence context of mutations in out-of-frame Ig
genes has proved a powerful tool enabling several accurate deductions concerning the mechanism of SHM. Differences in the characteristics of sequences flanking mutations from G and C nucleotides and those flanking A and T nucleotides implied separate
mechanisms for G/C mutations and A/T mutations (4). Moreover,
the remarkable resemblance between the flanking sequence surrounding mutations from G and the reverse complement of flanking sequence surrounding mutation from C lead us to suggest that
either G or C nucleotides were likely to be mutated and not both
(5). Both of these predictions have now been confirmed because
considerable progress in the understanding of the SHM process has
been made. The current, broadly accepted, hypothesis is that SHM
is a two-step process (6). The initiating event is via the action of
activation-induced cytidine deaminase (AID), which is both necessary and sufficient for SHM, class switching and gene conversion (7). AID can directly deaminate C nucleotides in DNA, targeting deamination events to C nucleotides in WRC motifs (8, 9).
This results in the in vivo observations of the RGYW/WRCY hot
spot motifs for mutations from G and C. The second step of SHM
involves error prone repair enzymes and generates further mutations, most notably those from A and T.
Mismatched U-G base pairs caused by cytosine deamination can
be repaired either by base excision repair (BER) or by mismatch
repair (MMR). Both are excision repair processes that require resynthesis of excised nucleotides by polymerases— especially for
0022-1767/05/$02.00
Downloaded from http://www.jimmunol.org/ by guest on June 16, 2017
Hypermutation is thought to be a two-phase process. The first phase is via the action of activation-induced cytidine deaminase
(AID), which deaminates C nucleotides in WRC motifs. This results in the RGYW/WRCY hot spot motifs for mutation from G
and C observed in vivo. The resemblance between the hot spot for C mutations and the reverse complement of that for G mutations
implies a process acting equally on both strands of DNA. The second phase of hypermutation generates mutations from A and T
and exhibits strand bias, with more mutations from A than T. Although this does not concur with the idea of one mechanism acting
equally on both strands, it has been suggested that the AT mutator also has a reversible motif; WA/TW. We show here that the
motifs surrounding the different substitutions from A vary significantly; there is no single targeting motif for all A mutations.
Sequence preferences associated with mutations from A more likely reflect an influence of adjacent nucleotides over what the A
mutates “to.” This influence tends toward “like” replacements: Purines (A or G) in the 5ⴕ position bias toward replacement by
another purine (G), whereas replacement with pyrimidines (C or T) is more likely if the preceding base is also a pyrimidine. There
is no reverse complementarity in these observations, in that similar influences of nucleotides adjacent to T are not seen. Hence,
WA and TW should not be considered as reverse complement hot spot motifs for A and T mutations. The Journal of Immunology,
2005, 175: 5170 –5177.
The Journal of Immunology
a) 20
15
10
5
0
A-C T-G
A-G T-C
A-T T-A
C-A G-T
C-GG-C
C-T G-A
A-C T-G
A-G T-C
A-T T-A
C-A G-T
C-G G-C
C-T G-A
b) 20
15
10
5
0
FIGURE 1. Hypermutation spectra. The spectra of mutations, as percentage of total mutations, is shown for each of two different data sets. H
chain Ig genes (a) and L chain Ig genes (b).
Materials and Methods
Two different sets of data were used in the primary analysis. Both were of
Ig gene rearrangements that were nonproductive due to their V and J regions being in different reading frames. These genes had, however, been
isolated from postgerminal centre cells and therefore these nonproductive
rearrangements carried mutations. The advantage of using such data is that
the mutations have accumulated without any possible confounding effects
of selection. One set of data was of H chain VH-DH-JH rearrangements (4)
and the other was of ␭ L chain VL-JL rearrangements (24). The genes were
arranged according to their IGV gene usage. Eight groups of H chain genes
were collated, containing a total of 55 sequences with 732 mutations from
the germline IGHV sequence. Six groups of L chain genes were collated,
containing a total of 25 sequences with 548 mutations from the germline
IGLV sequence (3). In addition, mutated sequences from four additional
sources were analyzed: 750 mutations from a 580-bp region of the mouse
JH intron (13), 296 mutations from a 320-bp region of the human JH intron
in patients with XPV (26), 282 mutations from a 560-bp region of the S␮
switch region in a Pol␩ ⫺/⫺ mouse (27), and 2324 mutations from the
V␬Ox1 transgene mutated in vivo (916 mutations) or in vitro (1408 mutations) (18).
The mutated sequences were aligned beneath the appropriate germline
sequence and a raw text file of the alignment created. This file was imported into a Microsoft Excel spreadsheet and computations of the number
of each type of nucleotide substitution (e.g., A to T, A to C, A to G, etc.)
and the composition of the flanking sequences around these substitutions
were performed using macros in Excel (Visual Basic). Calculations were
extended to 6 bases either side of the mutated base.
Because the chances of seeing a particular base in each position are not
always 25%, and there are differences in the germline genes of individual
sequences, the composition of each germline gene used in the analysis was
determined in terms of the percentage composition of bases at each position around each separate base (A, C, G, T). The individual compositions
for each IGV gene were compiled into two sets of baseline data for H and
L chain genes, the resulting compositions being the outcome from an analysis of 15,003 and 7,740 bases of sequence respectively. For example, the
percentage composition at position ⫺1 from an A in the H chain IGHV
genes used in this analysis was 17% A, 34% C, 32% G, and 17% T as
compared with the same composition in L chain IGLV genes of 14% A,
41% C, 27% G, and 18% T.
The percentage composition of each base at each position flanking a
particular mutation was determined. The baseline percentage composition
of the germline sequences was subtracted from this to show any differences
particular to that mutation. ␹2 analysis was used to determine whether any
differences seen were significant. Computations of percentage differences
and ␹2 analysis were also performed using Excel.
Downloaded from http://www.jimmunol.org/ by guest on June 16, 2017
long patch repair. Evidence for the involvement of BER and MMR
is strong, both processes have been shown to affect the SHM spectrum if altered. In the case of BER, Uracil N-glycosylase (UNG) is
the key first step in removing mispaired uracil. The absence of
UNG (either in mice, in cultured B cells, or in hyper-IgM syndrome type 4 patients) causes a skewing in the pattern of SHM so
that more transitions at G:C pairs are seen, but the pattern of A and
T mutations appears normal (10, 11). The evidence for the involvement of MMR derives from observations of deficiencies in
the mismatch repair process. Deletions of mismatch repair proteins
MSH2 or MSH6 also causes skewing of the pattern of SHM—in
this case to exclude mutations from A and T (3). Similarly a deficiency in exonuclease 1, another component of the MMR process, can result in a reduction in the number of mutations from A
and T in Ig genes (12). Recent experiments combining deficiency
of both UNG and MSH2 showed an ablation of all mutations except transitions at G:C pairs (13). The error prone polymerase that
appears to be the most important for the secondary phase of SHM
is DNA polymerase ␩ (Pol␩). Pol␩ is absent in patients with
XP-V, and these patients have a reduced (but not completely absent) number of mutations from A and T in their Ig genes (14, 15).
Further in vitro analysis of the Pol␩ error spectrum reports that it
shares many features with the SHM spectrum found in vivo (16 –
18). However, other error prone polymerases such as Pol␨ (19, 20),
Pol␫ (21), and Pol␮ (22, 23) may also be involved.
Targeting of mutations to certain areas of the IGHV sequence is
biologically important because mutations are advantageous predominantly when they modify the hypervariable regions. However, because the mechanisms that mutate G/C and A/T are not the
same, the mechanisms that target mutations to the hypervariable
regions are not likely to be the same either. Unlike the RGYW/
WRCY motifs, which reflect target preferences of the AID complex (8, 9), the significance of the motifs that appear to focus
mutations to A/T nucleotides is unknown. The issue of whether
both processes act on one strand or two has still not been completely resolved. There is no strand bias in G vs C mutations in
most studies. In contrast, the AT mutator consistently shows strand
bias in that there are always more mutations from A than mutations
from T. The lack of strand bias in GC mutations, coupled with the
fact that the RGYW/WRCY motifs are reverse complements of
each other, shows that the GC mutator likely involves one mechanism that acts on both strands of DNA (5, 24). Strand bias in
AT mutations indicates a possible preference for the action of
the mutator for one strand over the other, or the existence of
separate mutators for A and T mutations. The hot spot for mutations from A is generally accepted as being WA (17). Evidence that mutation occurs by the same mechanism on both
strands of DNA would be that mutations from T occur in the
reverse complement hot spot TW. This has been found in some
cases, and has been used as a predictor of mutations from T
(25). However, we have previously shown that the hot spot
motifs for mutations from A and T vary depending whether the
mutations are transitions or transversions (4) and these are not
necessarily reverse complement.
To analyze influences on AT mutations we have analyzed the
hot spot motifs around each individual mutation (A-C, A-G, A-T,
T-A, T-C, T-G). This analysis has been done in out of frame Ig
sequences to exclude any effects of selection. We show that WA is
not a target motif for all mutations from A and that adjacent nucleotides bias the nature of replacements from A. Furthermore
there are qualitative, as well as quantitative, aspects to strand bias.
These results have been confirmed in mouse JH intronic
sequences.
5171
5172
Results
Different motifs favor individual mutations from A and T
transversions from A to T, and AAT for transitions from A to G
(where the mutated base is underlined). Looking at the flanking
bases with a negative influence on a particular mutation, a G in
position ⫺1 from an A has a negative influence on the transversions (A-C and A-T), the influence being much stronger for the
A-T mutations. A transitions (A-G) are negatively influenced by a
C in position ⫺1, as are the A-C transversions, but this is not
apparent in the A-T transversions (Fig. 3).
Adjacent bases influence the spectrum of replacements from A
The data in Fig. 3 are inconsistent with the notion that the flanking
sequences associated with mutations from A are ‘target’ motifs,
because a targeting motif would result in some aspect being common to all three different motifs for mutations from A and this is
not the case. We therefore considered the possibility that what is
perceived to be a motif is in reality a consequence of the influence
of adjacent bases on either the replacement spectra, or the probability of subsequent fixing of mutation or repair. The analysis used
in generating Figs. 2 and 3, in common with other analyses of
hypermutation motifs (4, 5, 28, 29) corrects for the germline composition of the Ig genes in question. To view the influence of
adjacent bases on nucleotide replacements directly, we looked at
the numbers of each substitution without correcting for germline
composition. Fig. 4 shows the nucleotide replacement spectrum
when A or T are flanked by different nucleotides in either the 5⬘ or
the 3⬘ positions.
There are significant differences in the proportions of transition
(e.g., A-G or T-C), complement (e.g., A-T or T-A), or transition
FIGURE 2. Reversibility of A-N and T-N hypermutation motifs. Base preferences at positions ⫺1 (5⬘ side) to ⫹1 (3⬘ side) around mutations from A
and T (a and b). In each case, the motif for one mutation is shown above the reverse complement of the motif for the matching mutation, i.e., the T-N motif
is shown in the opposite direction (3⬘ to 5⬘) with the complementary bases (A is read as T, C is read as G, etc.) as c underneath the A-N motif in a. Similarly,
the reverse complement of the motif for A-N (d) is compared with the T-N motif (b). The bars represent the percentage difference between the base
composition around the mutation compared with the normal germline base composition The levels of significance after ␹2 test are indicated by the shading
of the bars: u, p ⬍ 0.05; o, p ⬍ 0.005; and f, p ⬍ 0.0005. Where the percentage difference is significant this represents a positive or negative influence
of that base in that position over the mutation in question. For example, T and A in position ⫺1 both have a significant positive influence on an A-N
mutation, while G and C both have a significant negative influence at this position. In position ⫹1 there is no significant effect of A or C, but G has a
negative influence and T has a positive influence. Overall this results in a WAT motif for the A-N mutation (where W is A or T and N is any nucleotide).
Downloaded from http://www.jimmunol.org/ by guest on June 16, 2017
1251 nucleotide substitutions in out of frame Ig rearrangements
were analyzed, including 727 from IGVH, and 524 from IGVL.
Consistent with all previous studies, significant strand bias in mutations from A and T was observed in IGVH and IGVL sequences
(Fig. 1). Because the two different data sets revealed very similar
mutation spectra the data were pooled for further analyses. We
determined how the sequence flanking A and T nucleotides influenced the probability of their mutation by analysis of the bases
favored (or not) in positions surrounding a mutated A or T, after
correction for the natural bias present in the germline sequence.
The strongest, and most consistent, influences on A and T mutations were from the bases in positions ⫺1 and ⫹1. This analysis is
presented in graphs that show whether a particular base is over- or
underrepresented (compared with the germline sequence) in these
positions around a mutated A or T.
The motifs thus determined for all mutations from A or T appear
to be the reverse complement of each other, as illustrated in Fig. 2.
Having a T before a mutated A, or an A after a mutated T, are
consistent positive influences on whether A and T are mutated.
However, this reverse complementarity did not always hold true
when individual substitutions from A and T were analyzed separately. Although the motifs around the different substitutions from
T were very similar, those around mutations from the different
substitutions from A showed significant differences (Fig. 3). Looking at the flanking bases with a positive influence on mutation from
A, the motifs are: TAC for transversions from A to C, TAT for
HYPERMUTATION AT A-T BASE PAIRS
The Journal of Immunology
5173
complement (e.g., A-C or T-G) mutations depending on the base in
the 5⬘ or 3⬘ position from the mutations. Because the normal distribution of transition, complement and transition complement mutations in mutated Ig genes is not equal we used the average percentage distribution from all mutations (50.2, 30.9, and 18.9%,
respectively) as the expected values for comparison. The significant changes from these expected values are indicated in Fig. 4.
Strikingly, the normal transition bias observed when all replacements are pooled is not seen when A is preceded by a T, and the
number of transition complements is significantly increased.
Hence, in this sequence context the replacement of A with C, G or
T occurs with approximately equal frequency. The presence of a C
in the 3⬘ position has a similar effect. An A or a G in the 5⬘ position
predisposes the replacements toward transitional (A-G) mutations.
The 5⬘ G also prejudices against complement (A-T) mutations,
while a C in the 5⬘ position favors them. There are no such changes
in the spectra of T mutations, the only point of change that is
significantly different from the normal is that a T in the 5⬘ position
from a T prejudices against transition complement mutations
(T-G).
Microsequence effects around mutations from A and T are not
reverse complements of each other
Reverse complementarity of motifs around complementary mutations can indicate targeting of one base on both strands of DNA.
The data presented in Fig. 4 above shows quite clearly that there
are significant effects of the bases immediately flanking mutations
from A which are not seen for those around T. If the effect were
due to the same mechanism acting on both strands of DNA, then
one would expect the effects seen at the 5⬘ side of A mutations
(Fig. 4a) to be mirrored at the 3⬘ side of T mutations (Fig. 4d), this
is not the case. Moreover, when the individual motifs for the different mutations from A are considered alongside the reverse complement motifs for their corresponding reverse complement mutation it can be seen that there are more points that differ than there
are that match (Fig. 5).
Comparison with mutations from the mouse JH intronic
sequence
Human out-of-frame IGV gene data has the advantage that it is
made up of 14 different IGHV and IGLV genes so that the chances
Downloaded from http://www.jimmunol.org/ by guest on June 16, 2017
FIGURE 3. Hypermutation motifs around individual mutations from A and T. Base preferences at positions ⫺1 to ⫹ 1 around mutations from A and
T. The bars represent the percentage difference between the base composition around the mutation compared with the normal germline base composition.
The levels of significance after ␹2 test are indicated by the shading of the bars: ( ), p ⬍ 0.05, (_), p ⬍ 0.005, and (䡵), p ⬍ 0.0005.
5174
HYPERMUTATION AT A-T BASE PAIRS
of bias arising from multiple copies of a single gene are less.
However, because IG genes have been selected through evolution alongside the hypermutation mechanisms they have characteristics that may cause perceived biases compared with
non-Ig gene sequences. Data are available from the mouse JH
intronic region (13) and we have analyzed this with respect to
the two key findings in this paper. First, with respect to the
effect of the preceding base on the type of mutation that occurs,
our results show that an A in the position immediately preceding the mutated A causes a bias toward transition mutations,
whereas a T in this position causes a bias toward transversions
(Fig. 4a). We found that this was also true in a database of 750
mutations measured in the JH intronic region of the mouse (of
which 280 mutations were from A) and in 916 mutations measured in a VkOx1 transgene (of which 298 mutations were from
A). In both cases the ratio of transitions to transversions was ⬎1
in the motif AA and ⬍1 in the motif TA (Table I). Second, with
respect to the nonreversibility of complementary mutations, the
reverse complements of motifs for mutations at T are not the
same as the motifs for A (Fig. 5). The effect of 5⬘ T on the types
of mutation that occur from A is not mirrored by the effect of A
in the 3⬘ position from a T (Fig. 4a compared with Fig. 4d). This
is also illustrated in Fig. 6a, showing the distribution of the
different types of mutation from A in the TA motif compared
with the reverse complement situation—the distribution of the
different types of mutation from T in the TA motif. Mutations
in the TA motifs of the mouse JH intronic sequences (Fig. 6b)
have the same trend of nonreversibility for transition (A-G vs
T-C) and complement (A-T vs T-A) mutations. However, this
does not hold true for the transition complement (A-C and T-G)
mutations. We investigated whether this may be due to a skewing in the germline composition of the two data sets. The microsequence that favored A-C mutations was TAC and the effect of a 3⬘ C on the types of mutations from A is very striking
Downloaded from http://www.jimmunol.org/ by guest on June 16, 2017
FIGURE 4. Influence of neighboring bases on hypermutation. Base composition of the 5⬘ and 3⬘ positions for each of the three different types of
mutation: Transition (u), complement (f), and transition complement (䡺). In the case of a and b, transition ⫽ A-G, complement ⫽ A-T, and transition
complement ⫽ A-C. In the case of c and d, transition ⫽ T-C, complement ⫽ T-A, and transition complement ⫽ T-G. The germline distribution of bases
adjacent to A for this data is also shown (e and f). Significant differences from the average distribution of all mutations in these data (50.2% transitions,
30.9% complements, and 18.9% transition complements) are indicated as follows: ⴱ, p ⬍ 0.05; ⴱⴱ, p ⬍ 0.005; and ⴱⴱⴱ, p ⬍ 0.0005.
The Journal of Immunology
5175
(Fig. 4b). TAC occurs with a frequency of only 0.52% in the JH
intronic sequence, compared with 2.1% in the human IGV gene
sequences.
Comparisons with Pol␩ activity
Mutations from A and T have been ascribed to the error prone
polymerase activity of Pol␩. The sequence context of in vivo IGV
mutations from A and T has been compared favorably with that
shown by mutations caused by Pol␩ when copying a lac Z gene, or
a mouse transgene in vitro (16 –18), and Pol␩-deficient mice and
humans show a reduced level of mutation from A and T (14, 15,
26, 27). An analysis of the replacement spectrum of mutations
from A caused by Pol␩ in the AA and TA motifs shows a strong
bias toward transition mutations when synthesizing the nontranscribed strand of an IGVK transgene, and a bias toward transversions when synthesizing the transcribed strand. There are differences between the mutations in the AA motif compared with those
in the TA motif, the transition bias is very high for the AA motif
(Table I). In contrast, the transition bias normally seen in the AA
motif as compared with the TA motif is absent in the mutation
spectrum of mice and humans deficient in Pol␩ (Table I).
Discussion
We have analyzed the replacement spectra resulting from the SHM
of A and T nucleotides in the context of local microsequence. We
have observed that adjacent bases 5⬘ and 3⬘ consistently and significantly influence the replacement spectrum of mutations from A.
Most notably we observed random replacement of A (i.e., C, G, or
T occur with equal frequency) when A is flanked by a T in the 5⬘
position, or a C in the 3⬘ position. This is significantly different to
the pattern of mutations from A seen in other sequence contexts,
where the more usual bias toward A-G transitions occurs. Other
studies of hypermutation hot spots, including our data in Fig. 1,
involved the determination of the proportion of mutations occurring in a particular sequence context as compared with the frequency of that sequence context in the germline. Such analysis,
using proportions to control for germline sequence biases, can
mask more simple trends such as the random replacement spectrum from A in the context of TA described above and illustrated
in Fig. 6. Other biases in substitution were apparent, and indicated
an influence of the adjacent nucleotides over the incorporation of
mis-paired nucleotides. This preference tended toward “like” nucleotides: if a purine (A or a G) preceded the target A, then the
replacement base was more likely to be another purine (G). However, if a pyrimidine, T, preceded the target A, then the replacement base was more likely to be the pyrimidines C or T. Similarly,
the presence of a pyrimidine, C, in the 3⬘ position predisposes
toward replacement by pyrimidines C or T. The most striking observation in this regard is that there is a bias in the A-G transitions
from the second A of an AA motif, whereas the bias switches
Table I. Relative numbers of transitions and transversions from A occurring in motif AA as compared with motif TA
Human IGHV
Human IGLV
JH intron (13)
VkOx1 (18)
Transcribed strand (18)b
Nontranscribed strand (18)b
Human JH XPV (31)
Pol ␩⫺/⫺ S␮ (32)
AA
TA
Total No. of Mutations
No. of Mutations
Ts/Tv (ratio)a
Ts/Tv (ratio)a
732
548
750
916
632
776
296
282
209
182
280
298
141
421
25
31
28/20 (1.40)
21/9 (2.33)
56/33 (1.70)
52/25 (2.08)
5/16
147/0
4/7 (0.57)
4/12 (0.33)
19/42 (0.45)
19/36 (0.53)
53/77 (0.69)
32/57 (0.56)
3/48
65/0
1/1 (1)
2/3 (0.67)
a
Transcribed strand and nontranscribed strand are mutations arising during the (in vitro) synthesis (by polymerase ␩) of the transcribed strand and nontranscribed strand of
the VkOx1 gene, respectively (18).
b
The numbers of transitions (Ts) and transversions (Tv) are given. The ratio is of transitions over transversions.
Downloaded from http://www.jimmunol.org/ by guest on June 16, 2017
FIGURE 5. Nonreversibility of hypermutation motifs for A and T transitions. Base preferences at positions ⫺1 to ⫹1 around A-C, A-G, and A-T
mutations compared with the reverse complement of the corresponding T-A, T-C, and T-A motifs, respectively. The bars represent the percentage difference
between the base composition around each mutation, compared with the normal germline base composition. The levels of significance after ␹2 test are
indicated by the shading of the bars: u, p ⬍ 0.05; o, p ⬍ 0.005; and f, p ⬍ 0.0005.
5176
toward transversions (A-C and A-T) in the A of a TA motif. This
observation is consistent between different data sets, including
those from non-Ig sequence (Table I). The influence of adjacent
bases on replacements from A was not mirrored in mutations from
T. The only significant variability in the replacement spectrum
from T occurred when there was a C or a T in the 5⬘ position. The
levels of significance were much smaller in this instance but it is
still possible to see a trend toward replacement by pyrimidine C
when preceded by pyrimidines C or T.
The fact that adjacent bases can influence the type of mutation
is consistent with the involvement of error prone polymerases at
this stage of SHM. Different polymerases have widely differing
abilities to insert, and to extend from, a mispaired insertion and it
has been shown that sequence context can influence the specificity
of a polymerase (30). It is therefore likely that what has hitherto
been perceived as motifs targeting mutation from A and T may
actually be a consequence of the influence of adjacent bases on
substitution preferences by the error prone polymerases. Pol␩ is
thought to be the main error-prone polymerase causing mutation of
A and T nucleotides; the absence of Pol␩ causes a reduced number
of mutations at A and T (14, 15, 26, 27) and it has recently been
shown to have physical and functional links with MSH2 (31). This
polymerase tends to generate transitions as a consequence of misincorporation of dGMP opposite T (16, 32). Transitions from T
predominate when the transcribed strand is synthesized (18), and
are more likely to occur when the base pair preceding the error is
T.A or A.T than when it is G.C or C.G (16). The sequence contextdependent variations in replacement spectrum observed in our
study were not observed in the in vitro models of Pol␩ activity.
However, we did see that the transition bias of mutations from A
as a result of the in vitro Pol␩ activity was much higher in the AA
than in the TA motif. This is in accord with the transition bias seen
in vivo in the human IGV sequences as well as the data from the
VkOx1 mouse transgene (18) and mouse JH intron (13). Also consistent with a role for Pol␩ in AT mutation is the reduction in the
transition bias of mutations from A in the AA motif in the human
and mouse models of Pol␩ deficiency (26, 27), although the numbers of mutations available in this analysis are rather low. However, removal of Pol␩ does not completely abrogate mutations
from A and T and on balance it is likely that Pol␩ is not the only
error prone polymerase involved in the generation of mutations at
A and T in vivo. Other error-prone polymerases (Pol␨, Pol␫ and
Pol␮) have also been implicated (19 –23).
Strand bias, generating more mutations from A than from T, is
a consistent feature of the SHM process in vivo. The implication
has been that this reflects differences in activity of the AT mutator
on the transcribed and non-transcribed strands. The idea that the A
and T mutators are equivalent, but operate unequally on both
strands in vivo, was supported by the apparently reversible motifs WA and TW that are associated with hypermutation hot
spots. A previous study by Cowell and Kepler (33) reported
symmetry under complementation in support of this hypothesis.
However, in this study we show that WA and TW are not truly
reversible motifs for A and T mutations, but only appear so as
a result of pooling transitions and transversions. There are some
striking differences in motifs from A compared with the reverse
complement of the complementary motifs, most notably in the
transitions; A to G mutations occur preferentially in AAT, but
T to C mutations occur preferentially in TTA (Fig. 5). This
finding is in accord with the finding of Cowell and Kepler (33)
that a nucleotide within a homodimer is more likely to undergo
a transition mutation. However, these motifs are not reverse
complements of each other so do not indicate strand symmetry
under complementation. The actual numbers of mutations,
without the proportional correction for germline sequence,
clearly illustrate that the influences of adjacent nucleotides on
the types of mutations from A are not mirrored in mutations
from T (Fig. 4). This is also true for mutations in non-Ig sequence (Fig. 6). Thus there are two aspects to strand bias; a
quantitative bias indicating that a mechanism operates unequally on both strands, and a qualitative difference—in the
type of mutation that is created in a given context—indicating
that the mechanism itself may be different on each strand.
Instances of TA and TAC sequence motifs where the A is in the
second position of a codon are most commonly observed in the
complementarity determining regions of the Ig genes. Hence,
the mutation of A in these circumstances is likely to have functional consequences. For example, the TAC codon is often observed in complementarity determining regions where mutation of
the A would result in replacement of tyrosine with phenylalanine,
serine, or cysteine. This could potentially generate significant
changes in the Ag binding sites.
In summary, we have shown that adjacent nucleotides affect the
substitutions when A nucleotides are mutated. However, mutations
from T are not influenced in the same way. Equivalent, complementary, sequence contexts for A and T mutations result in different replacement spectra. Therefore, not only is there a quantitative difference between mutations from A and T, but there is also
a qualitative difference. This could be a consequence of differential
mutation and/or repair activity on the two DNA strands.
Acknowledgments
We are grateful to Laurent Boursier and Su Wen for producing Ig gene
sequence data, and to Mark Dunn for help with the programming.
Downloaded from http://www.jimmunol.org/ by guest on June 16, 2017
FIGURE 6. Qualitative nonreversibility of the TA motif; the distribution of the three different types of mutation in motif TA from both A and
T mutations. A-G transitions are shown in gray, A-T complement mutations are shown in black, and A-C transition complement mutations are
shown in white. The corresponding mutations from T (T-C transitions, T-A
complement, and T-G transition complement, respectively) are shown in
the adjacent striped bars. The values are given as a percentage of the total
numbers of mutations from A in TA (n ⫽ 116 for human IGV; n ⫽ 130 for
mouse JH intron) or from T in TA (n ⫽ 72 for human IGV; n ⫽ 62 for
mouse JH intron).
HYPERMUTATION AT A-T BASE PAIRS
The Journal of Immunology
Disclosures
The authors have no financial conflict of interest.
References
17. Rogozin, I. B., Y. I. Pavlov, K. Bebenek, T. Matsuda, and T. A. Kunkel. 2001.
Somatic mutation hotspots correlate with DNA polymerase ␩ error spectrum.
Nat. Immunol. 2: 530 –536.
18. Pavlov, Y. I., I. B. Rogozin, A. P. Galkin, A. Y. Aksenova, F. Hanaoka, C. Rada,
and T. A. Kunkel. 2002. Correlation of somatic hypermutation specificity and
A-T base pair substitution errors by DNA polymerase ␩ during copying of a
mouse immunoglobulin ␬ light chain transgene. Proc. Natl. Acad. Sci. USA 99:
9954 –9959.
19. Diaz, M., L. K. Verkoczy, M. F. Flajnik, and N. R. Klinman. 2001. Decreased
frequency of somatic hypermutation and impaired affinity maturation but intact
germinal center formation in mice expressing antisense RNA to DNA polymerase
zeta. J. Immunol. 167: 327–335.
20. Zan, H., A. Komori, Z. Li, A. Cerutti, A. Schaffer, M. F. Flajnik, M. Diaz, and
P. Casali. 2001. The translesion DNA polymerase zeta plays a major role in Ig
and Bcl-6 somatic hypermutation. Immunity 14: 643– 653.
21. Faili, A., S. Aoufouchi, E. Flatter, Q. Gueranger, C. A. Reynaud, and J. C. Weill.
2002. Induction of somatic hypermutation in immunoglobulin genes is dependent
on DNA polymerase ␫. Nature 419: 944 –947.
22. Ruiz, J. F., D. Lucas, E. Garcia-Palomero, A. I. Saez, M. A. Gonzalez,
M. A. Piris, A. Bernad, and L. Blanco. 2004. Overexpression of human polymerase mu (Pol ␮) in a Burkitt’s lymphoma cell line affects the somatic hypermutation rate. Nucleic Acids Res. 32: 5861–5873.
23. Lucas, D., T. Lain de Lera, M. A. Gonzalez, J. F. Ruiz, O. Dominguez,
J. C. Casanova, C. Martinez-A, L. Blanco, and A. Bernad. 2005. Polymerase mu
is up-regulated during the T cell-dependent immune response and its deficiency
alters developmental dynamics of spleen centroblasts. Eur. J. Immunol. 35:
1601–1611.
24. Boursier, L., W. Su, and J. Spencer. 2004. Analysis of strand biased ‘G’.C hypermutation in human immunoglobulin V(␭) gene segments suggests that both
DNA strands are targets for deamination by activation-induced cytidine deaminase. Mol. Immunol. 40: 1273–1278.
25. Michael, N., T. E. Martin, D. Nicolae, N. Kim, K. Padjen, P. Zhan, H. Nguyen,
C. Pinkert, and U. Storb. 2002. Effects of sequence and structure on the hypermutability of immunoglobulin genes. Immunity 16: 123–134.
26. Mayorov, V. I., I. B. Rogozin, L. R. Adkinson, and P. J. Gearhart. 2005. DNA
polymerase ␩ contributes to strand bias of mutations of A versus T in immunoglobulin genes. J. Immunol. 174: 7781–7786.
27. Delbos, F., A. De Smet, A. Faili, S. Aoufouchi, J.-C. Weill, and C.-A. Reynaud.
2005. Contribution of DNA polymerase ␩ to immunoglobulin gene hypermutation in the mouse. J. Exp. Med. 201: 1191–1196.
28. Shapiro, G. S., K. Aviszus, D. Ikle, and L. J. Wysocki. 1999. Predicting regional
mutability in antibody V genes based solely on di- and trinucleotide sequence
composition. J. Immunol. 163: 259 –268.
29. Dorner, T., H. P. Brezinschek, R. I. Brezinschek, S. J. Foster, R. Domiati-Saad,
and P. E. Lipsky. 1997. Analysis of the frequency and pattern of somatic mutations within nonproductively rearranged human variable heavy chain genes.
J. Immunol. 158: 2779 –2789.
30. Strauss, B. S. 2002. The “A” rule re-visited: polymerases as determinants of
mutational specificity. DNA Repair. 1: 125–135.
31. Wilson, T. M., A. Vaisman, S. A. Martomo, P. Sullivan, L. Lan, F. Hanaoka,
A. Yasui, R. Woodgate, and P. J. Gearhart. 2005. MSH2-MSH6 stimulates DNA
polymerase ␩, suggesting a role for A:T mutations in antibody genes. J. Exp.
Med. 201: 637– 645.
32. Kunkel, T. A., Y. I. Pavlov, and K. Bebenek. 2003. Functions of human DNA
polymerases ␩, ␬ and i suggested by their properties, including fidelity with
undamaged DNA templates. DNA Repair 2: 135–149.
33. Cowell, L. G., and T. B. Kepler. 2000. The nucleotide-replacement spectrum
under somatic hypermutation exhibits microsequence dependence that is strandsymmetric and distinct from that under germline mutation. J. Immunol. 164:
1971–1976.
Downloaded from http://www.jimmunol.org/ by guest on June 16, 2017
1. Golding, G. B., P. J. Gearhart, and B. W. Glickman. 1987. Patterns of somatic
mutations in immunoglobulin variable genes. Genetics 115: 169 –176.
2. Betz, A. G., C. Rada, R. Pannell, C. Milstein, and M. S. Neuberger. 1993. Passenger transgenes reveal intrinsic specificity of the antibody hypermutation mechanism: clustering, polarity, and specific hot spots. Proc. Natl. Acad. Sci. USA 90:
2385–2388.
3. Phung, Q. H., D.B. Winter, A. Cranston, R. E. Tarone, V. A. Bohr, R. Fishel, and
P. J. Gearhart. 1998. Increased hypermutation at G and C nucleotides in immunoglobulin variable genes from mice deficient in the MSH2 mismatch repair
protein. J. Exp. Med. 187: 1745–1751.
4. Spencer, J., M. Dunn, and D. K. Dunn-Walters. 1999. Characteristics of sequences around individual nucleotide substitutions in IgVH genes suggest different GC and AT mutators. J. Immunol. 162: 6596 – 6601.
5. Dunn-Walters, D. K., A. Dogan, L. Boursier, C. M. MacDonald, and J. Spencer.
1998. Base-specific sequences that bias somatic hypermutation deduced by analysis of out of frame genes. J. Immunol. 160: 2360 –2364.
6. Rada, C., M. R. Ehrenstein, M. S. Neuberger, and C. Milstein. 1998. Hot spot
focusing of somatic hypermutation in MSH2-deficient mice suggests two stages
of mutational targeting. Immunity 9: 135–141.
7. Durandy, A. 2003. Activation-induced cytidine deaminase: a dual role in classswitch recombination and somatic hypermutation. Eur. J. Immunol. 33:
2069 –2073.
8. Larijani, M., D. Frieder, W. Basit, and A. Martin. 2005. The mutation spectrum
of purified AID is similar to the mutability index in Ramos cells and in
ung⫺/⫺msh2⫺/⫺ mice. Immunogenetics 56: 840 – 845.
9. Pham, P., R. Bransteitter, J. Petruska, and M. F. Goodman. 2003. Processive
AID-catalysed cytosine deamination on single-stranded DNA simulates somatic
hypermutation. Nature 424: 103–107.
10. Rada, C., G. T. Williams, H. Nilsen, D. E. Barnes, T. Lindahl, and
M. S. Neuberger. 2002. Immunoglobulin isotype switching is inhibited and somatic hypermutation perturbed in UNG-deficient mice. Curr. Biol. 12:
1748 –1755.
11. Imai, K., G. Slupphaug, W. I. Lee, P. Revy, S. Nonoyama, N. Catalan, L. Yel,
M. Forveille, B. Kavli, H. E. Krokan, et al. 2003. Human uracil-DNA glycosylase
deficiency associated with profoundly impaired immunoglobulin class-switch recombination. Nat. Immunol. 4: 1023–1028.
12. Bardwell, P. D., C. J. Woo, K. Wei, Z. Li, A. Martin, S. Z. Sack, T. Parris,
W. Edelmann, and M. D. Scharff. 2004. Altered somatic hypermutation and reduced class-switch recombination in exonuclease 1 mutant mice. Nat. Immunol.
5: 224 –229.
13. Rada, C., J. M. Di Nola, and M. S. Neuberger. 2004. Mismatch recognition and
uracil excision provide complementary paths to both Ig switching and the A/Tfocused phase of somatic mutation. Mol. Cell 16: 163–171.
14. Zeng, X., D. B. Winter, C. Kasmer, K. H. Kraemer, A. R. Lehmann, and
P. J. Gearhart. 2001. DNA polymerase ␩ is an A-T mutator in somatic hypermutation of immunoglobulin variable genes. Nat. Immunol. 2: 537–541.
15. Yavuz, S., A. S. Yavuz, K. H. Kraemer, and P. E. Lipsky. 2002. The role of
polymerase ␩ in somatic hypermutation determined by analysis of mutations in
a patient with XPV. J. Immunol. 169: 3825–3830.
16. Matsuda, T., K. Bebenek, C. Masutani, I. B. Rogozin, F. Hanaoka, and
T. A. Kunkel. 2001. Error rate and specificity of human and murine DNA polymerase ␩. J. Mol. Biol. 312: 335–346.
5177
Related documents