Download Structural Domains and Matrix Attachment Regions

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Epigenetics of diabetes Type 2 wikipedia , lookup

Pathogenomics wikipedia , lookup

Genetic engineering wikipedia , lookup

Epigenomics wikipedia , lookup

Biology and consumer behaviour wikipedia , lookup

Polycomb Group Proteins and Cancer wikipedia , lookup

Genomics wikipedia , lookup

Ridge (biology) wikipedia , lookup

Transposable element wikipedia , lookup

NEDD9 wikipedia , lookup

Point mutation wikipedia , lookup

Gene desert wikipedia , lookup

Genomic imprinting wikipedia , lookup

Minimal genome wikipedia , lookup

Non-coding DNA wikipedia , lookup

Genome (book) wikipedia , lookup

Gene expression programming wikipedia , lookup

RNA-Seq wikipedia , lookup

Vectors in gene therapy wikipedia , lookup

Nutriepigenomics wikipedia , lookup

Genomic library wikipedia , lookup

Genome editing wikipedia , lookup

Genome evolution wikipedia , lookup

Gene expression profiling wikipedia , lookup

Gene wikipedia , lookup

Epigenetics of human development wikipedia , lookup

History of genetic engineering wikipedia , lookup

Site-specific recombinase technology wikipedia , lookup

Microevolution wikipedia , lookup

Designer baby wikipedia , lookup

Therapeutic gene modulation wikipedia , lookup

Helitron (biology) wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Transcript
The Plant Cell, Vol. 12, 249–264, February 2000, www.plantcell.org © 2000 American Society of Plant Physiologists
Structural Domains and Matrix Attachment Regions along
Colinear Chromosomal Segments of Maize and Sorghum
Alexander P. Tikhonov, Jeffrey L. Bennetzen, and Zoya V. Avramova1
Department of Biological Sciences, Purdue University, West Lafayette, Indiana 47907
Although a gene’s location can greatly influence its expression, genome sequencing has shown that orthologous genes
may exist in very different environments in the genomes of closely related species. Four genes in the maize alcohol dehydrogenase (adh1) region represent solitary genes dispersed among large repetitive blocks, whereas the orthologous
genes in sorghum are located in a different setting surrounded by low-copy-number DNAs. A specific class of DNA sequences, matrix attachment regions (MARs), was found to be in comparable positions in the two species, often flanking
individual genes. If these MARs define structural domains, then the orthologous genes in maize and sorghum should
experience similar chromatin environments. In addition, MARs were divided into two groups, based on the competitive
affinity of their association with the matrix. The “durable” MARs retained matrix associations at the highest concentrations of competitor DNA. Most of the durable MARs mapped outside genes, defining the borders of putative chromatin
loops. The “unstable” MARs lost their association with the matrix under similar competitor conditions and mapped
mainly within introns. These results suggest that MARs possess both domain-defining and regulatory roles. Miniature
inverted repeat transposable elements (MITEs) often were found on the same fragments as the MARs. Our studies
showed that many MITEs can bind to isolated nuclear matrices, suggesting that MITEs may function as MARs in vivo.
INTRODUCTION
Our recent results have illustrated that genes residing in the
maize alcohol dehydrogenase (adh1) region display two distribution patterns: individual genes amid a sea of repetitive
DNA and clustered genes occupying a space uninterrupted
by highly repetitive DNAs (Tikhonov et al., 1999). Sorghum, a
close relative of maize with an ⵑ3.5-fold smaller genome
(Laurie and Bennett, 1985), has orthologous genes located
in an environment that lacks highly repeated DNAs
(Tikhonov et al., 1999). Similarly, the maize shrunken2/a1
genes are located ⵑ140 kb apart (Civardi et al., 1994), separated by blocks of repeats of retroelements (P. SanMiguel,
unpublished observations), whereas no large repeats were
found in the orthologous regions of sorghum and rice (Chen
and Bennetzen, 1996; Chen et al., 1997). Given the established importance of the genomic context for the function of
a gene, best illustrated by the phenomenon known as position-effect variegation, these findings point to an apparent
paradox: orthologous genes in related species may show
similar functions in very different chromosomal settings.
However, all results describing the effects of the surrounding DNA on a gene’s behavior have been obtained either in transgenic systems or with translocation events in
which tested genes are placed in an environment that is dif-
1 To
whom correspondence should be addressed. E-mail zavramov
@bilbo.bio.purdue.edu; fax 765-496-1496.
ferent from their wild-type location. Therefore, the chromosomal context and its putative effect on a gene’s function in
its natural environment have not been assessed. The existence of functional genes among blocks of retroelement repeats indicates that natural systems must have evolved
mechanisms and strategies that would allow orthologous
genes to function at their respective, evolutionarily established locations, despite the very different natures of the
neighboring sequences.
A current model accounting for this apparent paradox
suggests that genes and blocks of repetitive DNAs might
exist in different, structurally separated nuclear compartments (Henikoff and Comai, 1998; Lamond and Earnshaw,
1998; Cockell and Gasser, 1999) and that mislocation alters
expression. Each gene in the nucleus may have only one
“address” at which it functions correctly, and during evolution,
natural genes may have acquired “anchors” to stably position
them in the spatial architecture of the nucleus (Flavell, 1994).
A specific class of DNA sequences, matrix attachment regions (MARs), may be involved in this anchoring function.
MARs are operationally defined as DNA sequences that
preferentially bind to the proteins of the nuclear matrix, potentially defining and delimiting individual structural units of
chromatin (reviewed in Bode et al., 1996). One step in testing
such a model therefore would be to screen for the distribution
of MARs along large chromosomal continuums. MAR location would suggest a pattern for a putative folding of the
250
The Plant Cell
region into individual structural domains. Defining how biologically active units are positioned within these domains
would be an obvious next step.
In an earlier study, we reported that the long stretches of
mixed classes of repetitive DNAs in the maize adh1 region were
segregated into topologically sequestered units (Avramova
et al., 1995). The lack of DNA sequence data, however, precluded more detailed comparisons between the possible
structural organization of the region and gene/genome function. Subsequent studies of the region (Avramova et al.,
1996; SanMiguel et al., 1996; Tikhonov et al., 1999) provided
a molecular basis for renewing our pursuit of a possible relationship between gene structure and function. Although some
limitations of our analysis and screening approach, at that
time, have been acknowledged and discussed (Avramova et
al., 1995), most of them have now been overcome, as described below.
Recently, we were able to demonstrate that microcolinearity of gene composition between maize and rice was paralleled by genomic folding into colinear structural units.
More importantly, the pattern of potential structural organization was followed by similar placement of the genes within
the putative loops, suggesting that the positioning of the
genes in their respective structural domains might be conserved in evolution (Avramova et al., 1998). However, the
colinear shrunken2/a1–homologous regions of both rice and
sorghum lack large amounts of repetitive DNA and hence
are in gene-rich environments (one gene per 8 to 10 kb). The
colinear maize and sorghum adh regions differ greatly in the
amount of intervening repetitive DNAs and resulting gene
richness (one gene per 25 kb in maize versus one gene per 6
kb in sorghum; Tikhonov et al., 1999).
Here, we studied a possible structural folding of large
colinear segments carrying orthologous genes in different
genomic environments by using the distribution of MARs
along the regions as genomic elements that may define
structural units. The placement of candidate genes within
the putative structural domains supports a model in which
plant genes have evolved within structural domains separated from neighboring nongenic or genic sequences.
RESULTS
Comprehensive Analysis of MARs in the Maize
adh1 Region
The screening for MARs was conducted by using an approach that tests the capacity of isolated nuclear matrices to
interact with end-labeled tested DNA fragments in the presence of excess unlabeled competitor DNA. In our previous
study, due to the absence of sequence information, MARs
were positioned on the basis of the available map created
by a set of three restriction endonucleases (Springer et al.,
1994). As a result, MARs could have been destroyed by the
digestion (Avramova et al., 1995) or missed because of too
short or too long fragment lengths (Izaurralde et al., 1988;
Mielke et al., 1990). Recent data on the sequence composition of this region (Tikhonov et al., 1999) allowed MAR
screening after digestions with sets of carefully chosen restriction endonucleases (see Methods). This approach, however, generated a very large number of fragments, and we
have chosen to reexamine only the low-copy-number regions of the maize contig under new binding conditions.
Multiple overlapping fragments from the low-copy-number regions were generated and studied. The shortest fragments still displaying binding activities were defined as
MARs. This approach made it possible to identify individual
attachment points within larger matrix binding regions. Each
identified MAR was characterized under several different
competitive concentrations and on several overlapping fragments. These multiple reexaminations allowed independent
confirmation of the presence of a MAR and its characteristics. Moreover, these experiments also permitted a more
precise definition of MAR location and complexity. Knowing
that the largest low-copy-number domain from the maize
adh1 contig was segregated in a putative loop of ⵑ45 kb
(Avramova et al., 1995), we wished to test the possible structural organization within this domain, particularly with regard
to several candidate genes (Tikhonov et al., 1999). A goal of
this study was to compare the possible chromatin organization of orthologous genes in colinear genomic regions of two
species differing in the amount and nature of intervening
DNAs. Hence, we have focused on the possible structural
organization of the putative gene-containing regions.
In our earlier study, MAR screening was conducted under
stringent binding conditions (Avramova et al., 1995). High
concentrations of competitor DNA and nonsaturating quantities of matrices in the assays allowed only fragments with
the highest matrix affinity to be identified. Here, binding assays were performed with saturating amounts of matrix proteins. These have been established empirically by testing
increasing amounts of isolated matrices, until a point was
reached at which all of a labeled MAR fragment was matrix
bound. The concentration of matrix proteins at which all of
the labeled MAR fragments were recovered from the pellet,
with none being found in the supernatant, was defined as
saturating. It corresponded, approximately, to the residual
protein obtained after high-salt extraction of 1 A260 OD of
starting nuclei (see Methods). A higher concentration of matrix proteins provides more binding sites in the system, allowing for the discovery of weaker but specific MARcontaining fragments (Mielke et al., 1990).
Matrices isolated from the nuclei of sorghum seedlings
were used. Matrix proteins are evolutionarily conserved
such that they recognize and bind MARs across species in
test assays conducted in vitro (Cockerill and Garrard,
1986b; Avramova and Bennetzen, 1993). Here, matrix preparations from sorghum seedlings were found to bind maize
MAR-containing fragments with the same specificity shown
earlier for matrices isolated from maize leaves and roots
MARs and Putative Genomic Structures
(Figures 1A and 1B). Also, a MAR flanking a tobacco root–
specific gene, RB7 (Hall et al., 1991), bound with very high
specificity to the matrix isolated from nuclei of sorghum
seedlings (Figure 1C), supporting the idea that matrices from
one species can be used to discover MARs in the genome
of another.
Finally, three different concentrations of competitor DNA
were tested. The results allowed us to evaluate both the relative binding strength of individual MARs and to follow the
profile of their binding to the matrix under competition (Figures 2 and 3). Two classes of MARs have been tentatively
discriminated: “durable” MARs and “unstable” MARs.
Two Classes of MARs
Under the binding conditions described, many fragments
from the tested clones bound specifically to the nuclear matrices. This created ambiguities for defining possible loops.
251
We looked for criteria that would allow us to distinguish between different types of matrix binding DNAs. Usually,
MARs are arbitrarily defined as strong and weak, based on
their different affinities for the nuclear matrix in the presence
of an excess of nonspecific DNA.
Following this criterion, we quantified the amount of remaining matrix-bound DNA after competition with various
concentrations of nonspecific DNA. On the basis of the density-profile peaks of a labeled fragment recovered from the
pellet (MAR-containing fragment), its amount was calculated
as a percentage of the density of each fragment present in
the input fraction, taken as 100% (Figure 2; see Methods).
Restriction fragments recovered from the pellet were quantitatively analyzed for their matrix binding affinity in the
presence of 100 to 400 ␮g/mL competitor DNA. These correspond to 5000- to 20,000-fold excesses of Escherichia
coli DNA over the substrate genomic DNA. Approximately
270 DNA fragments were analyzed and 36 MARs were identified along the two contigs.
Figure 1. In Vitro Matrix Binding Assays near the Orthologous adh Genes.
(A) The sorghum adh-containing clone p13MC (a 13-kb MluI-ClaI fragment in the pGEM7⫹ vector) was digested with two combinations of enzymes: MluI, SacII, Bsu36I, SnaBI, MfeI, and NcoI (left and middle sections) or MluI, SacII, ApaLI, SmaI, MfeI, SpeI, and XbaI (right section). The
amount of competitor DNA is shown above each lane in ␮g/mL.
(B) Maize subclones p5zMAR and p3zMAR from the 5⬘ and 3⬘ regions near adh1-F, respectively. p5zMAR (left two lanes) was digested with XbaI
and XhoI, whereas p3zMAR (right two lanes) was digested with BamHI, BglII, and EcoRI. The amount of competitor DNA is shown on top of each
lane in ␮g/mL.
(C) The tobacco RB7 MAR (Hall et al., 1991) as a heterologous control. Clone pGHNC5, containing the matrix attachment fragment as a dimer,
was digested with NotI and Bsp120I. The amount of competitor DNA is shown on top of each lane in ␮g/mL.
In, the input DNA before binding (one-twentieth of the amount used for binding was loaded); M, a 1-kb ladder DNA marker; Pellet, the DNA
bound to the matrix proteins; Supernatant, the unbound DNA (one-tenth of the supernatant was loaded); v, the fragments of the cloning vector
used as a negative control; v⫹, vector fragment with an attached cloned fragment. Black arrows indicate matrix-bound fragments; open-headed
arrows indicate fragments resulting from partial star activity.
252
The Plant Cell
The matrix binding curves drop steeply under increasing
concentrations of competitor DNA (Figure 3B). Although
some of the MARs in this class could qualify as strong at
lower excesses of E. coli DNA (e.g., Figure 3B, curves 1 to 3),
their binding to the matrix was blocked by higher concentrations of competitor DNA. These less competitive, unstable
MARs mapped to regions inside genes and, predominantly,
within putative introns (Tikhonov et al., 1999).
A notable exception is the dissociation curve of a MAR located internal to maize gene 334B7.4; this MAR displays a
durable profile (Figure 3C). It is discussed in more detail later
in the article.
Orthologous adh Genes of Maize and Sorghum and
Their Structural Domains
Figure 2. Matrix Binding Analysis.
(A) X-ray film image of a representative matrix binding assay. The input lane shows a complete set of fragments, from the maize clone
pL347, tested for binding activity. The pellet lanes show the DNA
fragments recovered from the pellet, that is, bound to the matrix, under competition with 100, 200, or 400 ␮g/mL of E. coli DNA. v, mobility of vector fragments; v⫹, vector fragments with adjacent
genomic sequences.
(B) Density-profile plots of the gel in (A) representing the average
gray value of the pixels across the gel lane. The scale is different for
each plot.
After plotting the amount of recovered matrix-bound fraction over the concentration of the competitor DNA, we were
able to follow the changes in binding of the different MARs.
Analyzing the profiles of the curves, we observed that MARs
could be divided into two classes. A common feature of the
first class, as illustrated by the representative profiles of several MARs (Figure 3A), is that these MARs continue to bind
the matrix in the presence of the highest tested concentrations of competitor DNA. Some of these MARs, if considered at only one competitor DNA concentration (even the
lowest, i.e., a 5000-fold excess), could have been classified
as weak and possibly dismissed from further analysis. However, these MARs still could bind the matrix (15 to 30% recovered from the pellet) under higher (up to 20,000-fold)
excesses of competitor DNA (Figure 3A, curve 5). This class
represents the durable MARs, and 16 MAR fragments were
found to belong to this category. All but one of them
mapped in the space outside genes.
The MARs in the second class exhibit different profiles.
The maize adh1 gene is an example of a lone gene amid
large blocks of highly repetitive elements. Previous studies
have localized a MAR 5 ⬘ to the gene (Avramova and
Bennetzen, 1993; Paul and Ferl, 1993) and two MARs at its
3⬘ end (Avramova et al., 1995). The precise location of the
weaker MAR closer to the 3 ⬘ end, however, has remained
unclear. Thus, the size of the putative adh1-containing loop
could vary from 3 to 9 kb, depending on the position of the
MAR inside a 7-kb restriction fragment. Also, a small MARcontaining fragment (⬍500 bp) from the region was not positioned on the previously available map (Avramova et al.,
1995).
Here, the 5⬘ and 3⬘ regions flanking maize adh1 have been
examined in detail after being subcloned and digested with
a combination of restriction endonucleases. The recovery of
the tested fragments from the pellet (MARs) and the supernatant (nonbound DNA) is shown (Figure 1B). The gradual
increase in the percentage of the fragment in the supernatant, as a result of increasing concentration of competitor, illustrates the different affinities for the matrix proteins
displayed by the various MAR-containing and non-MAR
fragments.
Analysis of the matrix-bound fragments indicated that
they were adjacent on the contig, probably constituting multiple attachment points. A model for the structure of a putative (ⵑ4 kb) adh1 domain, bordered by the two flanking
MARs, is shown (Figure 4). Thus, both MARs surrounding
the adh1 gene appear to be composite and durable (Figure
3A, curves 1 and 5) and to be tightly bordering the long terminal repeats of the flanking retroelements ( Cinful-2 and
Milt). These complex MARs are positioned to precisely separate the gene from the neighboring 70-kb (5 ⬘) and 14-kb
(3⬘) blocks of highly repetitive DNAs.
It was of interest to determine the structure of the region
around the orthologous sorghum adh gene because the regions occupied by MARs in maize did not exhibit obvious
sequence homology with sorghum and because there are
no large repetitive DNA blocks around the sorghum adh
gene (Tikhonov et al., 1999). A 13-kb region containing the
MARs and Putative Genomic Structures
253
sorghum adh gene was subcloned and digested with two
different sets of restriction endonucleases, and the overlapping fragments were tested for binding activity. The results
from the binding of two different sets of fragments are
shown (Figure 1A). Both 5 ⬘ and 3⬘ MARs flanking the sorghum adh1 homolog were found to be durable (data not
shown).
Analysis of the matrix-bound fragments suggested a potential structural organization of the sorghum adh gene–containing domain (Figure 4). The similarity in the sizes of the
putative loops, defined by the positioning of the matrix binding regions, is quite remarkable, given the absence of MAR
sequence homology. In the different chromosomal contexts
in which the orthologous genes exist in the two species, the
preservation of the structural organization of the gene may
be relevant to its expression.
Putative Genes and Putative Domains along the
Maize Contig
Figure 3. Competition Profiles for Various MARs.
(A) Matrix associations of durable MAR fragments under three different concentrations of E. coli competitor DNA: 100, 200, 300, or 400
␮g/mL. Representative graphs are shown for MARs from both maize
and sorghum: curve 1, a 1.2-kb MAR fragment 5⬘ of the maize adh1
gene; curve 2, a 245-bp MAR fragment between the sorghum
110K5.10 and 110K5.11 genes; curve 3, a 2.7-kb MAR fragment between the maize 334B7.7 and 334B7.8 genes; curve 4, a 160-bp MAR
between the sorghum 110K5.8 and 110K5.9 genes; curve 5, a 2.5-kb
MAR 3⬘ of the adh1 gene; and curve 6v, a control vector fragment.
(B) Matrix associations of intronic MARs under similar binding conditions. Curves 1 and 2 are 2.3- and 2.2-kb MAR fragments, respectively, located in introns of the sorghum 110K5.4 gene; curve 3, a
440-bp MAR fragment inside the sorghum 110K5.11 gene; curve 4,
a 2.3-kb MAR fragment in the last intron of the maize 334B7.7 gene;
and curve 5v, a control vector fragment.
The possible chromatin-loop arrangements of the gene candidates recently identified on the maize contig (Tikhonov et
al., 1999) were examined. Four genes, 334B7.2, 334B7.3,
334B7.4, and 334B7.9, are individual loci, surrounded by
blocks of retroelement DNAs. Screening for MARs has identified MARs around the genes, suggesting structural organization similar to the 334B7.3 ( adh1) gene. Putative gene
334B7.2, discovered at one end of the contig, is separated
from its nearest neighbor, adh1, by ⵑ70 kb of retroelements
(Figure 5). A MAR marks one end of its putative structural
loop, whereas the other end was not defined because it was
outside the contig. Downstream of the adh1-containing
loop, after an ⵑ14-kb block of highly repeated retroelements, another solitary putative gene, 334B7.4, was discovered amid repetitive DNAs (Tikhonov et al., 1999). In this
region, both a newly found and a previously discovered
MAR (fragment 47 in Avramova et al., 1995) mapped inside
the gene (Figure 5). One of the internal MARs, overlapping
putative exon sequences of the gene, displayed a matrixaffinity profile relating it to the durable, possibly loop-defining MARs (Figure 3C). The curve of the other was somewhat
intermediate (see Discussion). The closest MARs located
outside the 334B7.4 gene sequence were, on the one side,
the MAR from the 3⬘ end region of adh1 and, on the other,
two MARs on adjacent fragments (see below). These would
define a putative loop of ⵑ60 kb. This predicted loop is unusual because it is composed of various highly repetitive
(C) Matrix affinity of a MAR located inside the maize 334B7.4 gene.
Curve 1, a 1.4-kb MAR fragment found inside a putative exon sequence (this fragment also contains a MITE); and curve 2v, a control
vector fragment.
254
The Plant Cell
Figure 4. Comparison of the Orthologous adh Domains in Maize
and Sorghum.
The maize yeast artificial chromosome (YAC) bar (top) represents a
225-kb region of maize chromosome 1 containing adh1. The empty
numbered boxes correspond to the nested retrotransposon blocks.
The open boxes with the arrows inside and numbers on top indicate
gene-containing regions. The arrows designate putative directions
of transcription of identified candidate genes. The sorghum bacterial
artificial chromosome (BAC) bar (bottom) represents the orthologous
78-kb region of sorghum. The figures in the middle illustrate the putative adh1 chromatin domains. The potential chromatin loops, defined by MARs (empty boxes), are shown as dotted loops. The
composite structures of the MARs are shown by open boxes, indicating individual matrix binding fragments. The asterisks designate
identified MITEs in the MAR-containing regions. The boxes containing triangles, flanking the maize adh1 gene domain, correspond to
the long terminal repeats (LTR) of the neighboring Cinful-2 and Milt
retrotransposons.
retrotransposons and a gene. The maize 334B7.4 gene had
been considered, hitherto, as an example of a plant gene
“broken” by the insertion of three large blocks of highly repetitive elements, scattering the genic sequence over 42 kb
of genomic space (Tikhonov et al., 1999). Here, we show
that 334B7.4 may have an unusual domain organization as
well.
The last gene from the maize contig surrounded by repetitive DNAs is putative gene 334B7.9. However, its apparent
loop also includes remnants of two members of retroelement families, Cinful-1 and Kake-1 (Figure 5). The possible
loop-defining MARs of this (ⵑ10 kb) domain have been established previously (Avramova et al., 1995) and have not
been retested in this study.
The remaining four genes, 334B7.5, 334B7.6, 334B7.7,
and 334B7.8, are organized in a cluster, uninterrupted by repetitive retroelement DNA (Tikhonov et al., 1999). Multiple
overlapping fragments from the region were generated and
tested for matrix binding. The pattern of MAR distribution
was rather complex, and the durable MARs were considered
domain defining.
The first gene of the cluster, 334B7.5, is positioned in a
putative 5-kb loop. The upstream binding region seems to
contain at least two attachment points that, together, sepa-
rate the low-copy-number region from the highly repetitive
DNA block upstream. Each of these two attachment fragments behaved as weak MARs, but, nonetheless, they
bound the matrix under the stringent assay conditions of the
earlier study (Avramova et al., 1995) and displayed durable
MAR profiles under the highest competition conditions of
this assay (not shown).
The MAR closing the loop containing the 334B7.5 gene
and defining the beginning of the adjacent loop was found
positioned very near the 3⬘ end of the neighboring 334B7.6
gene. The other end was located ⵑ20 kb downstream,
where a durable MAR was found between the 334B7.7 and
334B7.8 genes (Figure 3A, curve 3). An unusual feature of
this putative structural loop is that it encompasses two
genes, because no MAR was found in the space separating
the 334B7.6 and 334B7.7 loci. These two putative genes,
apparently transcribed in opposite directions, were found as
an unusual example of closely positioned maize genes that
are not gene duplication products (Tikhonov et al., 1999).
They are separated by only 200 bp of DNA and apparently
share the same structural domain. Gene 334B7.7 contains
several unusually large introns and two internal MARs (Figure 3B, curve 4). The last gene from the cluster, 334B7.8, is
positioned in a putative loop defined by two durable MARs.
An unstable MAR was found inside a putative intron (data
not shown).
Putative Folding of the Colinear Region of Sorghum
As a result of the absence of retrotransposons from the homologous region of sorghum, the genes are more densely
arranged, with an average density of one gene per 6 kb. An
unexpected observation was that three putative genes, in
the otherwise colinear sorghum region, were missing in
maize (Tikhonov et al., 1999). The differences in the genomic
structures, at a sequence level, prompted us to compare the
possible domain-folding patterns of the two regions. The
distribution of the MARs along the 78-kb sorghum continuum colinear with the 225-kb maize contig was examined.
Each of seven subclones was digested with combinations of
restriction nucleases (see Methods), and overlapping fragments were tested for binding capacity under the same conditions as described for maize (Figure 6).
The distribution of the intergenic MARs along the sorghum contig outlines a pattern, placing most of the genes in
apparently well-defined putative loops. Notable exceptions
are two genes, 110K5.9 and 110K5.10, that appear to share
the same loop. The ends of this putative structural domain
are defined by durable MARs positioned between 110K5.8
and 110K5.9 (Figure 3A, curve 4) and between 110K5.10
and 110K5.11 at the other end (Figure 3A, curve 2). The region separating putative gene 110K5.9 from 110K5.10 displayed no matrix binding potential. Putative intronic MARs
were identified in both 110K5.10 (not shown) and 110K5.11
(Figure 3B, curve 3).
MARs and Putative Genomic Structures
255
Figure 5. MARs and Predicted Loops in the Maize adh1 Region.
Maize YAC 334B7 is represented by a bar with open boxes (genic DNA). Putative exons are shown as vertical boxes. Diamonds and filled arrowheads designate MITES and long interspersed nuclear element–like retroelements, respectively. The thin lines correspond to regions occupied
by repetitive retrotransposon DNAs. Putative chromatin loops are shown by curves above the boxes. The anchors and the asterisks indicate the
positions of domain-defining and intragenic MARs, respectively. The boxes below the YAC designate the restriction enzyme fragments tested in
the binding assays: multiple overlapping fragments were tested, and the MAR signs were placed above the shortest restriction fragments recovered from the matrix-bound fraction. The scale is in kilobases. The numbers above the boxes correspond to the putative genes as defined in
Tikhonov et al. (1999) and as referred to in the text.
MARs and Miniature Inverted Repeat
Transposable Elements
Based on the molecular analysis of two different orthologous chromosome segments, the shrunken2/a1 regions of
rice and sorghum and the adh1 regions of maize and sorghum, ⵑ80 miniature inverted repeat transposable elements
(MITEs; reviewed in Wessler et al., 1995) were identified
along the four genomic continuums (Chen and Bennetzen,
1996; Chen et al., 1997; Tikhonov et al., 1999). Screening
the regions for MARs revealed ⵑ40 restriction fragments
with matrix affinity that also contained a MITE, raising the
question of whether MITEs carry the MAR function and/or
whether MITEs have inserted preferentially around MARs.
To study this question, we tested the capacity of isolated
MITEs to bind to the matrix. Approximately 20 elements
from the four genomic regions were examined under the
binding conditions described in this study. The MITEs, in all
tested cases, displayed matrix binding activity (Figure 7).
The MITEs were obtained either by excision with restriction
endonucleases or by polymerase chain reaction (PCR) amplification, using devised primers (shown in Methods).
In approximately a dozen cases, the presence of convenient restriction sites allowed the isolation of MITEs as separate recognizable fragments from the surrounding genomic
sequences. The binding profiles of a few examples are
shown in Figures 7A to 7E.
Alternative digestions with restriction nucleases of the region containing two nested Tourist elements 3⬘ to the sorghum 110K5.3 gene (Figure 6) released either the whole set
(Figure 7A) or an individual element (Figure 7B; for detailed
nucleotide information on the structure and location of the
elements and the position of the restriction sites, see the
legend to Figure 7 as well as GenBank accession number
AF124945 and Tikhonov et al. [1999]). The matrix binding
tests showed that both nested and individual Tourist elements can bind the matrix (Figures 7A and 7B). In accordance, an individual Tourist near the 5⬘ end of the 110K5.5
gene from the same contig (Figure 6) binds also to the matrix (Figure 7C, marked by the arrow). The higher molecular
weight bands belong to adjacent genomic sequences not
discussed here.
The matrix binding of two MITEs of maize origin are
shown as well: the MITE from the putative exon of maize gene
334B7.4 (Figure 7D) and the Tourist-Zm1 element (Bureau
and Wessler, 1992; Figure 7E). The latter element is very
short, ⵑ130 bp, which may be a reason for its apparent
weaker binding. It is possible that it is a component of a
larger binding region at its genomic location.
Conveniently located restriction sites, however, often
would leave flanking genomic sequences or eliminate sequences from the tested elements, which could impact the
MAR activity of the tested fragments. To avoid this possibility, we examined several MITEs after PCR amplification of
their sequences. The 894-bp fragment containing three complex MITEs, between maize genes 334B7.7 and 334B7.8
(Figure 5), has been PCR amplified using primers tightly
flanking the whole set. The primers and their genomic positions are shown in Methods and provided under GenBank
accession number AF123535. The matrix binding capacity
of the whole fragment is shown in Figure 7G (arrow 1).
The potential involvement of individual elements in the
256
The Plant Cell
Figure 6. MARs and Predicted Loops in the Sorghum adh Region.
A region of 78 kb from sorghum BAC 110K5 is shown as a bar containing the exon–intron structures of candidate genes. All designations, including MITEs, long interspersed nuclear element–like elements, intragenic MARs, MARs outside genes, and putative chromatin loops, are as
given in Figure 5. The boxes below the graph designate the tested restriction fragments. The seven BAC subclones (p11CM, p13MC, p5.4, p17,
p2, p16, and p18) are shown; their cloning sites are marked by vertical dotted lines.
matrix binding interaction is illustrated for two adjacent
Tourist elements revealed in the sorghum shrunken2/a1
region, close to the 3⬘ end of the shrunken2 gene homolog
(Chen et al., 1997; GenBank accession number AF010283).
Their structure, the location of the various primers used, and
the amplified regions are illustrated in Figures 8A and 8B.
The binding capacity of the individually amplified fragments
are shown in Figures 7F and 7G. Bands 2 and 4, corresponding to the individual MITEs, can bind the matrix; an
amplified fragment containing only the terminal repeats at
the junction of the two elements (band 5) does not bind to
the matrix. The whole fragment (band 1) binds as well, although with an apparent lower affinity. The reason for this is
not clear at present, although one possibility is the lower
overall outcome of this particular labeling (as determined by
the intensity of the input fragment). Nonetheless, this apparently weaker fragment retained persistent matrix binding
capacity when tested under three different competition concentrations. Its matrix-affinity curve as well as the binding
curves of all examined MITEs with MAR activity place them
in the class of the durable MARs (data not shown).
DISCUSSION
9), and two genes that apparently cohabit with retroelements inside their structural domains.
The genic sequence of the maize 334B7.4 gene has been
scattered over 42 kb of genomic space as a result of the insertion of three large blocks of retroelements (Tikhonov et
al., 1999). The closest external MARs would delimit a putative structural domain of ⵑ60 kb (Figure 5). However, a MAR
located inside the gene displayed matrix-affinity characteristics of the durable MARs (Figure 3C). This could indicate
that structural attachment point(s) might exist inside the
genic sequence. Thus, this region might be involved in a
structural role, particularly if the gene is not functional anymore, as suggested by the location of a MAR and a MITE inside an exon (Tikhonov et al., 1999). In the absence of
expression data regarding the functional capacity of the
334B7.4 gene, however, it is not possible to decide whether
it is located within a large structural domain, flanked by and
interrupted by retroelements, or whether it serves as an attachment structure itself.
The only other gene in the studied region sharing a loop
with retrotransposable elements is the putative 334B7.9
gene (Figure 9, loop l⬘). There are no data regarding whether
this is a functional gene, but both 334B7.4 and 334B7.9
genes provide attractive systems to further pursue a relationship between genes’ structural organization and their
expression.
Domain Organization along the Maize Contig
Domain Structures in Sorghum
Comprehensive analysis of MAR distribution in the lowcopy-number regions of the maize contig provided an insight into the potential of these regions to fold into putative
structural loops. The general arrangement along the maize
contig shows genes occupying individual loops that are
separated from neighboring retroelement repeats. Three exceptions have been observed: the 334B7.6 and 334B7.7
genes, which are positioned in the same loop (Figures 5 and
In the 78-kb colinear region of sorghum, 18 MARs were
identified. Most of the genes, with the apparent exception of
genes 110K5.9 and 110K5.10, seem to be positioned in individual loops (Figures 6 and 9). Seven of the MARs colocalize
with fragments containing MITEs. It is interesting that a fragment in the space between genes 110K5.9 and 110K5.10
did not bind to the matrix, despite the fact that it contained
MARs and Putative Genomic Structures
257
a MITE; thus, the two genes apparently share the same
domain.
Comparison of the Predicted Domain Structures in
Maize and Sorghum
The degree of similarity between the orthologous adh-containing regions in maize and sorghum has been reported at
the DNA sequence level (Tikhonov et al., 1999). The possible
folding into structural domains (loops) of these colinear continuums has been assessed here. Despite the presence of
more genes (nine and 14 putative genes on the maize and
sorghum contigs, respectively), the sorghum contig contains
approximately three times less DNA than does the orthologous maize region. Of the 225 kb surrounding maize adh1,
Figure 7. In Vitro Nuclear Matrix Binding Assays of MITEs.
Fragments corresponding to MITEs are shown by arrows in (A) to
(G). Open arrowheads denote fragments from the vector; m, I, and P
denote lanes containing size-marker fragments, input fragments,
and fragments recovered from the pellet, respectively.
(A) Clone p13MC (Table 1), digested with PmlI, SnaBI, and ScaI, released nested Tourist elements located 3⬘ to the sorghum 110K5.3
gene.
(B) Alternative digestion of the same clone with StuI alone, showing
the binding activity of one of the Tourist elements.
(C) Clone p17 (Table 1) after digestion with NotI and StuI, generating
a solo Tourist at the 5⬘ of the sorghum 110K5.5 gene.
(D) Maize clone p95d (Table 1) after digestion with AflII and StuI, releasing the MITE inside the putative exon of maize gene 334B7.4.
(E) Clone pB2wx carries the 128-bp Tourist-Zm1 element (Bureau
and Wessler, 1992; GenBank accession number S48688). Digestion
with EcoRI released a 184-bp fragment; its binding is shown by the
arrow.
(F) and (G) PCR-amplified MITEs. Fragment 1 contains three MITEs
(two nested and a closely adjacent one) from the spacer region between maize genes 334B7.7 and 334B7.8 (see Figure 8B). The primers are shown in Table 3. Fragments numbered 2 to 4 correspond to
amplified elements, as shown in Figure 8A. Fragment 5 is a short region containing the terminal inverted repeats of two adjacent Tourist
elements and does not display binding activity.
Figure 8. Schematic Representation of the MITEs Shown in Figure 7.
(A) The region containing two closely positioned Tourist elements at
the 3⬘ end of the sorghum shrunken2 homolog (Chen et al., 1997;
GenBank accession number AF010283).
(B) The region containing three MITEs (two nested and a closely adjacent one) from the spacer region between maize genes 334B7.7
and 334B7.8 (Tikhonov et al., 1999).
The individual MITEs are shown by the boxes, and the terminal inverted repeats as blank arrows. The position of the primers is shown
by the black arrowheads. The numbers at the primers correspond to
the genomic location of the respective sequence. The PCR products
are shown by the numbered lines under the boxes, corresponding to
the numeration of the binding fragments in Figures 7F and 7G. The
cross-hatched box inside the empty box illustrates a MITE nested
within another MITE.
160 kb is occupied by highly repetitive DNAs represented by
23 long terminal repeat retrotransposons, organized in six
blocks of various sizes (Tikhonov et al., 1999). To be able to
compare the gene-containing structures of the two regions,
we have graphically “subtracted” the retrotransposon blocks
from the maize contig (Figure 9).
It is shown that the gene-containing domains and the
sizes of the putative loops containing orthologous genes in
the two species are comparable. This fact suggests that the
structural folding of the homologous genomic regions, excluding the blocks of highly repetitive DNA, might be very
similar. Thus, despite an enormous difference in the sizes of the
loops occupied by the two orthologous genes, 110K5.4 and
334B7.4 (7 and 60 kb, respectively), if the retroelement blocks
in the maize gene were excluded, the genic sequence would
be located in comparable loops (Figure 9, d and d⬘).
A notable difference is that in sorghum, a gene-containing
loop (Figure 9, loop e) immediately adjacent to the 110K5.4
gene has been deleted from what would have been its location in maize (Figure 9). A relevant question is whether the
perturbation of the maize 334B7.4 domain structure by the
inserted large blocks of retroelements might be related to
the deletion of its immediate neighbor.
Another major difference is that two adjacent genes in
sorghum, 110K5.8 and 110K5.9, are also missing from the
258
The Plant Cell
Figure 9. Comparison of Putative Chromatin Domain Structures of the Orthologous adh Regions in Maize and Sorghum.
The maize YAC represents 225 kb of the maize adh1-F region, with 22 identified long terminal repeat retroelements and eight genes. The
sorghum BAC bar represents 78 kb of the sorghum adh region and 14 candidate genes. The black arrows in the boxes designate the putative direction of both gene and retroelement transcription. The genes in sorghum are designated with letters, whereas the homologous
maize genes are denoted by the same letters with the prime signs. Nonhomologous genes are designated with open arrows. Dashed lines
connecting the bars indicate the nonconserved (deleted) regions between maize and sorghum. The anchors in the boxes indicate durable
domain-defining MARs.
maize region. These regions differ in their domain organization as well: 110K5.8 is predicted to be in an individual small
loop (Figure 9, loop h) on the sorghum contig, whereas
110K5.9 shares a putative loop with 110K5.10 (Figure 9,
loop i to j). The orthologous maize gene, 334B7.7, also is
positioned in a loop as a pair. However, because the two
corresponding neighboring genes 5⬘ to 334B7.7 are missing
in maize, its partner in the loop is its current 5⬘ neighbor,
gene 334B7.6 (Figure 9, loop g⬘ to j⬘). Therefore, sharing a
structural domain with a closely positioned 5⬘ neighboring
gene seems to be a preserved feature for the two orthologous genes, 334B7.7 and 110K5.10. The possible significance of such an arrangement is unclear, but it points again
to a possible relationship between the positioning of a gene
within a structural unit and its functional requirements.
The apparent tendency of high-copy-number retroelements to insert into other retroelements, rather than into
genic sequences, was reported by SanMiguel and co-workers (SanMiguel et al., 1996). Here, we noted that the insertion sites in the maize genome of the initial retroelements
(those at the base of the stacks [SanMiguel et al., 1996])
map right at, or in a very close proximity to, MARs that flank
genes. This observation suggests that MARs may act both
as potential target sites and as barriers for the genes against
invasion by transposable elements.
Intronic MARs
It has been reported that under conditions of nonsaturating
amounts of matrix proteins (i.e., binding sites), only DNAs
with the highest affinity would exhibit binding. Equilibrium
binding studies, at saturating levels of matrix proteins, have
revealed that the process is cooperative and reversible
(Mielke et al., 1990). Competitor DNA, added to prevent
nonspecific binding, is used to measure the matrix affinity of
a MAR-containing fragment. On this basis, MARs have been
arbitrarily defined as strong and weak, and it has been suggested that they may have different functions (Allen et al.,
1993; Bode et al., 1996). However, the criteria for defining
strong and weak MARs have not been unified.
The detection of MARs in the introns of several genes
(Cockerill and Garrard, 1986a; Mielke et al., 1990; Avramova
and Paneva, 1992; Romig et al., 1994) pointed to yet another feature that divides MARs into different categories.
Intronic MARs, by definition, cannot block progression by polymerase II, implying either that they do not function in vivo
or that their binding to the matrix is regulated. In any case,
intronic MARs are not considered to be defining structural domains (Bode et al., 1996). An important question,
therefore, is what makes the two types of MARs different.
It has been shown that a simple classification of MARs as
strong and weak would not discriminate between intronic
and external MAR elements (Romig et al., 1994). We therefore chose to examine the various MARs for their affinity to
the nuclear matrix, under increasing concentrations of competitor DNA, instead of making a one-point strong/weak
comparison. Tentatively, the tested MARs were divided into
two classes on the basis of their binding patterns. One
group contains elements that could bind the matrix in the
presence of the highest competitor concentrations. These
MARs were termed durable, and they were usually found
MARs and Putative Genomic Structures
outside the genes. The second group, unstable MARs, were
competed off their matrix binding sites under similar conditions. Most of the intronic MARs were found in this class. It
should be emphasized that the difference in the competition
curves is not regarded as an absolute criterion separating the
two classes of MARs. Some of the tested MAR fragments,
particularly longer restriction fragments and fragments encompassing internal and external genic sequences, exhibited curves intermediate to the two profiles, indicating a
continuum of MAR affinities. The continuous series of MAR
binding behavior suggest that various components collaborate and contribute to the interactions. Therefore, the difference in the curves suggests that intronic and external MARs
might differ in the nature of their association contacts with
the nuclear matrix. Persistent binding activity might be more
relevant for MARs with a structural function, whereas a capacity for an easier dissociation of even strong MARs might
be a prerequisite for a controlled association with the matrix.
Intronic MARs were found in several sorghum and maize
genes. It is interesting that orthologous loci in maize/sorghum (334B7.4/110K5.4, 334B7.7/110K5.10, and 3334B7.8/
110K5.11) have preserved MARs in their respective introns.
These MAR-containing introns also display extensive sequence conservation (Tikhonov et al., 1999), distinguishing
them from the external MARs.
Little is known about the role, if any, of intronic MARs. It
has been suggested that MARs, particularly those in introns,
may act as a sink for absorbing the positive supercoils generated in front of a progressing polymerase transcription
complex, increasing the efficiency of transcript elongation
by removing a geometric impediment to progress (Benham
et al., 1997). Other data show an involvement of the intronic
␬-immunoglobulin MAR in the regulated demethylation of
the gene upon activation (Kirillov et al., 1996) and in somatic
hypermutation (Betz et al., 1994). An intronic MAR from a
potato gene, when placed between the gene encoding human interferon ␤ and neo, caused activation of transcription
in transfected mouse L cells (Mielke et al., 1990). The findings that intronic MARs are preserved in homologous genes
(Avramova and Paneva, 1992) and in orthologous introns in
maize and sorghum (see earlier text) support the idea of a
conserved function, possibly in gene regulation. Specific
MAR binding soluble factors might be involved in mediating
a regulated binding of intronic MARs to the nuclear matrix
(Boulikas and Kong, 1993; Bode et al., 1996).
MARs and MITEs
Inverted repeats are cruciform-forming sequences (Schroth
and Ho, 1995), and MARs are often enriched in inverted repeats (Boulikas and Kong, 1993). Plant genomes harbor
short interspersed repeats, MITEs, which have terminal inverted repeats and have a potential to form secondary
stem–loop structures (Bureau and Wessler et al., 1994;
Charrier et al., 1999). Nineteen of the 36 MARs discovered in
259
the adh-homologous genomic regions of the two grasses
were found to colocalize on the same fragments with MITEs.
Frequent close associations between MITEs and MARs in
the shrunken2/a1–homologous regions of rice and sorghum
have been noticed as well (Avramova et al., 1998), suggesting a possible correlation between the potential secondary
DNA structure of these elements and MAR function.
Hence, the capacity of isolated MITEs to bind to the matrix was tested. Approximately 20 elements contained within
MAR fragments from the four genomic regions were examined. The MITEs were obtained either by excision with restriction nucleases or by PCR amplification, and in all tested
cases, the MITEs displayed matrix binding activity. In several cases, MITEs were found in a nested arrangement or
immediately adjacent to each other, and our results showed
that the set as a whole and its individual components display a capacity for matrix binding. This had been observed
earlier for the maize 5⬘ adh1 MAR, before recognizing that this
region was made of two nested Tourist elements. Each of
these elements bound to the matrix, although with reduced
activity compared with the nested structure (Avramova and
Bennetzen, 1993).
The apparent involvement of these mobile elements in a
genome structural function concerns the basic question of
whether transposable elements and repetitive DNAs may
play any role in host genomes. As is also true of the MARs
associated with some members of retroelement families, it
does not seem likely that mobile DNAs would be key determinants of chromosome structure. However, once present at
a new genomic location, a mobile DNA element might be
selected for new local functions, such as gene regulation, recombinational initiation (Galliano et al., 1995), telomere structure
(Danilevskaya et al., 1994; Eickbush, 1997), or MAR function.
Loop Organization of Plant Genes
A major consequence of the model suggesting that chromatin in the nucleus is folded into discrete structural domains
(loops) is the attractive possibility that each domain constitutes an independent unit of genetic function. Several
classes of structural elements have been suggested to play
this role (reviewed in Geyer, 1997), but there is controversy
regarding the chromatin-boundary function for some of
them (Avramova and Tikhonov, 1999).
MARs are good candidates for playing a structural, domain-defining role, and their distribution in eukaryotic genomes typically places them at the borders of gene domains.
Adjacent MARs commonly outline potential loops of 3 to
400 kb (Boulikas and Kong, 1993), exhibiting a general inverse relationship between domain size and gene activity
(reviewed in Bode et al., 1996). In plants, relatively small (3
to 10 kb) loops seem to represent a common pattern for
the organization of plant genes (Breyne et al., 1992; van
der Geest et al., 1994; Chinn and Comai, 1996; Avramova
et al., 1998). In a 16-kb region of the small Arabidopsis
260
The Plant Cell
genome, for example, loops averaging ⵑ5 kb were found to
harbor two genes each (van Drunen et al., 1997).
Recently, it was reported that the maize adh1 gene is located within a predicted 90-kb loop, based on the size of a
genomic fragment obtained by preferential cleavage at superhypersensitive sites (Paul and Ferl, 1998). Although it is
not possible to interpret this result in terms of a structural
loop, it localizes the closest nuclease-susceptible sites to
the adh1 gene. Our sequence data indicate that ⵑ90 kb is
the approximate distance between the promoter of adh1
gene and the closest flanking promoter of maize gene
334B7.5, suggesting that the nuclease hypersensitive sites
map at potentially active promoter regions that often colocalize with MARs (reviewed in Bode et al., 1996).
A difficult unresolved matter is the relationship between
genome structure and regulated gene expression. An approach providing information on a possible link between
chromatin folding into structural units and the organization
of the genes inside has been reported here. From this perspective, MARs have been analyzed and considered as
structural elements only outlining putative domains. The
comparative analysis enabled us to observe preservation of
common structural features of gene organization, despite
the very different nature of the surrounding sequences. An
important corollary is that if MARs define structural do-
mains, then the orthologous genes in maize and sorghum
should experience similar chromatin environments. The
preservation of MAR function around orthologous genes, including cases of genes sharing a domain (334B7.7 and
110K5.10), points to functional significance.
Although numerous MARs have been found involved in
a variety of regulated phenomena (Allen et al., 1993;
Mlynarova et al., 1995; Iglesias et al., 1997; Holmes-Davis
and Comai, 1998; Matzke and Matzke, 1998; Ulker et al.,
1999; Vain et al., 1999), others have failed to play the role
expected of them in some transgenic studies. This suggests
that the role of MARs may be more elusive and complicated
than initially postulated. To understand how a genome functions, it is necessary to know how it is constructed, and a
description of the organization of complex native systems is
a step in that direction.
METHODS
DNA Clones
The yeast artificial chromosome (YAC) 334B7, contains an ⵑ225-kb
insert from the maize adh1 F region (Edwards et al., 1992). The bac-
Table 1. Subclones Analyzed in the Matrix Binding Assays
Clone
Sorghum
p11CM
P13MC
p5.4
p17
p2
p16
p18
p5sMARda
p3sMARa
Maize
pL243d
p5zMARa
p3zMAR1a
p3zMAR2a
pL95d
pL201
pL347
pL21
p3.3BamHI
pB2wxb
Tobacco
pGHNC5a,c
a Subclones
Position (bp)
Size (bp)
Cloning Sites
Vector
Source
1–10,408
10,403–22,564
22,559–27,930
27,923–45,076
45,069–47,184
47,177–64,422
64,417–78,195
8,096–10,362
14,433–16,851
10,408
12,161
5,371
17,153
2,115
17,245
13,778
2,266
2,418
ClaI-MluI
MluI-ClaI
ClaI-NotI
NotI-NotI
NotI-NotI
NotI-ClaI
ClaI-NotI
XbaI-XbaI
SacII-MunI
pGEM7⫹
pGEM7⫹
pBSIIKS⫹
pMOB
pBSIIKS⫹
pMOB
pMOB
pBSIIKS⫹
pGEM-TEazy
This study
This study
This study
This study
This study
This study
This study
This study
This study
1–6,028
65,831–67,056
71,014–73,511
71,014–73,333
81,429–93,499
144,388–159,323
154,148–172,938
169,902–186,579
186,298–189,670
N/A
7,000
1,225
2,496
2,319
12,070
14,935
18,790
16,676
3,272
184
BsiWI-BsiWI
XhoI-XbaI
PstI-BamHI
PstI-SacI
NotI-XhoI
NotI-NotI
NotI-NotI
NotI-NotI
BamHI-BamHI
EcoRI-EcoR
pGEM7⫹
pBSIIKS⫹
pBSIIKS⫹
pBSIIKS⫹
pBSIIKS⫹
pMOB
pMOB
pMOB
pBSIIKS⫹
pCRII
This study
This study
This study
This study
This study
This study
This study
This study
This study
Bureau and Wessler (1992)
N/A
2.2 kb
NotI-Bsp120I
pBSIISK⫹
Hall et al. (1991)
with minimal MAR fragments.
b Tourist-Zm1 element of wx-B2 locus of maize (GenBank accession number S48688; Bureau and Wessler, 1992).
c Dimer RB7 scaffold attachment regions (Hall et al., 1991).
MARs and Putative Genomic Structures
261
Table 2. Restriction Enzyme Digestions for Matrix Binding Assays
Clone
Assay 1
Assay 2
Assay 3
p11CM
XbaI, EcoRI, ApaLI, ClaI, MluI
MluI, MfeI, BamHI, NruI
MfeI, EcoRI, XbaI, BsiWI, NcoI, NruI
P13MC
MluI, XbaI, Bsu36I
MluI, ClaI, ApaLI, SpeIa
p5.4
p17
p2
p16
p18
pL243d
ClaI, SacI
BsiWI, PmeI, ApaLI, XbaI, NotI
NotI
HindIII, Ecl136, StuI, ClaI
Bsp120I, HindIII, MfeI, NdeI, NotI
NcoI, NotI, XhoI, BamHI
p5zMAR
p3zMAR1
pL95d
XhoI, XbaI
BamHI, BglII, EcoRI
Bsu36I, HindIII, SalI, XbaI, XhoI
pL201
pL347
pL21
p3.3BamHI
PB2wx
pGHNC5
Bsu36I, XhoI, XbaI, NotI
Bsu36I, XbaI, XhoI, NotI
Bsu36I, XhoI, XbaI, NotI
AatII, SnaBI, EcoRI
EcoRI
NotI, Bsp120I
ClaI, BstXI
Bsp120I, SacII, XhoI
NotI, StuI
ApaLI, NdeI, NotI, ClaI
ApaLI, NotI, StuI
Bsu36I, PstI, NotI, XhoI,
BamHI
BamHI, XhoI, XbaI, BsrGI
BamHI, EcoRI
SpeI,a SphI, StuI, BsiWI,
XhoI
BsrGI, BsiWI, EcoRI, NotI
ApaLI, AvrII, NotI
ApaLI, AvrII, NotI
BglII, NcoI, StuI, BamHI
a Enzyme
Assay 4
BsiWI, ClaI, MluI, NcoI,
PacI, SpeIa
MluI, SacII, Bsu36I, SnaBI, MfeI, NcoI MluI, SacII, ApaLI, SmaI,
MfeI, SpeI, XbaI
StuI, NotI
BsiG, EcoRI, NarI, NotI, ClaI
BsrGI, SalI, XbaI, NotI
EcoRI, BspHI, ClaI
StuI, AscI
AflIII, StuI
BspHI, SpeI, PstI, NotI
MfeI, NcoI, SpeI, NotI
BglII, BspHI, NotI
BglII, StuI, AatII
that produced partial digestion.
terial artificial chromosome (BAC) 110K5 contains ⵑ165 kb of the
colinear sorghum region (Woo et al., 1994). YAC 334B7 and BAC
110K5 were subcloned, and the clones analyzed here are shown in
Table 1. All 225 kb of the maize YAC and the colinear 78 kb from the
sorghum BAC have been screened for matrix attachment region
(MAR) distribution, after their complete sequence analysis (Tikhonov
et al., 1999). The sizes of the cloned fragments, the cloning sites, and
the vectors they were cloned in are shown in Table 1.
The clones were subjected to digestions with various combinations of restriction endonucleases, as shown in Table 2. The purpose
was to create multiple overlapping fragments, all of which were
tested for matrix binding activity. Their positions on the respective
contigs are shown in Figures 5 and 6. Restriction endonucleases and
T4 polynucleotide kinase were obtained from New England Biolabs
(Beverly, MA) or Promega (Madison, WI). Calf intestinal alkaline
phosphatase used for the dephosphorylation of DNA fragments was
obtained from Pharmacia (Piscataway, NJ).
Isolation of Miniature Inverted Repeat Transposable Elements
The location of the genomic clones containing colocalized miniature
inverted repeat transposable elements (MITEs) and MARs is shown
in Table 1. Restriction digestion of the clones was conducted as indicated in the legend to Figure 7.
Oligonucleotide primers (summarized in Table 3) were synthesized
by Integrated DNA Technologies, Inc. (Coralville, IA). Polymerase
chain reactions (PCRs) were performed as follows: one cycle at 95⬚C
for 3 min; 35 cycles at 55⬚C for 30 sec, 72⬚C for 2 min, 94⬚C for 45
sec; and one cycle at 75⬚C for 5 min. Samples were stored at 4⬚C.
Isolation of Nuclei and Nuclear Matrices
Nuclei were prepared following the procedure described in Avramova
and Bennetzen (1993). Sorghum nuclei were isolated from 15-dayold etiolated seedlings. Ten grams of seedling tissue was frozen in
liquid nitrogen and ground in three portions to a fine powder in a
mortar. All subsequent steps were performed at 0 to 4⬚C. Intactness
and purity of nuclei were monitored under a light or fluorescent microscope after staining with 4⬘,6-diamidino-2-phenylindole. The nuclear preparation from the final pellet was adjusted to 70% glycerol,
aliquoted at 3 A260 ODs, and kept at ⫺70⬚C.
Nuclear matrices were isolated by the 2 M NaCl extraction protocol, as described in Cockerill and Garrard (1986a), with minor modifications. Three A260 units of nuclear preparation were thawed on
ice and spun down for 3 min at maximum speed in a microcentrifuge, and the pellet was washed in 1 mL of 10 mM NaCl, 3 mM
MgCl2, 10 mM Tris-HCl, pH 7.4, 0.5 mM phenylmethylsulfonyl fluoride, with the addition of leupeptin and aprotinin (1 ␮g/mL each),
pelleted at maximim speed, and resuspended in 250 ␮L of the
same buffer. DNase I and RNase A were added (5 ␮g of each), and
the nuclei were incubated for 1 hr at 37⬚C. An equal volume of icecold 4 M NaCl was added, and the tube was incubated on ice for
10 to 20 min with two to three inversions of the tube. The nuclear
proteins that were insoluble in 2 M NaCl were pelleted by centrifugation at maximum speed for 3 min at room temperature. The
supernatant was removed carefully, and the pellet (nuclear matrix)
was washed twice with 1 mL of 2 M NaCl as described previously.
The nuclear matrices were washed three times, as described previously, in matrix binding buffer (MBB; 10 mM Tris-HCl, pH 7.4, 50
mM NaCl, 20 mM EDTA, 250 mM sucrose, and 0.25 mg/mL BSA).
262
The Plant Cell
Table 3. Clones and DNA Primers Used for Amplification of MITEs
PCR
Clones
Primers
Sequencesa
1
pL21
Z180369, Z181263
2
pSh2-1b
Sb7106, Sb7475
3
pSh2-1
Sh7384, Sh7784
4
5
pSh2-1
pSh2-1
Sb7106, Sh7784
Sh7384, Sb7475
cctacgtaTGCATTACTCATCTGTCAATGAC
cctgatcagCTGAGCCCGCAGAAAATCGCA
gagctcgaGTACTCAAACCTTAGACCCTG
cacgtgtcGAcAGGACTACAGTCTAAGGCCC
ccgaaTTCATGCATGTGTCCCAAAGATTCG
cctcgcgaATTCAACCTCCCACTGTTGG
See above
See above
a Capital
letters designate genomic sequence. Underlined nucleotides designate sequences from terminal inverted repeats. Double-underlined
nucleotides are direct duplications at the insertion site.
b Clone pSh2-1 containing a 13,945-bp BamHI fragment from the sorghum Shrunken2-homologous region (GenBank accession number
AF010283; Chen and Bennetzen, 1996).
Finally, matrices from 3 A260 units of the nuclear preparation were
dispersed in 100 ␮L of MBB.
Matrix Binding Assay
An aliquot (30 ␮L) of the matrix preparation described above was
used in each binding assay, in a total volume of 50 ␮L. Sheared (0.3
to 2 kb) competitor Escherichia coli DNA was added to the suspension of nuclear matrices to a final concentration of 50 to 400 ␮g/mL.
Depending on the size of the tested plasmid, 20 to 100 ng of DNA
was dephosphorylated and end-labeled with ␥-32P-ATP by T4 polynucleotide kinase. Five percent of the labeling reaction was added to
the binding assay, and the volume was adjusted to 50 ␮L with MBB.
Microcentrifuge tubes were placed in a horizontal position on a tube
rotary shaker and incubated at room temperature. After 5 to 12 hr,
the matrices were collected by centrifugation at maximum speed for
3 min, and the supernatants were saved. The pellets were washed by
resuspending once in 1 mL of MBB, followed by centrifugation as described earlier, and finally washed once with 1 mL of Tris-EDTA, pH
8.0. Final pellets were dispersed in 25 ␮L of 0.5% SDS and 2 mg/mL
proteinase K in Tris-EDTA. After incubation at 37⬚C for 5 to 12 hr, the
reactions were incubated at 70⬚C for 15 min. One-half of each reaction was loaded onto 0.7 to 1.2% agarose gels in 0.5 ⫻ Tris-borateEDTA buffer. After electrophoresis, agarose gels were dried onto nylon membranes in a vacuum gel drier at 80⬚C. Dried gels were quantitatively analyzed either after exposing x-ray film to them and
scanning by densitometer or by exposing them directly to a PhosphorImager plate (Molecular Dynamics, Sunnyvale, CA). Both methods yielded comparable results.
Image Analysis
Exposed x-ray films were analyzed by ImageQuant version 3.3 (Molecular Dynamics, Sunnyvale, CA) and/or Scion Image version 3, a
Windows version of NIH Image (Scion Corp., Frederick, MD). The
mean gray value of each fragment recovered from the pellet was
computed, and the values for the background were subtracted. The
mean gray values for the background were determined by measuring
a band of the same size from a region adjacent to the respective
fragment in the pellet. The mean gray value of each respective frag-
ment from the “input” lanes was used as a reference value, taken as
100%. MAR activity was quantified as a percentage of each matrixbound fragment relative to its value in the input fraction. These values were then plotted against the step-gradient concentration of the
competitor DNA.
ACKNOWLEDGMENTS
The authors are grateful to Phillip SanMiguel (Purdue University) for
subcloning the maize YAC and to Steve Spiker (North Carolina State
University, Raleigh) and Susan Wessler (University of Georgia,
Athens) for providing the clones containing the MAR from the tobacco root–specific gene RB7 and the clone pB2wx carrying the
Tourist-Zm1 element, respectively. This research was supported by
the U.S. Department of Agriculture Cooperative State Research Education and Extension Service grants No. 98-35300-6167 to Z.V.A.
and No. 97-35300-4594 to J.L.B.
Received September 14, 1999; accepted December 5, 1999.
REFERENCES
Allen, G.C., Hall, G.E., Childs, L.C., Weissinger, A.K., Spiker, S.,
and Thompson, W.F. (1993). Scaffold attached regions increase
reporter gene expression in stably transformed plant cells. Plant
Cell 5, 603–613.
Avramova, Z., and Bennetzen, J.L. (1993). Isolation of nuclear
matrices from maize leaves: Identification of a matrix binding site
adjacent to the maize Adh1 gene. Plant Mol. Biol. 22, 1135–1143.
Avramova, Z., and Paneva, E. (1992). Matrix attachment sites in the
murine ␣-globin gene. Biochem. Biophys. Res. Commun. 182,
78–85.
Avramova, Z., and Tikhonov, A.P. (1999). Are scs and scs⬘ ‘neutral’
chromatin domain boundaries of the 87A7 locus in vivo? Trends
Genet. 15, 138–139.
MARs and Putative Genomic Structures
Avramova, Z., SanMiguel, P., Georgieva, E., and Bennetzen, J.L.
(1995). Matrix attachment regions and transcribed sequences
within a long chromosomal continuum containing maize adh1.
Plant Cell 7, 1667–1680.
Avramova, Z., Tikhonov, A., SanMiguel, P., Jin, Y.-K., Liu, C.,
Woo, S.-S., Wing, R., and Bennetzen, J.L. (1996). Gene identification in a complex chromosomal continuum containing maize
adh1. Plant J. 10, 1163–1168.
Avramova, Z., Tikhonov, A., Chen, M., and Bennetzen, J.L.
(1998). Matrix-attachment regions in colinear segments of the sorghum and rice genomes. Nucleic Acids Res. 26, 761–767.
Benham, C., Kohwi-Shigematsu, T., and Bode, J. (1997). Stressinduced duplex DNA destabilization in scaffold/matrix attachment
regions. J. Mol. Biol. 274, 181–196.
Betz, A., Milstein, C., Gonzalez-Fernandez, A., Panneli, R., Larson,
T., and Neuberger, M. (1994). Elements regulating somatic
hypermutation of the immunoglobulin ␬ gene: Critical role for the
intron enhancer/matrix attachment region. Cell 77, 239–248.
Bode, J., Iber, M.C., Kay, V., Schlake, T., and Dietz-Pfeilstetter,
A. (1996). Scaffold/matrix–attached regions: Topological switches
with multiple regulatory functions. Crit. Rev. Eukaryotic Gene
Expression 6, 115–138.
Boulikas, T., and Kong, C. (1993). Multitude of inverted repeats
characterize a class of anchorage sites of chromatin loops to the
nuclear matrix. J. Cell. Biochem. 53, 1–12.
Breyne, P., van Montague, M., Depicker, A., and Gheysen, G.
(1992). Characterization of a plant scaffold-attachment region in
DNA fragment that normalizes transgene expression in tobacco.
Plant Cell 4, 463–471.
Bureau, T.E., and Wessler, S. (1992). Tourist: A large family of small
inverted repeat elements frequently associated with maize genes.
Plant Cell 4, 1283–1294.
Bureau, T.E., and Wessler, S. (1994). Stowaway: A new family of
inverted repeat elements associated with the genes of both monocotyledonous and dicotyledonous plants. Plant Cell 6, 907–916.
Charrier, B., Foucher, F., Kondorsi, E., d’Aubenton-Carafa, Y.,
Thermes, C., Kondorsi, A., and Rater, P. (1999). Bigfoot: A new
family of MITE elements characterized from the Medicago genus.
Plant J. 18, 431–441.
Chen, M., and Bennetzen, J. (1996). Sequence composition and
organization in the sh2/a1-homologous region in rice. Plant Mol.
Biol. 32, 999–1001.
Chen, M., SanMiguel, P., and Bennetzen, J. (1997). Sequence
organization and conservation in sh2/a1-homologous regions of
the sorghum and rice genomes. Genetics 148, 435–443.
Chinn, A.M., and Comai, L. (1996). The heat shock cognate 80
gene of tomato is flanked by matrix attachment regions. Plant
Mol. Biol. 32, 959–968.
263
enhancer in a region containing topoisomerase II sites. Cell 44,
273–282.
Cockerill, P.N., and Garrard, W.T. (1986b). Chromosomal loop
anchorage sites appear to be evolutionarily conserved. FEBS Lett.
204, 5–7.
Danilevskaya, O., Slot, F., Pavlova, M., and Pardue, M.-L. (1994).
Structure of the Drosophila HeT-A transposon: A retrotransposonlike element forming telomeres. Chromosoma 103, 215–224.
Edwards, K.J., Thompson, H., Edwards, D., de Saizieu, A.,
Sparks, C., Thompson, J.I., Greenland, A.J., Eyers, M., and
Schuch, W. (1992). Construction and characterization of a yeast
artificial chromosomal library containing three haploid maize
genome equivalents. Plant Mol. Biol. 19, 299–308.
Eickbush, T.H. (1997). Telomerase and retrotransposons: Which
came first? Science 277, 911–912.
Flavell, R.B. (1994). Inactivation of gene expression in plants as a
consequence of specific sequence duplication. Proc. Natl. Acad.
Sci. USA 91, 3490–3496.
Galliano, H., Muller, A.E., Lucht, J.M., and Meyer, P. (1995). The
transformation booster sequence from Petunia hybrida is a retrotransposon derivative that binds to the nuclear scaffold. Mol.
Gen. Genet. 247, 614–622.
Geyer, P.K. (1997). The role of insulator elements in defining
domains of gene expression. Curr. Opin. Genet. Dev. 7, 242–248.
Hall, G.E., Allen, G.C., Loer, D.S., Spiker, S., and Thompson, W.F.
(1991). Nuclear scaffolds and scaffold-attachment regions in
higher plants. Proc. Natl. Acad. Sci. USA 88, 9320–9324.
Henikoff, S., and Comai, L. (1998). Trans-sensing effects: The ups
and downs of being together. Cell 93, 329–332.
Holmes-Davis, R., and Comai, L. (1998). Nuclear matrix attachment regions and plant gene expression. Trends Plant Sci. 3, 91–97.
Iglesias, V.A., Moscone, E.A., Papp, I., Neuhuber, F., Michalowski,
S., Phelan, T., Spiker, S., Matzke, M.A., and Matzke, A.J.M.
(1997). Molecular and cytogenetic analysis of stably and unstably
expressed transgene loci in tobacco. Plant Cell 9, 1251–1264.
Izaurralde, E., Mirkovitch, J., and Laemmli, U.K. (1988). Interaction
of DNA with nuclear scaffolds in vitro. J. Mol. Biol. 200, 111–125.
Kirillov, A., Kistler, B., Mostoslavsky, R., Cedar, H., Wirth, T., and
Bergman, Y. (1996). A role for nuclear NF-␬B in B-cell-specific
demethylation of the Igk locus. Nat. Genet. 13, 435–441.
Lamond, A.I., and Earnshaw, W.C. (1998). Structure and function
in the nucleus. Science 280, 547–553.
Laurie, D.A., and Bennet, M.D. (1985). Nuclear DNA content in the
genra Zea and Sorghum. Intergenic, interspecific and intraspecific
variation. Heredity 55, 307–313.
Matzke, J.M., and Matzke, M.A. (1998). Position effects and epigenetic
silencing of plant transgenes. Curr. Opin. Plant Biol. 1, 142–148.
Civardi, L., Xia, Y., Edwards, K.J., Schnable, P., and Nikolau, B.J.
(1994). The relationship between genetic and physical distances
in the cloned a1-sh2 interval of the Zea mays L. genome. Proc.
Natl. Acad. Sci. USA 91, 8268–8272.
Mielke, C., Kohwi, Y., Kohwi-Shigematsu, T., and Bode, J. (1990).
Hierarchical binding of DNA fragments derived from scaffoldassociated regions: Correlation of properties in vitro and function
in vivo. Biochemistry 29, 7475–7485.
Cockell, M., and Gasser, S.M. (1999). Nuclear compartments and
gene regulation. Curr. Opin. Genet. Dev. 9, 199–205.
Mlynarova, L., Jansen, R.C., Conner, A.J., Stiekema, W.J., and
Nap, J.-P. (1995). The MAR-mediated reduction in position effect
can be uncoupled from copy-number dependent expression in
transgenic plants. Plant Cell 7, 599–609.
Cockerill, P.N., and Garrard, W.T. (1986a). Chromosomal loop
anchorage of the kappa immunoglobulin gene occurs next to the
264
The Plant Cell
Paul, A.-L., and Ferl, R.J. (1993). Osmium tetroxide footprinting of a
scaffold attachment region in the maize adh1 promoter. Plant Mol.
Biol. 22, 1145–1151.
Paul, A.-L., and Ferl, R.J. (1998). Higher order chromatin structures
in maize and Arabidopsis. Plant Cell 10, 1349–1359.
Romig, H., Ruff, J., Fackelmayer, F., Patil, M., and Richter, A.
(1994). Characterization of two intronic nuclear matrix-attachment
regions in the human DNA topoisomerase I gene. Eur. J. Biochem.
221, 411–419.
SanMiguel, P., Tikhonov, A., Jin, Y.-K., Motchoulskaya, N.,
Zakharov, D., Berhan, A., Springer, P., Edwards, K., Lee, M.,
Avramova, Z., and Bennetzen, J. (1996). Nested retrotransposons in the intergenic regions of the maize genome. Science
274, 765–768.
Schroth, G.P., and Ho, P.S. (1995). Occurrence of potential cruciform and H-DNA forming sequences in genomic DNA. Nucleic
Acids Res. 23, 1977–1983.
Ulker, B., Allen, G.C., Thompson, W.F., Spiker, S., and Weissinger,
A.K. (1999). A tobacco matrix attachment regions increase transgene expression levels and stability in transgenic tobacco plants.
Plant J. 18, 253–263.
Vain, P., Worland, B., Kohl, A., Snape, J., Allen, G.C., and
Thompson, W.F. (1999). Matrix attachment regions reduces the
loss of transgene expression in the progeny of transgenic tobacco
plants. Plant J. 18, 233–242.
van der Geest, A.H.M., Hall, G.E., Spiker, S., and Hall, T.C. (1994).
The ␤-phaseolin gene is flanked by matrix-attachment regions.
Plant J. 6, 413–423.
van Drunen, C., Oosterling, R., Keultjes, G., Weisbeek, P., van
Driel, R., and Smeekens, C.M. (1997). Analysis of the chromatin
domain organization around the plastocyanin gene reveals a
MAR-specific sequence in Arabidopsis thaliana. Nucleic Acids
Res. 25, 3904–3911.
Springer, P., Edwards, K.S., and Bennetzen, J.L. (1994). DNA
class organization on maize Adh1 yeast artificial chromosomes.
Proc. Natl. Acad. Sci. USA 91, 863–867.
Wessler, S.R., Bureau, T.E., and White, S.E. (1995). LTR-retrotransposons and MITEs: Important players in the evolution of
plant genomes. Curr. Opin. Genet. Dev. 5, 814–821.
Tikhonov, A.P., SanMiguel, P.J., Nakajima, Y., Gorenstein, N.,
Bennetzen, J.L., and Avramova, Z.V. (1999). Colinearity and its
exceptions in orthologous adh regions of maize and sorghum.
Proc. Natl. Acad. Sci. USA 96, 7409–7414.
Woo, S.-S., Jiang, J., Gill, B.S., Paterson, A.H., and Wing, R.A.
(1994). Construction and characterization of a bacterial artificial
chromosome library for Sorghum bicolor. Nucleic Acids Res. 22,
4922–4931.
Structural Domains and Matrix Attachment Regions along Colinear Chromosomal Segments of
Maize and Sorghum
Alexander P. Tikhonov, Jeffrey L. Bennetzen and Zoya V. Avramova
Plant Cell 2000;12;249-264
DOI 10.1105/tpc.12.2.249
This information is current as of August 3, 2017
References
This article cites 52 articles, 17 of which can be accessed free at:
/content/12/2/249.full.html#ref-list-1
Permissions
https://www.copyright.com/ccc/openurl.do?sid=pd_hw1532298X&issn=1532298X&WT.mc_id=pd_hw1532298X
eTOCs
Sign up for eTOCs at:
http://www.plantcell.org/cgi/alerts/ctmain
CiteTrack Alerts
Sign up for CiteTrack Alerts at:
http://www.plantcell.org/cgi/alerts/ctmain
Subscription Information
Subscription Information for The Plant Cell and Plant Physiology is available at:
http://www.aspb.org/publications/subscriptions.cfm
© American Society of Plant Biologists
ADVANCING THE SCIENCE OF PLANT BIOLOGY