Download Target selected insertional mutagenesis on chromosome IV of

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Epigenetics of diabetes Type 2 wikipedia , lookup

Metagenomics wikipedia , lookup

Copy-number variation wikipedia , lookup

Epigenetics of neurodegenerative diseases wikipedia , lookup

Biology and consumer behaviour wikipedia , lookup

Public health genomics wikipedia , lookup

Oncogenomics wikipedia , lookup

NEDD9 wikipedia , lookup

Ridge (biology) wikipedia , lookup

Gene therapy wikipedia , lookup

Polyploid wikipedia , lookup

Non-coding DNA wikipedia , lookup

Human genome wikipedia , lookup

Genomic library wikipedia , lookup

Genetically modified crops wikipedia , lookup

Polycomb Group Proteins and Cancer wikipedia , lookup

Gene nomenclature wikipedia , lookup

Genomics wikipedia , lookup

Point mutation wikipedia , lookup

Gene desert wikipedia , lookup

Pathogenomics wikipedia , lookup

Nutriepigenomics wikipedia , lookup

Genetic engineering wikipedia , lookup

Genomic imprinting wikipedia , lookup

X-inactivation wikipedia , lookup

Gene expression programming wikipedia , lookup

Vectors in gene therapy wikipedia , lookup

Epigenetics of human development wikipedia , lookup

No-SCAR (Scarless Cas9 Assisted Recombineering) Genome Editing wikipedia , lookup

RNA-Seq wikipedia , lookup

Gene wikipedia , lookup

Transposable element wikipedia , lookup

Minimal genome wikipedia , lookup

Gene expression profiling wikipedia , lookup

Genome evolution wikipedia , lookup

Therapeutic gene modulation wikipedia , lookup

Genome (book) wikipedia , lookup

Site-specific recombinase technology wikipedia , lookup

Genome editing wikipedia , lookup

History of genetic engineering wikipedia , lookup

Designer baby wikipedia , lookup

Microevolution wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Helitron (biology) wikipedia , lookup

Transcript
Journal of Biotechnology 78 (2000) 301 – 312
www.elsevier.com/locate/jbiotec
Target selected insertional mutagenesis on chromosome IV
of Arabidopsis using the En–I transposon system
Elly Speulman, Ronald van Asperen, Jessica van der Laak,
Willem J. Stiekema, Andy Pereira *
CPRO, Department of Molecular Biology, PO Box 16, 6700 AA Wageningen, The Netherlands
Received 9 February 1999; received in revised form 29 November 1999; accepted 3 December 1999
Abstract
Reverse genetics using insertional mutagenesis is an efficient experimental strategy for assessing gene functions. The
maize Enhancer–Inhibitor (En–I) transposable element system was used to develop an effective reverse genetics
strategy in Arabidopsis based on transposons. To generate insertion mutations in a specific chromosomal region we
developed a strategy for local transposition mutagenesis. A small population of 960 plants, containing independent
I transpositions was used to study local mutagenesis on chromosome IV of Arabidopsis. A total of 15 genes, located
on chromosome IV, were tested for I insertions and included genes identified by the European ESSA I sequencing
programme. These genes were of particular interest since homologies to other genes and gene families were identified,
but their exact functions were unknown. Somatic insertions were identified for all genes tested in a few specific plants.
Analysis of these progeny plants over several generations revealed that the ability to generate somatic insertions in the
target gene were heritable. These genotypes that show high levels of somatic insertions can be used to identify
germinal insertions in the progeny. © 2000 Elsevier Science B.V. All rights reserved.
Keywords: Reverse genetics; En-I; Transposon; Pectinesterase; Arabidopsis
1. Introduction
Arabidopsis thaliana is used as a model in plant
science to study many aspects of plant biology. Due
to its small genome and its well-studied genetics,
Arabidopsis was the first plant species chosen to
generate a complete physical map of the genome
(Choi et al., 1995; Liu et al., 1995; Schmidt et al.,
* Corresponding author. Tel.: +31-31-777114; fax: + 3131-7418094.
E-mail address: [email protected] (A. Pereira)
1996, 1997; Zachgo et al., 1996; Camilleri et al.,
1998; Kaneko et al., 1998; Mozo et al., 1998). This
effort was followed by the multinational co-ordinated sequencing of the Arabidopsis genome using
bacterial artificial chromosomes (BACs) and P1
clones which had been mapped to defined regions
of the genome (Goodman et al., 1995; Bevan et al.,
1998). However, while the complete nucleotide
sequence of the Arabidopsis genome is expected to
become available at the end of the year 2000
(Meinke et al., 1998) the function of most of the
unravelled gene sequences remains unclear.
0168-1656/00/$ - see front matter © 2000 Elsevier Science B.V. All rights reserved.
PII: S 0 1 6 8 - 1 6 5 6 ( 0 0 ) 0 0 2 0 3 - 0
302
E. Speulman et al. / Journal of Biotechnology 78 (2000) 301–312
Information on the function of a particular
gene might be obtained from the analysis of mutants in which the function of the gene is abolished. Insertional mutagenesis is a convenient
method to create knockout mutants to study gene
function. Transposon or T-DNA insertions can
result in complete inactivation of the target gene
and have been used for the isolation and identification of genes (Feldmann, 1991; Koncz et al.,
1992; Aarts et al., 1993; Bancroft et al., 1993;
Jones et al., 1994; Okuley et al., 1994; AzpirozLeehan and Feldmann, 1997). Insertional mutagenesis can also be used to identify insertions in
specific target genes with known sequence but
unknown function. This reverse genetics procedure termed target selected insertional mutagenesis uses T-DNA or transposable elements to
mutagenize the entire genome with insertions at
random positions in the genome. This is followed
by the identification of an insertion in the gene of
interest making use of the sensitivity of the polymerase chain reaction (PCR). A primer, specific
for the gene of interest is used in combination
with primers specific for the insertion sequence
and amplification of a gene-specific product occurs when an insert is present in the gene. The
efficiency of these screens can be improved by
dividing a population of mutagenized individuals
into pools and using specific pooling strategies.
Individual plants identified in such a screen are
subsequently analysed phenotypically to determine the function of the disrupted gene. This
technique has been shown to work in Drosophila,
Caenorhabditis, maize and petunia populations
saturated with transposable elements in which
knockout mutations in specific genes were identified (Ballinger and Benzer, 1989; Kaiser and
Goodwin, 1990; Rushforth et al., 1993; Zwaal et
al., 1993; Das and Martienssen, 1995; Koes et al.,
1995; Mena et al., 1996).
For reverse genetics purposes in Arabidopsis,
populations of T-DNA insertions have been used
successfully (McKinney et al., 1995; Krysan et al.,
1996; Azpiroz-Leehan and Feldmann, 1997;
Bouchez and Höfte, 1998; Winkler et al., 1998).
Although T-DNA insertions are in general single
or low copy and stable, an advantage of using
transposon- over T-DNA insertions is their ability
to transpose. Transposons can excise from the
gene of interest and the resulting reversion of a
mutation can be used to confirm that it was
caused by the transposon. Also, characteristic of
the maize Ac/Ds and En/Spm transposable elements is their preference to transpose to linked
sites and this property can be advantageously
used in a local mutagenesis strategy to tag genes
located in the vicinity of the donor site (Peterson,
1970; Bancroft and Dean, 1993; Cardon et al.,
1993; Aarts et al., 1995; James et al., 1995).
In order to further establish a system that could
be used in reverse genetics screens in Arabidopsis,
to complement the already available T-DNA populations, we used a modified two component
maize Enhancer– Inhibitor (En– I or Spm– dSpm)
transposable element system. A stable transposase
locus T-En5, was shown to mediate frequent
transposition of I elements which occurred continuously throughout plant development (Aarts et
al., 1993, 1995). The high levels of activity could
result in rapid amplification and proliferation of
insertions to linked and unlinked sites. These
characteristics were utilised for the generation of a
small population of 960 lines containing multiple
I elements per line. This population was in particular established to study local transposition of the
En– I system by using an I element (I-201 ), which
had previously been mapped on chromosome IV,
as a ‘launch pad’ for further transposition events.
The genes tested for I insertions were therefore
located at relatively near distances (0–15 cM)
from I-201 and included genes identified by the
ESSA I (European Scientists Sequencing Arabidopsis) programme. These genes were of particular interest since they are homologous to other
genes and gene families but their function remains
to be analysed. Two other well-characterised
genes, GA1 and APETALA 2 (AP2 ) located at
much further distances (31–55 cM) from I-201,
were used to test the long distance mutagenesis
potential of this transposon system. Insertions
were identified for all genes tested, and transpositions of I-201 had therefore occurred across the
chromosome. However, all insertions were identified as somatic insertions and the tendency to
insert somatically was inherited in some plants
and their progeny. Insertion rates and insertion
E. Speulman et al. / Journal of Biotechnology 78 (2000) 301–312
stability are influenced by many factors and include the proximity of the transposon to the
target, the timing and level of transposase expression and other element-encoded gene products,
and also host-encoded products and regulation
mechanisms (Fedoroff and Banks, 1988; Donlin et
al., 1995; Eisses et al., 1997). Somatic insertions
will be transmitted germinally to the progeny at a
lower frequency and can be identified when more
progeny of plants are screened and selected for
inheritance.
2. Material and methods
2.1. Plant material
In earlier experiments described by Aarts et al.
(1995), the ‘in cis two-element En – I’ transposon
system was introduced into the A. thaliana ecotype Landsberg erecta using a T-DNA construct
(cwEnN::I), which contained an unmarked 2.2 kb
I element inserted in the open reading frame of an
NPTII marker gene. This construct also harboured an En element as stable transposase
source, from which the 5% and 3% ends were removed and replaced by the cauliflower mosaic
virus (CaMV) 35S promoter (5%) and terminator
(3%) sequences. A primary transformant displaying
active transposition was used as source of the
transposon population. This transformant contained two T-DNA loci harbouring the transposase T-En5 on chromosome II, and T-En2 on
chromosome I, conferring 10 – 30% and 2 – 5%
transposition, respectively. Plants containing the
T-En5 locus were selfed over several generations
resulting in lines in which new transpositions of
the original I element(s) had taken place. The
positions of I elements were determined for several of such lines by using inverse PCR to isolate
flanking sequences of I elements and subsequently
mapping these products on the genome using
recombinant inbred lines (RILs) (Lister and
Dean, 1993; Aarts et al., 1995). One line harboured I-201 on chromosome IV and was chosen
for further experiments. A F2 segregating population was obtained after a cross with wildtype
Landsberg erecta. One plant homozygous for I-
303
201, and hemizygous for T-En5 was selected, after
probing genomic Southern blots with an I-201
and T-En5 flanking probe. A total of 50 hygromycin resistant progeny plants were transferred to the greenhouse and used as pollen
donors in crosses with a marker stock (F2 of
ecotype Columbia X Landsberg erecta; phenotype
erecta, cer2, ms1 ). Around 100 crosses were made
with every pollen donor plant, resulting in a total
of 5000 siliques with different genetic backgrounds. Seed (960), from these crosses was chosen randomly and grown in the greenhouse as a
F1 population and planted in ten trays/blocks,
and young inflorescence material was harvested in
a three dimensional pooling strategy (as outlined
in Results and Discussion (Section 3)). As the
pollen parent was a hygromycin resistant selfed
progeny of a T-En5 hemizygous plant, the resultant F1 population consisted of one third T-En5
transposase free stable plants and 2/3 hemizygous
for the T-En5 locus. The CIC-library (Creusot et
al., 1995) was later used to fine map I-201 on
CIC1H1 and CIC11C1 at position 62 cM on
chromosome IV to establish a more precise correlation with the genes and sequences which were
tested for insertions and which had also been
anchored to CICs on chromosome IV (Schmidt et
al., 1996).
2.2. Isolation of genomic DNA
Inflorescence or leaf material of a single plant
was used for DNA extraction. Extractions were
performed in Eppendorf tubes using liquid nitrogen and twice a volume of 250 ml of DNA extraction buffer (0.3 M NaCl, 50 mM Tris pH 7.5, 20
mM EDTA, 2% sarkosyl (v/v), 0.5% SDS (w/v),
5% phenol (v/v), 5% H2O (v/v)) as described by
Pereira and Aarts (1998). Pools of inflorescence
material were ground with a mortar and pestle in
liquid nitrogen and twice a volume of 2.5 ml of
DNA extraction buffer. After phenol/chloroform
extraction and centrifugation at 4000 rpm for 15
min, the DNA was precipitated after adding 0.8
volumes of isopropanol to the supernatant, incubation at RT for 10 min and centrifugation at
4000 rpm for 10 min. The pellet was washed twice
with 70% ethanol and dissolved in 1 ml TE pH
304
E. Speulman et al. / Journal of Biotechnology 78 (2000) 301–312
8.0 and 10 mg ml − 1 RNase. After a second round
of phenol/chloroform extractions, the pellet was
dissolved in 200–400 ml TE pH 8.0 to a final
concentration of 50 – 100 ng − 1 ml of DNA, depending upon the size of the pool.
2.3. Detection and analysis of I insertions
To detect I insertions in the genes tested, two I
specific primers were used; itir2 (CTT TGA CGT
TTT CTT GTA GTG) and itir3 (CTT GCC TTT
TTT CTT GTA GTG) (Life Technologies), complementary to the 5% and 3% terminal inverted
repeats of I. Both primers were used in one PCR
reaction together with one gene-specific primer.
Gene-specific primers were designed complementary to 5% and 3% regions of the coding sequence, in
such a way that putative insertions could be identified within the gene. The gene-specific primers
used were: Ag-6 (ACTCCA GGC CAT TTC CTT
CAG) for AGAMOUS, primer 1010 (CGT CCC
AAA AAT GCT CTG TTC) and primer 1011
(TGC TAC ACG TCA TGT TCG TTG) for
pectinesterase (accession number: Z97340g2244956), primer 1014 (CTA CCA AGA GAC
AAT ATC ATG) and primer 1015 (TCC GGC
CAC ACG TTC ACA CGA) for HAT-1 (access.
no: Z97343-g2245105), primer 1016 (GGG TGA
CTC CTT TAG TGT AGC) and primer 1017
(GTC TCC GTA GAT GAG TCC AGC) for
membrane channel protein (access. no: Z97343g2245093), primer 1030 (GCA GTC GGA TGT
GAT GAG GTT) and 1031 (CAA CCC TAA
GGC CTC CCA GTT) for glycerol-3-phosphate
permease (access. no: Z97343-g2245113), primer
1022 (CGGCAG GCC AGG AAC GTT TCA)
and primer 1023 (TCG GAT CTG GAC CGT
TGG TGG) for GTP-binding protein (access. no:
Z97343-g2245111), primer 1040 (CTT CAC CAA
GTA AAC GGG TTG G) and primer 1041
(GTG ATG ATGAGG AGA GAA TCC) for
AP-2, primer 1052 (CGCCGA CGG AAC TCG
AAG GGG) and primer 1053 (CAA GAT AAA
CCT CGC CGG GTG) for CH42, primer 1050
(TCA TCT GGC CTG ACG ACG AAA) and
primer 1051 (GCC CGA GAA GCT CCA TGA
TCT) for GA5, primer 1032 (TGT CGC TAG
AGA CAA ATC CAG) and primer 1034 (GTG
AGT TTG GAG ATG ATC GCC) for GA1 and
primer 1048 (TGG AGG GAA GCC CAG TGA
CCA) and primer 1049 (TCC GAC CCC ATG
AAT CGT GTA) for Cer-2. Primers for PRHA,
Mek1 and DD-1 were sent by collaborators for
insertion screens. PCR reactions were performed
in a total volume of 50 ml (0.5 mg of each primer,
250 mM of each nucleotide, 1× Supertaq buffer),
containing 50–100 ng of DNA and 1U Taq DNA
polymerase (Supertaq, HT Biotechnology Ltd,
UK). The reactions were performed in a PTC-200
Peltier thermal cycler (MJ Research), using 1 cycle
at 94°C for 4 min and 31 cycles of 45 s at 94°C,
45 s at 60°C, 3 min at 72°C, followed by a 7 min
extension step at 72°C. Amplification products
were size-separated in 1% TBE agarose gels containing ethidium-bromide, alkali treated, then
vacuum transferred onto Hybond-N+ membranes
and hybridised to a gene-specific probe. Primers
corresponding to both 5% and 3% regions of the
coding sequence, as mentioned above, were used
in combination to amplify genomic DNA products, which were used as probes. For AGAMOUS,
its corresponding cDNA clone was used as a
probe. After gel-electrophoresis and isolation of
the DNA with a DNA gel-purification kit (QIAGEN), the DNA was used as a probe. Membranes were pre-hybridised at 65°C in 1 M NaCl,
1% SDS, 10% dextran sulphate for 1–4 h and
subsequently hybridised with a 32P labelled probe
at 65°C, o/n in the same solution. The membranes
were washed at room temperature in 2× SSC for
5 min and at 65°C in 2× SSC, 1% SDS for 1 h.
After identification of a positive three-dimensional insertion address, 12 seeds of each three-dimensional address were sown as individuals in the
greenhouse and one leaf was isolated from all 12
individuals and pooled. After reconfirmation of
an insertion in a pool of a putative three-dimensional address by DNA extraction, PCR, Southern blotting and hybridisation; inflorescences of
individual plants were subsequently analysed
separately.
Amplification products from DNA of individual plants, obtained with one gene-specific primer
and both I primers were gel-purified (QIAGEN)
and subjected to another round of PCR amplifications. The purified fragments were cloned into the
E. Speulman et al. / Journal of Biotechnology 78 (2000) 301–312
PGEM-T-vector (Promega), sequenced via an automated sequencing system (Applied Biosystems
373 DNA Sequencer), and analysed with the
BLAST algorithm at the National Centre for
Biological Information (NCBI).
To determine if individual plants were either
stable or homo- or hemizygous for the T-En5
locus, primers were designed which were complementary to the flanking sequences of T-En5 using
primers Ten3 (GGA CCA CAA AAC ATG GAA
CAT G) and Ten5 (CAC AGC ACA TGT ACA
CAA CTG GCG) and a third primer TEn (CAC
CAG TTG CTA TCA TTG CGG CC) complementary to sequences within T-En5. PCR conditions were as described above with three primers
in one reaction, using 1 cycle at 94°C for 1 min
and 35 cycles of 30 s at 94°C, 30 s at 60°C and 3
min at 68°C.
305
3. Results and discussion
3.1. Three-dimensional pooling and PCR
strategies
To identify insertions in genes of interest using
PCR-based reverse genetics screens, the population of 960 plants was divided into three-dimensional pools in order to minimise the amount of
PCRs necessary to identify an insertion. In a
three-dimensional pooling strategy, one or more
plants carrying an insertion in the gene of interest
can be identified directly in a large population by
using just one round of PCRs on all the pools
representing the population. The population was
divided over ten blocks of 96 plants each (8
rows×12 columns) shown in Fig. 1A. A total of
six flower buds were taken from the main inflorescence of each plant and divided over three pools.
Fig. 1. Schematic representation of three-dimensional pooling and PCR strategies. A small population of 960 I element containing
Arabidopsis lines was divided over 10 blocks (trays) and each block contained 96 plants (8 rows and 12 columns). Inflorescence
material of every plant in the population was harvested in three dimensions. Two flower buds of one particular plant in the
population were taken for the pool of its block; another two were taken for the pool of its row and two more were used for the
pool of its column. A. This resulted in 30 DNA pools (12 columns + 8 rows + 10 blocks). For the identification of an I insertion
in a gene of interest, one gene-specific and two I specific primers were used in 30 PCR reactions. B. After gel-electrophoresis, blotting
and hybridisation with a gene-specific probe, three-dimensional bands can be observed. The autoradiogram shows the result of a
PCR reaction with a primer (1010) specific for pectinesterase. Putative three-dimensional insertions were identified, but numbered
are only the three-dimensional insertions of 800 bp identified in column 9 and 10, block IV and VI and row B and F (L represents
the negative control Landsberg erecta). These positives correspond to plants IV-9-F, IV-9-B, IV-10-F, IV-10-B, VI-9-F, VI-9-B,
VI-10-F and VI-10-B in the population. C. PCR analysis of progeny plants (F2) of putative positives IV-9-F, VI-10-F and VI-10-B
(plant 3 of IV-9-F, and plant 12 of VI-10-B did not germinate).
306
E. Speulman et al. / Journal of Biotechnology 78 (2000) 301–312
Two flower buds were used to represent the pool
of its block (96 plants), another two were taken
for the pool of its row (10 ×12 =120 plants) and
the last two for the pool of its column (8 ×10 =
80 plants). In this way material from each plant is
represented in a unique combination of one block,
one row and one column. A total of 30 pools were
obtained after DNA extraction of all pooled material. To further minimise the effort needed to
identify an insertion in a gene of interest we used
two I element and one gene-specific primer in one
reaction. The I primers were complementary to
the left and right terminal inverted repeat ends of
the I element. After gel-electrophoresis and blotting, the membrane was hybridised with a genespecific probe (Fig. 1B). Hybridising bands are the
result of an amplification reaction between one of
the I-element primers and the gene specific primer
and indicate the number of different insertion
positions in the target gene. However, only those
bands which appear to be of equal size in a
column, a block as well as a row can be considered as a three-dimensional insertion and correspond to one particular plant in the population.
3.2. Identification and analyses of insertion sites
To identify insertions in all genes of interest on
chromosome IV, we used two gene-specific
primers that were complementary to 5% and 3%
regions of the coding sequence. Insertions were
found in all genes tested and a relatively high
number of independent three-dimensional insertions (hybridising bands of independent sizes)
were obtained as displayed in Table 1. Given the
nature of the population used, we did not expect
to find any more than one three-dimensional insertion per gene tested. Putative three-dimensional
insertions were usually identified in a number of
blocks, columns and row pools. Consequently, all
possible combinations of one block, one row and
one column (=one three-dimensional address
and one plant in the population) had to be considered and this resulted on average a total of 15
plants with a putative insertion in the target gene
(Table 1, c total insertions). When positives
were identified in e.g. three different blocks, one
row and one column, three different combinations
of one block, one row and one column had to be
analysed, although only one such address could
have been positive, unless multiple insertions were
present in these pools. Because of the high number of putative positives found in these screens, 12
progeny plants of each address were grown in the
greenhouse the leaves harvested of all plants and
analysed as a pool. Pools of addresses that were
positive after a new round of PCR reactions and
Southern analysis were subsequently chosen for
further individual analysis. Insertions were identified in progenies of 90% of all genes tested.
These insertions usually differed in size from the
ones identified in the three-dimensional screen.
Multiple putative three-dimensional insertions
were also identified in a screen with a gene-specific
primer (see Material and Methods) for the putative pectinesterase enzyme (Fig. 1B). The results
obtained with this gene will be used further to
describe this population.
One independent insertion was identified in a
number of three-dimensional pools and was located at a distance of 800 bp from the gene-specific
primer.
Pectinesterases
and
pectin
methylesterases exist as multi-gene families in the
genome (Richard et al., 1996) and the multiple
insertions found for this pectinesterase could
therefore also result from insertions in close family members. However, genomic Southern analysis
showed that this gene was not a member of a
multi-gene family. Homology searches in the
databases also did not identify DNA sequence
homologies of any significance to other known
pectinesterases although amino acid homologies
did exist and were as high as e − 54 over 165 amino
acids for a pectinesterase on chromosome IV
(BAC T24M8). A maximum of two homologous
genes is present in the genome according to
Southern analysis, and insertions are therefore
most likely insertions in the pectinesterase gene
that was tested.
A total of 25 possible three-dimensional addresses were tested for insertions in pectinesterase,
which corresponds to an initial insertion frequency of 2.6% (25 addresses/960 lines). After
PCR and Southern analysis 11 progeny pools
were positive and this corresponds to a frequency
of 44% (11 positive pools/25 putative positive
E. Speulman et al. / Journal of Biotechnology 78 (2000) 301–312
307
Table 1
Genes tested for insertionsa
Gene
c Insertions
c Total insertions
Insertions in
progeny
Function
References
AGAMOUS
2
19
No
Yanofsky et al.
(1990)
Pectinesterase (ESSA I)
HAT-1 (ESSA I)
2
1
25
12
Yes
Yes
Palmitoyl protein thioesterase
(ESSA I)
Membrane channel protein
(ESSA I)
Glycerol-3-phosphate permease
(ESSA I)
GTP-binding protein (ESSA I)
DD1
Mek1
PRHA (a)
3
3
Nt
Flower development
Unknown
Plant development
Unknown
4
12
Yes
Unknown
2
15
Yes
Unknown
2
4
5
3
28
13
\30
\30
Yes
Yes
Yes
Yes
Unknown
Unknown
MAP-kinase
Unknown
PRHA (b)
Ap-2 (3%)
3
2
15
9
Yes
Yes
Ap-2 (5%)
CH42
1
2
1
4
Yes
Yes
GA5
1
4
Nt
GA1
1
1
Yes
Cer-2
2
\30
Yes
Schena and Davis
(1992)
Morris et al. (1997)
Korfhage et al.
(1994)
Flower development
Jofuku et al.
(1994)
Chloroplast
protein
Gibberellin
biosynthesis
Gibberellin
biosynthesis
Lipid synthesis
Koncz et al. (1990)
Xu et al. (1995)
Sun and Kamiya
(1994)
Xia et al. (1997)
a
The total number of three-dimensional insertions of distinct sizes obtained ( c insertions), and the total number of
corresponding putative three-dimensional combinations ( c total insertions) are indicated. Progenies of three-dimensional insertions
were tested and it is indicated when insertions were identified.
pools). However, the previously obtained 800 bp
band was identified in only 10% of the progeny
pools, while bands of different sizes and thus
different insertion sites, were identified in the
other positive pools (results not shown). In this
way 14 addresses could be discarded. Of the 11
positive pools, five that showed strong hybridisation signals were further analysed on individual
plant level and PCR progeny analyses of addresses IV-9-F, VI-10-F and VI-10-B (F2 generation) are shown in Fig. 1C. Insertions were
detected in the progenies of all five positive pools
and this corresponds to a frequency of 100%
(5/5). Insertions were confirmed in 25 – 50% of the
individuals at these addresses. However, as was
found for the pools, most of these insertions were
present at other positions in the gene. Bands of
around 800 bp were identified in 10–67% of positive individuals of an address. The frequencies
with which insertions in pectinesterase recur in the
progenies suggest that the initial three-dimensional insertion was somatic and that the property
to insert somatically was heritable. These were
insertion events that had occurred in at least
several cells of the flower and hence insertions
could be visualised only after hybridisation with a
gene-specific probe. To assess whether the insertions were somatic, we analysed the individual
progeny plants by genomic Southern analysis.
Insertions that are transmitted through the
gametes to one or more of the progeny are considered germinal events and can be identified as
308
E. Speulman et al. / Journal of Biotechnology 78 (2000) 301–312
band shifts of the wild-type allele (the size of an I
element is 2.2 kb) when restriction enzymes are
used to digest the genomic DNA that do not cut
within the I element. No band-shifts were identified for any of the progeny plants and this
confirms the somatic but heritable nature of these
insertions.
Progeny of plant IV-9-F-4 (F3 generation), that
was homozygous for the T-En5 locus, were also
analysed (results not shown) and also displayed a
similar pattern of somatic insertions. The F2 generation segregated as 1:2:1 for the T-En5 locus. It
was found that transposase free plants did not
display somatic or stable insertions; thus the somatic insertions are a characteristic of transposase
containing families.
To identify the position of one such insertion in
pectinesterase and also to confirm that this insertion was not present in a homologue, a weak band
of 400 bp was isolated from the gel and after
purification re-amplified using one gene-specific
primer and both I element primers. These products
were cloned and sequenced. Sequence analysis
showed that in one clone the I element insertion
was present in an exon of the pectinesterase gene
identified by ESSA I and had generated a 3 bp
target site duplication, characteristic for the En/
Spm transposon family (Fig. 2A and B) (Aarts et
al., 1993).
Other clones identified five different sequences
of 400 bp between two I elements that were in the
same orientation, since itir2 and itir3 sequences
were found at either end of the fragment. This data
suggests that transposition had resulted in clustering of I-elements. Analysis of the sequences
showed that these insertions were present on chro-
mosome II (BAC T3F17) in a putative Ap2 domain containing protein, on chromosome II (BAC
T11A7) in an unknown protein, on chromosome V
(P1 clone MTH16) in a gene which has similarity
to the transcriptional activator Ra, and two insertions were identified in gene sequences which have
not yet been identified by the Arabidopsis sequencing programmes but which showed homologies to
a phenylalanine ammonia lyase and a protein
serine/threonine kinase like gene, respectively.
3.3. Local mutagenesis on chromosome IV
The other genes that were tested for insertions
on chromosome IV are APETALA2 (AP2 ),
PRHA, DD1, Mek1, GA5, Cer2, CH42, AGAMOUS (AG), GA1 and genes identified by ESSA
I (Fig. 3). The genes described by ESSA I were
annotated as having a ‘strong similarity’or ‘similarity’ to known genes. The three-dimensional insertion patterns obtained for all these genes were
similar to what was observed for pectinesterase.
All insertions identified were also somatic and
heritable and patterns were complex. The number
of insertions identified varied between the genes,
but a correlation between the relative distance of
I-201 and the tested genes and the number of
insertions identified was not observed. The total
number of putative three-dimensional insertions
was low for GA1 and AP2, positioned at 45 cM
and 31 cM from I-201, respectively. However,
equally low numbers were also observed for GA5,
very near (3cM) I-201 and CH42 at the same
genetic position as I-201. This might be due to
PCR primer discrepancies or insertion-site preferences of I-elements.
Fig. 2. Analysis of a pectinesterase::I tagged site. A. Drawing of the pectinesterase gene showing the position and orientation of an
I insertion and the relative positions of both gene-specific primers. The bar represents 125 bp. B. Nucleotide sequences surrounding
the insertion site in the somatic pectinesterase::I allele. The target site duplication (TSD) is underlined.
E. Speulman et al. / Journal of Biotechnology 78 (2000) 301–312
Fig. 3. Genes tested for I insertions. Schematic representation
of chromosome IV. Genes tested for I insertions are shown in
bold. Sequences representing genes found by ESSA I are
shown on the left. Numbers represent the position (cM) of the
gene or marker on chromosome IV (Lister and Dean, 1997).
To determine if similar patterns of somatic
insertions were obtained with a population which
also contained multiple I insertions per line but
with 75 000 I insertions distributed randomly over
the genome (Speulman et al., 1999), the genes
HAT-1 and pectinesterase were tested for insertions using identical PCR conditions as described
here. Similar multiple patterns of somatic insertions were obtained, suggesting that the obtained
patterns are gene-specific and or a consequence of
the multiple I element lines, c.q. the genetic background used.
The most likely explanation for heritability of
somatic insertions is that a donor I element, possibly originating from I-201 has transposed close to
the target gene and is selected for in the process of
the population screen. In the next generation,
transposition events from this linked site will
again produce somatic insertion events at random
in the gene tested when the transposase source is
present, although certain positions might be preferred. Overall, this data indicates that I-201
transposed locally as well as over longer distances
and that the En–I system can be used in Arabidopsis for local as well as long-distance
mutagenesis.
309
A high degree of somatic insertions does not
guarantee detection of germinal insertions at a
high frequency in the next generation. However,
use of DNA from germinal tissue for testing
would greatly increase the likelihood of finding
progeny with germinal insertions. Although we
used inflorescence material for the analysis of the
progeny, it might be necessary to isolate DNA
from pollen of plants showing strong hybridisation signals as described for the isolation of Dsinsertions in the gene polygalacturonase, using
similar multiple insertion lines of tomato (Cooley
and Yoder, 1998). Alternatively, it might be sufficient to analyse very small flower buds for insertions in the gene of interest, followed by selective
crosses of several flowers, originating from the
same inflorescence meristem and located in the
near vicinity of the analysed flower bud, with
wild-type pollen to segregate out the transposase.
When using very small flowerbuds, the amount of
germline specific tissue and the probability of
finding a germinal insertion is considerably higher
as opposed to older flower buds.
Systematic selection for germinal events as a
system was demonstrated by Cooley and Yoder
(1998), who isolated germinal insertions in the
polygalacturonase (PG) gene from plants that displayed high somatic transposition. Four thousand
progeny were screened in this experiment and five
were found containing germinal transmitted Ds
insertions in PG. This procedure can also be
applied to the population we produced for chromosome IV transpositions.
The population we generated was established to
identify germinal insertions in genes located on
chromosome IV using active transposition from
the I-201 launching pad. In addition to the I-201
element, four other I elements, including one
mapped on chromosome V and another on chromosome III are present in the background. Assuming uniform transposition in the target region
on chromosome IV, from I-201 with 50% transposition frequency (data not shown) and the other
elements, we estimate about 500 independent germinal insertions in this region of approximately
21 000 kb (http://genome-www3.stanford.edu/
Arabidopsis/chromosome). If insertions are distributed randomly over this chromosome, one
310
E. Speulman et al. / Journal of Biotechnology 78 (2000) 301–312
insertion should be present every 40 kb. For a
target of an average gene of about 2 kb (using
PCR from one side) this gives a chance of about
1/20 or 5% to hit a specific gene. The targeted
genes in the ESSA I region are located in a 430 kb
interval, with a total of 121 genes. The average
length of a gene in this region is 2.3 kb and an
average of one insertion every 2 kb is necessary to
be able to identify an insertion in any random
gene chosen on chromosome IV. As the chance of
identifying a germinal insertion within the coding
sequences of any of the genes in this region is only
about 5%, our failure to identify germinal insertions directly is therefore due to a low genome
coverage of the transposons. In a larger population for randomly distributed genome wide coverage of transposons with multiple transposons per
plant (Speulman et al., 1999), about 50% insertions in genes were found that were heritable. The
evidence of these insertion mutants was also provided with phenotypes of tagged mutants, showing that this transposon system can be useful for
reverse genetics methods.
Despite an absence of germinal insertions in
any of the genes tested, we show that this population contains a genetic background with a very
high insertion activity that can be used for insertional mutagenesis of any gene of interest on
chromosome IV. However, more refined and thorough screening procedures and larger progeny
populations will be necessary to subsequently
identify germinal insertions in the gene of interest.
Acknowledgements
The authors would like to thank Bert Schipper
and Bas te Lintel Hekkert for technical assistance.
References
Aarts, M.G.M., Corzaan, P., Stiekema, W.J., Pereira, A.,
1995. A two-element enhancer–inhibitor transposon system
in Arabidopsis thaliana. Mol. Gen. Genet. 247, 555–564.
Aarts, M.G.M., Dirkse, W.G., Stiekema, W.J., Pereira, A.,
1993. Transposon tagging of a male sterility gene in Arabidopsis. Nature 363, 715–717.
Azpiroz-Leehan, R., Feldmann, K.A., 1997. T-DNA insertion
mutagenesis in Arabidopsis: going back and forth. Trends
Genet. 13, 152 – 156.
Ballinger, D.G., Benzer, S., 1989. Targeted gene mutations in
Drosophila. Proc. Natl. Acad. Sci. USA 86, 9402 – 9406.
Bancroft, I., Dean, C., 1993. Factors affecting the excision
frequency of the maize transposable element Ds in Arabidopsis thaliana. Mol. Gen. Genet. 240, 65 – 72.
Bancroft, I., Jones, J.D.G., Dean, C., 1993. Heterologous
transposon tagging of the DRL1 locus in Arabidopsis.
Plant Cell 5, 631 – 638.
Bevan, M., Bancroft, I., Bent, E., et al., 1998. Analysis of a 1.9
Mb of contigous sequence from chromosome 4 of Arabidopsis thaliana. Nature 391, 485 – 488.
Bouchez, D., Höfte, H., 1998. Functional genomics in plants.
Plant Physiol. 118, 725 – 732.
Camilleri, C., Lafleuriel, J., Macadré, C., Varoquaux, F.,
Parmentier, Y., Picard, G., Caboche, M., Bouchez, D.,
1998. A YAC contig map of Arabidopsis thaliana chromosome 3. Plant J. 14, 633 – 642.
Cardon, G.H., Frey, M., Saedler, H., Gierl, A., 1993. Mobility
of the maize transposable element En/Spm in Arabidopsis
thaliana. Plant J. 3, 773 – 784.
Choi, S., Creelman, R.A., Mullet, J.E., Wing, R.A., 1995.
Construction and characterisation of a bacterial artificial
chromosome library of Arabidopsis thaliana. Plant Mol.
Biol. Rep. 13, 124 – 128.
Cooley, M.B., Yoder, J.I., 1998. Insertional inactivation of the
tomato polygalacturonase gene. Plant Mol. Biol. 38, 521 –
530.
Creusot, F., Fouilloux, E., Dron, M., et al., 1995. The CIC-library: a large insert YAC library for genome mapping in
Arabidopsis thaliana. Plant J. 8, 763 – 770.
Das, L., Martienssen, R., 1995. Site-selected transposon mutagenesis at the hcf 106 locus in maize. Plant Cell 7, 287 – 294.
Donlin, M.J., Lisch, D., Freeling, M., 1995. Tissue-specific
accumulation of MURB, a protein encoded by MuDR, the
autonomous regulator of the Mutator transposable element
family. Plant Cell 7, 1989 – 2000.
Eisses, J.F., Lafoe, D., Scott, L.A., Weil, C.F., 1997. Novel,
developmentally specific control of Ds transposition in
maize. Mol. Gen. Genet. 256, 158 – 168.
Fedoroff, N.V., Banks, J.A., 1988. Is the Suppressor – mutator
element controlled by a basic developmental regulatory
mechanism? Genetics 120, 559 – 577.
Feldmann, K.A., 1991. T-DNA insertion mutagenesis in Arabidopsis: mutational spectrum. Plant J. 1, 71 – 82.
Goodman, H.M., Ecker, J.R., Dean, C., 1995. The genome of
Arabidopsis thaliana. Proc. Natl. Acad. Sci. USA 92,
10831 – 10835.
James, D.W., Lim, E.J., Keller, J., Plooy, I., Ralston, E.,
Dooner, H.K., 1995. Directed tagging of the Arabidopsis
fatty acid elongation (FAE1) gene with the maize transposon activator. Plant Cell 7, 309 – 319.
Jofuku, K.D., den Boer, B.G.W., van Montagu, M., Okamuro, J.K., 1994. Control of Arabidopsis flower and seed
development by the homeotic gene Apetala2. Plant Cell 6,
1211 – 1255.
E. Speulman et al. / Journal of Biotechnology 78 (2000) 301–312
Jones, D.A., Thomas, C.M., Hammond-Kosack, K.E., BalintKurti, P.J., Jones, J.D.G., 1994. Isolation of the tomato
Cf-9 gene for resistance to Cladosporium ful6um by transposon tagging. Science 266, 789–793.
Kaiser, K., Goodwin, S.F., 1990. Site-selected transposon
mutagenesis of Drosophila. Proc. Natl. Acad. Sci. USA 87,
1686 – 1690.
Kaneko, T., Kotani, H., Nakamura, Y., Sato, S., Asamizu, E.,
Miyajima, N., Tabata, S., 1998. Structural analysis of
Arabidopsis thaliana chromosome 5. Sequence features of
the regions of 1 381 565 bp covered by twenty one physically assigned P1 and TAC clones. DNA Res. 5, 131–145.
Koes, R., Souer, E., van Houwelingen, A., et al., 1995.
Targeted gene inactivation in petunia by PCR-based selection of transposon insertion mutants. Proc. Natl. Acad.
Sci. USA 92, 8149 – 8153.
Koncz, C., Mayerhofer, R., Koncz-Kalman, Z., Nawrath, C.,
Reiss, B., Redei, G.P., Schell, J., 1990. Isolation of a gene
encoding a novel chloroplast protein by T-DNA tagging in
Arabidopsis thaliana. EMBO J. 9, 1337–1346.
Koncz, C., Nemeth, K., Redei, G.P., Schell, J., 1992. T-DNA
insertional mutagenesis in Arabidopsis. Plant Mol. Biol. 20,
963 – 976.
Korfhage, U., Trezzini, G.F., Meier, I., Hahlbrock, K.,
Somssich, I.E., 1994. Plant homeodomain protein involved
in transcriptional regulation of a pathogen defense-related
gene. Plant Cell 6, 695–708.
Krysan, P.J., Young, J.F., Tax, F., Sussman, M.R., 1996.
Identification of transferred DNA insertions within Arabidopsis genes involved in signal transduction and ion
transport. Proc. Natl. Acad. Sci. USA 93, 8145–8150.
Lister, C., Dean, C., 1993. Recombinant inbred lines for
mapping RFLP and phenotypic markers in Arabidopsis
thaliana. Plant J. 4, 745–750.
Lister, C., Dean, C., 1997. The latest RI map using Lister and
Dean RI lines. Weeds World 4, 22–32.
Liu, Y.G., Mitsukawa, N., Vazquez-Tello, A., Whittier, R.F.,
1995. Generation of a high quality P1 library of Arabidopsis thaliana suitable for chromosome walking. Plant J. 7,
351 – 358.
McKinney, E.C., Ali, N., Traut, A., Feldmann, K.A., Belostotsky, D.A., McDowell, J.M., Meagher, R.B., 1995.
Sequence-based identification of T-DNA insertion mutations in Arabidopsis actin mutants act2 -1 and act4 -1. Plant
J. 8, 613 – 622.
Meinke, D.W., Cherry, J.M., Dean, C., Rounsley, S.D.,
Koorneef, M., 1998. Arabidopsis thaliana: a model plant
for genome analysis. Science 282, 662–682.
Mena, M., Ambrose, B.A., Meeley, R.B., Briggs, S.P., Yanofsky, M.F., Schmidt, R.J., 1996. Diversification of C-function activity in maize flower development. Science 274,
1537 – 1540.
Morris, P.C., Guerrier, D., Leung, J., Giraudat, J., 1997.
Cloning and characterisation of MEK1, an Arabidopsis
gene encoding a homologue of MAP kinase kinase. Plant
Mol. Biol. 35, 1057 – 1064.
311
Mozo, T., Fischer, S., Meier-Ewert, S., Lehrach, H., Altmann,
T.S.O., 1998. Use of the IGF BAC library for physical
mapping of the Arabidopsis thaliana genome. Plant J. 16,
377 – 384.
Okuley, J., Lightner, J., Feldmann, K., Yadav, N., Lark, E.,
Browse, J., 1994. Arabidopsis FAD2 gene encodes the
enzyme that is essential for polyunsaturated lipid synthesis.
Plant Cell 6, 147 – 158.
Pereira, A., Aarts, M.G.M., 1998. Transposon tagging with
the En – I system. In: Martinez-Zapater, J., Salinas, J.
(Eds.), Arabidopsis Protocols. Humana Press Inc, Totowa,
pp. 329 – 338.
Peterson, P., 1970. The En mutable system in maize. Theor.
Appl. Genet. 40, 367 – 377.
Richard, L., Qin, L.X., Goldberg, R., 1996. Clustered genes
within the genome of Arabidopsis thaliana encoding pectin
methylesterase-like enzymes. Gene 170, 207 – 211.
Rushforth, A.M., Saari, B., Anderson, P., 1993. Site-selected
insertion of the transposon Tc1 into a Caenorhabditis
elegans myosin light chain gene. Mol. Cell. Biol. 13, 902 –
910.
Schena, M., Davis, R.W., 1992. HD-Zip proteins: members of
an Arabidopsis homeodomain protein superfamily. Proc.
Natl. Acad. Sci. USA 89, 3894 – 3898.
Schmidt, R., Love, K., West, J., Lenehan, Z., Dean, C., 1997.
Description of 31 YAC contigs spanning the majority of
Arabidopsis thaliana chromosome 5. Plant J. 11, 563 – 573.
Schmidt, R., West, J., Cnops, G., Love, K., Balestrazzi, A.,
Dean, C., 1996. Detailed description of four YAC contigs
representing 17 Mb of chromosome 4 of Arabidopsis
thaliana ecotype Columbia. Plant J. 9, 755 – 765.
Speulman, E., Metz, P.L.J, van Arkel, G., te Lintel Hekkert,
B., Stiekema, W.J., Pereira, A., 1999. A two component
En– I transposon mutagenesis system for functional analysis of the Arabidopsis genome. Plant Cell 11, 1853 – 1866.
Sun, T.P., Kamiya, Y., 1994. The Arabidopsis GA1 locus
encodes the cyclase ent-kaurene synthetase A of gibberellin
biosynthesis. Plant Cell 6, 1509 – 1518.
Winkler, R.G., Frank, M.R., Galbraith, D.W., Feyereisen, R.,
Feldmann, K.A., 1998. Systematic reverse genetics of
transfer-DNA-tagged lines of Arabidopsis. Plant Physiol.
118, 743 – 750.
Xia, Y., Nikolau, B.J., Schnable, P.S., Xia, Y.J., 1997. Developmental and hormonal regulation of the Arabidopsis
CER2 gene that codes for a nuclear-localized protein required for the normal accumulation of cuticular waxes.
Plant Physiol. 115, 925 – 937.
Xu, Y., Li, L., Wu, K., Peeters, A.J.M., Gage, D.A., Zeevaart,
J.A.D., 1995. The GA5 locus of Arabidopsis thaliana encodes a multifunctional gibberellin 20-oxidase: molecular
cloning and functional expression. Proc. Natl. Acad. Sci.
USA 92, 6640 – 6644.
Yanofsky, M.F., Ma, H., Bowman, J.L., Drews, G.N., Feldmann, K.A., Meyerowitz, E.M., 1990. The protein encoded
by the Arabidopsis homeotic gene agamous resembles transcription factors. Nature 346, 35 – 39.
312
E. Speulman et al. / Journal of Biotechnology 78 (2000) 301–312
Zachgo, E.A., Wang, M.L., Dewdney, J., Bouchez, D., Camilleri, C., Belmonte, S., Huang, L., Dolau, M., Goodman,
H.M., 1996. A physical map of chromosome 2 of Arabidopsis
thaliana. Genome Res. 6, 19–25.
Zwaal, R.R., Broeks, A., van Meurs, J., Groene, J.T.M., Plasterk, R.H.A., 1993. Target-selected gene inactivation in Caenorhabditis elegans by using a frozen transposon insertion
mutant bank. Proc. Natl. Acad. Sci. USA 90, 7431 – 7435.
.
.