Download A Large Rearrangement Involving Genes and Low-Copy DNA

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

The Selfish Gene wikipedia , lookup

Microsatellite wikipedia , lookup

Helitron (biology) wikipedia , lookup

Transcript
Copyright  2003 by the Genetics Society of America
A Large Rearrangement Involving Genes and Low-Copy DNA Interrupts the
Microcollinearity Between Rice and Barley at the Rph7 Locus
S. Brunner, B. Keller and C. Feuillet1
Institute of Plant Biology, University of Zürich, CH-8008 Zürich, Switzerland
Manuscript received November 1, 2002
Accepted for publication February 19, 2003
ABSTRACT
Grass genomes differ greatly in chromosome number, ploidy level, and size. Despite these differences,
very good conservation of the marker order (collinearity) was found at the genetic map level between
the different grass genomes. Collinearity is particularly good between rice chromosome 1 and the group
3 chromosomes in the Triticeae. We have used this collinearity to saturate the leaf rust resistance locus
Rph7 on chromosome 3HS in barley with ESTs originating from rice chromosome 1S. Chromosome
walking allowed the establishment of a contig of 212 kb spanning the Rph7 resistance gene. Sequencing
of the contig showed an average gene density of one gene/20 kb with islands of higher density. Comparison
with the orthologous rice sequence revealed the complete conservation of five members of the HGA gene
family whereas intergenic regions differ greatly in size and composition. In rice, the five genes are closely
associated whereas in barley intergenic regions are ⬎38-fold larger. The size difference is due mainly to
the presence of six additional genes as well as noncoding low-copy sequences. Our data suggest that a
major rearrangement occurred in this region since the Triticeae and rice lineage diverged.
RASSES evolved from a common ancestor ⵑ50–80
million years ago (Wolfe et al. 1989). Comparative
mapping studies have demonstrated a good conservation of the marker order (collinearity) along the chromosomes in grass species although they differ greatly
in genome size (Moore et al. 1995; Gale and Devos
1998; Keller and Feuillet 2000). Therefore, comparative genomics in grasses was suggested to be an excellent
tool for map-based cloning of agronomically important
genes from large and complex genomes using the small
rice genome as a vehicle. However, several studies have
shown that, although collinearity is generally good at
the genetic map level, numerous small rearrangements
at the gene level disrupt microcollinearity between rice
and large genome species such as barley (Kilian et al.
1997; Han et al. 1998, 1999; Druka et al. 2000), wheat
(Li and Gill 2002; SanMiguel et al. 2002), and maize
(Chen et al. 1997, 1998; Tarchini et al. 2000; for recent
reviews see Bennetzen and Ramakrishna 2002; Feuillet and Keller 2002). These data have demonstrated
that rice can be very useful in providing numerous markers to saturate genetic maps in other cereals and that mapbased cloning must be performed in the species of interest. As a consequence, large efforts were undertaken to
develop efficient tools for gene isolation from barley
and wheat. In barley, a number of detailed genetic maps
(http://wheat.pw.usda.gov/ggpages/maps.shtml#barley),
a bacterial artificial chromosome (BAC) library (Yu et al.
G
1
Corresponding author: Institute of Plant Biology, University of Zürich,
Zollikerstr. 107, CH-8008 Zürich, Switzerland.
E-mail: [email protected]
Genetics 164: 673–683 ( June 2003)
2000), ⬎240,000 expressed sequence tags (ESTs) (http://
www.ncbi.nlm.nih.gov/dbEST/dbEST_summary.html),
and a gene tagging system (Koprek et al. 2000) were
generated in the last years.
The development of BAC libraries allows the analysis
of the gene organization in this large and complex genome as well as a comparison with the recently released
rice genome sequences (http://rgp.dna.affrc.go.jp/; Goff
et al. 2002; Yu et al. 2002) for a better understanding of
genome evolution in grasses. Few detailed studies of
large contiguous barley genomic sequences have been
published. Two fragments of ⬍100 kb have been analyzed at the Rar1 locus on chromosome 2H (Shirasu
et al. 2000) and at the Mlo locus on chromosome 4H
(Panstruga et al. 1998). Very recently, full sequence
analyses of a 102.4-kb sequence on chromosome 5H
(Dubcovsky et al. 2001), 417.5 kb (Rostoks et al. 2002)
on four BAC clones from different chromosomal locations, and 261 kb at the Mla gene cluster on chromosome 1H (Wei et al. 2002) were released. In these studies, the gene density was ⵑ10-fold higher than expected
from an equidistant gene distribution in the barley genome. Moreover, it was found that the genes were organized mostly in gene islands separated by large blocks
of nested retrotransposons. In most cases, retrotransposons occupied ⵑ70% of the intergenic regions, confirming data obtained in maize and wheat that indicate
that they are the main cause for genome expansion
during evolution (SanMiguel et al. 1996; SanMiguel
and Bennetzen 1998; Bennetzen 2000b; Dubcovsky
et al. 2001; Wicker et al. 2001). So far, a single study has
compared large orthologous DNA stretches of barley
674
S. Brunner, B. Keller and C. Feuillet
(102.4 kb) and rice (50 kb) at the sequence level (Dubcovsky et al. 2001). At the Xwg644 locus, four genes
were completely conserved with one inversion and one
duplication in barley compared to rice. Large size differences in the intergenic regions were associated with the
insertion of different transposable elements (Dubcovsky et al. 2001).
Here, we describe fine mapping of the barley leaf
rust resistance gene Rph7 on chromosome 3HS and the
establishment of a 212-kb barley BAC contig spanning
the resistance gene locus. Sequence analysis revealed a
high density of genes and an unusually low amount of
repetitive DNA in this region. Sequence comparison
with the orthologous rice region showed that microcollinearity is restricted to the conservation of one gene
family and that a large rearrangement occurred at the
Rph7 locus after the divergence of the Triticeae and
rice genomes.
MATERIALS AND METHODS
Plant material: A mapping population of 1157 F2 plants was
generated from a cross between the Rph7-containing barley
cultivar Cebada Capa and the susceptible cultivar Bowman.
Phenotypic characterization of each individual plant was performed as previously described (Brunner et al. 2000). A subpopulation consisting of 105 plants showing recombination
between the flanking markers S6246 and C10991 was selected
and F3 seeds were produced for each of the recombinant
plants. The phenotype of each recombinant F2 plant was confirmed by artificial infection of 15 F3 seedlings. Seeds of the
barley cultivar Morex were kindly provided by Dr. A. Graner
(IPK Gatersleben, Germany).
Restriction fragment length polymorphism (RFLP) analysis:
DNA isolation, Southern blotting, and labeling experiments
were performed as described by Graner et al. (1990). Genomic DNA was digested with seven restriction endonucleases
(BamHI, BglII, DraI, EcoRI, EcoRV, HindIII, and XbaI). Hybridization and washes (in 0.5⫻ SSC, 0.1% SDS) were performed
at 65⬚. The rice cDNA clones were kindly provided by Dr. T.
Sasaki (National Institute of Agrobiological Resources, Tsukuba, Japan). The barley EST clone HY04A02T (AL506706)
was kindly provided by Dr. A. Graner (IPK Gatersleben).
BAC library screening: Filters of the barley Morex BAC
library providing 6.3 haploid genome equivalents were
screened by hybridization (Yu et al. 2000). DNA for fingerprint
analysis and BAC end cloning was prepared by standard procedures (Sambrook et al. 1989). Positive clones were analyzed
by fingerprinting with the restriction enzymes HindIII and
NotI. BAC end sequences were generated by inverse PCR,
plasmid rescue (Woo et al. 1994), or direct cycle sequencing
(Thermo sequenase premixed cycle sequencing kit, Amersham Pharmacia) of 4 ␮g of pure BAC DNA [QIAGEN (Chatsworth, CA) plasmid midi kit] using an automated DNA Sequencer 4200 (Li-Cor).
Shotgun and low-pass sequencing: Shotgun libraries were
constructed for the barley BAC clones 211E24 and 252N19
as previously described (Stein et al. 2000). A total of 1256
shotgun clones were sequenced from both ends (average
length: 696 bp), resulting in an eightfold coverage of a
211,664-bp sequence. The assembly of the shotgun clones
resulted in 12 individual subcontigs. Gaps were due to either
stretches of poly(G) or the presence of repetitive DNA. Gaps
between the subcontigs were closed by PCR using 18- to 20-
mer oligonucleotides. PCR fragments were cloned into the
pGEM-T Easy vector system I (Promega, Madison, WI) and
sequenced. The predicted error rate (1/1 kb) was based on
the PHRED quality score and sequence coverage. The assembly was checked by comparison with the pattern of restriction
enzyme digestion, and sequences corresponding to coding
regions were resequenced to achieve an error rate of 1/10 kb.
Sequence analysis and phylogenetic analyis: The 211,664bp sequence was compared with National Center for Biotechnology Information dbEST and nonredundant databases using BLASTN, BLASTX, and TBLASTX algorithms (Altschul
et al. 1997) to identify putative genes and known repetitive
elements. Repetitive elements were also detected by comparison with a local database of repetitive elements from wheat
and barley of which an annotated version can be found at the
Triticeae repeat database (TREP; http://wheat.pw.usda.gov/
ITMI/Repeats/index.shtml). In addition, the sequence was
analyzed using the rice Genomic Automated Annotation System (http://RiceGAAS.dna.affrc.go.jp/), which combines programs for coding region prediction, homology search analysis,
repetitive DNA analysis, and protein motif prediction. Finally,
comparative analysis of the barley sequence against itself and
against the rice P1 artificial chromosome (PAC) sequence
(AP002868) using the program Dotter (Sonnhammer and
Durbin 1995) allowed the identification of direct repeats,
such as the long terminal repeats of retroelements, gene duplications, and deletions as well as miniature inverted repeat
transposable elements (MITEs). The GCG software programs
COMPOSITION, BLAST, BESTFIT, PILEUP, FIGURE, and
MAP were also used to characterize the sequence. Cross-analysis of the information obtained by these different approaches
was essential to validate the sequence analysis and annotations.
Predicted genes were considered as putative genes only if
significant homology was obtained with ESTs and/or proteins
(including predicted ones) in the databases using BLASTn,
tBLASTn, or BLASTx programs. Genes and proteins are
named with the two first letters representing the initial letters
of the Latin binomial followed by the original symbol.
Alignment of the amino acid sequences from the Arabidopsis
thaliana, rice, and barley HGA genes was performed using the
ClustalX program (Thompson et al. 1997). A neighbor-joining
(NJ) method was then applied to produce a phylogenetic tree.
The optimal NJ tree was found using heuristic searches in
PAUP (Phylogenetic Analysis Using Parsimony), version 4.04a,
(Sinaur Associates, Sunderland, MA). The relative degree of
branch support was determined within the NJ framework using
the bootstrap procedure (Felsenstein 1985) in PAUP 4.04a.
The original data set was resampled 1000 times.
RESULTS
Fine mapping of the Rph7 leaf rust resistance gene in
barley: We have recently mapped the leaf rust resistance
gene Rph7 on barley chromosome 3HS using a small
segregating population (112 F2 plants) derived from a
cross between the resistant cultivar Cebada Capa and
the susceptible cultivar Bowman (Brunner et al. 2000).
Rph7 lies in a genetic interval of 19 cM between the
barley cDNA ABC171 and the rice cDNA S1543 (Figure
1A). An additional rice cDNA C970 maps at 2.7 cM distal
to S1543 (Figure 1A; Brunner et al. 2000). The two rice
cDNAs were located on rice chromosome 1 by the Rice
Genome Project (http://rgp.dna.affrc.go.jp/). Thirtyfour ESTs identified on rice yeast artificial chromosomes
and PACs derived from this region were used as RFLP
Genome Rearrangement at the Rph7 Locus
675
Figure 1.—Genetic and physical
mapping of the Rph7 leaf rust resistance gene on barley chromosome
3HS using collinearity with rice chromosome 1S. (A) Genetic map of the
Rph7 locus on barley chromosome
3HS (linkage analysis on 112 segregating F2 plants). The solid bar above
the interval between XS6246 and
Rph7 represents the interval that is
described at high resolution at the
genetic and physical levels in D and
E, respectively. (B) Genetic map of
the orthologous region on rice chromosome 1S. Rice ESTs marked with
asterisks are mapped physically but
not genetically in rice. (C) Physical
map of the orthologous region of rice
chromosome 1S. All rice ESTs used
for mapping in barley are physically
located on a rice PAC contig of 982
kb, which includes the PACs 439B06,
436F06, 698G03, 494A10, 698A04,
684C01, 445D12, and 37C04. Physical
distances (in kilobases) are indicated
between the rice markers. A possible
rearrangement resulting from an inversion or a deletion at the XC10991
and XE1072 loci was observed. Probes S10623 and G107 were monomorphic between the parents of the barley mapping population
and therefore could not be mapped. (D) High-density genetic map of the Rph7 locus based on a set of 105 F2 individuals showing
recombination between the S6246 and C10991 markers. (E) Physical map at the Rph7 locus on barley chromosome 3HS. BACs
222G2 and 252N19 were isolated using the probes S6246 and HYO4A0T2, respectively. BAC 211E24 was identified with the lowpass sequencing probes HvA10, HvC11, and HvE6. Genetic distances are indicated in centimorgans.
probes to saturate the Rph7 locus in barley. Twenty-nine
of them (85%) gave a clear hybridization pattern on barley
genomic DNA. Only three probes (S6246, C10991, and
E1072), or 9%, identified polymorphic fragments between the parents of our mapping population. They
all mapped close to Rph7 (Figure 1A). S6246, which is
located 47 kb proximal to S1543 in rice (Figure 1C),
revealed at least six bands on barley genomic DNA (data
not shown). Only one was polymorphic and it was mapped
at 0.4 cM distal to the Rph7 gene and 1.7 cM proximal
to S1543 in barley (Figure 1A). In rice, the single-copy
EST C10991 is present 365 kb proximal to S6246 (Figure
1C). It maps as a single locus 5 cM proximal to Rph7
in barley (Figure 1A). The single-copy EST E1072 is
located between the ESTs S6246 and C10991 at a distance of 114 kb from S6246 in rice (Figure 1C). In
barley, E1072 identifies a fragment that maps 3.6 cM
proximal to C10991 (Figure 1A). Therefore, C10991
lies between E1072 and S6246, suggesting a local rearrangement (inversion or deletion) in barley compared to rice.
For the map-based cloning of Rph7, a recombinant
population consisting of 105 F2 plants showing recombination between the flanking markers S6246 and C10991
was selected out of 1157 F2 plants and was used for highresolution mapping. A barley homolog to S6246 (62%
identity), HY04A02T (AL506706), was identified by
BLAST search against the barley EST database at IPK
Gatersleben (http://hordeum.ipk-gatersleben.de/blast/
blast_server.html). The HY04A02T sequence (2.2 kb)
is longer than S6246 (750 bp), which corresponds to
the very conserved 3⬘ end region of HGA genes (see
below). Therefore, Southern hybridization in barley
showed more specific signals with HY04A02T than with
S6246. Out of four strongly hybridizing fragments, one
that was not detected by the probe S6246 was polymorphic and was completely linked to Rph7 (Figure 1D).
Thus, the homologous cDNAs S6246 from rice and
HY04A02T from barley identify different genetic loci,
suggesting the presence of a gene family at the Rph7
locus in barley. Here, the use of rice ESTs allowed the
saturation of the Rph7 locus and the delimitation of a
genetic interval of 5.4 cM around the resistance gene.
Identification of a 212-kb physical contig spanning
the Rph7 gene in barley: The analysis of translocation
breakpoints in the barley genome showed high recombination frequency in the telomeric region of 3HS with
a physical-to-genetic distance ratio of 100 kb/cM (Kunzel et al. 2000). Our mapping data showed that the
Rph7 locus lies in this region in an interval of 5.4 cM.
Therefore, a physical distance of ⵑ600 kb, which is
compatible with map-based cloning, can be expected
between the flanking markers. The barley BAC library
of the susceptible cultivar Morex (Yu et al. 2000) was
676
S. Brunner, B. Keller and C. Feuillet
Figure 2.—Schematic representation of
the physical contig of 212 kb at the Rph7
resistance locus. The genes are represented
as colored boxes with an arrow indicating
the transcription orientation. Asterisks indicate pseudogenes. Black/yellow patterned
boxes represent full or partial retrotransposons whose names are indicated in the figure. Elements found as fragments are indicated only with the generic name of the element; full element names have been given
according to the convention Name_BAC
number_rank in the sequence. Red bars
represent MITEs and transposons are indicated as blue striped white boxes. Microsatellites are represented as vertical blue arrowheads. Small horizontal black arrowheads indicate direct repeats or inverted repeats. New
unknown elements (XH_211E24 and XI211E24) are represented as orange bars. The
duplication of a fragment of HvHGA5 preceding the insertion of the Katarina_211E24_1
element is indicated. The dotted line below
the contig indicates the physical region
spanning the Rph7 resistance gene. The red
arrowheads under the contig indicate the
positions of the shotgun clones used to assess the copy number of the uncharacterized fraction of the sequence. Annotations
of the different elements can be found under accession no. AF521177.
screened with the probes S6246 and HY04A02T (Figure
1D). Six and seven positive BAC clones were identified
with the S6246 and HY04A02T probes, respectively.
BAC-end and inverse-PCR fragment sequencing resulted only in repetitive sequences, which could not
be used for further mapping. A low-pass sequencing
strategy (Stein et al. 2000) was then performed on BAC
252N19 (Figure 1E). Of 29 nonrepetitive clones, 11
revealed polymorphisms between the parental lines and
were mapped in the recombinant population. Nine of
the nonrepetitive clones were completely linked with
Rph7 whereas two (Hv283 and Hv480) mapped at one
recombination event proximal to it (Figure 1D). A first
step of chromosome walking was then performed by
screening the barley BAC library with three of the linked
markers (HvA10, HvC11, and HvE6; Figure 1E). They
all identified the same six BAC clones, which overlap
BAC 252N19 on 40 kb. BAC 211E24 (Figure 1E) was
chosen for further analysis. The left BAC end (Hv211L)
was nonrepetitive and mapped to the same locus as the
marker S6246, i.e., at 0.3 cM (seven recombinants) distal
to Rph7 (Figure 1D). A nonrepetitive low-pass clone,
Hvgad1, which is a fragment of a putative glutamate
decarboxylase gene, was mapped as a single locus at
0.09 cM (two recombinants) distal to the resistance gene
(Figure 1D). Both the Hv211L and the Hvgad1 probes
gave a hybridization signal on the BAC clone 222G2,
which was isolated after screening the BAC library with
S6246. The two BAC clones 211E24 and 222G2 share
ⵑ100 kb of sequence (Figure 1E). Thus, with one step
of chromosome walking, a contiguous BAC contig of
ⵑ217 kb spanning the Rph7 resistance gene was established.
Sequencing of a 212-kb BAC contig in barley: The
BAC clones 252N19 and 211E24 were completely sequenced, providing 211,664 bp of contiguous sequence
(AF521177) with an approximately eightfold coverage.
The 212-kb contiguous sequence was annotated using
a combination of comparative and prediction tools (see
materials and methods).
Repetitive elements: Three full-length Ty1-copia-like
retrotransposons were identified on the 212-kb contig
(Figure 2). They correspond to a BARE-1 element (BARE1_211E24_1) and two new retroelements, Katarina_
211E24_1 and Bianca_252N19_1. Amino acid sequence
comparison showed that the reverse transcriptase (RT)
encoded by Katarina_211E24_1 is 47.5% similar to the
one of Opie-1 in maize (SanMiguel et al. 1996) and
46.7% similar to the BARE-1 RT. Bianca_252N19_1
shows less similarity to the BARE-1 RT (41%) than do
the Katarina_211E24_1 and Opie elements. Interestingly, Katarina_211E24_1 is found inserted into a gene
(HvHGA5) and not into another retroelement, as frequently found for the Opie-1 elements in maize (SanMiguel et al. 1996). Seven additional incomplete retroelements that show homology to already characterized
wheat and barley copia, gypsy, and TRIM retroelements
(Dubcovsky et al. 2001; Wicker et al. 2001; http://
Genome Rearrangement at the Rph7 Locus
677
TABLE 1
Characteristics of the predicted genes found on the 212-kb contig spanning the Rph7 locus in barley
Name
Length (bp)
HvHGA1
HvHGA2
HvHGA4
HvHGA5
Hvgad1
Hvhel1
3362
2990
2235
2002
3151
4452
Hvrh2
Hvpg1
Hvpg3
Hvpg4
783
4893
5421
2510
Structurea
5E,
5E,
4E,
2E,
7E,
2E,
4I
4I
3I
1I
6I
1I
1E
8E, 7I
14E, 13I
5E, 4I
Proteinb
Predicted function
505
536
525
427
490
1423
Unknown
Unknown, pseudogene
Unknown
Unknown, disrupted
Glutamate decarboxylase
Helicase, pseudogene
210
1049
689
226
Ring-H2 finger motif, pseudogene
Unknown, Gly-Cys rich domain
Unknown, DNA repair domain
Unknown
Rice 1 orthologsc
698A04.11
698A04.11
698A04.9
698A04.8
No
No
No
No
No
No
Rice homologsd
Chromosomes
1, 2, 3, 4, 5, 6, 10
C terminus only e
Chromosomes 2, 3, 4
Chromosomes 1, 2, 3,
4, 5, 6, 8, 10
Chromosome 2
Chromosome 10
Chromosomes 2, 4
Chromosome 3
a
The number of exons (E) and introns (I) deduced either from comparison with cDNAs or from prediction programs.
The number of amino acid residues.
c
For each barley gene, orthologs found on rice chromosome 1 are indicated with the names given in the GenBank annotation.
d
Homologs to the 10 barley genes identified in the rice databases are indicated with their chromosomal location as annotated
in the database.
e
Homology was found with rice genes other than the OsHGA orthologs but was limited to the very conserved C-terminal region
of the HGA genes.
b
wheat.pw.usda.gov/ITMI/Repeats/nrTREP_list.html)
were detected on the sequence (Figure 2). A new
transposon, Caspar_211E24_1, was also identified. It is
a complete TNP2-like transposon of 9103 bp with a
flanking CACTG sequence resembling the CACTA motif found in this type of transposon (Nacken et al. 1991)
and a 3-bp target site duplication (TSD). It is putatively
active as it contains an open reading frame [1105 amino
acids (aa)] that shows ⬎73% similarity to TNP2-like rice
transposases such as RIM2 (He et al. 2000). Incomplete
retroelements were found nested mainly around the
Caspar_211E24_1 transposon and the BARE-1_211E24_1
and Bianca_252N19_1 retrotransposons (Figure 2). A
total of 13 MITEs of the Stowaway family were identified.
All except one (Ceres_211E24_1) are similar to already
characterized wheat MITES (TREP database, http://
wheat.pw.usda.gov/ITMI/Repeats/nrTREP_list.html).
Seven of them are in close vicinity in genes or in introns
(Figure 2). Interestingly, the Ceres_211E24_1 MITE is
located at the 5⬘ end of the HvHGA4 gene and is interrupted by a 638-bp element (XH_211E24) (Figure 2).
XH_211E24 is flanked by a 9-bp TSD and is also found
in the 5⬘ untranslated region of two other barley genes
in the databases. Three microsatellites of ⬎15 (TA)
repeats are located on the sequence. Two of them are
associated with newly characterized elements of 370 bp
(XI_211E24), which have TA target site duplications
(Figure 2).
Ten genes are present on the barley contig: Ten genes
were identified on the 212-kb contiguous barley sequence. Four of them, HvHGA1, HvHGA2, HvHGA4,
and HvHGA5, belong to the same gene family (HGA).
The six additional genes are all different and are located
between the HvHGA4 and HvHGA2 genes (Figure 2).
The length and the characteristics of the genes and the
corresponding deduced proteins are given in Table 1.
The intron/exon structure of the HvHGA genes has
been deduced from predictions and comparisons with
barley ESTs, including HY04A02T, which corresponds
to a cDNA highly similar to HvHGA2. The HvHGA1 and
HvHGA2 proteins are 80% similar and the intron/exon
structure of the two genes is identical (Figure 3, Table
1). HvHGA4 shows ⵑ60% similarity to HvHGA1 and
HvHGA2. It has only four introns whose positions are
not conserved with the HvHGA1 and HvHGA2 genes
(Figure 3). The structure of HvHGA5 is more difficult
to assess as the gene is interrupted by the insertion
of the retrotransposon Katarina_211E24_1 into the last
exon (Figure 3). In addition, HvHGA5 is probably truncated and is partially duplicated (Figure 2). The 2-kb
duplicated fragment does not contain any sequence of
the Katarina_211E24_1 retrotransposon, indicating that
the duplication predated the insertion of the retroelement (Figure 2). A hypothetical gene sequence was reconstructed from the HvHGA5 sequence (Table 1). The
predicted protein shows 54% similarity to the other
members of the HvHGA gene family. In general, the
C-terminal region of the putative HvHGA proteins is
more conserved than the N-terminal region. One domain (GVHGAGLTN) is particularly well conserved and
it is also found in several putative proteins from rice
and A. thaliana. No known functional domain could be
detected in the HvHGA proteins and no function so far
has been assigned to any of the homologs of the HvHGA
genes.
Out of the six other genes detected on the barley contig,
three either showed homology to known proteins
(Hvhel1, Hvgad1) or contained well-characterized motifs
678
S. Brunner, B. Keller and C. Feuillet
Figure 3.—Schematic representation of the HGA genes
from barley and rice. The exon (solid)/intron (open) structure is represented for each orthologous pair of genes of the
HGA gene family in barley (HvHGA) and rice (OsHGA). No
barley ortholog of OsHGA3 was found on the 217-kb physical
contig and the structure of the putative HvHGA3 gene is not
known so far. The retrotransposon Katarina_211E24_1, which
is inserted into the last exon of the HvHGA5 gene, is represented as a diagonally striped box. HvHGA5 is truncated and
only the last exon can be compared with the predicted
OsHGA5 orthologous rice gene. Vertical bars delimit the sequences, which are perfectly conserved between the different
genes.
(Hvrh2; Table 1). Hvhel1 encodes a putative helicase
and is a pseudogene because of three frameshift mutations and one stop codon. Hvgad1 codes for a putative
glutamate decarboxylase and Hvrh2 is a pseudogene
that would encode a protein containing a very conserved
RING-H2 finger motif. HvRH2 shows ⵑ60% similarity
to the EL5 protein of rice (Takai et al. 2001), which is
induced by a biotic elicitor and was recently shown to
be a ubiquitin ligase (Takai et al. 2002). The three other
genes (Hvpg1, Hvpg3, Hvpg4) are putative genes that
do not have any identified function in any of the organisms in which homologs were found. Interestingly, two
of them (Hvpg1 and Hvpg4) are well conserved in animal
species such as Caenorhabditis elegans, Homo sapiens, and/
or Drosophila melanogaster. For all of the predicted genes,
at least two ESTs were found to confirm the gene prediction.
With 10 genes within 212 kb, the gene density found
at the Rph7 locus is one gene every 20 kb. However,
the genes are not homogeneously distributed along the
contig. Most of them are present in two clusters—cluster
1 (Hvrh2, Hvpg3, Hvgad1) and cluster 2 (Hvpg4, Hvhel1,
HvHGA2, HvHGA1)—with a higher gene density of one
gene every 6–9 kb. In contrast, the intergenic regions
located between HvHGA4 and Hvrh2 as well as between
Hvgad1 and Hvpg1 are larger than 34 and 43 kb, respectively.
Ratio of physical-to-genetic distances: All the RFLP
probes used for mapping showed an identical hybridization pattern for the susceptible parent Bowman and
Morex, the variety used to construct the BAC library.
This allowed the estimation of the correlation between
physical and genetic distances by comparing the RFLP
fragments with the BAC sequence of Morex. From S6246
to Hv480, an overall physical-to-genetic distance ratio
of 588 kb/cM was estimated (Figure 1, D and E). This
is over five times less than 3.7–4.2 Mb/cM, which was
estimated for the barley genome (Graner et al. 1991;
Kleinhofs et al. 1993). The distribution of the recombination frequency was analyzed in detail. From Hv211L
to Hvgad1, 88 kb represents a genetic distance of 0.21
cM, resulting in a ratio of 419 (⫾214) kb/cM. A similar
ratio was estimated in the interval between Hvgad1 and
the HvC11 (Figure 1, D and E). In contrast, around the
Rph7 gene, no recombination was found over a distance
of ⵑ110 kb (⬎2.5 Mb/cM). On the proximal side of
Rph7, a ratio of 75 (⫾37.5) kb/cM, which is very similar
to the 100 kb/cM estimated by Kunzel et al. (2000) for
this region of chromosome 3H, was found.
Regions without genes or repetitive elements: After
extensive sequence analysis, ⵑ130 kb of the total sequence did not show homology to any known genes or
repetitive elements and remained uncharacterized. This
represents ⵑ60% of the sequence while the coding regions and the identified repetitive elements represent
15 and 25%, respectively. This contrasts with previous
analysis of large genomic fragments in barley, wheat,
and maize where ⵑ70% of the sequence was repetitive
and composed mainly of retrotransposons (SanMiguel
et al. 1996, 2002; Tikhonov et al. 1999; Dubcovsky et
al. 2001; Wicker et al. 2001). Thus, the nonannotated
regions correspond to either uncharacterized or extremely degenerated repetitive elements or low-copy regions without any identified equivalent in the large grass
genomes. To test these hypotheses, 15 randomly chosen
shotgun clones corresponding to unidentified regions
(Figure 2) were used as probes on Southern blots of barley
genomic DNA. Four were highly repetitive whereas the
others showed 2–10 discrete bands with little background signal (data not shown). On the basis of these
results, we estimate that ⵑ25% of the nonannotated
130-kb sequence still corresponds to uncharacterized
repetitive elements. Consequently, ⵑ45% of the 212kb contig might correspond to noncoding, low-copy
sequence, suggesting that the Rph7 locus is unusually
rich in noncoding low-copy sequence.
All the unidentified sequences were assembled into
a single stretch of 130 kb and compared to the 907 kb
(eight BACs) of barley genomic sequences currently
Genome Rearrangement at the Rph7 Locus
Figure 4.—Unrooted phylogenetic tree of the HGA genes
from A. thaliana (atxg), rice (OsHGA), and barley (HvHGA).
ClustalX amino acid sequence alignment was used to produce
the best tree after neighbor-joining analysis. Branches that
had bootstrap values ⬍100 are indicated.
available in the public databases. Four stretches of 301–
1282 bp (for positions see AF521177) showing 75–86%
of identity were identified in at least another BAC sequence. Three of them are present at the Mla locus in
predicted gene regions (Wei et al. 2002) as well as in
other BACs in nonannotated regions. The frequency of
these sequences in the small subset of the barley genome
covered by these BACs strongly suggests that they belong
to the pool of repetitive sequences, which remain to
be identified. This will require further sequencing and
comparison of barley genomic sequences.
The HvHGA gene family is conserved in rice and A.
thaliana: Five predicted genes (P0698A04.6, P0698A04.8,
P0698A04.9, P0698A04.11, P0698A04.13) showing homology to the HvHGA genes are present in a tandem array
spanning 35.8 kb on rice chromosome 1 (AP002868). To
simplify the analysis, P0698A04.6, P0698A04.8, P0698A04.9, P0698A04.11, and P0698A04.13 were renamed
OsHGA6, OsHGA5, OsHGA4, OsHGA1, and OsHGA3, respectively. OsHGA6 corresponds to the rice EST S6246,
which mapped 0.3 cM distal to Rph7 (Figure 1). Phylogenetic analysis showed that each barley HvHGA gene is
more similar to one rice OsHGA homolog than to one
another (Figure 4), indicating orthologous relationships between the rice and barley genes. The only exception concerns HvHGA1 and HvHGA2, which are more
similar to each other than to the rice ortholog OsHGA1
(Figure 4). The intron/exon structure of the barley
and rice HGA orthologs is generally well conserved, in
particular for the last large exon, which contains highly
conserved domains of the gene family (Figure 3).
No orthologs were found on the contiguous 212-kb
barley sequence for the two other rice genes, OsHGA6
(EST S6246) and OsHGA3, which are present in the rice
gene cluster (Figure 5A). However, a sequence strongly
hybridizing with S6246 was detected on BAC 222G2
679
(Figure 1), suggesting the presence of a gene homologous to OsHGA6 (S6246) on this BAC. A HindIII fragment of 5 kb hybridizing with S6246 was subcloned from
BAC 222G2 and sequenced. It contains a gene of 2689
bp, which we named HvHGA6 (Figure 3, AF525024).
The predicted 637-aa protein shows the highest similarity
(65%) to OsHGA6 (Figure 4), confirming that HvHGA6
is orthologous to OsHGA6 (Figure 5A). The estimated
physical distance between HvHGA5 and HvHGA6 is ⵑ25
kb (Figure 5A). Thus, our data show that each barley
HvHGA gene has an ortholog in rice. The order of
the genes is completely conserved and the transcription
orientation is identical except for HvHGA4, which is
inverted compared to OsHGA4 (Figure 5A).
In A. thaliana, BLASTp and BLASTx searches identified seven HvHGA gene homologs. They are located
on chromosome 2 (atg2g03360, at2g03370, at2g41640)
and chromosome 3 (at3g10320, at3g57380, at3g18170,
at3g18180; Figure 5B). at3g18170 and at3g18180 as well
as atg2g03360 and at2g03370 are arranged in tandem
on chromosome 3 and 2, respectively (Figure 5B). This
and their close relationships in the phylogenetic analysis
(Figure 4) suggest that they originate from recent duplications. The at2g41640 and at3g57380 genes are located
in regions of chromosome 2 and 3, which are known
to result from a ancient genome duplication (Vision et
al. 2000), and they are closely related in the phylogenetic analysis (Figure 4). Although the intron numbers
and positions differ among the HvHGA homologs in
barley, rice, and A. thaliana, the position and the length
of the last exon is highly conserved (data not shown).
Phylogenetic analysis indicated that the at3g18170 and
at3g18180 gene products have the highest similarity
(ⵑ53%) to the HvHGA protein sequences of rice and
barley (Figure 4). This suggests that a single gene, which
resembled the at3g18170/80 genes, was present in a
common ancestor of the monocots and dicots.
Microcollinearity is lost in the HGA intergenic regions:
In addition to the insertion of a retrotransposon in
HvHGA5 and the inversion of HvHGA4, very few rearrangements affected individual members of the HGA
gene family during evolution in rice and barley. In contrast, major changes were observed in the intergenic regions. The distance between the barley genes HvHGA5
and HvHGA4 (30 kb) as well as between the HvHGA4
and HvHGA2 genes (153 kb) is ⬎38-fold greater than
the distance between the orthologous rice genes (700
bp and 4 kb, respectively; Figure 5A). In the HvHGA4/
HvHGA2 interval, two full-length retrotransposons, six
truncated retrotransposons, two transposons, and six
MITEs contribute to ⵑ30% (45 kb) of this size difference. However, the most striking difference resides in
the presence of six additional single genes (Hvrh2,
Hvpg3, Hvgad1, Hvpg1, Hvpg4, Hvhel1) and of a substantial proportion of noncoding low-copy sequence in barley compared to rice (Figure 5A). Southern hybridization and BLAST search against the recently published rice
680
S. Brunner, B. Keller and C. Feuillet
Figure 5.—Comparison of the gene organization at the HGA gene loci in rice, barley, and A. thaliana. (A) Microcollinearity
is retained only for the barley HvHGA genes and the orthologous rice OsHGA genes. A large rearrangement interrupting the
microcollinearity between rice and barley occurred in the HGA5/HGA4 and the HGA4/HGA2 intergenic regions. (B) Organization
of the HGA gene homologs in A. thaliana. Seven genes are distributed on chromosomes 2 and 3 in the A. thaliana genome. Two
pairs of paralogs, highlighted with a red circle, are found on each chromosome. The genes At2g41640 and At3g57380 are located
in chromosomal regions known to have originated from a large duplication (interrupted circle). The HGA genes are presented
as green boxes whereas the additional genes are shown in different colors. Arrows indicate the direction of transcription except
for HvHGA6, which is located on BAC 222G2 and whose orientation respective to the 212-kb contig is not known.
genome drafts of Syngenta (http://www.tmri.org) and the
Beijing rice genome project (http://210.83.138.53/rice/;
Goff et al. 2002; Yu et al. 2002) showed that, except for
the helicase-like gene that belongs to a multigene family,
one to four copies of each of the additional barley genes
are present elsewhere in the rice genome. Their chromosomal location was estimated on the basis of the
results of BLAST search against rice BAC and PAC sequences in public databases (Table 1). None of these
genes were found on PACs located at the distal end of
chromosome 1 in the vicinity of the OsHGA genes. In
some cases, homologs were found on the same chromosome (i.e., Hvpg3, Hvgad1, and Hvrh2 on chromosome
2) but never on the same BAC/PAC clones. These data
indicate that the gene cluster found in barley between
the HvHGA4 and HvHGA2 genes has no equivalent in
rice and that a large rearrangement occurred at the
Rph7 locus since the divergence of the two species. We
have used the HvHGA1, HvHGA2, Hvpg1, Hvpg3, Hvpg4,
and Hvgad1 genes as probes for hybridization on wheat
genomic DNA of the nulli-tetrasomic lines of the variety
Chinese Spring. All the genes hybridized to single fragments on each homeologous chromosome 3 in hexaploid wheat (data not shown), suggesting that the additional genes found in barley are also present in the
orthologous region in wheat. Thus, the rearrangement
likely predates the Bambusoideae/Poideae divergence.
DISCUSSION
Characterization of the genetic and physical interval
spanning the Rph7 locus in barley: Mapping of ESTs
originating from rice chromosome 1S allowed the development of a high-density genetic map and the initiation
of chromosome walking at the Rph7 resistance locus on
barley chromosome 3HS. Our data confirmed the good
synteny between these chromosomes (Smilde et al.
2001) and the usefulness of rice as a source of new
probes for saturating genetic regions in other grasses. A
physical contig of ⵑ217 kb spanning the Rph7 resistance
gene was established in a single step of chromosome
walking. The average physical-to-genetic distance ratio
of 588 kb/cM that was estimated at this locus is higher
than the 100 kb/cM that was previously estimated by
cytogenetic analysis (Kunzel et al. 2000). This is probably due to different levels of resolution in the two techniques. Nevertheless, it confirms that in this region of
chromosome 3H, the ratio of physical-to-genetic distance is lower than that expected from a genome-wide
average (3–4 Mb/cM) and that gene-rich regions are
Genome Rearrangement at the Rph7 Locus
associated with a higher recombination frequency
(Faris et al. 2000; Rostoks et al. 2002; Sandhu and
Gill 2002a). The recombination frequency was unevenly distributed (from 75 kb/cM to ⬎2.5 Mb/cM). It
is known that the relationship between physical and
genetic distance varies throughout the genome but very
few studies have reported on the variation of this ratio
at the molecular level in wheat and barley. In wheat,
local variations from 600 kb/cM to 12 Mb/cM were
found at the Lr10 locus (Stein et al. 2000), and in barley,
variations from 176 kb/cM to 5 Mb/cM were observed
in eight intervals spanning the Mla cluster (Wei et al.
1999). Thus, large local variations in recombination
frequencies might be a feature of these large genomes.
As in the Lr10 and Mla disease resistance loci, suppression of recombination was observed at the Rph7 resistance locus. In these cases, suppressed recombination
was postulated to be due to extreme polymorphism associated with the origin of the resistant region. Here,
although a high level of polymorphim was observed
between Cebada Capa and the susceptible cultivars Bowman and Morex, no clear correlation could be established between the degree of polymorphism and the
recombination rate.
High gene density and diversity is found at the Rph7
locus in barley: Together with the 212 kb of genomic
sequence analyzed in this study, ⬎1.1 Mb of barley genomic sequence distributed over nearly all the chromosomes has been studied by now. General patterns of
genome organization have started to emerge from the
analysis of these sequences. For example, the average
gene density found at the Rph7 locus (one gene/20 kb)
seems to be very representative of the gene density in
the gene-containing regions of the barley genome (Panstruga et al. 1998; Shirasu et al. 2000; Dubcovsky et
al. 2001; Rostoks et al. 2002). It is also commonly observed that the average gene density value does not
reflect the unhomogeneous distribution of the genes.
At the Rph7 locus, two clusters of three and four genes
with a density of one gene/6–9 kb are present, whereas
the Hvpg1 is the only gene present in a 60-kb region
rich in repetitive elements. Similar gene clustering has
been observed at different loci in barley, wheat, and
maize and it has become clear that high-gene-density
islands and single genes separated by large regions of
repetitive elements are both found in gene-rich regions
of the grass genomes (Feuillet and Keller 1999, 2002;
Rostoks et al. 2002; Sandhu and Gill 2002b; Wei et
al. 2002).
Seven different types of gene were found at the Rph7
locus. Except for the glutamate decarboxylase (Hvgad1),
the helicase (Hvhel1), and the ring-H2 (Hvrh2)-like
genes, no function based on similarity with known proteins could be assigned to the different genes. Linkage
analysis indicates that five genes (Hvpg1, Hvpg4, Hvhel1,
HvHGA2, and HvHGA1) are putative candidates for
Rph7. Remarkably, none shows similarity to already char-
681
acterized disease resistance genes (Hulbert et al. 2001).
For each candidate gene detected in the susceptible
variety, an allele was detected in the resistant cultivar
Cebada Capa. However, we cannot exclude the possibility that an additional gene is present in the resistant haplotype. We are currently constructing a genomic library
from Cebada Capa to test this possibility and to identify
the Rph7 resistance gene.
Microcollinearity at the Rph7 locus in barley and rice
is interrupted by a large genic rearrangement: Each
member of the barley HGA gene family has a clear
ortholog in rice, suggesting that this gene family arose
through tandem duplications before the Bambusoideae/Poideae divergence, i.e., ⵑ40–50 million years
ago. In barley, two genes, HvHGA1 and HvHGA2, are
orthologous to the rice gene OsHGA1. This suggests
that either gene duplication occurred in barley or one
paralog was deleted in rice after the barley-rice divergence. The fact that HvHGA1 and HvHGA2 are more
similar to each other than to OsHGA1 favors the first
hypothesis. A similar local gene duplication in barley
compared to rice has also been found at the Xwg644
locus (Dubcovsky et al. 2001). Thus, the HGA genes did
not undergo massive rearrangements during evolution,
resulting in an apparently good microcollinearity between barley and rice in this region. In contrast, considerable differences in size and composition were found in
the intervals between the HGA5/HGA4 and the HGA4/
HGA2 genes in barley compared to rice. In the HGA4/
HGA2 intergenic region, collinearity is interrupted by
the presence of six additional genes and a large proportion of noncoding low-copy sequences in barley. Microcollinearity studies at orthologous loci between rice and
other grass genomes have already shown many small
rearrangements (deletions, duplications, translocations, inversions; for review see Bennetzen 2000a; Bennetzen and Ramakrishna 2002; Feuillet and Keller
2002). However, they generally involved one to three
genes and the rearrangement of a segment with six
genes has not been reported so far.
Three different scenarios can explain such a rearrangement. In one scenario, it is possible that the
current gene organization in barley reflects the ancestral locus organization. In this case, the organization
observed in rice could result from the deletion of a
large fragment containing the six genes or from their
relocation to nonorthologous loci during evolution. In
an alternative scenario, the ancestral locus was similar
to the modern rice locus in only the HGA gene family,
and additional sequences including the six genes were
inserted during barley genome evolution.
The presence of homologs of the six additional barley
genes in low-copy numbers at nonorthologous loci in
the rice genome supports the hypothesis of rearrangements in rice. It is not possible to discriminate between
the deletion or relocation hypothesis by comparing the
degree of divergence between the six additional genes
682
S. Brunner, B. Keller and C. Feuillet
and the HGA orthologs in rice and barley. The similarity
at the amino acid level is comparable between the two
groups of genes. The deletion scenario would imply that
more than one copy of each gene was originally present
in the ancestral genome and that only the copies located
at the Rph7 locus were deleted in rice. Large deletions
of repetitive regions caused by unequal intrastrand recombination or illegitimate recombination have been
found to partially counteract genome expansion in
plants (Bennetzen 2002; Devos et al. 2002). However,
such a mechanism has not yet been shown to affect
large genic regions. The alternative mechanism of rearrangement requires the relocation of six rice genes
to different nonhomeologous positions. Recent studies
suggest that single genes (Bennetzen and Ramakrishna 2002) or small blocks of adjacent genes (Li
and Gill 2002) have moved to new locations during
evolution in grass genomes.
Finally, as a third scenario, we cannot exclude that a
new and not-yet-discovered mechanism might have been
involved in shaping this genomic region in barley. The
presence of a large amount of noncoding, low-copy DNA
beside the six additional genes is intriguing and might
result from such a new mechanism. Further comparative
analysis of the Rph7 locus in other members of the
Poaceae family, such as maize, will be needed to address
the question of the origin of this rearrangement and
its molecular basis.
The amount of low-copy sequence strongly contrasts
with previous analysis of large genomic fragments in
grasses where ⵑ70% of the sequences corresponded to
repetitive elements (SanMiguel et al. 1996, 2002; Chen
et al. 1997; Tikhonov et al. 1999; Dubcovsky et al. 2001;
Wicker et al. 2001). Here, 60% of the sequence could
not be annotated. Recently, Rostoks et al. (2002) found
on average 37% of uncharacterized sequence but they
did not indicate if this was potentially repetitive. We
have estimated that 25% of the nonannotated sequence
likely corresponds to uncharacterized repetitive elements and that 45% of the total sequence is low copy.
It is well established that the insertion of repetitive elements has largely contributed to genome expansion in
grasses (SanMiguel et al. 1996; SanMiguel and Bennetzen 1998; Bennetzen 2000b; Wicker et al. 2001).
Our findings raise the question of whether the rearrangement of genes and low-copy sequences can also
participate in local genome expansion.
We conclude that multiple rearrangements involving
not only repetitive elements but also coding regions and
low-copy sequences have reshaped the barley and rice
genomes since their divergence. Thus, grass genome
evolution is certainly more complex than expected on
the basis of collinearity assessments, and microcollinearity between rice and the Triticeae genome is possibly
not very high. Our results also support the idea that
the development of efficient structural and functional
genomic tools should be pursued in the Triticeae to
achieve isolation of genes of agronomic importance.
The authors thank Dr. Sasaki and Dr. Yano from the Rice Genome
Project for kindly providing the EST sequences and unpublished
mapping information and for mapping of the ABC 171 RFLP probe
in rice. We thank Dr. N. Yahiaoui and Dr. C. Ringli for critical review
of the manuscript. This project was supported by the Swiss National
Science Foundation grant no. 31-53597.98.
LITERATURE CITED
Altschul, S. F., T. L. Madden, A. A. Schaffer, J. H. Zhang, Z. Zhang
et al., 1997 Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs. Nucleic Acids Res. 25: 3389–
3402.
Bennetzen, J. L., 2000a Comparative sequence analysis of plant
nuclear genomes: microcolinearity and its many exceptions. Plant
Cell 12: 1021–1029.
Bennetzen, J. L., 2000b Transposable element contributions to
plant gene and genome evolution. Plant Mol. Biol. 42: 251–269.
Bennetzen, J. L., 2002 Mechanisms and rates of genome expansion
and contraction in flowering plants. Genetica 115: 29–36.
Bennetzen, J. L., and W. Ramakrishna, 2002 Numerous small rearrangements of gene content, order and orientation differentiate grass genomes. Plant Mol. Biol. 48: 821–827.
Brunner, S., B. Keller and C. Feuillet, 2000 Molecular mapping
of the Rph7.g leaf rust resistance gene in barley (Hordeum vulgare
L.). Theor. Appl. Genet. 101: 783–788.
Chen, M., P. SanMiguel, A. C. deOliveira, S. S. Woo, H. Zhang et
al., 1997 Microcolinearity in sh2 homologous regions of the
maize, rice, and sorghum genomes. Proc. Natl. Acad. Sci. USA
94: 3431–3435.
Chen, M. S., P. SanMiguel and J. L. Bennetzen, 1998 Sequence
organization and conservation in sh2/a1 homologous regions of
sorghum and rice. Genetics 148: 435–443.
Devos, K. M., J. K. M. Brown and J. L. Bennetzen, 2002 Genome
size reduction through illegitimate recombination counteracts
genome expansion in Arabidopsis. Genome Res. 12: 1075–1079.
Druka, A., D. Kudrna, F. Han, A. Kilian, B. Steffenson et al., 2000
Physical mapping of the barley stem rust resistance gene rpg4.
Mol. Gen. Genet. 264: 283–290.
Dubcovsky, J., W. Ramakrishna, P. J. SanMiguel, C. S. Busso, L. L.
Yan et al., 2001 Comparative sequence analysis of colinear barley
and rice bacterial artificial chromosomes. Plant Physiol. 125:
1342–1353.
Faris, J. D., K. M. Haen and B. S. Gill, 2000 Saturation mapping
of a gene-rich recombination hot spot region in wheat. Genetics
154: 823–835.
Felsenstein, J., 1985 Confidence limits on phylogenies: an approach using the bootstrap. Evolution 39: 783–791.
Feuillet, C., and B. Keller, 1999 High gene density is conserved
at syntenic loci of small and large grass genomes. Proc. Natl.
Acad. Sci. USA 96: 8265–8270.
Feuillet, C., and B. Keller, 2002 Comparative genomics in the
grass family: molecular characterization of grass genome structure and evolution. Ann. Bot. 89: 3–10.
Gale, M. D., and K. M. Devos, 1998 Comparative genetics in the
grasses. Proc. Natl. Acad. Sci. USA 95: 1971–1974.
Goff, S. A., D. Ricke, T. H. Lan, G. Presting, R. L. Wang et al.,
2002 A draft sequence of the rice genome (Oryza sativa L. ssp
japonica). Science 296: 92–100.
Graner, A., H. Siedler, A. Jahoor, R. G. Herrmann and G. Wenzel,
1990 Assessment of the degree and the type of restriction fragment length polymorphism in barley (Hordeum vulgare). Theor.
Appl. Genet. 80: 826–832.
Graner, A., A. Jahoor, J. Schondelmaier, H. Siedler, K. Pillen et
al., 1991 Construction of an RFLP map of barley. Theor. Appl.
Genet. 83: 250–256.
Han, F., A. Kleinhofs, S. E. Ullrich, A. Kilian, M. Yano et al., 1998
Synteny with rice: analysis of barley malting quality QTLs and
rpg4 chromosome regions. Genome 41: 373–380.
Han, F., A. Kilian, J. P. Chen, D. Kudrna, B. Steffenson et al., 1999
Genome Rearrangement at the Rph7 Locus
Sequence analysis of a rice BAC covering the syntenous barley
Rpg1 region. Genome 42: 1071–1076.
He, Z. H., H. T. Dong, J. X. Dong, D. B. Li and P. C. Ronald, 2000
The rice Rim2 transcript accumulates in response to Magnaporthe
grisea and its predicted protein product shares similarity with
TNP2-like proteins encoded by CACTA transposons. Mol. Gen.
Genet. 264: 2–10.
Hulbert, S. H., C. A. Webb, S. M. Smith and Q. Sun, 2001 Resistance gene complexes: Evolution and utilization. Annu. Rev. Phytopathol. 39: 285–312.
Keller, B., and C. Feuillet, 2000 Colinearity and gene density in
grass genomes. Trends Plant Sci. 5: 246–251.
Kilian, A., J. Chen, F. Han, B. Steffenson and A. Kleinhofs, 1997
Towards map-based cloning of the barley stem rust resistance
genes Rpg1 and rpg4 using rice as an intergenomic cloning vehicle.
Plant Mol. Biol. 35: 187–195.
Kleinhofs, A., A. Kilian, M. A. S. Maroof, R. M. Biyashev, P. Hayes
et al., 1993 A molecular, isozyme and morphological map of
the barley (Hordeum vulgare) genome. Theor. Appl. Genet. 86:
705–712.
Koprek, T., D. McElroy, J. Louwerse, R. Williams-Carrier and
P. G. Lemaux, 2000 An efficient method for dispersing Ds elements in the barley genome as a tool for determining gene
function. Plant J. 24: 253–263.
Kunzel, G., L. Korzun and A. Meister, 2000 Cytologically integrated physical restriction fragment length polymorphism maps
for the barley genome based on translocation breakpoints. Genetics 154: 397–412.
Li, W. L., and B. S. Gill, 2002 The colinearity of the Sh2/A1 orthologous region in rice, sorghum and maize is interrupted and accompanied by genome expansion in the Triticeae. Genetics 160:
1153–1162.
Moore, G., K. M. Devos, Z. Wang and M. D. Gale, 1995 Cereal
genome evolution: grasses, line up and form a circle. Curr. Biol.
5: 737–739.
Nacken, W. K. F., R. Piotrowiak, H. Saedler and H. Sommer, 1991
The transposable element Tam1 from Antirrhinum majus shows
structural homology to the maize transposon En/Spm and has no
sequence specificity of insertion. Mol. Gen. Genet. 228: 201–208.
Panstruga, R., R. Buschges, P. Piffanelli and P. Schulze-Lefert,
1998 A contiguous 60 kb genomic stretch from barley reveals
molecular evidence for gene islands in a monocot genome. Nucleic Acids Res. 26: 1056–1062.
Rostoks, N., Y. Park, W. Ramakrishna, J. Ma, A. Druka et al., 2002
Genomic sequencing reveals gene content, genomic organization, and recombination relationships in barley. Funct. Integr.
Genomics 2: 51–59.
Sambrook, J., E. F. Fritsch and T. Maniatis, 1989 Molecular Cloning: A Laboratory Manual. Cold Spring Harbor Laboratory Press,
Cold Spring Harbor, NY.
Sandhu, D., and K. S. Gill, 2002a Gene containing regions of wheat
and the other grass genomes. Plant Physiol. 128: 803–811.
Sandhu, D., and K. S. Gill, 2002b Structural and functional organization of the ‘1S0.8 gene-rich region’ in the Triticeae. Plant Mol.
Biol. 48: 791–804.
SanMiguel, P., and J. L. Bennetzen, 1998 Evidence that a recent
increase in maize genome size was caused by the massive amplification of intergene retrotransposons. Ann. Bot. 82: 37–44.
SanMiguel, P., A. Tikhonov, Y. K. Jin, N. Motchoulskaia, D. Zakharov et al., 1996 Nested retrotransposons in the intergenic regions of the maize genome. Science 274: 765–768.
SanMiguel, P. J., W. Ramakrishna, J. L. Bennetzen, C. S. Busso
and J. Dubcovsky, 2002 Transposable elements, genes and recombination in a 215-kb contig from wheat chromosome 5Am.
Funct. Integr. Genomics 2: 70–80.
Shirasu, K., A. H. Schulman, T. Lahaye and P. Schulze-Lefert,
683
2000 A contiguous 66 kb barley DNA sequence provides evidence for reversible genome expansion. Genome Res. 10: 908–
915.
Smilde, W. D., J. Halukova, T. Sasaki and A. Graner, 2001 New
evidence for the synteny of rice chromosome 1 and barley chromosome 3H from rice expressed sequence tags. Genome 44:
361–367.
Sonnhammer, E. L. L., and R. Durbin, 1995 A dot matrix program
with dynamic threshold control suited for genomic DNA and
protein sequence analysis. Gene 167: 1–10.
Stein, N., C. Feuillet, T. Wicker, E. Schlagenhauf and B. Keller,
2000 Subgenome chromosome walking in wheat: a 450 kb physical contig in Triticum monococcum L. spans the Lr10 resistance
locus in hexaploid wheat (Triticum aestivum L.). Proc. Natl. Acad.
Sci. USA 97: 13436–13441.
Takai, R., K. Hasegawa, H. Kaku, N. Shibuya and E. Minami, 2001
Isolation and analysis of expression mechanisms of a rice gene,
EL5, which shows structural similarity to ATL family from Arabidopsis, in response to N-acetylchitooligosaccharide elicitor. Plant
Sci. 160: 577–583.
Takai, R., N. Matsuda, A. Nakano, K. Hasegawa, C. Akimoto et
al., 2002 EL5, a rice N-acetylchitooligosaccharide elicitorresponsive RING-H2 finger protein, is a ubiquitin ligase which
functions in vitro in co-operation with an elicitor-responsive ubiquitin-conjugating enzyme, OsUBC5b. Plant J. 30: 447–455.
Tarchini, R., P. Biddle, R. Wineland, S. Tingey and A. Rafalski,
2000 The complete sequence of 340 kb of DNA around the
rice Adh1-Adh2 region reveals interrupted colinearity with maize
chromosome 4. Plant Cell 12: 381–391.
Thompson, J. D., T. J. Gibson, F. Plewniak, F. Jeanmougin and D. G.
Higgins, 1997 The CLUSTAL_X windows interface: flexible
strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res. 25: 4876–4882.
Tikhonov, A. P., P. J. SanMiguel, Y. Nakajima, N. M. Gorenstein,
J. L. Bennetzen et al., 1999 Colinearity and its exceptions in
orthologous adh regions of maize and sorghum. Proc. Natl. Acad.
Sci. USA 96: 7409–7414.
Vision, T. J., D. G. Brown and S. D. Tanksley, 2000 The origins
of genomic duplications in Arabidopsis. Science 290: 2114–2117.
Wei, F. S., K. Gobelman-Werner, S. M. Morroll, J. Kurth, L. Mao
et al., 1999 The Mla (powdery mildew) resistance cluster is associated with three NBS-LRR gene families and suppressed recombination within a 240-kb DNA interval on chromosome 5S (1HS)
of barley. Genetics 153: 1929–1948.
Wei, F. S., R. A. Wing and R. P. Wise, 2002 Genome dynamics and
evolution of the Mla (powdery mildew) resistance locus in barley.
Plant Cell 14: 1903–1917.
Wicker, T., N. Stein, L. Albar, C. Feuillet, E. Schlagenhauf et
al., 2001 Analysis of a contiguous 211 kb sequence in diploid
wheat (Triticum monococcum L.) reveals multiple mechanisms of
genome evolution. Plant J. 26: 307–316.
Wolfe, K. H., M. L. Gouy, Y. W. Yang, P. M. Sharp and W. H. Li,
1989 Date of the monocot dicot divergence estimated from
chloroplast DNA sequence data. Proc. Natl. Acad. Sci. USA 86:
6201–6205.
Woo, S. S., J. M. Jiang, B. S. Gill, A. H. Paterson and R. A. Wing,
1994 Construction and characterization of a bacterial artificial
chromosome library of Sorghum bicolor. Nucleic Acids Res. 22:
4922–4931.
Yu, J., S. N. Hu, J. Wang, G. K. S. Wong, S. G. Li et al., 2002 A draft
sequence of the rice genome (Oryza sativa L. ssp indica). Science
296: 79–92.
Yu, Y., J. P. Tomkins, R. Waugh, D. A. Frisch, D. Kudrna et al., 2000
A bacterial artificial chromosome library for barley (Hordeum
vulgare L.) and the identification of clones containing putative
resistance genes. Theor. Appl. Genet. 101: 1093–1099.
Communicating editor: A. Paterson