Download Figure S1: kmer spectra at K=25 for filtered fragment reads (red) and

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Polycomb Group Proteins and Cancer wikipedia , lookup

Polyploid wikipedia , lookup

X-inactivation wikipedia , lookup

Non-coding DNA wikipedia , lookup

Quantitative trait locus wikipedia , lookup

Oncogenomics wikipedia , lookup

Essential gene wikipedia , lookup

Whole genome sequencing wikipedia , lookup

History of genetic engineering wikipedia , lookup

Microevolution wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Designer baby wikipedia , lookup

Site-specific recombinase technology wikipedia , lookup

Genomic library wikipedia , lookup

Gene wikipedia , lookup

Genomics wikipedia , lookup

Public health genomics wikipedia , lookup

Genome editing wikipedia , lookup

Human genome wikipedia , lookup

Epigenetics of human development wikipedia , lookup

Metagenomics wikipedia , lookup

Genomic imprinting wikipedia , lookup

Pathogenomics wikipedia , lookup

Ridge (biology) wikipedia , lookup

Human Genome Project wikipedia , lookup

Biology and consumer behaviour wikipedia , lookup

Gene expression profiling wikipedia , lookup

Genome (book) wikipedia , lookup

RNA-Seq wikipedia , lookup

Minimal genome wikipedia , lookup

Genome evolution wikipedia , lookup

Transcript
Figure S1: kmer spectra at K=25 for filtered fragment reads (red) and for fragment reads after error
correction (blue). The single peak is indicative of a high degree of homozygosity.
Figure S2: Synteny between ryegrass and barley. Synteny between the ryegrass and barley genomes for
each of the seven corresponding chromosomes as revealed on basis of ryegrass genetic marker information
and the POPSEQ-based ordering of barley whole genome sequence assemblies. The orange highlighted
regions indicate the translocated and inverted genome segment (segment was translocation from the long
arm of chromosome L5 to the short arm of chromosome L4).
Figure S3: Coverage of the Brachypodium genome by assembled Lolium WGS scaffolds. The coverage of
the Brachypodium genome was determined by comparing the assembled ryegrass WGS scaffolds against
the Brachypodium gene set. The heat maps measure the proportion of detected orthologous Brachypodium
genes along the five chromosomes (Bd 1 to Bd 5) by using a sliding window approach (window size 0.5Mb
and window shift 0.1Mb). The colored boxes above each chromosome indicate the syntenic relationship to
Lolium. Assembled Lolium genome scaffolds matching genes within these regions were assigned to
individual chromosomes according to the synteny relationships. The syntenic regions corresponding to the
L4/L5 translocation is highlighted by the orange rectangle.
Figure S4: Estimated repeat branch rate as a function of k-mer length., for perennial ryegrass (purple) in the
context of the repeat branch rate for yeast (Saccharomyces cerevisiae, pink), snake (Boa constrictor
constrictor, red), oyster (Crassostrea gigas, yellow), human (dark green), fish (Maylandia zebra, light green),
bird (Melopsittacus undulates, blue). Perennial ryegrass has the highest frequency of variant branches in the
assembly graph.
Figure S5: Comparison of density plots of log10(average TPM) for leaf (red) and pollen (blue) samples.
Figure S6: MA-plots and Volcano plots for the differentially expressed genes during a compatible pollination
reaction.
Figure S7: MA-plots and Volcano plots for the differentially expressed genes during an incompatible
pollination reaction.
Figure S8: Histogram of the number of the number of PacBio reads addressing closed gaps in the Lolium
perenne genome assembly.
Table S1: Summary of Illumina sequencing data for P226/135/16.
Table S2: Summary of gap, contig, and scaffold statisitcs before and after gap filling with PacBio reads.
Gap filling was performed with PBJelly.
Table S3: Summary statistics of the ALLPATHS-LG assembly.
Table S4: Statistics on the completeness of the genome based on 248 ultra-conserved Core Eukaryotic
Genes (CEGs). Results are shown for complete coverage and partial coverage of CEGs. Group 1 CEGs are
the least conserved, and group 4 CEGs are the most conserved across species.
Table S5: GenomeZipper statistics for individual linkage perennial ryegrass linkage groups.
Table S6: Summary statistics of the Abyss unitig assembly of the perennial ryegrass paired-end fragment
reads.
Table S7: Details of pollen allergens from various species. This was the starting point for developing hmm
profiles to identify homologs in the perennial ryegrass genome.
Table S8: Average Transcripts Per Million (TPM) for genes with > 1000 TPM in mature pollen grains.
Average TPM in leaf samples is also shown.
Table S9: How each library was used in the ALLPATHS-LG assembly, and the percentage of duplicates for
each library type.
Table S10: The number of Illumina reads used for the nine de-novo transcriptome assemblies generated,
and the source of the material.
Table S11: Summary statistics of PacBio data, before and after error correction, and after sampling a one
times genome equivalent.
Supporting Information Data S1-S7: spreadsheets with the perennial ryegrass GenomeZipper data, one
file for each of the seven linkage groups.
Supporting Information Data S8: spreadsheet showing the expression levels (as TPM) of the 630 genes
with sequence homology to known pollen allergens, in both mature pollen and leaf samples.
Supporting Information Data S9: spreadsheet showing genes significantly down regulated in both a
compatible and incompatible pollination reaction.
Supporting Information Data S10: spreadsheet showing genes significantly down regulated in a
compatible pollination reaction.
Supporting Information Data S11: spreadsheet showing genes significantly down regulated in an
incompatible pollination reaction.
Supporting Information Data S12: spreadsheet showing genes significantly up regulated in both a
compatible and incompatible pollination reaction.
Supporting Information Data S13: spreadsheet showing genes significantly up regulated in a compatible
pollination reaction.
Supporting Information Data S14: spreadsheet showing genes significantly up regulated in an
incompatible pollination reaction.