Download ppt - Sol Genomics Network

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Genetic engineering wikipedia , lookup

DNA sequencing wikipedia , lookup

Y chromosome wikipedia , lookup

Short interspersed nuclear elements (SINEs) wikipedia , lookup

Chromosome wikipedia , lookup

Essential gene wikipedia , lookup

Karyotype wikipedia , lookup

Polycomb Group Proteins and Cancer wikipedia , lookup

Quantitative trait locus wikipedia , lookup

NUMT wikipedia , lookup

Oncogenomics wikipedia , lookup

Gene desert wikipedia , lookup

Non-coding DNA wikipedia , lookup

Polyploid wikipedia , lookup

Copy-number variation wikipedia , lookup

Biology and consumer behaviour wikipedia , lookup

No-SCAR (Scarless Cas9 Assisted Recombineering) Genome Editing wikipedia , lookup

Therapeutic gene modulation wikipedia , lookup

Transposable element wikipedia , lookup

X-inactivation wikipedia , lookup

Gene expression programming wikipedia , lookup

Public health genomics wikipedia , lookup

Segmental Duplication on the Human Y Chromosome wikipedia , lookup

Ridge (biology) wikipedia , lookup

Genomic imprinting wikipedia , lookup

Epigenetics of human development wikipedia , lookup

Microevolution wikipedia , lookup

Gene wikipedia , lookup

History of genetic engineering wikipedia , lookup

Gene expression profiling wikipedia , lookup

Site-specific recombinase technology wikipedia , lookup

Helitron (biology) wikipedia , lookup

Neocentromere wikipedia , lookup

Designer baby wikipedia , lookup

Human genome wikipedia , lookup

Metagenomics wikipedia , lookup

Genome (book) wikipedia , lookup

Pathogenomics wikipedia , lookup

RNA-Seq wikipedia , lookup

Whole genome sequencing wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Minimal genome wikipedia , lookup

Genome editing wikipedia , lookup

Human Genome Project wikipedia , lookup

Genomic library wikipedia , lookup

Genomics wikipedia , lookup

Genome evolution wikipedia , lookup

Transcript
Estimate of tomato euchromatin and heterochromatin genome fractions
Based on 50 independent measurements of stained tomato chromosomes
Relative chromosome length
Relative bivalent diameter
Relative area
Relative optical density
Relative OD X relative area
Total OD X area
Fraction of genome
Heterochromatin Euchromatin
0.36
0.64
X 1.23
X 1.00
0.44
0.64
X 4.78
X 1.00
2.10
0.64
/ 2.74
/ 2.74
0.77
0.23
Approximately 23% of the tomato genome
is in the form of euchromatin
Mb size of tomato euchromatin based
on cytogenetic measurements
0.95 pg / tomato genome X 0.23 (euchromatin fraction) =
0.22 pg
965 X 106 pb/pg = 2.12 x 108 bp or 212 Mb
(705 Mb heterochromatin)
Estimate of gene space missed in this approach:
Genes missed in centromere (rice chromosome 8 - 86 genes)
12 x 86 = 1032 centromere genes
Exelisis heterochromatin BACs - 2 BACs representing 200 kb
were sequenced and one gene identified.
705,000 kb in herterochromatin (slide 2)
705,000 / 200 = 3525 heterochromatin genes
35,000 estimated tomato genes - 1032 - 3525 = 30,500 genes
(87%)
Correcting for 3% euchromatin gaps (as in rice) results in 85% of
total tomato gene space is anticipated to be recovered under the
International Tomato Genome Sequencing Project.
Estimate of tomato euchromatin size based on
available EST and genome sequence
15.5 Mb available sequence (August 2006)
8,097 high quality unigene set
- all available full-length tomato genes in GENBANK
- TIGR full-length cDNA sequences (redundantly sequenced)
- SGN unigene contigs with 5 or more ESTs
- redundnacy correction
456 of 8,097 genes found in available genome sequence (5.6%)
Correcting for 85% expectation yields 6.6% of target gene space
15.5/0.066 = 234 Mb tomato euchromatin target
Sequencing standards
A “finished BAC” is defined as……
• it contains an error rate of less than 1:10,000 bases and
continuous sequence across the entire BAC (HTGS phase 3)
• has an average of 8-fold redundancy in sequencing coverage
with a minimum of one high quality read in both directions at any
specific sequence
• all reasonable state of the art approaches available at the time
for gap filling will be used
Tomato euchromatin completion criteria:
1) complete sequencing of the major euchromatin “arms” flanking
each of the 12 tomato chromosomes
2) to a degree of completion comparable to the standards of
completion used to guide the international rice genome
sequencing project (IRGSP, 2005) ---- e.g. anticipate 4 - 6 gaps
per chromosome.
Furthermore:
1) Sequence to at least the closest mapped marker to the euchromatin /
heterochromatin border .
2) Attempt to walk until characteristic heterochromatin repeats are
identified and at minimum define the size of the remaining gap
In summary, the target of the
international genome sequencing
effort is sequencing of the euchromatin
arms of all twelve tomato
chromosomes which we estimate
will represent approximately 85%
of the tomato gene space.