Download lecture28_Sequencing.. - University of Alberta

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

DNA sequencing wikipedia , lookup

Therapeutic gene modulation wikipedia , lookup

Human–animal hybrid wikipedia , lookup

Genome (book) wikipedia , lookup

Oncogenomics wikipedia , lookup

Human genetic variation wikipedia , lookup

Transposable element wikipedia , lookup

DNA barcoding wikipedia , lookup

No-SCAR (Scarless Cas9 Assisted Recombineering) Genome Editing wikipedia , lookup

Vectors in gene therapy wikipedia , lookup

Minimal genome wikipedia , lookup

NUMT wikipedia , lookup

Microevolution wikipedia , lookup

Designer baby wikipedia , lookup

Public health genomics wikipedia , lookup

Site-specific recombinase technology wikipedia , lookup

History of genetic engineering wikipedia , lookup

Non-coding DNA wikipedia , lookup

RNA-Seq wikipedia , lookup

Helitron (biology) wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Pathogenomics wikipedia , lookup

Human genome wikipedia , lookup

Genome editing wikipedia , lookup

Genomic library wikipedia , lookup

Whole genome sequencing wikipedia , lookup

Genome evolution wikipedia , lookup

Human Genome Project wikipedia , lookup

Genomics wikipedia , lookup

Metagenomics wikipedia , lookup

Transcript
historical costs to sequence the
3 billion bp of a human genome
Gordon Moore
costs to sequence a human genome
10
log10 (US dollar)
9
8
$3 billion
7
$300 million
6
5
$300,000
two vendors
4
1989
1991
1993
1995
1997
1999 2001
year sequenced
2003
competition
2005
2007
2009
BGI Offers Next-Gen Sequencing Service: Kicks Off 100Genome Sequencing Project [8 January 2008]
Knome, BGI Forge Sequencing Alliance; GATC Spins Off
Personal Genomics Unit [January 15 2008]
Google
580,000 SNPs
BGI-Shenzhen
1 million SNPs
whole genome
YanHuang and the panda genome
(raising awareness for the new technologies)
Emperor’s Yan and Huang were
the first rulers of ancient China,
so modern Chinese say that they
are descendants of YanHuang.
The panda is a Chinese national treasure
and the logo for the World Wildlife Fund.
While not the first endangered species to
be sequenced (chimp was first), it will be
the first with a conservation focus.
Whole genome shotgun assembly is nontrivial for 35 bp reads even with paired end
information and 50x redundancy.
BGI-Shenzhen and allies in the US and UK will be
sequencing 1000 human genomes in the next 3 years
Nature: 17 January 2008
Science: 25 January 2008
expressed gene sequences of 1000
medicinal plants for only $2 million
There are 96 plant species with more than
20,000 expressed sequence tags (ESTs),
but most are crop plants. If we count only
medicinal plants, generously defined to
include makers of secondary metabolites
with purported health benefits, such as
lycopene for tomatoes and resveratrol for
grapes, there are 16 plant species with
more than 20,000 ESTs. If we use a strict
definition of medicinal, there are just 4
plant species with more than a mere 5000
ESTs. They are artemesia, Madagascar
periwinkle, gingko, and ginseng.
the triad of biological causation
gene hunts vs
synthetic biology
human
genome
infectious
agents
environmental
factors
cultures vs
metagenomics
questionnaires vs
molecular detection
http://www.cancer.gov/cancertopics/factsheet/Risk/
BRCA
Among individuals of Ashkenazi Jewish descent,
researchers have found that about 2.3 percent have
an altered BRCA1 or BRCA2 gene. This frequency
is about 5 times higher than that of the general
population.
http://www.who.int/mediacentre/factsheets/fs297/en
/index.html
One-fifth of cancers worldwide are due to chronic
infections, mainly from hepatitis B viruses HBV
(causing liver), human papilloma viruses HPV
(causing cervix), Helicobacter pylori (causing
stomach), schistosomes (causing bladder), the liver
fluke (bile duct) and human immunodeficiency
virus HIV (Kaposi sarcoma and lymphomas).
we can cure, eradicate, or at least treat almost all infectious diseases
metagenomics: DNA sequencing
analysis of viral and microbial worlds
without requiring a pure culture
classical methods
metagenomics
biological sample
biological sample
grow as a
pure culture
molecular biology
known sequence
targeted
amplification
partial purification
(e.g. by size)
just sequence it
we have little information on the vast majority of viral and microbial species because of
our inability to culture them in the lab; the estimate is fewer than 1% have been cultured
metagenomics of the “virome”
tangential flow filter,
high speed centrifuge
0.45 m
filter
viral
particles
viral genomes (103 bp)
are very small compared
to bacterial (106 bp) or
human (109 bp) genomes
and it helps to remove
non-viral contamination
human specimen
DNA
RNA
virome
sequence
February 6, 2008 (10.1056/NEJMoa073785)
A New Arenavirus in a Cluster of Fatal Transplant-Associated Diseases
Gustavo Palacios, Julian Druce, Lei Du, … and W. Ian Lipkin
Center for Infection and Immunity, Mailman School of Public Health, Columbia University, New
York; Victorian Infectious Diseases Reference Laboratory, Victoria, Australia; 454 Life Sciences,
Branford, CT; Centers for Disease Control and Prevention, Atlanta.
Viral and bacterial cultures; polymerase-chain-reaction assays for known pathogens; viral and
panmicrobial microarrays  revealed no plausible candidates.
However, BLASTX analysis of the deduced proteins for 94,043 reverse transcribed sequences
revealed 14 fragments consistent with Old World arenaviruses.
22 February 2008: Vol. 319. no. 5866, pp. 1096 - 1100
Clonal Integration of a Polyomavirus in Human Merkel Cell Carcinoma
Huichen Feng, Masahiro Shuda, Yuan Chang, Patrick S. Moore
Molecular Virology Program, University of Pittsburgh Cancer Institute, University of Pittsburgh,
5117 Centre Avenue, Suite 1.8, Pittsburgh, PA 15213, USA.
99.4% of 382,747 tumor derived
sequences aligned to human RefSeq
RNA, mitochondrial, assembled
chromosomes, or immunoglobulin
sequences. Two sequences were
used to define a previously unknown
human polyomavirus.
proof of causation and other steps
after candidate pathogen is identified
synthesis costs
$0.50 per bp
genome
synthesis
disease
tissue
viral
fragment
genome
sequence
population
case-controls
compound
screening
develop
vaccines
develop
diagnostics
develop
treatments
computationally
accelerated
metagenomics of idiopathic inflammatory diseases is a high-risk extremely-high-payoff activity, but
there is a low-risk moderately-high-payoff activity that will put Alberta “on the map” internationally