Download Identifying Mutations Responsible for Rare Disorders Using New

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Genetic engineering wikipedia , lookup

Genomic imprinting wikipedia , lookup

Pharmacogenomics wikipedia , lookup

Population genetics wikipedia , lookup

Gene expression profiling wikipedia , lookup

Non-coding DNA wikipedia , lookup

Quantitative trait locus wikipedia , lookup

History of genetic engineering wikipedia , lookup

Cell-free fetal DNA wikipedia , lookup

Biology and consumer behaviour wikipedia , lookup

Gene wikipedia , lookup

No-SCAR (Scarless Cas9 Assisted Recombineering) Genome Editing wikipedia , lookup

Bisulfite sequencing wikipedia , lookup

Neuronal ceroid lipofuscinosis wikipedia , lookup

Designer baby wikipedia , lookup

Minimal genome wikipedia , lookup

DNA sequencing wikipedia , lookup

Human genome wikipedia , lookup

Site-specific recombinase technology wikipedia , lookup

Medical genetics wikipedia , lookup

Epigenetics of neurodegenerative diseases wikipedia , lookup

Epistasis wikipedia , lookup

Helitron (biology) wikipedia , lookup

Genome editing wikipedia , lookup

Public health genomics wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Microevolution wikipedia , lookup

Genome (book) wikipedia , lookup

Pathogenomics wikipedia , lookup

Oncogenomics wikipedia , lookup

Mutation wikipedia , lookup

Human Genome Project wikipedia , lookup

Genome evolution wikipedia , lookup

Genomic library wikipedia , lookup

Frameshift mutation wikipedia , lookup

Point mutation wikipedia , lookup

Metagenomics wikipedia , lookup

Whole genome sequencing wikipedia , lookup

RNA-Seq wikipedia , lookup

Genomics wikipedia , lookup

Exome sequencing wikipedia , lookup

Transcript
Identifying Mutations Responsible for
Rare Disorders Using New
Technologies
Jacek Majewski,
Department of Human Genetics, McGill University, Montreal, QC Canada
Mendelian Diseases
• Clear mode of inheritance – dominant of
recessive
• High penetrance – having the mutation
determines the phenotype with near certainty
• Clear phenotypic consequences
• Low environmental influence – lack of
phenocopies
• Examples – Tay-Sachs disease, Cystic Fibrosis…
Finding causes of Mendelian
Disorders
• Although many of those disorders are rare (1:2000 –
1:100,000 incidence), taken together they constitute
a substantial health burden
• It is estimated that over 90% of Mendelian disorders
are caused by mutations in the coding regions
(missense, nonsense, frameshifts, in/dels, splicing)
• Traditionally, the approach was via genetic linkage
analysis, followed by targeted sequencing of
candidate genes.
• In view of new technologies – can we do better?
1993
Homozygosity mapping Today
The American Journal of Human Genetics(2010)doi:10.1016/j.ajhg.2010.09.005
“Conventional” mutation mapping strategy for autosomal recessive disorders. Here we collected four
patients from consanguinous Bedouin families in Qatar with the Van den Ende-Gupta Syndrome
(VDEGS). Genotyping using SNP arrays identified a region of shared homozygosity – presumably inherited
from a common ancestor and containing the mutation - 2.4 mb in length, containing 44 genes.
Sanger Sequencing and Exome Capture
We concurrently conducted
conventional Sanger sequencing of
candidate genes and whole-exome
sequencing. In this case, the targeted
candidate approach was the winner,
identifying two distinct pathogenic
mutations in the SCARF2 gene. A few
days later, results from exome
sequencing confirmed this result and
helped to exclude all other candidate
genes in the region.
A. Presence
and
effect
c.1328_1329delTG mutation.
of
B. Presence and effect of c.773G>A
mutation.
C. Conservation of residues near 773G>A
(p.C258Y) mutation.
D. Exome capture and sequencing results
in the mother of patient 1.
High Throughput Sequencing
• E.g. Illumina HiSeq, or ABI Solid
• Main characteristics – typically produce
millions to hundreds of millions short (50100bp) sequencing reads per application
• Reads can be aligned to the reference
genome, and variants identified
• Currently, sequencing of the entire human
genome is still quite expensive - $9k today.
• Can we sequence a subset of the genome?
What is Exome Capture?
•
A method that uses specific probes to capture only the coding portions of all
annotated exons within the genome. This is followed by high throughput shotgun
sequencing. Our group uses the Agilent SureSelect Human All Exon Kit (in solution
beads), followed by Illumina GAIIx, 76 bp read sequencing (single lane).
Coverage Statistics
Exome coverage obtained by Agilent All
Exon in solution capture process, followed by
Illumina sequencing. The bars represent
coverage from 1, 2, and 3 lanes of 76 base
reads. E.g., two lanes of sequence produce
an average 52.3X coverage of the exome,
90% of the exome is covered at >10X level,
and 95% at > 5X. In our experience, 2 lanes
of sequencing provide excellent coverage at
a
reasonable
cost.
Further,
slight
improvement in coverage upon increasing
the number of lanes is not warranted.
Current bottom line – should be
able to get high quality 50X
exome for around $2k
Current Analysis Pipeline
•
•
•
•
•
•
•
•
Exome Capture and Sequencing by Genome Quebec Platforms
Base calling, QC – standard Illumina Pipeline
Alignment to reference genome – BWA
Retain unique alignments only
SNV and in/del calls (SAMTools, GATK)
CNV calls (in house)
Annotate functional variants as (ANNOVAR)
Filtering
– Other exomes (~50) – Key!!!
– dbSNP
– 1000 Genomes
• Relax filtering criteria if necessary
END USER OUTPUT
Filtering The Data
Filtering The Data
Visualization, manual QC and
inspection
Visualization, manual QC and
inspection
ONGOING Projects
• Part of the FORGE consortium, Canada-wide
(GC/CIHR)
• Part of the IGNITE project, Dalhousie-Atlantic
Canada (GC)
• RaDiCAL (McGill/MUHC)
• Numerous collaborations with individual
researchers – McGill, Quebec, Canada, France,
Poland, Qatar, Lebanon…
Successes so far - new genes
•
•
•
•
•
•
•
•
•
Fowler Syndrome
Mitochondrial Disease (2)
Vitamin B12 metabolism (2)
Nephrotic Syndrome
LCA
Hajdu-Cheney Syndrome
Hyper IgM (2)
Novel developmental dysmorphisms (2)
…
Two unrelated patients – total study time = 2 weeks from DNA sample to gene
Schematic of the FLVCR2 mutations found in two patients (F1 and F3).
A. The four mutations in FLVCR2 identified in F1 and F3 using whole exome sequencing as visualized using the Integrative
Genomics Viewer from the Broad Institute (http://www.broadinstitute.org/igv).
B Mutations in FLVCR2 are shown relative to the protein domains.
Mol Cell Biol. 2010 Sep 7. [Epub ahead of print]
The Fowler Syndrome associated protein FLVCR2 is an importer of heme.
Duffy SP, Shing J, Saraon P, Berger LC, Eiden MV, Wilde A, Tailor CS.
Program in Cell Biology, The Hospital for Sick Children, Toronto, ON M5G 1X8, Canada; Department of Molecular Genetics,
University of Toronto, Toronto, ON M5G, Canada; National Institute of Mental Health, Laboratory of Cellular & Molecular
Regulation, Bethesda, MD 20892 USA.
Hajdu-Cheney Syndrome
(with Mark Samuels and Jeremy Schwartzentruber)
• Rare dominant disease, characterized by bone
deterioration (our first dominant, FORGE project)
• < 100 patients known worldwide
• DNA from 3 affected family members, + 3 unrelated
individuals
• Exome sequencing
• Identification of variants
• 2 days after sequence data available – gene found
• All patients have truncating mutations in the last exon of
the gene Notch2
• Paper prepared for publication during the next week
• But...
Monday...
Summary
• Exome Sequencing is a fast and efficient way to
identify new disease genes
• In our hands, success rates vary from 100% (families)
to 33% (single individuals)
• Other applications:
- genomic molecular diagnosis
- genetic testing
Summary
• Exome sequencing is a rapid and increasingly affordable method of
identifying disease mutations
• CNVs (large structural variants) can be detected
• We have successfully found mutations in a number of additional recessive
disorders, using only a single individual
• In the next few months dozens of new disease genes/mutations will be
identified
• Next step - Genomic Molecular Diagnosis