Download Coordination of CNV analysis for Schizophrenia

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Call Details
DATE: Friday, November 9, 2012
START: We will begin promptly on the hour
1100 US eastern
0800 US west coast
1600 UK
1700 Central Europe
An ungodly hour in Australia
DURATION: 1 hour
TELEPHONE:
- US Toll free: 1 877 280.4962
- International direct: +1 857 244.7319
- Toll-free number? See http://www.btconferencing.com/globalaccess/?bid=75_public
- We will pay for an operator to help people having problems
- The line can handle up to 300 participants (or so we are told)
PASSCODE: 544 704 02
Mute your line NOW
Press *6 to mute your line.
It is possible for just one person to ruin the
call for everyone due to background noise,
electronic feedback, crying children, wind,
typing, etc.
PLEASE Out of courtesy for your
colleagues, mute your line now
How to “ask” questions
• Operators will be on standby to assist with
questions at the end of the talk.
• Please listen to operator instructions
closely. They will direct you to press
certain keys on your phone if you have a
question and bring you into the conference
one at a time.
• The speaker will respond to questions as
time allows.
PGC2 CNV analysis
Jonathan Sebat
PGC lab meeting
November 9, 2012
CNV analysis subgroup
•
Goals
– Identify novel risk factors for
mental illness by direct detection
of structural variants, both rare
and common
– Apply a CNV-based approach
within the collaborative
framework of the Psychiatric
GWAS Consortium
•
Initial Objectives
– Evaluate available software tools
for the analysis of Affymetrix and
Illumina Array data
– Develop a pipeline for CNV
analysis that is optimized for
each platform
– Develop novel statistical methods
for the “mega-analysis” of CNVs
in the large PGC dataset
Partek
GenoCN
iPattern
C-Score
Penn-CNV
Birdsuite
Overview
• Goals of the PGC-CNV effort
• Analysis Plan
– Core analyses
– other (disease-specific or exploratory)
analyses
• Publication
• Timeline
Goals of the PGC CNV group
• Develop a pipeline for processing array
data
• Generate a high-quality set of CNV calls
for each disorder
• Develop (in collaboration with
stats/analysis group) rigorous metaanalytical approach for testing association
with disease
Goals of the PGC CNV group
• Develop a pipeline for processing array
data
• Generate a high-quality set of CNV calls
for each disorder
• Develop (in collaboration with
stats/analysis group) rigorous metaanalytical approach for testing association
with disease
Variable results from different CNV
methods
• The default
outputs of different
methods differ
considerably
• Why?
A composite pipeline
BirdSuite Pipeline
BirdSuite
Preprocessing
BirdSuite
Auxiliary Files
Intensity
Files
Ipattern Pipeline
Ipattern
Preprocessing
C_Score Pipeline
Parse
C_Score
Output
C_Score
Auxiliary Files
PennCNV Pipeline
PennCNV
Preprocessing
Parse
Ipattern
Output
Ipattern
Auxiliary Files
Auxiliary
Files
C_Score
Preprocessing
Parse
BirdSuite
Output
PennCNV
Auxiliary Files
Parse
PennCNV
Output
StandardFormat
Call Files
Status of PGC-SCZ dataset
28,000 arrays have been processed (90%)
Study
MGS
Omar Gustafsson
Jennifer Moran
Brien P. Riley
Moran_Sweden
Elvira Bramon
Total (Affy6.0)
Platform
Affy_6.0
Affy_6.0
Affy_6.0
Affy_6.0
Affy_6.0
Affy_6.0
Case
4195
263
1170
3510
1089
1019
11246
Control
3804
369
1762
2048
1529
3124
12636
Processed
Yes
Yes
Yes
Yes
Yes
Yes
Total
7999
632
2932
5558
2618
4143
23882
J.E. Buizer
Rujescu, Dan
Thomas Werge
Total (Illumina)
Illum_550K
Illum_300K
Illum_550K
843
437
623
1903
683
353
498
1534
Yes
Yes
Yes
1526
790
1121
3437
Jennifer Moran
Jennifer Moran
George Kirov
Total (Others)
Affy_5.0
Affy_500
Affy_500
2133
0
494
2627
1624
169
0
1793
No
No
No
3757
169
494
4420
15776
15963
Total (PGC)
31739
Goals of the PGC CNV group
• Develop a pipeline for processing array
data
• Generate a high-quality set of CNV calls
for each disorder
• Develop (in collaboration with
stats/analysis group) rigorous metaanalytical approach for testing association
with disease
Performance of the composite
pipeline
GLM score
Intersection of
multiple methods
Default
output of
Individual
Methods
Notes: BPIC: Birdseye + PennCNV + Ipattern + Cscore; CSC: Cscore ; BDE: Birdseye; UPC: PennCNV ;
IPT: Ipattern .
Merging Calls from multiple methods
(from a single individual) into a “consensus” call
• A consensus call is
defined as a contiguous
region called by >1
method
• The consensus genotype
(“Del” or “Dup”) is decided
by majority vote
• vote is (with some
specific exceptions*)
virtually always
unanimous
*When consensus overlaps
substantially (>50%) with a
canary call. In this case
(only), canary has veto
power
Goals of the PGC CNV group
• Develop a pipeline for processing array
data
• Generate a high-quality set of CNV calls
for each disorder
• Develop (in collaboration with
stats/analysis group) rigorous metaanalytical approach for testing association
with disease
Core Analyses
• It’s advisable to use a consistent methodology across
disorders for the “core” analyses, which would likely
include
– merging and QC of CNV call sets
– Point-wise and Regional association tests
– Mutational burden analysis
• Note: PGC2 PIs must adhere to the 1 year timeline for
selecting targets for replication (for all disorders)
• How best to coordinate this?
A proposal to use model similar to what’s been done for GWAS
– Implementation of core case/control methodologies by analysis
group (members of CNV, stats and disease group)
– Further analysis by disease group
Core Analyses (cont.)
• Analysis of CNV/gene burden (what
proportion of risk can be explained by
CNVs?)
– CNVs will be subclassified by type, size, gene
content
• locus association tests, options for this
include
– regional test of CNV loci (predefined regions)
• Can be defined based on frequency in the
combined sample
• also may carry out separate test of candidate loci
from prior studies
– gene based or exon based
Meta-analytic method issues
• -Burden and association tests will be
carried out (using R or PLINK) controlling
for several covariates (known to influence
observed CNV burden or allele frequency)
– Ancestry as defined by PCA of ancestry
informative SNP markers (these labels should
already be available for most samples)
– Platform
– Study
– Plate
Many other (non-core) analyses
• Pathway-based association
• Cross-disorder/genetic overlap
• Analysis of clinical data
– Correlating clinical metrics (e.g. head
circumference, IQ, others) with CNV genotype
– CNV burden in clinical subgroups
• Other?
Insert preliminary data here
• Proof of principle. The following
preliminary analyes are in progress
– Consensus calls have been generated on all
Affy6 arrays from the SCZ dataset
– Calls will be compiled for select SCZassociated loci at 1q21.1, 15q13.3, 16p11.2,
22q11.2, NRXN1 and VIPR2
– Concordance will be determined with calls
from published studies (ISC and MGS)
– Any (sizable) discordant calls will be tested
experimentally
Next Steps
• Processing of Bipolar dataset (November 2012 –
February 2013)
• Complete Pipeline for Core Analyses of SCZ
(November-December 2013)
• Preliminary SCZ results on genome wide
mutational burden (Jan-Feb 2013)
• Preliminary SCZ results on regional association
(Jan-Feb 2013)
• Inclusion of other disorders in primary analysis
(for psych chip) will depend on whether data
assembly & data processing can be completed
in 1 year
Psych Chip
• In late 2013, the CNV and SNP content of
the “psych chip” will be finalized.
• Genotyping will be carried out in large
replication samples
Summary
• Affy & Illumina pipeline construction is
complete
• Processing of SCZ dataset is 90%
complete
• Consensus SCZ CNV calls assembled
• Beginning now
– SCZ analysis
– BD data processing
Next CNV call
• Thursday Nov 15, 2pm EST
• Phone: 866-740-1260
• Access code: 5342332
Thank You
•
University of Toronto
–
–
–
–
•
Pat Sullivan
Jin Peng Szatkiewicz
Laura Todd
•
Brien Riley
Ken Kendler
Broad/MGH
–
–
–
–
–
Sarah Bergen
Steve McCarroll
Ben Neale
Jennifer Moran
Stephan Ripke
Danielle Posthuma
University of Colorado
–
•
Elliot Gershon
Ailley Rodriguez
VU
–
•
Doug Greer
Wenting Wu
Madhu Gujral
Abhishek Bhandari
Dheeraj Malhotra
University of Chicago
–
–
VCU
–
–
•
•
University of North Carolina
–
–
–
•
Dalila Pinto
Pamela Sklar
UCSD
–
–
–
–
–
Steve Scherer (co-chair)
Christian Marshall
Bhooma Thiruvahindrapuram
Zhuozhi Wang
Mt. Sinai College of Medicine
–
–
•
•
Matthew Keller
And many more…
–
–
–
–
Doug Levinson
Mick Odonovan
Steve Faraone
George Kirov