Download Integrating Gene Expression Analysis into Genome-Wide

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Saethre–Chotzen syndrome wikipedia , lookup

Oncogenomics wikipedia , lookup

Cancer epigenetics wikipedia , lookup

Genomics wikipedia , lookup

Epigenetics in learning and memory wikipedia , lookup

NEDD9 wikipedia , lookup

Primary transcript wikipedia , lookup

Non-coding RNA wikipedia , lookup

Epitranscriptome wikipedia , lookup

MicroRNA wikipedia , lookup

Genomic imprinting wikipedia , lookup

Epigenetics of depression wikipedia , lookup

Pharmacogenomics wikipedia , lookup

Pathogenomics wikipedia , lookup

Epigenetics of neurodegenerative diseases wikipedia , lookup

Non-coding DNA wikipedia , lookup

Gene nomenclature wikipedia , lookup

Gene desert wikipedia , lookup

Vectors in gene therapy wikipedia , lookup

RNA silencing wikipedia , lookup

Genetic engineering wikipedia , lookup

Epigenetics of human development wikipedia , lookup

Gene wikipedia , lookup

Gene therapy wikipedia , lookup

Metagenomics wikipedia , lookup

History of genetic engineering wikipedia , lookup

Human genetic variation wikipedia , lookup

Gene therapy of the human retina wikipedia , lookup

Long non-coding RNA wikipedia , lookup

Genome (book) wikipedia , lookup

Public health genomics wikipedia , lookup

Genome evolution wikipedia , lookup

Epigenetics of diabetes Type 2 wikipedia , lookup

Site-specific recombinase technology wikipedia , lookup

Therapeutic gene modulation wikipedia , lookup

Microevolution wikipedia , lookup

Nutriepigenomics wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Gene expression profiling wikipedia , lookup

Gene expression programming wikipedia , lookup

Mir-92 microRNA precursor family wikipedia , lookup

Designer baby wikipedia , lookup

RNA-Seq wikipedia , lookup

Quantitative trait locus wikipedia , lookup

Transcript
technical note: rna analysis
Integrating Gene Expression Analysis
into Genome-Wide Association Studies
Combining multiple forms of data in a single analysis enables more informative characterization
of complex traits.
introduction
the potential value that genome-wide expression screens
The recent availability of high-throughput, whole-genome
may add to clinical trait-based GWAS is beginning to be
genotyping arrays such as Illumina’s Infinium® HD DNA
realized and is cause for growing excitement in the genetics
Analysis BeadChips has enabled researchers to efficiently
field5–8. In part for this reason, Illumina has developed
screen for associations between variations in the genome
the HumanHT-12 Gene Expression BeadChip, a 12-sample
and phenotypes of interest. To date, hundreds of genome-
array that presents a low cost solution for enhancing
wide association studies (GWAS) have identified quantita-
GWAS with whole-genome expression analysis. This
tive trait loci (QTL) underlying many common complex
document highlights some of the potential benefits of
diseases, demonstrating the utility of this approach for
incorporating whole-genome gene expression data into a
dissecting the genetic basis of polygenic traits1–3.
clinical trait-based GWAS.
Although GWAS can effectively map loci contributing to
phenotypes of interest, they offer limited insight as to the
eQTL Analysis: gene expression as a quantitative trait
causative genetic variation or the mechanism by which
In a traditional GWAS, the trait being investigated is as-
it confers its effect. As a result, the research community
sociated with a region in the genome. This is also the case
is recognizing the utility of integrating multiple forms
with eQTL (expression QTL) analysis, which treats mRNA
of data into a single analysis. For example, the wealth of
abundance as a trait in a GWAS. An eQTL screen identifies
information that whole-genome expression profiling is
loci, or eQTL, that may contribute directly or indirectly to
capable of uncovering has been well-documented4, but
expression levels. Expression QTL can be divided broadly
Figure 1: Illumina solutions for integrated analysis
rs7597595
2.00
1.80
1.60
1.40
Norm R
1.20
1
0.80
0.60
0.40
0.20
0
-0.20
5
0
18
0.20
0.40
0.60
101
0.80
1
Norm Theta
4117526127
Infinium HD DNA Analysis BeadChip
HumanHT-12 Gene Expression BeadChip
The 12-sample HumanHT-12 Gene Expression BeadChip (right) targets more than 48,000 transcripts in the RefSeq database (Build
36.2, Release 22). This multi-sample whole-genome expression BeadChip matches the throughput of Illumina’s Infinium HD DNA Analysis BeadChip (left) product line. Illumina’s user-friendly analysis software permits evaluation of both genotype and expression data.
technical note: rna analysis
Genome Position (transcript)
fIGURE 2: eqtl analysis data
X
22
21
20
19
18
17
16
15
14
13
12
11
10
9
8
7
6
5
4
3
2
1
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 X
Genome Position (eQTL)
This graph illustrates observations from eQTL analysis that have been reported in the current literature. The diagonal band indicates
cis-eQTL. These eQTL are detected when the locus that affects mRNA abundance overlaps the location of the affected gene. The
horizontal band represents a trans-band or “eQTL hot spot,” which suggests that expression of multiple genes map to the same
single-nucleotide polymorphism (SNP).
into two categories based on their position relative to the
Cis-eQTL and candidate gene filtering
gene whose expression is being measured. If the eQTL and
Although the process of mapping clinical QTL has become
gene positions overlap, the eQTL is considered to be
very efficient, the transition from QTL to the identification
cis-acting and the gene it overlaps to be cis-regulated
and validation of the quantitative trait genes (QTGs)
(Figure 2). If the eQTL and gene location are non-overlapping,
containing the causative genetic variant has been
the eQTL is considered to be trans-acting. Cis-eQTL are
severely rate-limiting9,10. While informatics tools such as
usually believed to result from a variant in a regulatory
SNP and functional pathway databases have utility in
region of the gene that affects its level of abundance.
filtering QTG candidates, biological evidence supporting
The causal mechanisms behind trans-eQTL can be con-
these data is often lacking. Adding expression signatures
siderably more variable. For instance, a trans-eQTL could
to a GWAS may serve to bridge this gap by enabling the
reflect a variant that affects the abundance or activity
detection of cis-regulated QTG within clinical QTL6,8,11.
of a transcription factor. Alternatively, a trans-eQTL
Cis-eQTL suggest a relationship between a local genetic
could result from a variation in a component of a signaling
variant and phenotypic variation that is specific to a
cascade that ultimately affects the abundance of the
unique QTG candidate and could be capable of driving
mRNA being measured (Figure 2).
the clinical trait difference. If a cis-regulated QTG
candidate were supported by informatics-based evidence
technical note: rna analysis
fIGURE 3: potential outcomes of combining gene expression and genotype data
Identify distinct expression
patterns within populations
Expression
Screen for differential expresssion
between cases and controls and identify
functional pathway enrichments
Identify cis-eQTL and trans-eQTL
and screen for functional
pathway enrichments
Genotype
Phenotype
Discover clinical QTL
and candidate QTG
such as known function relative to the clinical trait of
Comparing gene expression signatures within clinical
interest or the presence of candidate causative SNPs, the
trait populations may provide value to GWAS as well.
data would present a compelling case for prioritizing the
For example, although the phenotype exhibited by the
QTG candidate for further experimental validation.
study population may appear uniform across affected
individuals, its genetic basis may not be. The detection
Differential expression, trans-eQTL, and functional
pathway analysis
of discernible expression profiles within a sample
Adding expression analysis to a GWAS may also be useful
subphenotype groups. These subgroups could then be
in the identification of differential expression patterns
analyzed independently, potentially reducing the level
across study populations. Screening for functional
of noise and enabling the detection of clinical QTL that
category enrichment among genes that are differentially
might otherwise be missed6,8.
population may be useful for identifying distinct
regulated between case and control populations has been
used to identify pathways that may be involved in
SUMMARY
conferring traits of interest6,12. In some cases, subsets of
Combining expression and genotype data sheds greater
these genes might be under coordinated regulatory
light on the biological context of QTL. It enables the
control, such that a single genetic variant is responsible
generation of better-informed hypotheses and provides
for the differential mRNA abundance detected across
additional filtering tools for prioritizing candidate QTGs.
them. When integrated with genotype data in the context
The rapidly growing number of examples highlighting the
of a GWAS, such gene sets may map to eQTL trans-bands,
importance of interplay among genotype, DNA methyla-
or “ eQTL hot spots,” that overlap a QTL for the clinical
tion status, miRNA, and mRNA abundance in determining
trait6. In such instances, any genes in the co-regulated set
the incidence, nature, and severity of clinical phenotypes
that are not annotated in the enriched functional
serves as an indication of the broad range of experiments
category may be excellent candidates for downstream
that future studies will employ. Just as methods used in
validation as novel members of the functional pathway
GWAS can be applied to expression data to generate eQTL
and modulators of the clinical trait.
results, they can also be applied to data from methylation,
technical note: rna analysis
miRNA, and similar experiments to create a more complete
characterization of the molecular basis of complex traits.
Clearly, an effective tool for characterizing a multifaceted
trait is a multifaceted analysis. To that end, Illumina is
building a growing portfolio of products that facilitate
cost-effective and efficient data integration for the future
of genetic discovery.
References
(1)
Pearson TA, Manolio TA (2008) How to Interpret a Genome-wide Assocation Study.
JAMA 299(11):1335–1344.
(2)
McCarthy MI, Abecasis GR, Cardon LR, Goldstein DB, Little J, et al. (2008) Genomewide association studies for complex traits: consensus, uncertainty and challenges.
Nat Genet 9:356–369.
(3)
http://www.illumina.com/pagesnrn.ilmn?ID=89
(4)
Segal E, Friedman N, Kaminski N, Regev A, Koller D (2005) From signatures to models:
understanding cancer using microarrays. Nat Genet 37:S38–S45.
(5) Dermitzakis ET (2008) From gene expression to disease risk. Nat Genet 40:492–493.
(6)
Drake TA, Schadt EE, Lusis AJ (2006) Integrating genetic and gene expression
data: application to cardiovascular and metabolic traits in mice. Mamm Genome
17(6):466–79.
(7)
Additional Information
For more information about Illumina Gene Expression
and Infinium HD DNA Analysis BeadChips, please visit
www.illumina.com or contact us at the address below.
Gilad Y, Rifkin SA, Pritchard JK (2008) Revealing the architecture of gene regulation:
the promise of eQTL studies. Trends Genet 24(8):408–15. (8)
Schadt EE, Monks SA, Friend SH (2003) A new paradigm for drug discovery: integrating clinical, genetic, genomic and molecular phenotype data to identify drug targets.
Biochem Soc Trans 31(2):437–43.
(9)
DiPetrillo K, Wang X, Stylianou IM, Paigen B. (2005) Trends Genet. Bioinformatics
toolbox for narrowing rodent quantitative trait loci. 21(12):683–92.
(10) Korstanje R and Paigen B (2002) From QTL to gene: the harvest begins. Nat Genet
31:235–236.
(11) Stranger BE, Forrest MS, Clark AG, Minichiello MJ, Deutsch S, et al. (2005) Genomewide associations of gene expression variation in humans. PLoS Genet 1(6):e78.
(12) Kloth JN, Gorter A, Fleuren GJ, Oosting J, Uljee S et al. (2008) Elevated expression of
SerpinA1 and SerpinA3 in HLA-positive cervical carcinoma. J Pathol 215(3):222–30.
Illumina, Inc.
Customer Solutions
9885 Towne Centre Drive
San Diego, CA 92121-1975
1.800.809.4566 (toll free)
1.858.202.4566 (outside North America)
[email protected]
www.illumina.com
For research use only
© 2008 Illumina, Inc. All rights reserved.
Illumina, Solexa, Making Sense Out of Life, Oligator, Sentrix, GoldenGate, DASL, BeadArray, Array of Arrays, Infinium, BeadXpress, VeraCode, IntelliHyb, iSelect, CSPro,
iScan, and GenomeStudio are registered trademarks or trademarks of Illumina. All other brands and names contained herein are the property of their respective owners.
Pub. No. 470-2008-008 Current as of 05 August 2008