Download Expression profiling reveals off

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Zinc finger nuclease wikipedia , lookup

Metagenomics wikipedia , lookup

Genetic engineering wikipedia , lookup

Cancer epigenetics wikipedia , lookup

Epitranscriptome wikipedia , lookup

Minimal genome wikipedia , lookup

Non-coding RNA wikipedia , lookup

History of genetic engineering wikipedia , lookup

Genomics wikipedia , lookup

Short interspersed nuclear elements (SINEs) wikipedia , lookup

No-SCAR (Scarless Cas9 Assisted Recombineering) Genome Editing wikipedia , lookup

Point mutation wikipedia , lookup

Public health genomics wikipedia , lookup

Epigenetics in learning and memory wikipedia , lookup

Pathogenomics wikipedia , lookup

Transposable element wikipedia , lookup

Epigenetics of neurodegenerative diseases wikipedia , lookup

Biology and consumer behaviour wikipedia , lookup

Ridge (biology) wikipedia , lookup

MicroRNA wikipedia , lookup

X-inactivation wikipedia , lookup

Gene therapy wikipedia , lookup

Gene therapy of the human retina wikipedia , lookup

Genomic imprinting wikipedia , lookup

Vectors in gene therapy wikipedia , lookup

Gene desert wikipedia , lookup

Gene nomenclature wikipedia , lookup

Polycomb Group Proteins and Cancer wikipedia , lookup

Genome (book) wikipedia , lookup

NEDD9 wikipedia , lookup

Epigenetics of diabetes Type 2 wikipedia , lookup

Genome evolution wikipedia , lookup

Primary transcript wikipedia , lookup

Long non-coding RNA wikipedia , lookup

Gene wikipedia , lookup

Genome editing wikipedia , lookup

Microevolution wikipedia , lookup

Epigenetics of human development wikipedia , lookup

Site-specific recombinase technology wikipedia , lookup

Helitron (biology) wikipedia , lookup

Mir-92 microRNA precursor family wikipedia , lookup

Gene expression programming wikipedia , lookup

Nutriepigenomics wikipedia , lookup

Designer baby wikipedia , lookup

RNA silencing wikipedia , lookup

RNA interference wikipedia , lookup

Therapeutic gene modulation wikipedia , lookup

Gene expression profiling wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

RNA-Seq wikipedia , lookup

Transcript
© 2003 Nature Publishing Group http://www.nature.com/naturebiotechnology
B R I E F C O M M U N I C AT I O N S
Expression profiling reveals
off-target gene regulation by RNAi
Aimee L Jackson1,2, Steven R Bartz1,2, Janell Schelter1,
Sumire V Kobayashi1, Julja Burchard1, Mao Mao1, Bin Li1,
Guy Cavet1 & Peter S Linsley1
RNA interference is thought to require near-identity between
the small interfering RNA (siRNA) and its cognate mRNA.
Here, we used gene expression profiling to characterize the
specificity of gene silencing by siRNAs in cultured human
cells. Transcript profiles revealed siRNA-specific rather than
target-specific signatures, including direct silencing of
nontargeted genes containing as few as eleven contiguous
nucleotides of identity to the siRNA. These results
demonstrate that siRNAs may cross-react with targets of
limited sequence similarity.
RNA interference (RNAi) is a potent method to suppress gene expression in mammalian cells, and has generated much excitement in the
scientific community1. Specific gene silencing promises the potential
to harness human genome data to elucidate gene function, identify
drug targets and develop more specific therapeutics. Numerous
reports in the literature describe the exquisite specificity of siRNAs,
suggesting a requirement for near-perfect identity with the siRNA
sequence2–4. However, most of the published analyses of siRNAinduced gene silencing have examined only one or a few genes in
addition to the targeted gene, an approach not unlike ‘looking for
keys under the lamppost.’ Cross-hybridization with transcripts containing partial identity to the siRNA sequence may elicit phenotypes
reflecting silencing of unintended transcripts in addition to the target
gene. We used expression profiling for an unbiased, genome-wide
analysis of the efficacy and specificity of siRNA-induced silencing of
two genes involved in signal transduction, the insulin-like growth factor receptor (IGF1R) and mitogen-activated protein kinase 1
(MAPK14, also known as p38α).
Using standard selection rules5, we designed 16 siRNAs to target the
coding region of IGF1R and 8 siRNAs to target MAPK14 (see
Supplementary Table 1 online). After the siRNAs were transfected into
HeLa cells, we compared the expression profiles resulting from silencing of the same target gene by different siRNAs (see Supplementary
Methods online). Surprisingly, our analysis revealed few genes regulated in common by different siRNAs to the same target gene. Each of
the 8 siRNA duplexes targeted to MAPK14 produced a distinct expression pattern (Fig. 1a). Likewise, each of the 16 siRNA duplexes to
IGF1R produced a unique expression pattern (Fig. 1b). Virtually identical gene expression patterns were observed in three independent
experiments, demonstrating that gene regulation resulting from a particular siRNA was reproducible. Detailed analysis suggests that
different siRNAs to the same target transcript elicit a small number of
gene regulations in common (data not shown), but the vast majority
of the transcript expression patterns were siRNA-specific rather than
target-specific. The number and identity of altered transcripts did not
correspond to the ability of the siRNA to silence the target gene. Target
mRNA levels were correlated with protein levels and the extent of
mRNA silencing measured by TaqMan was equivalent to that measured by array profiling. An siRNA targeted to luciferase reproducibly
regulated the expression of several genes despite the lack of a homologous target in the human genome. Thus, we have observed patterns of
gene regulation that are specific for the siRNA sequence used for
silencing, rather than for the intended target.
We subsequently performed detailed concentration and kinetic
analysis of MAPK14 protein and RNA knockdown by siRNA
MAPK14-1. Although target gene silencing was detectable when the
siRNA concentration was decreased 1,000-fold, off-target gene regulation was also detectable (Fig. 1c). Many of these genes showed
nearly identical half-maximal responses with respect to siRNA concentration as MAPK14 (∼1nM). We were unable to titrate the offtarget gene regulation from silencing of the intended target, indicating
that off-target gene regulation is not simply an artifact of high
siRNA concentration. We then analyzed temporal gene expression
patterns. The MAPK14 protein demonstrated a half-life of approximately 40 h after siRNA transfection (see Supplementary Fig. 1
online). In contrast, the MAPK14 transcript demonstrated halfmaximal degradation approximately 11 h post-transfection (see
Supplementary Fig. 1 online). Through expression profiling, we
observed a surprising degree of gene regulation at early time points
(6–12 h) well before any observable decrease in the MAPK14 protein
(see Supplementary Fig. 1 online). These gene expression changes
therefore were unlikely to be secondary events resulting from loss of
MAPK14 function. The expression signature could be divided into
several temporally distinct groups of transcripts based on timing of
half-maximal gene regulation (Fig. 2a). Group 1 contains a single
transcript, the intended target MAPK14. Group 2 contains nine transcripts demonstrating similar kinetics of silencing to MAPK14, with
half-maximal degradation at 7–13 h as determined by microarray.
This same group of transcripts was downregulated with rapid kinetics
in a separate experiment, demonstrating that these genes were reproducibly silenced by this siRNA. The rapid kinetics of transcript regulation suggests that these are direct transcript degradation events.
This is in contrast to kinetic groups 3 and 4, for which half-maximal
degradation occurs at approximately 40 h and therefore likely represents secondary gene expression changes.
None of the genes in kinetic group 2 (Fig. 2b) are known to function in the MAPK14 pathway. Remarkably, all of these transcripts
were found to contain regions of partial sequence identity to the
siRNA duplex (Fig. 2b). Sequence alignment demonstrated that these
genes could be divided into two subgroups. One subgroup contained
a core of 14–15 nucleotides of similarity encompassing the central
1Rosetta Inpharmatics LLC, 12040 115th Avenue NE, Kirkland, Washington 98034, USA. 2These authors contributed equally to this work. Correspondence should be
addressed to A.L.J. ([email protected]) or S.R.B. ([email protected]).
NATURE BIOTECHNOLOGY ADVANCE ONLINE PUBLICATION
1
B R I E F C O M M U N I C AT I O N S
a
–1.0
0
Figure 1 Expression profiling reveals siRNA sequence-specific and
concentration-dependent gene expression patterns. (a) Eight different
siRNA duplexes targeted to the MAPK14 coding region were used for gene
silencing in HeLa cells. Luc, siRNA targeted to luciferase. Green indicates
decreased expression relative to mock transfection, red indicates increased
expression. The bar graph represents the fraction of target protein (red) and
RNA (black) remaining 48 h after siRNA transfection (see Supplementary
Methods and ref. 8 for details on microarray analysis). (b) Sixteen different
siRNA duplexes targeted to the IGF1R coding region were used for gene
silencing. Asterisks indicate the IGF1R siRNA duplexes that reduce protein
level by at least 60%. (c) HeLa cells were transfected with the indicated
concentrations of MAPK14-1 siRNA.
1.0
Log 10 ratio
MAPK14-8
MAPK14-7
MAPK14-6
MAPK14-5
MAPK14-4
© 2003 Nature Publishing Group http://www.nature.com/naturebiotechnology
MAPK14-3
MAPK14-2
MAPK14-1
Luc
0 0.2 0.4 0.6 0.8 1.0
Fraction MAPK14 remaining
b
–1.0
0
1.0
Log 10 ratio
Luc
IGF1R-16
IGF1R-15
IGF1R-14
IGF1R-13
IGF1R-12
IGF1R-11
IGF1R-10
IGF1R-9
IGF1R-8
IGF1R-7
IGF1R-6
IGF1R-5
IGF1R-4
IGF1R-3
IGF1R-2
IGF1R-1
*
*
*
*
*
*
*
*
0
0.4 0.8
*
*
*
1.2
1.6 2.0
Fraction IGF1R remaining
Mapk14
c
100 nM
20 nM
4 nM
0.8 nM
0.16 nM
–1.0
0
1.0
Log 10 ratio
region of the siRNA sequence. The second subgroup contained a
smaller core of similarity encompassing the nine nucleotides at the 3′
end of the siRNA sense-strand sequence. This is in contrast to transcripts in kinetic groups 3–5. Many of these genes lacked substantial
sequence similarity to the siRNA, as determined by FASTA analysis,
and those that did displayed only short stretches (<6–8 nucleotides)
of similarity distributed randomly throughout the siRNA sequence
(data not shown.) Thus, the bias for a core of sequence similarity
encompassing the 3′ end of the siRNA is unique to the rapidly
silenced transcripts. The same pattern of sequence similarity to genes
silenced with rapid kinetics was also observed with two different
siRNAs to IGF1R (data not shown.) Interestingly, for one of these
siRNAs, the off-target gene silencing was directed by the antisense
strand of the siRNA, whereas for the other siRNA the off-target gene
silencing appeared to be directed by the sense strand. This suggests
that both the sense and antisense strands of an siRNA duplex can contribute to transcript silencing.
2
On the basis of published reports that gene silencing was abolished
by single nucleotide changes in the siRNA sequence2,6, we would not
have predicted that this limited degree of sequence similarity would be
sufficient for transcript silencing. However, to test this possibility, we
systematically substituted the nucleotide at each position of the siRNA
sequence and determined the effect of the altered sequence on the
expression signature. Although a comprehensive discussion of this
analysis is beyond the scope of this report, representative results are
presented in Fig. 2c. A single nucleotide substitution at position 4
reduced silencing of MAPK14, and abolished silencing of the off-target
genes in subgroup 1 that contain similarity to MAPK14 at this position. However, silencing was not abolished for the six off-target genes
in subgroup 2 that do not contain similarity to MAPK14 in this region,
confirming that silencing of these off-target genes is independent of
loss of MAPK14 expression. A single nucleotide substitution at position 15 reduced MAPK14 silencing, and abolished silencing of all nine
off-target genes, presumably because all nine transcripts contain similarity to MAPK14 in this region.
An siRNA duplex designed to target a different sequence in
MAPK14 (MAPK14-2) silenced MAPK14 but not any of the group 2
transcripts, which are not similar to the new siRNA sequence (Fig. 2c).
Finally, siRNAs for two off-target transcripts, KPNB3 and FLJ20291,
silenced the expression of MAPK14 in addition to their intended targets (Fig. 2d). The KPNB3 siRNA shares 14 contiguous nucleotides,
and a total of 15 nucleotides, of identity with MAPK14. The FLJ20291
siRNA shares only 11 contiguous nucleotides, and a total of
15 nucleotides, of identity with MAPK14-1. In conclusion,
15 nucleotides, and perhaps as few as 11 contiguous nucleotides, of
sequence identity are sufficient to direct silencing of nontargeted transcripts and therefore, although RNA interference results in robust
silencing of the desired target, off-target gene regulation can occur as a
result of degradation of mRNA transcripts with partial identity to the
siRNA sequence.
Although the early regulated genes show sequence similarity to the
siRNA, not all possible transcripts with this level of sequence similarity
were silenced. Despite repeated attempts, we have thus far been unable
to identify patterns that could help predict off-target activity of
siRNAs. Furthermore, the transcripts silenced with early kinetics are
only a portion of the total signature. Although we cannot explain all of
the off-target gene regulation, we believe that at least a portion of it can
be explained by cross-hybridization to transcripts of similar sequence.
The finding of a core of sequence similarity to the 3′ end of the
siRNA sense strand (the 5′ end of the antisense strand) invites speculation that this region of the siRNA plays an important role in transcript
silencing. This is strikingly reminiscent of the finding that the 5′ ends
of many microRNAs (miRNAs) contain seven nucleotides of sequence
that are complementary to target regulatory motifs in the 3′ untranslated regions of mRNA7. Sequence complementarity to these regula-
ADVANCE ONLINE PUBLICATION NATURE BIOTECHNOLOGY
Contig56528_RC
AF093680
RPA2
DKFZp564J157
RRAD
FLJ20291
RAP2A
MAPK14
of expression ratio
KPNB3
c
Group 1 (MAPK14)
Group 2 (9 genes)
Group 3 (8 genes)
Group 4 (2 genes)
Group 5 (1 gene)
Group 6 (5 genes)
MAPK14-1
pos. 4 mut
pos. 5 mut
10
pos. 15 mut
MAPK14-2
Log
Log10 (ratio)
Time after transfection ( h)
b
P ≤ 0.01
2.0
d
1.0
0.0
–1.0
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19
Accession
NM_001315
NM_002271
Hs13_10109_29_13_1603
NM_017748
A133672
NM_004165
NM_002946
NM_018457
NM_013242
AW237459
Gene name
MAPK14
KPNB3
RAP2A
FLJ20291
Contig53709_RC
RRAD
RPA2
DKFZp564J157
AF093680
Contig56528_RC
G C T A
T C T A
T T T A
CC T C
A G G C
G A A G
A T G A
A A T A
A G G A
(MAPK14-2)
MAPK14
A T G T G A T T G G T C T G T T T T A
CC T AC
C
C
C
A
A
A
A
A A
CT
CA
G
G
G
G
A
A
A
A
G
G
G
G
A
A
A
A
A G A A
C AG A
GG G A
A
A
A
A
C
A
A
G C T T T G A
T T T C T T C
G A A A T G A
C
C
C
C
C
C
C
C
C
C
T
T
T
T
T
T
T
T
T
T
G
G
G
T
G
G
C
C
C
C
C
C
T T
G C
G C
G C
G G T T
A T C T
A G C C
G G T A
G G T T
G G G T
G G T G
G G T T
G G T T
G G T A
Identity
0.0
14/15
14/15
11/15
8/13
10/12
5/11
9/10
8/10
8/11
2.0
Log10 (ratio)
© 2003 Nature Publishing Group http://www.nature.com/naturebiotechnology
a
Contig53709_RC
B R I E F C O M M U N I C AT I O N S
Log10 (intensity)
2.0
P ≤ 0.01
1.0
0.0
–1.0
0.0
2.0
Log10 (intensity)
Figure 2 Kinetic and sequence analysis of MAPK14 knockdown by RNAi. (a) RNA extracts were harvested at the indicated times after transfection of HeLa
cells and processed for expression profiling. Transcript data from the microarray analysis were analyzed by trend plot to determine timing of half-maximal
transcript degradation. Data are presented as log10 of expression ratio plotted as a function of time after transfection. (b) Sequence alignment of genes
regulated with similar kinetics to MAPK14. Nucleotides with perfect identity to the MAPK14 sequence are indicated in bold, mismatched nucleotides are
indicated in small font. Accession numbers listed are from the NCBI GenBank database. Gene names are from LocusLink except for the EST contig
designations, which are from www.pharp.org/est_assembly/human/gene_number_methods.html. The degree of sequence identity to MAPK14-1 is indicated
as the number of contiguous identical nucleotides /the total number of identical nucleotides. (c) HeLa cells were transfected with MAPK14-1 siRNA, or
MAPK14-1 siRNA containing a single nucleotide substitution. (see Supplementary Table 1 online for siRNA sequences). (d) Gene expression resulting from
silencing by siRNAs to the off-target genes KPNB3 and FLJ20291. Microarray data are plotted as log10 of expression ratio versus log10 of fluorescence
intensity. Blue represents transcripts with unaltered expression. The targeted genes, as well as MAPK14, are indicated.
tory motifs was located exclusively at the 5′ ends of the miRNAs, suggesting a role in recognition or stabilization of the miRNA-mRNA
interaction. The same may be true for the siRNA-mRNA interaction.
Transcripts containing sequence similarity to this portion of the
siRNA could potentially be collateral targets for silencing.
Given the small degree of similarity implicated in off-target gene
regulation, it may be difficult to select an siRNA sequence that will be
absolutely specific for the target of interest. Until siRNA design can be
improved to convincingly reduce this off-target activity, incorporating
multiple siRNA duplexes to silence a target gene of interest will
increase the confidence with which an observed phenotype and
expression pattern can be linked to target gene silencing. In this way,
expression profiling in conjunction with gene silencing by RNAi will
provide an effective means to identify and characterize gene function
in cultured mammalian cells.
Note: Supplementary information is available on the Nature Biotechnology website.
NATURE BIOTECHNOLOGY ADVANCE ONLINE PUBLICATION
ACKNOWLEDGMENTS
We thank G. Schimmack, S. Kim, M. Imakura and J. McLarnan for expert technical
assistance, M. Biery for assistance with computer graphics and the Rosetta Gene
Expression Laboratory for microarray work.
COMPETING INTERESTS STATEMENT
The authors declare that they have no competing financial interests.
Received 10 February; accepted 18 March 2003
Published online 18 May 2003; doi:10.1038/nbt831
1. Couzin, J. Science 298, 2296–2297 (2002).
2. Elbashir, S.M., Martinez, J., Patkaniowska, A., Lendeckel, W. & Tuschl, T. EMBO
J. 20, 6877–6888 (2001).
3. Tuschl, T., Zamore, P.D., Lehmann, R., Bartel, D.P. & Sharp, P.A. Genes Dev. 13,
3191–3197 (1999).
4. Hutvagner, G. & Zamore, P.D. Science 287, 2056–2060 (2002).
5. Elbashir, S.M., Harborth, J., Weber, K. & Tuschl, T. Methods 26, 199–213
(2002).
6. Martinez, L.A. et al. Proc. Natl. Acad. Sci. USA 99, 14849–14854 (2002).
7. Lai, E.C. Nat. Genetics 30, 363–364 (2002).
8. Hughes, T.R. et al. Nat. Biotechnol. 19, 342–347 (2001).
3