Download Alternative Splicing

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Alternative Splicing
Genomic DNA Sequence
Transcription
pre-mRNA
Intron
Exon
Exon
Exon
Intron
Exon
Intron
Exon
RNA Processing
Gm
mRNA
Gm
AAAAA
AAAAA
mRNA
Alternative Splicing Data
Sources are Large and Growing
Curated databases
SWISS-PROT and RefSeq both support annotation
of experimentally supported alternative splicing
cDNA Sequencing Projects
RIKEN sequenced >21000 full length mouse cDNAs
Many other projects underway (human, fly, plants,…)
Shinagawa et al. (2001) Nature 409:685-90
Microarray detection
Direct or indirect alternative splicing detection
Hu et al. (2001)Genome Res 11:1237-45
Yeakley et al. (2002) Nat Biotech 20:353-9
Public EST data sources (dbEST)
>4.5 million human EST sequences
>12 million total EST sequences
About 1000 new sequences per day
Boguski et al. (1993) Nat
Gen 4:332-3
Nonsense-Mediated mRNA Decay
Genomic DNA
pre-mRNA
Gm
mRNA
Exon junction complex
AAAAAAAAA
Leeds et al. (1991) Genes Dev 5:2303-14
Nagy and Maquat (1998) TIBS 23:198-9
Le Hir et al. (2000) Genes & Dev 14:1098-1108
Mitchell and Tollervey (2001) Curr Opin Cell Biol 13:320-5
Ishigaki et al. (2001) Cell 106:607-17
Lykke-Andersen et al. (2001) Science 293:1836-9
Kim et al. (2001) EMBO 20:2062-68
Termination codon is
on the last exon
(not premature)
Nonsense-Mediated mRNA Decay
Interaction between EJC and
release factors triggers NMD
Decapping and degradation
Gm
mRNA
AAAAAAAAA
Termination codon > 50nt before
last exon junction
(Premature Termination Codon)
Leeds et al. (1991) Genes Dev 5:2303-14
Nagy and Maquat (1998) TIBS 23:198-9
Le Hir et al. (2000) Genes & Dev 14:1098-1108
Mitchell and Tollervey (2001) Curr Opin Cell Biol 13:320-5
Ishigaki et al. (2001) Cell 106:607-17
Lykke-Andersen et al. (2001) Science 293:1836-9
Kim et al. (2001) EMBO 20:2062-68
Nonsense-Mediated mRNA Decay
Translated normally
Gm
AAAAAAAAA
Degraded by NMD
Gm
AAAAAAAAA
NMD is Pervasive
1498 of 1500 genes surveyed from
fungi, plants, insects and
vertebrates obey the PTC rule
Nagy and Maquat (1998) TIBS 23:198-9
4.3% of reviewed RefSeqs have PTCs
34% have start codon after first exon
V(D)J recombination
Wang et al. (2002) J Biol Chem
277:18489-93
“NMD is a critical process in
normal cellular developement”
Wagner and Lykke-Andersen (2002) J Cell Sci 115:3033-8
Renders recessive many otherwise
dominant mutations
Cali and Anderson (1998) Mol Gen Genet 260:176-84
Transcriptional
Regulation
Gene locus
transcription
pre-mRNA
productive
splicing
productive
mRNA
RUST
translation
Protein
Transcriptional
Regulation
RUST
Gene locus
Gene locus
transcription
pre-mRNA
pre-mRNA
productive
splicing
Productive
mRNA
Productive
mRNA
Alternative Splicing Can Yield Isoforms
Differentially
Subjected to Nucleus
NMD
Nucleus
DNA
pre-mRNA
mRNA
DNA
pre-mRNA
mRNA
Premature
termination
codon
NMD
SC35 Auto-regulation
SC35 Locus
transcription
SC35 pre-mRNA
alternative
splicing
splicing
Productive SC35 mRNA
translation
SC35 protein
Sureau et al. (2001) EMBO J 20:1785-96
SC35 Locus
SC35 pre-mRNA
Productive SC35 mRNA
SC35 protein
SC35 Auto-regulation
Alternative splicing coupled with
nonsense-mediated decay
ORF
SC35 pre-mRNA
Gm
AAAAA SC35 mRNA
SC35
protein
SC35 pre-mRNA
SC35 mRNA
AAAAA (with premature
termination codon)
Sureau et al. (2001) EMBO J 20:1785-96
Gm
EST-inferred human isoforms
0
2000
NMD
Candidates
4000
6000
8000
10000
1989 (35 % of 5693)
Alternative isoforms
All isoforms, including canonical
5693
8820
Canonical Splice Forms
Refseq
mRNAs
Pruitt, K.D. et al (2001)
NAR 29: 137-40
Extract coding
regions
Lander et al. (2001) Nature 409:
860-921
Coding
Refseqs
Genomic DNA Sequence
Genomic
Contigs
Association
via LocusLink
Exon 1
Exon 2
Exon 3
Exon 4
Refseq-Contig Pairs
align w/
Spidey
≥98% id,
no gaps
Wheelan et al. (2001)
Gen Res 11:1952-7
Refseq-coding gene
mRNA
Construct genes from
aligned Refseq exons
& intervening genomic
introns (overlap
choose mRNA w/ largest
number of exons)
Refseq-coding genes
Refseq-coding genes
ESTs from dbEST
Boguski et al., (1993) Nat Genet 4, 332-3.
Cluster ESTs w/ WU-BLAST2
≥92% id, allow gaps
Florea, et al.,(1998)
Gen Res 8, 967-74.
Kan, et al. (2001)
Gen Res 11, 889-900.
Gish,(2002)(Wash.Univ.)
Align ESTs w/ sim4
Use TAP to infer
alternative mRNAs
Identification of
Alternative Isoforms
Alternative Isoforms of Refseq-coding genes
>92%
identity,
gaps
allowed
Aligned EST 5’ end
does not indicate
reading frame
Previous and new RUST targets
Class
Splicing
Factors
Ribosomal
Proteins
Experimental Evidence
AUF1, SC35
SRP20, SRP30b (in C. elegans)
Sureau et al. (2001) EMBO J 20:1785-96
Wilson et al. (1999) Mol Cell Bio 19:4056-64
Morrison et al. (1997) PNAS 94:9782-9785
L3, L7a, L10a, L12 (in C. elegans)
L30, S14B (in S. cerevisiae)
Mitrovich & Anderson (2000) GenesDev 14:2173-84
Among Our Results
AUF1, *10 new
L3, L7a, L10a,
L12, *11 new
Alternative Splicing
not
integer #
codons
Recruitment of
Sequence.
Deletion of
Sequence.
*Frameshift and
Truncation.
Premature Stop Codons
EST Limitations
Single pass
sequencing errors
Incompletely
processed transcripts
3’ end bias
Library
contamination
Thanaraj (1999) NAR 27:2627-37
Number of EST splice forms
Alternative Splicing EST Analysis
2000
1500
1000
500
0
Did not affect
Inserted stop
Changed
reading frame
codon
reading frame
From data in Brett et al. (2000) FEBS Lett 474:83-6
Alternative Isoform Inference
from Splice Pairs
Alternative Splice Pairs, by Mode
Alternative Splice Pairs, by Mode
Splice Pairs Generating Premature Stops
EST coverage and premature stops
For 76% of isoforms with premature stops:
RefSeq mRNA
Alternatively spliced
EST, reading frame 0
ESTs cover a PTC & splice junction downstream
In 80% of these isoforms, there is a PTC in every reading frame:
Alternatively spliced
EST, reading frame 1
Alternatively spliced
EST, reading frame 2
Alternative polyadenlyation signals are biased against recovery
Related documents