Download Powerpoint - University of British Columbia

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Epigenomics wikipedia , lookup

Minimal genome wikipedia , lookup

Gene wikipedia , lookup

Genetic code wikipedia , lookup

Frameshift mutation wikipedia , lookup

Therapeutic gene modulation wikipedia , lookup

Molecular Inversion Probe wikipedia , lookup

Non-coding DNA wikipedia , lookup

Helitron (biology) wikipedia , lookup

Site-specific recombinase technology wikipedia , lookup

Human genome wikipedia , lookup

Microsatellite wikipedia , lookup

Point mutation wikipedia , lookup

Genome editing wikipedia , lookup

Genome evolution wikipedia , lookup

Bisulfite sequencing wikipedia , lookup

Public health genomics wikipedia , lookup

DNA sequencing wikipedia , lookup

Human Genome Project wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Pathogenomics wikipedia , lookup

Whole genome sequencing wikipedia , lookup

Exome sequencing wikipedia , lookup

RNA-Seq wikipedia , lookup

Genomic library wikipedia , lookup

Metagenomics wikipedia , lookup

Genomics wikipedia , lookup

Transcript
Genomics
1
Genomics relies on high-throughput
technologies
• Automated sequencers
• Robotics
– Colony pickers and Arrayers for sequencing
– Microarray spotters
– Protein Spot-Picker and In-gel proteolytic
digestion
• High-throughput genetics
2
Areas of Genomics
Sequencing Genome (Reference)
Gene Expression
Functional Genomics
Proteomics
Comparative
Genomics
Metabolomics
3
DNA Libraries
Genomic libraries
[for Genome sequencing]
ESTs
sequences
cDNA libraries
[for EST sequencing]
Full Length cDNA
sequences
• cDNA libraries are needed for EST sequencing
4
EST Library
• Expressed Sequence Tags
• Specific tissues and developmental stages = mRNA
isolation (Conversion to cDNA)
• Require cloning of cDNAs
• Require many different tissues = good coverage of
genomic information
• Usually sequence from 5’ or 3’ end (known as pair end or
mate end sequencing)
• Will require more $$ to sequence both ends
• Usually less than 60% of genes coverage
• Will not have regulatory elements information
• Paralogs issue when figuring out the annotation and
function of the gene
5
FL cDNA Library
• Full Length cDNA library
• Require intact length of mRNA
• Sequencing from both ends of the cDNA clone
using the vector sequences
• Clone size selection (>800 bases)
• Require finishing of the larger clones by
designing more primers internal to clone insert
• Should translate to a full protein
• Require more $$$ and effort to have the data
6
Next Generation(NG)
Sequencing
• Sanger sequencing for the past 25 years
• Expensive, little coverage, limited
nucleotide (nt) per run
• Ngseq (NG 2nd) has greatly reduced cost
per amount of data generated, greater
coverage
• Different technologies pending on
company producing the technology (Metzker
2010 Nature Reviews/Genetics)
7
8
9
Next Generation(NG)
Sequencing
• NGseq (NG 3rd) has greatly reduced cost
per amount of data generated, greater
coverage
• Much longer reads vs NG 2nd generation
• Different technologies pending on
company producing the technology (Metzker
2010 Nature Reviews/Genetics)
• Ion Torrent, PacBio, Oxford Nanopore
10
PacBio NG 3rd
11
Genome Annotation
Requiring Bioinformatics
•
Ref = Koonin, E.V. and Galperin, M.Y. 2004 In:
Sequence-Evolution-Function. p.196
12
Bioinformatics
• Biology, computer science, and information technology
merge = a single discipline
• Creating and maintaining databases to store biological
information eg. nucleotide, amino acid sequences
• Development of complex interfaces whereby researchers
could both access existing data as well as submit new or
revised data
• Develop a comprehensive picture of normal cellular
activities
• Involves analysis and interpretation of various types of
data, including nucleotide and amino acid sequences,
protein domains, and protein structures
13