Download Hong - Gene Ontology Consortium

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Pharmacogenomics wikipedia , lookup

Gene desert wikipedia , lookup

Non-coding DNA wikipedia , lookup

Therapeutic gene modulation wikipedia , lookup

Metabolic network modelling wikipedia , lookup

Nutriepigenomics wikipedia , lookup

Oncogenomics wikipedia , lookup

Human genome wikipedia , lookup

Quantitative trait locus wikipedia , lookup

Gene expression programming wikipedia , lookup

Polycomb Group Proteins and Cancer wikipedia , lookup

Essential gene wikipedia , lookup

History of genetic engineering wikipedia , lookup

Public health genomics wikipedia , lookup

Site-specific recombinase technology wikipedia , lookup

Genome editing wikipedia , lookup

Metagenomics wikipedia , lookup

Genomics wikipedia , lookup

Microevolution wikipedia , lookup

Genomic imprinting wikipedia , lookup

Pathogenomics wikipedia , lookup

Designer baby wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Gene wikipedia , lookup

Ridge (biology) wikipedia , lookup

Genome (book) wikipedia , lookup

Epigenetics of human development wikipedia , lookup

Biology and consumer behaviour wikipedia , lookup

RNA-Seq wikipedia , lookup

Genome evolution wikipedia , lookup

Minimal genome wikipedia , lookup

Gene expression profiling wikipedia , lookup

Transcript
Manually curated and computationally
predicted GO annotations at the
Saccharomyces Genome Database
http://www.yeastgenome.org/
Eurie L. Hong, Ph.D.
Department of Genetics • Stanford University School of Medicine
Scientific community
Integrated data
Analysis tools
Data from
traditional
experiments
Data from high
through-put
experiments
Genome
sequence
CHS6/YJL099W Locus Summary Page
Nomenclature
Summary of
published data
Curated data
from
published
literature
Sequence
Information
Links to other
databases
Links to SGD tools
and other databases
Data from
high throughput
experiments
Accessing the data via files
ftp://ftp.yeastgenome.org/yeast/
Display of GO Annotations
Status of GO Annotations at SGD
All protein and RNA gene products have been annotated with
GO terms
All GO annotations are manually curated from literature (no IEA)
Genes without published characterization data
Molecular Function
2112 genes (33.6% of all genes)
Biological Process
1448 genes (23.0% of all genes)
Cellular Component
864 genes (13.7% of all genes)
from Genome Snapshot 8/23/2006
Sources of Computationally Predicted
GO Annotations
1. InterPro domain matches in S. cerevisiae proteins
source: GOA project
2. Integrated analysis of multiple datasets
source: publications, external databases
CHS6/YJL099W Locus Summary Page
Identifying Types of GO Annotations
CHS6/YJL099W GO Annotation Page
Core GO Annotations
GO Annotations from
Large Scale Experiments
Computationally Predicted
GO Annotations
{
{
{
Changes to GO Term Finder
Current functionality
{
{
Specify background set
Refine annotations used
by annotation source or
evidence codes
{
Improving GO Annotations
Computationally predicted GO annotations
Manually curated GO annotations
1. Computational predictions may indicate publications that were
overlooked
2. Review inconsistencies between computationally predicted and
manually curated GO annotations to improve mappings and manually
curated annotations
3. Review inconsistencies between computationally predicted and
manually curated GO annotations to improve ontology
Additional Annotations Using Interpro2GO
Information added to genes with
no published characterization data
Molecular Function
468 genes
Biological Process
316 genes
Cellular Component
207 genes
from gene_association.goa_uniprot 7/2006
Preliminary Comparison:
Cellular Component Annotations
Interpro2go annotation
is ancestor of curated
annotation
43%
Shared parent
is root term
2%
Other
38%
Interpro2go
annotation for
an unknown
4%
Interpro2go
annotation matches
curated annotation
15%
5946 IEA
9059 IC+IDA+IEP+IGI+IMP+IPI+ISS+NAS+RCA+TAS
Shared parent
is child of root
term
18%
Other shared
parent term
18%
Summary
1. Currently, all GO annotations for S. cerevisiae gene
products are manually curated from literature
2. SGD will incorporate computationally predicted GO
annotations that will provide additional information for a
gene product’s role in biology
3. Computationally predicted GO annotations will be used
to refine and improve manually curated GO annotations
at SGD
[email protected]