Download - Cal State LA - Instructional Web Server

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Epigenetics of neurodegenerative diseases wikipedia , lookup

Copy-number variation wikipedia , lookup

Saethre–Chotzen syndrome wikipedia , lookup

Metagenomics wikipedia , lookup

Genetic engineering wikipedia , lookup

Epigenetics of diabetes Type 2 wikipedia , lookup

Neuronal ceroid lipofuscinosis wikipedia , lookup

Nutriepigenomics wikipedia , lookup

Gene wikipedia , lookup

Pathogenomics wikipedia , lookup

Public health genomics wikipedia , lookup

History of genetic engineering wikipedia , lookup

Vectors in gene therapy wikipedia , lookup

Gene therapy of the human retina wikipedia , lookup

The Selfish Gene wikipedia , lookup

Genome (book) wikipedia , lookup

Genome editing wikipedia , lookup

Gene therapy wikipedia , lookup

Genome evolution wikipedia , lookup

Gene desert wikipedia , lookup

Gene expression profiling wikipedia , lookup

Therapeutic gene modulation wikipedia , lookup

Gene expression programming wikipedia , lookup

Helitron (biology) wikipedia , lookup

Gene nomenclature wikipedia , lookup

RNA-Seq wikipedia , lookup

Microevolution wikipedia , lookup

Site-specific recombinase technology wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Designer baby wikipedia , lookup

Transcript
The Wold Lab
BioHub
Cory Tobin
Collaborators
Brandon King
Joe Roden
Diane Trout
Dr. Barbara
Goal
•
Standardize the relationship between biological data
•
Integrate all of the data seamlessly
•
Provide novel methods to search for and analyze data
Adapted from http://woldlab.caltech.edu/biohub/
My Contribution
Implement a database for homology data
Background
Species A
Gene
Gene
Paralogs
The more general
term is “homology”
Orthologs
Species B
Gene
Requirements
•
Be more accurate and flexible than HomoloGene
•
Work in real time
•
Make sense of HomoloGene’s misleading data
Rationale
BioHub
HomoloGene
Gene
Gene
Gene
Gene
Gene
Gene
Gene
Gene
Gene
Gene
They are similar
They are related like this
Rationale Continued
Seq A
Seq B
Human Genome
BioHub
will BLAST
A BLAST
against
HomoloGene
wouldseq
BLAST
also
seq
mouse,
find
seq
C,
then
BLAST C
A against
seq
B against
mouse
mouse
and
and
determine
back
against
human
to an
see
that seq
detrmine
Cthat
is an
seq
ortholog
C is
of ifseq
there
areof
any
matches. It
A.
ortholog
seqbetter
B.
will find seq B to be better.
Seq C
Mouse Genome
Methods
•
Design data relationships that make sense biologically
•
Generate the low-level database interaction code
•
Parse and load HomoloGene’s data into our database
•
Write biologically useful functions
•
Create a web-based interface for easy use
Materials
•
ArgoUML – Design Aid
•
Pymerase – Design Implementation
•
PostgreSQL – Database
•
HomoloGene – Data Source
•
Python – Programming Language
Current State
•
Design data relationships that make sense biologically
•
Generate the low-level database interaction code
•
Parse and load HomoloGene’s data into our database
•
Write biologically useful functions
•
Create a web-based interface for easy use
Example Usage
Sequence of Interest
…GGATACAAAATTCCTC…
Are there any known genes in
this sequence?
acetyl - coenzyme A
dehydrogenase ( Human )
(cont.)
acetyl - coenzyme A
dehydrogenase ( Human )
Are there any homologs?
Mouse
Rat
Mosquito
Fruit fly
Nematode
(cont.)
How are those genes related?
Where do you
want to go?
More Info
BioHub
woldlab.caltech.edu / biohub
HomoloGene
www.ncbi.nlm.nih.gov
Python
python.org
Pymerase
pymerase.sf.net
PostgreSQL
postgresql.org