Download Bioinformatics: One Minute and One Hour at a Time

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Primary transcript wikipedia , lookup

Extrachromosomal DNA wikipedia , lookup

Long non-coding RNA wikipedia , lookup

Gene therapy wikipedia , lookup

Epigenetics in learning and memory wikipedia , lookup

Gene nomenclature wikipedia , lookup

Genetic engineering wikipedia , lookup

Essential gene wikipedia , lookup

Quantitative trait locus wikipedia , lookup

Epigenetics of neurodegenerative diseases wikipedia , lookup

Cancer epigenetics wikipedia , lookup

Epigenetics of diabetes Type 2 wikipedia , lookup

Point mutation wikipedia , lookup

Transposable element wikipedia , lookup

Human genome wikipedia , lookup

Polycomb Group Proteins and Cancer wikipedia , lookup

Oncogenomics wikipedia , lookup

Gene desert wikipedia , lookup

Metagenomics wikipedia , lookup

Vectors in gene therapy wikipedia , lookup

Non-coding DNA wikipedia , lookup

Public health genomics wikipedia , lookup

Pathogenomics wikipedia , lookup

Genomic imprinting wikipedia , lookup

Genome editing wikipedia , lookup

Gene expression programming wikipedia , lookup

Nutriepigenomics wikipedia , lookup

Genomics wikipedia , lookup

Therapeutic gene modulation wikipedia , lookup

Genome (book) wikipedia , lookup

History of genetic engineering wikipedia , lookup

Ridge (biology) wikipedia , lookup

Minimal genome wikipedia , lookup

Site-specific recombinase technology wikipedia , lookup

Gene wikipedia , lookup

Epigenetics of human development wikipedia , lookup

Helitron (biology) wikipedia , lookup

Genome evolution wikipedia , lookup

RNA-Seq wikipedia , lookup

Designer baby wikipedia , lookup

Biology and consumer behaviour wikipedia , lookup

Microevolution wikipedia , lookup

Gene expression profiling wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Transcript
Bioinformatics:
One Minute
and One Hour
at a Time
Laurie J. Heyer
L.R. King Asst. Professor of Mathematics
Davidson College
[email protected]
What is Bioinformatics?
Computer
Science
Mathematics
Bioinformatics
Biology
Genomics, Proteomics and
Systems Biology
• Primary audience
– Junior bio majors
• Prerequisites
– Bioinformatics and intro
molecular biology
or
– One of several 300-level
biology courses
• Course home page:
– http://www.bio.davidson.edu/
genomics
• “Math Minutes”
• Taught by A. Malcolm
Campbell (Biology)
Sample Topic:
DNA Microarrays
Plotting Expression Data
• One highlighted
gene is induced 16
fold
• One highlighted
gene is repressed
16 fold
• But induction looks
much more dramatic
Log Transformation
• Calculate log2 of each
ratio
• Ratio of 16 becomes
value of 4
• Ratio of .0833 (1/16)
becomes value of –4
• Induction and
repression look equal,
but opposite sign
Hierarchical Clustering
• Join two most similar genes
• Join next two most similar
“objects” (genes or clusters
of genes)
• Distance from one gene to a
set of genes is minimum of
all distances from the gene
to the individual members
(Single Linkage)
• Repeat until all genes have
been joined
Genome Consortium for
Active Teaching (GCAT)
http://www.bio.davidson.edu/GCAT
High School Chips
See Kathy Gabric’s page:
http://cstaff.hinsdale86.org/~kgabric/honorscalendar.html
Bioinformatics Course
• Prerequisites
– Genomics or experience with modeling and “algorithmic
thinking”
• Goals:
– To understand and apply various algorithms and statistical
tests for analyzing DNA, RNA and protein sequences, and
DNA microarray data.
– To gain practical experience with Perl, a programming
language widely used in molecular biology, web design, and
text processing.
• Course home page
– http://gcat.davidson.edu/bioinformatics/bioinf.html
Bioinformatics Topics
•
•
•
•
•
•
•
•
•
Determining sequences
Comparing sequences
Finding genes
Predicting structure
Comparing genomes
Inferring phylogenies
Analyzing images
Clustering gene expression patterns
Designing experiments
Bioinformatics Projects
Image Segmentation
• Locate spot (signal)
pixels
• Measure intensity of
signal and
background in each
channel
• Compute ratio
Adaptive Circle Algorithm
• Specify threshold % between
darkest and lightest pixel
• Pixels above threshold are
“on”, others are “off”
• Combine two binary images
– if pixel is “on” in either
image, it is “on” in combined
image
• Search for radius and center
that maximize percent of “on”
pixels
Adaptive Circle V2 (Dapple)
• Compute 4-neighbor
second-difference
approximation to the
Laplacian
• Find sharply defined
“upper” edge by
convolving
Laplacian with
annular filters
From “Dapple: Improved Techniques for Finding
Spots on DNA Microarrays” UW CSE Technical
Report UWTR-2000-08-05
Quality Clustering: QT Clust
1. Each gene builds a supervised cluster
2. Gene with “best” list, and genes in its list, becomes next cluster
3. Remove these genes from consideration, and repeat
4. Stop when all genes are clustered, or largest cluster is smaller than
user specified threshold
Why teach Bioinformatics?
• Critical thinking
• Interdisciplinary
• Integrative
–
–
–
–
–
Modeling
Data analysis
Computational science
Discrete math
Probability and statistics
• Student research opportunities