* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Bioinformatics: One Minute and One Hour at a Time
Primary transcript wikipedia , lookup
Extrachromosomal DNA wikipedia , lookup
Long non-coding RNA wikipedia , lookup
Gene therapy wikipedia , lookup
Epigenetics in learning and memory wikipedia , lookup
Gene nomenclature wikipedia , lookup
Genetic engineering wikipedia , lookup
Essential gene wikipedia , lookup
Quantitative trait locus wikipedia , lookup
Epigenetics of neurodegenerative diseases wikipedia , lookup
Cancer epigenetics wikipedia , lookup
Epigenetics of diabetes Type 2 wikipedia , lookup
Point mutation wikipedia , lookup
Transposable element wikipedia , lookup
Human genome wikipedia , lookup
Polycomb Group Proteins and Cancer wikipedia , lookup
Oncogenomics wikipedia , lookup
Gene desert wikipedia , lookup
Metagenomics wikipedia , lookup
Vectors in gene therapy wikipedia , lookup
Non-coding DNA wikipedia , lookup
Public health genomics wikipedia , lookup
Pathogenomics wikipedia , lookup
Genomic imprinting wikipedia , lookup
Genome editing wikipedia , lookup
Gene expression programming wikipedia , lookup
Nutriepigenomics wikipedia , lookup
Therapeutic gene modulation wikipedia , lookup
Genome (book) wikipedia , lookup
History of genetic engineering wikipedia , lookup
Ridge (biology) wikipedia , lookup
Minimal genome wikipedia , lookup
Site-specific recombinase technology wikipedia , lookup
Epigenetics of human development wikipedia , lookup
Helitron (biology) wikipedia , lookup
Genome evolution wikipedia , lookup
Designer baby wikipedia , lookup
Biology and consumer behaviour wikipedia , lookup
Microevolution wikipedia , lookup
Bioinformatics: One Minute and One Hour at a Time Laurie J. Heyer L.R. King Asst. Professor of Mathematics Davidson College [email protected] What is Bioinformatics? Computer Science Mathematics Bioinformatics Biology Genomics, Proteomics and Systems Biology • Primary audience – Junior bio majors • Prerequisites – Bioinformatics and intro molecular biology or – One of several 300-level biology courses • Course home page: – http://www.bio.davidson.edu/ genomics • “Math Minutes” • Taught by A. Malcolm Campbell (Biology) Sample Topic: DNA Microarrays Plotting Expression Data • One highlighted gene is induced 16 fold • One highlighted gene is repressed 16 fold • But induction looks much more dramatic Log Transformation • Calculate log2 of each ratio • Ratio of 16 becomes value of 4 • Ratio of .0833 (1/16) becomes value of –4 • Induction and repression look equal, but opposite sign Hierarchical Clustering • Join two most similar genes • Join next two most similar “objects” (genes or clusters of genes) • Distance from one gene to a set of genes is minimum of all distances from the gene to the individual members (Single Linkage) • Repeat until all genes have been joined Genome Consortium for Active Teaching (GCAT) http://www.bio.davidson.edu/GCAT High School Chips See Kathy Gabric’s page: http://cstaff.hinsdale86.org/~kgabric/honorscalendar.html Bioinformatics Course • Prerequisites – Genomics or experience with modeling and “algorithmic thinking” • Goals: – To understand and apply various algorithms and statistical tests for analyzing DNA, RNA and protein sequences, and DNA microarray data. – To gain practical experience with Perl, a programming language widely used in molecular biology, web design, and text processing. • Course home page – http://gcat.davidson.edu/bioinformatics/bioinf.html Bioinformatics Topics • • • • • • • • • Determining sequences Comparing sequences Finding genes Predicting structure Comparing genomes Inferring phylogenies Analyzing images Clustering gene expression patterns Designing experiments Bioinformatics Projects Image Segmentation • Locate spot (signal) pixels • Measure intensity of signal and background in each channel • Compute ratio Adaptive Circle Algorithm • Specify threshold % between darkest and lightest pixel • Pixels above threshold are “on”, others are “off” • Combine two binary images – if pixel is “on” in either image, it is “on” in combined image • Search for radius and center that maximize percent of “on” pixels Adaptive Circle V2 (Dapple) • Compute 4-neighbor second-difference approximation to the Laplacian • Find sharply defined “upper” edge by convolving Laplacian with annular filters From “Dapple: Improved Techniques for Finding Spots on DNA Microarrays” UW CSE Technical Report UWTR-2000-08-05 Quality Clustering: QT Clust 1. Each gene builds a supervised cluster 2. Gene with “best” list, and genes in its list, becomes next cluster 3. Remove these genes from consideration, and repeat 4. Stop when all genes are clustered, or largest cluster is smaller than user specified threshold Why teach Bioinformatics? • Critical thinking • Interdisciplinary • Integrative – – – – – Modeling Data analysis Computational science Discrete math Probability and statistics • Student research opportunities