Download What is Bioinformatics I?

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Gene desert wikipedia , lookup

Gene nomenclature wikipedia , lookup

History of genetic engineering wikipedia , lookup

Transposable element wikipedia , lookup

Public health genomics wikipedia , lookup

NEDD9 wikipedia , lookup

Genome (book) wikipedia , lookup

Vectors in gene therapy wikipedia , lookup

No-SCAR (Scarless Cas9 Assisted Recombineering) Genome Editing wikipedia , lookup

Microevolution wikipedia , lookup

Minimal genome wikipedia , lookup

Gene wikipedia , lookup

Designer baby wikipedia , lookup

Point mutation wikipedia , lookup

Non-coding DNA wikipedia , lookup

Gene expression profiling wikipedia , lookup

Human Genome Project wikipedia , lookup

Genomic library wikipedia , lookup

Multiple sequence alignment wikipedia , lookup

Human genome wikipedia , lookup

Therapeutic gene modulation wikipedia , lookup

RNA-Seq wikipedia , lookup

Sequence alignment wikipedia , lookup

Site-specific recombinase technology wikipedia , lookup

Genome evolution wikipedia , lookup

Computational phylogenetics wikipedia , lookup

Helitron (biology) wikipedia , lookup

Metagenomics wikipedia , lookup

Pathogenomics wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Genome editing wikipedia , lookup

Genomics wikipedia , lookup

Transcript
BIO/STAT 597F, CSE 598F
Bioinformatics I: basic analysis of DNA and protein sequences
Fall semester, 2004
Tues/Thurs 2:30-3:45, 111 Boucke Building
3 credits
Schedule numbers: BIO 597F - 3777311; CSE 598F - 455170; STAT 597F - 447556
Web site: http://www.bx.psu.edu/courses/bx-fall04/
Prerequisites: Graduate standing or consent of instructors. Familiarity with molecular biology or computer
methods is not assumed (though a willingness to learn a little about them is required).
Topics: (estimated time for discussion)
•
Discussion of potentential class projects. Introduction to molecular biology for the non-biologist. (1
week)
•
Survey of techniques used to analyze mammalian genome sequences. (0.5 week)
•
World Wide Web resources for identifying genes in a genomic sequence and for predicting a gene’s
function. Students will use RepeatMasker, GenScan, Blast, ClustalW, Pfam or Interpro, and PipMaker. Comparative analysis of the human, mouse, rat, and chicken genome sequences will serve as
a focal point for illustrating concepts and methods. (1.5 weeks)
•
Search methods for sequence databases. The Blast family of programs, including psi-blast and phiblast. Introduction to dynamic programming and determination of substitution scores. (1.5 weeks)
•
Multiple sequence alignment, including ClustalW. (0.5 week)
•
Phylogenetic analysis of molecular sequences with an emphasis on methods of phylogenetic inference and hypothesis testing. Gene and genome history, gene family evolution, inference of ancestral
proteins, and phylogenetic analysis as a predictive tool. (3 weeks)
•
Introduction to hidden Markov models. (1 week)
•
Methods to search databases of protein motifs or domains, particularly Pfam. (0.5 week)
•
Ab initio gene prediction, i.e., predicting the location of genes within a given genomic sequence
using only intrinsic sequence properties. Methods employed by GenScan (currently the best program
of this type) will be studied. (1.0 weeks)
•
Students’ in-class reports on their term projects. (3 weeks).
The course will cover neither protein structure prediction nor ‘‘post-sequencing’’ bioinformatics, such as
analysis of expression data (e.g., from micro-arrays), proteomics, and analysis of regulatory networks. Protein structure prediction is covered in courses offered by the Chemistry Department, and post-sequencing
bioinformatics is the topic of Bioinformatics II, which is taught most Spring Semesters (but not in 2005).
Grading will be based on (1) a term project of the student’s choice (60%), (2) an exam over the lecture
material (30%), and (3) a few homework assignments, e.g., using World Wide Web resources for analyzing
genomic sequences (10%).
Required text:
Bioinformatics: Sequence and Genome Analysis (Second Edition)
David W. Mount
Cold Spring Harbor Laboratory Press, 2004
ISBN 0-87969-687-7 (paperback)
For more information:
Webb Miller, 506B Wartik Lab, 865-4551, [email protected]
Claude dePamphilis, 212 Mueller Lab (soon to change), 863-6412, [email protected]