Survey
* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project
Overview of Bioinformatics 1 Module Denis Manley. Contact Details • Lecturer Name: Denis Manley • Room number: KE-1-013a • Email : [email protected] • Website: www.comp.dit.ie/dmanley • Phone: 01 402 4949 What is bioinformatics • Bioinformatics is the use of computers and computational methods to analyse large sets of molecular biological data that is used for : – The investigation of “living organisms” and their evolution. – The discovery of genes, gene regulation; genetic networks and protein functionality, which can be used to understand: human disease; human development (conception to adulthood) etc . – the results of which can facilitate our understanding of diseases like cystic fibrosis; suggest therapies; and the development of cures such as drug development, viral therapy… Reading DNA novels: “bioinformatics” • Analysing large sets of data is “equivalent” to reading and understanding a book (similar to Computational linguistics): – The first step in reading a book involves looking at letters [ including spaces and punctuation] to determine the words. Bioinformatics is the reading of DNA (alphabet referred to by letters ATGC) and determining location of important regions of the sequence; e.g. genes, gene “expression” control regions…. Reading DNA novels: “bioinformatics” – The next step in reading involves determining if the words are nouns/verbs/adverbs etc… In general each language has rules: • “what are these rules” – Bioinformatics involves determining what the “important elements” correspond to: e.g. genes; gene promoters…. – However, clearly the rules to determine “genes” and other elements are no as well defined as in a natural language and more importantly are constantly being modified and updated . Reading DNA novels: “bioinformatics” – syntax: – The next step in determining the sequence of the words.; e.g. should it be“what are the rules of english grammar”; are “what the rules of grammar english“ – Bioinformatics involves determining the sequence of “important elements”; e.g. promoter are “upstream of genes and not the other way around. Reading DNA novels: “bioinformatics” – Symantics: – What does the set of words (sentence) mean. “what is your purpose?” what processes do humans use to interpret this sentence – Bioinformatics attempts to analyse the function of DNA/genetic sequences by: e.g. • comparing the sequences to sequences whose function is already known. • By converting the sequence into its equivalent “protein” and comparing it to known proteins • determining 3-D structure of proteins and looking for known structural components. Reading DNA novels: “bioinformatics” • Bioinformatics also focuses on the IT aspects of the discipline such as: – Setting up databases – Writing code to perform analysis – Determining and Utilisation of known computational techniques to improve analysis of the biological data. • Bioinformatics, covers a very large area but this particular module will focus on the “computational analysis of genetic systems” and will be referred to as Bioinformatics 1. Bioinformatics 1: module syllabus. • Part 1:Introduction to fundamental of genetic systems: • Principles of inheritance and evolutions: essential criteria for our evolution and existence. • Basic Molecular cell biology: DNA , Genes and Amino acids (proteins) . • The relationship between a gene and its physical manifestation (proteins); The central “dogma” of Genetics: DNA -> RNA>Proteins • Introduction to structural elements of genetic systems • Gene “expression” regulation Bioinformatics 1: module syllabus. • Part 2: Fundamentals of “ bioinformatics” PERL: a common scripting language used in the field of bioinformatics • Fundamentals of Perl: read/write, loops…. • Fundamental Perl data structures: “bioinformatics“ data files; dynamic arrays and hash tables. • Perl Pattern matching techniques used in bioinformatics: searching for a pattern (e.g. ATG); extract a pattern from a sequence; substitute one pattern for another (e.g. replace T with a U) • Create perl sub-routines and Perl modules and use them in other perl programs • Development of “basic” bioinformatics data sequences analytical tools using perl and core computational algorithms [these algorithms will be covered in the computational element of the module]. What is bioinformatics – Part 3: Introduction to online bioinformatics resources; • How and where to obtain “bioinformatics” DNA data sequences and data relevant to these sequences • Explanation of the different elements of these data sets “data annotation” or (meta data). • Fundamentals of common online DNA analytical tools (such as sequence alignment measurement ) What is bioinformatics • Part 3: – Computational bioinformatics techniques , including Artificial intelligent techniques, used to: • Principles of DNA pattern matching: global/local/multiple • Principles of Align DNA sequences: e.g. Pairwise alignment • Application of alignment principles using basic computational methods • Reconstruct genomes (large DNA sequences) using “shotgun” alignment techniques • Principles of searching for “matching” DNA (gene) sequences in large online databases. • How to utilise and interpret findings of DNA database searches: e.g. gene functionality and evolutionary gene markers. Assignment and exam • 1 Assignment (40%): – Developing an application to analyse “small” DNA data sequences – A report discussing the findings of the applications when applied to known DNA sequences • Exam: question 1 + 2 out of 4 other question (60%) – Question 1 compulsory: Perl programming . – Other questions related to other areas in the module. Proposed schedule • Week 2 to Week 6 (Wednesday 16:00 to 18:00): “Part 1 Fundamental of genetic systems.” • Week 2 to Week 6 (Monday 9:00 to 11:00) – Fundamentals of perl • Week 7 review week [submit assignment part 1] Proposed schedule • Week 8 to Week 12 (Wednesday 16:00 to 18:00): “Computational techniques and their application to bioinformatics” • Week 8 to Week 12 (Monday 9:00 to 11:00) – Online bioinformatics databases and analytical applications (approx 2 weeks). – Development of fundamental computational applications using perl (approx 3 weeks) • Week 10 submission of assignment part 2 • Week 13 review of course and sample exam paper Assignment content • Assignment 1: – A report on the analysis on the biological impact of developing “ a bioinformatics applications. – Development of the fundamental functionality of the application based on the findings of the report • Assignment 2: – Using the application from assignment 1: Analysis and development of the analytical component “computational analysis” of the application. – A report on the findings of applying the final application to a given dataset obtained from online bio-informatic databases.