Download lecture2

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

DNA nanotechnology wikipedia , lookup

Transcript
An Introduction to Bioinformatics Algorithms
www.bioalgorithms.info
Algorithms for Molecular Biology
CSCI 4314-001
Elizabeth White
[email protected]
CSCI 4314/5314, Algorithms for Molecular Biology
DNA revisited
•
•
•
•
Double helix
A, C, G, T “letters”
A::T = 1
G::C = 1
Image from http://en.wikipedia.org/wiki/DNA
CSCI 4314/5314, Algorithms for Molecular Biology
Why these ratios?
• Same size pairs
• Matching shapes
• Matching charges
Image from http://en.wikipedia.org/wiki/DNA
CSCI 4314/5314, Algorithms for Molecular Biology
Base pairs bond to give double helix
Image from http://en.wikipedia.org/wiki/DNA
CSCI 4314/5314, Algorithms for Molecular Biology
DNA is designed to be replicated
• Each of the 2 strands can specify the other one
• Result is 2 helices, each with one old strand and
one new
Image from
http://www.emc.maricopa.edu/faculty/farabee/biobk/BioBookDNAMOLGEN.html
CSCI 4314/5314, Algorithms for Molecular Biology
Replication bubbles (made of 2 forks)
Image from http://www.mun.ca/biochem/courses/3107/images/lambda_0017a.jpg
CSCI 4314/5314, Algorithms for Molecular Biology
DNA is designed to be transcribed
• Similar mechanism to replication
• Result is original helix + RNA strand
Image from
http://www.bioedonline.org/slides/slide01.cfm?q=%22transcription%22&dpg=11
CSCI 4314/5314, Algorithms for Molecular Biology
DNA transcription to RNA
Image from http://en.wikipedia.org/wiki/Transcription_%28genetics%29
CSCI 4314/5314, Algorithms for Molecular Biology
RNA molecule is processed further
• Except in bacteria, RNA molecule contains
• Introns: do not code for protein
• Exons: protein-coding regions
• Introns are spliced out of the RNA transcript
• Result is “messenger RNA”, mRNA
Image from http://en.wikipedia.org/wiki/Splicing_(genetics)
CSCI 4314/5314, Algorithms for Molecular Biology
RNA splicing can give many variants
Image from http://en.wikipedia.org/wiki/Alternative_splicing
CSCI 4314/5314, Algorithms for Molecular Biology
RNA specifies 3-base codons
Image from http://en.wikipedia.org/wiki/Genetic_code
CSCI 4314/5314, Algorithms for Molecular Biology
3-letter codons map to amino acids
Image from http://www.pangloss.com/seidel/Protocols/codon.html
CSCI 4314/5314, Algorithms for Molecular Biology
Transfer RNAs do the mapping
Image from
http://cropandsoil.oregonstate.edu/classes/css430/lecture%209-07/figure-09-10.JPG
CSCI 4314/5314, Algorithms for Molecular Biology
Translation: RNA to protein
• Messenger RNA is “read” by a ribosome
• Ribosome connects amino acids to build new
protein strand as it reads
Image from
http://www.bioedonline.org/slides/slide01.cfm?q=translation&dpg=8
CSCI 4314/5314, Algorithms for Molecular Biology
Ribosomes at work
Image from http://cellbio.utmb.edu/cellbio/ribosome.htm
CSCI 4314/5314, Algorithms for Molecular Biology
Overview: transcription/translation
Image from http://en.wikipedia.org/wiki/MRNA
CSCI 4314/5314, Algorithms for Molecular Biology
Protein structure
• Primary: amino acid sequence
• Secondary: short regions of protein form
• Alpha-helix
• Beta-sheet
• Tertiary: helices and sheets nestle together to
make a 3 dimensional shape
• Quaternary: 2 or more proteins associate
together
CSCI 4314/5314, Algorithms for Molecular Biology
Primary structure: amino acid sequence
Top image from http://en.wikipedia.org/wiki/Amino_acid
Bottom image from http://commons.wikimedia.org/wiki/Image:2-amino-acids.png
CSCI 4314/5314, Algorithms for Molecular Biology
Secondary structure: alpha-helix
Left image from http://commons.wikimedia.org/wiki/Image:AlphaHelixProtein_fr.jpg
Bottom image from http://www.srs.ac.uk/px/showcase/guide_files/helix4.jpg
CSCI 4314/5314, Algorithms for Molecular Biology
Secondary structure: beta-sheet
Left image from http://www.sciencecollege.co.uk/SC/biochemicals.html
Right image from http://cnx.org/content/m11614/latest/
CSCI 4314/5314, Algorithms for Molecular Biology
Tertiary structure: 3D shape
Image from http://www.colorado.edu/chem/people/wuttked.html
CSCI 4314/5314, Algorithms for Molecular Biology
Quaternary structure: assembly
Image from
http://www.man.poznan.pl/CBB/GIF/hcc-beta.jpg
CSCI 4314/5314, Algorithms for Molecular Biology
Structural proteins
Image from
http://www.wellesley.edu/Chemistry/chem227/structproteins/strctprt.htm
CSCI 4314/5314, Algorithms for Molecular Biology
DNA-binding proteins
• Recognize particular DNA sequences
• Regulate which genes are transcribed into
RNA
• Often act in pairs
Image from http://en.wikipedia.org/wiki/DNA
CSCI 4314/5314, Algorithms for Molecular Biology
Enzymatic proteins
• Catalyze chemical reactions
• Beta-lactamase enzyme inactivates penicillin
Image from http://www.nersc.gov/news/annual_reports/annrep97/bash.html
CSCI 4314/5314, Algorithms for Molecular Biology
Open problem: protein folding
• Amino acid sequence of protein determines
its shape
• Proteins seem to “fall” into correct shape
• In theory, we should be able to look at a
protein’s sequence and deduce its shape
• Unfortunately, this is not computationally possible
• In practice, we deduce shapes by similarity
• Proteins with similar amino acid sequences tend
to take similar shapes