Download ucla1 - WEHI Bioinformatics

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Metalloprotein wikipedia , lookup

Nucleosome wikipedia , lookup

Expression vector wikipedia , lookup

Gel electrophoresis of nucleic acids wikipedia , lookup

RNA wikipedia , lookup

Transformation (genetics) wikipedia , lookup

Gene regulatory network wikipedia , lookup

Promoter (genetics) wikipedia , lookup

Community fingerprinting wikipedia , lookup

Real-time polymerase chain reaction wikipedia , lookup

Protein–protein interaction wikipedia , lookup

Proteolysis wikipedia , lookup

Molecular cloning wikipedia , lookup

DNA supercoil wikipedia , lookup

RNA-Seq wikipedia , lookup

Genetic code wikipedia , lookup

Transcriptional regulation wikipedia , lookup

Genetic engineering wikipedia , lookup

Genomic library wikipedia , lookup

Gene wikipedia , lookup

Biochemistry wikipedia , lookup

Endogenous retrovirus wikipedia , lookup

Non-coding DNA wikipedia , lookup

Gene expression wikipedia , lookup

Two-hybrid screening wikipedia , lookup

Silencer (genetics) wikipedia , lookup

Biosynthesis wikipedia , lookup

Vectors in gene therapy wikipedia , lookup

Point mutation wikipedia , lookup

Deoxyribozyme wikipedia , lookup

Molecular evolution wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Nucleic acid analogue wikipedia , lookup

Transcript
Blueprint of Life
Based on Chapter 1 of
Post-genome Informatics
by Minoru Kanehisa,
Oxford University Press, 2000
Genome and gene
Entity
Genome
Definition
Unit of information transmission
Molecular Mechanisms
DNA replication
Gene
Unit of information expression
Transcription to RNA
Translation to protein
Nucleic acid and proteins
Macromolecule
Nucleic
acid
Protein
Backbone
Repeating unit
Length
Role
DNA
Phosphodiester bonds Deoxyribonu cleotides 103-10 8
(A, C, G, T)
RNA
Phosphodiester bonds Ribonucleo tides
(A, C, G, U)
103-10 5
103-10 4
102-10 3
Genome
Messenger
Gene product
Peptide bonds
102-10 3
Gene product
Amino acids
(A, C, D, E, F, G, H,
I, K, L, M, N, P, Q,
R, S, T, V, W, Y)
Genome
Nucleotide codes
A
Adenine
W
Weak (A or T)
G
Guanine
S
Strong (G or C)
C
Cytosine
M
Amino (A or C)
T
Thymine
K
Keto (G or T)
U
Uracil
B
Not A (G or C or T)
R
Purine ( A or G)
H
Not G (A or C or T)
Y
Pyrimidin e (C or T)
D
Not C (A or G or T)
N
Any nucleotide
V
Not T (A or G or C)
BASES
Adenine
Thymine
Guanine
Cytosine
Uracil
The Four Nucleosides of DNA
A nucleoside is a sugar, here deoxyribose, plus a base
dA = deoxyadenosine, etc.
dA
dG
PURINES
dC
dT
PYRIMIDINES
Base Pairing
The monomeric units
of nucleic acids are
called nucleotides.
A nucleotide is a phospate,
a sugar, and a purine or a
pyramidine base.
DNA:
B form
(7BNA)
Amino acid codes
Ala
Arg
Asn
Asp
Cys
Gln
Glu
Gly
His
Ile
Leu
Lys
Met
Phe
Pro
Ser
Thr
Trp
Tyr
Val
Asx
Glx
Sec
Unk
A
R
N
D
C
Q
E
G
H
I
L
K
M
F
P
S
T
W
Y
V
B
Z
U
X
Alanin e
Arginin e
Asparagin e
Aspartic acid
Cysteine
Glut amin e
Glut amic acid
Glycine
Histidin e
Isoleucine
Leucine
Lysine
Methionine
Phenylalanin e
Prolin e
Serin e
Threonine
Tryptophan
Tyrosine
Valin e
Asn or Asp
Gln or Glu
Selenocysteine
Unknown
Sperm whale myoglobin (1MBN)
Standard
Genetic
Code
Variation of genetic codes
T1
T2
T3
T4
T5
T6
T9
T10
T12
T13
T14
T15
CUU
CUC
CUA
CUG
Leu
Leu
Leu
Leu
-
Thr
Thr
Thr
Thr
-
-
-
-
-
Ser
-
-
-
AUU
AUC
AUA
AUG
Ile
Ile
Ile
Met
Met
-
Met
-
-
Met
-
-
-
-
-
Met
-
-
-
UAU
UAC
UAA
UAG
Tyr
Tyr
Stop
Stop
-
-
-
-
Gln
Gln
-
-
-
-
Tyr
-
Gln
AAU
AAC
AAA
AAG
Asn
Asn
Lys
Lys
-
-
-
-
-
Asn
-
-
-
-
Asn
-
-
UGU
UCG
UGA
UGG
Cys
Cys
Stop
Trp
Trp
-
Trp
-
Trp
-
Trp
-
-
Trp
-
Cys
-
-
Trp
-
Trp
-
-
AGU
AGC
AGA
AGG
Ser
Ser
Arg
Arg
Stop
Stop
-
-
Ser
Ser
-
Ser
Ser
-
-
Gly
Gly
Ser
Ser
-
T1: standard
T2: vert mt
T3: yeast mt
T4: other mt
T5: invert. mt
T6: cil. etc nuc.
T9: ech. mt
T10: eup. nuc.
T12:alt yeast nuc
T13: asc. mt
T14: flat. mt
T15: bleph. nuc.
The central dogma and its evolution
Transcription
Replication
DNA
Translation
RNA
Protein
Reversetranscription
(RNA World)
RNA
RNA
Schematic illustration of a eukaryotic cell
Schematic illustration of a plant cell
The dual flow of information in life:
the vertical flow by genetic information transmission, and
the horizontal flow by the genetic information expression
Development
Organism
Germ Cell Line
History of structure determination for nucleic acids and proteins
1950
Technology development
Structure determi nation
49 Edman degr adation
-heli x model
54 Isomorphou s replaceme nt
1960
53 DNA double heli x model
Insu li n p rim ary struc ture
60 Myog lobin tertiary structure
62 Restriction enzy me
65 tRNAAla prim ary struc ture
1970
72 DNA clon ing
73 tRNAPhe tertiary structure
75 DNA sequenc ing
77 X174 complete genom e
79 Z-DNA by s ingle crystal differentiation
1980
84 Puls e fi eld gel electrophoresis
85 Polymerase chain reaction
87 YAC vec tor
86 Protein structure by 2D NMR
88 Human Geno me Project
1990
93 DNA chip
95 H influenzae complete geno me
2000
DNA B-type (7BNA)
DNA A-type (140D)
DNA Z-type (2ZNA)
Polymorphic DNA tertiary structures
Yeast alanyl transfer RNA
Yeast phenylalanine tRNA (1TRA)
The information content of various species in terms of the number of nucleotides in the
genome. The complete genome sequences were determined in the years as designated.
The increase of the GenBank nucleotide sequence database is also shown together with
the release dates.
(Bit s)
232
109


(Nuc le otides)
Human
Mouse



Rice
Fruit F ly
Nematode (1998)


Arabidopsis
Budd ing yea st (1997)
224

Genbank 10/97

Gen Bank 9/92

Genbank 9/87

Gen Bank 10/82
107


Escherichia coli (1997)
Haemoph ilius influenzae (1995)

Cytomegalov ir us (1990)

phage (1982)

X174 phage (1977)
105
216
103
28
Human chromosomes: idiograms
Chromosomes
X-linked recessive disorder. The inheritance pattern is shown for
a recessive gene on the chromosome X, designated in bold.
Male
XY
(normal)
Male
XY
(normal)
Male
XY
(affected)
Female
XX
(normal)
Female
XX
(normal)
Female
XX
(normal)
Linkage analysis. The relative distance between the disease gene
(D) and a marker (M) can be estimated from the frequency
that both D and M are transmitted together to offspring.
Father
Mother
Marker M
Disease gene D
Children
M
D
M
M
D
D
Reductionistic and synthetic approaches in biology
Biological System
(Organism)
Reductionistic
Synthetic
Approach
Approach
(Experiments)
(Bioinformatics)
Building Blocks
(Genes/Molecules)
Basic principles in physics, chemistry and biology.
Principles Known?
Physics
Chemistry
Biology
Matter
Compound
Organism
Elementary
Particles
Elements
Genes
Yes
Yes
No
The Protein Folding Problem
Protein Folding Problem
(Sequence 3D Structure)
1 Protein folding is thermodynamically determined
(Anfinsen’s thermodynamic principle)
Protein + Environment
2. Protein folding is a reaction imvolving other
interacting molecules
(Principle of molecular interactions)
Protein + Chaperonins +….
Central Paradigm