Download Genomes 3/e

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Genomic imprinting wikipedia , lookup

Molecular cloning wikipedia , lookup

Mutation wikipedia , lookup

DNA sequencing wikipedia , lookup

Comparative genomic hybridization wikipedia , lookup

Deoxyribozyme wikipedia , lookup

Exome sequencing wikipedia , lookup

RNA-Seq wikipedia , lookup

Cre-Lox recombination wikipedia , lookup

Personalized medicine wikipedia , lookup

Molecular ecology wikipedia , lookup

Endogenous retrovirus wikipedia , lookup

Genetic engineering wikipedia , lookup

Non-coding DNA wikipedia , lookup

Community fingerprinting wikipedia , lookup

Whole genome sequencing wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Molecular evolution wikipedia , lookup

Genome evolution wikipedia , lookup

Transcript
Terry Brown
Genomes
Third Edition
Chapter 3:
Mapping Genomes
Copyright © Garland Science 2007
Chapter Objectives





Why mapping is important before sequencing of the
genome?
What are genetic and physical maps?
Learning the techniques to construct genetic map of a
genome
The role of linkage analysis in map construction
Learning the techniques to construct physical map of
a genome
Objectives of
Today’s Lecture

Importance of genome mapping before sequencing of
the genome?

Understanding the way genome are mapped

Genetic Mapping
• Linkage Analysis
Importance of genome
mapping before sequencing of
the genome?
Mapping Genomes






Genome sequencing methodology depends on
sequencing technology available.
Even the most sophisticated techniques available
now can sequence about 750bp in a single
experiment.
So we need to construct the sequence of long
DNA molecules from a series of shorter
sequences.
By breaking the molecule into fragments and then
determining the sequence of each one.
Then assemble the sequence by searching over
lap and then build up the master sequence.
This method is known as shotgun method.
Mapping Genomes


The Shotgun method works very well with the
small prokaryotic genomes.
The complexity of analysis increases
disproportionately when large genomes are
analyzed.
• The number of possible over laps increases when
number of fragments increases like:
•
2n2-2n for 2 fragments we there would be 4 possible
over laps while for 4 fragments where would be 24!
• The problems with reparative sequence
Mapping Genomes




So the shotgun sequencing technique can not be
applied solely in sequencing of larger eukaryotic
genomes.
Rather a genome map is generated first which provide
a road map for sequencing project and help in
assembly of genome sequence.
The genetic map provides gene position and other
distinctive features.
After having genome map the sequencing can proceed
in either of the two ways:
• Whole-genome shotgun method
• Clone contig method
Figure 3.3 Genomes 3 (© Garland Science 2007)
Understanding the way genome
are mapped
Genetic and Physical Maps





The Genetic Map:
Based on the use of genetic technique to construct maps
showing the position of genes and other sequences features
on a genome.
Obtained by linkage analysis using cross-breeding
experiments and family histories (pedigrees)
Physical Maps:
Uses molecular biology techniques to examine DNA
molecules directly in order to construct map showing the
positions of sequence features, including gene.
Genetic and Physical Maps

The Genetic Maping:

Based on Markers (Land marks)
• Important genes
• Biochemical markers
• DNA markers


Restriction Fragment Length Polymorphisms (RFLPs)

Simple Sequence Length Polymorphisms (SSLPs)

Single Nucleotide Polymorphisms (SNPs)
Linkage analysis of Markers (enable positioning of land
marks)
• Based on recombination frequencies of genes/markers
Genetic Maps





The genetic map shows the genetic markers as land map shows
distinctive features on landscape like rivers, road and building.
Genes were the first genetic markers
The early map constructed during early twentieth century were of
genes having two alternative forms i.e. alleles with different
phenotypes.
These genes were those with can be recognized by eye i.e. pea
pod color, height of plant, shape of wing of fruit fly. Etc
The limitation of observable characters and to map those
organisms which have few visible characters like microbes, The
need of other markers were soon realized:
• Biochemical markers
• ABO blood groups
• HLA typing system

(HLA-DRB1=290 alleles

HLA-B= >400 alleles)
DNA Markers for Genetic Mapping





Using gene as a marker is very useful but it has limitation.
Gene occupy very small portion/space of genome and are not
evenly distributed in the genome.
And also every gene not have allelic forms or can not
distinguishable easily.
Therefore the map based on gene is not detailed and
comprehensive.
So other features which were not a gene, were used as marker
and known as DNA markers
• Restriction fragment length polymorphisms (RFLPs)
• Simple sequence length polymorphisms (SSLPs)
• Single nucleotide polymorphisms (SNPs)
RFLP

Based on recognized restriction sequence length EcoRI (46= 4096)

Density : 105 RFLPs in a mammalian genome.

Simple Sequence Length
Polymorphisms (SSLPs)



SSLPS are array of repeat sequences that display length variations i.e.
different alleles containing different numbers of repeat units.
They can be multiallelic i.e. each SSLP can have a number of different
length variant.
They are Minisatellite or variable number of tandem repeats (VNTRs)
• Repeat unit is up to 25bp in length (not evenly distributed found at ends)

Microsatellite or simple tandem repeats (STRs)
• Repeat unit are 13bp or less (10-30 copies 6bp) (5x105 micro >6bp in human)
Single Nucleotide Polymorphisms
(SNPs)





The individual of a species have genome which differ at many
nucleotides positions. i.e. A in one person and G in other.
Some of these may give rise to RFLPs
There are about 4Millions SNPs in human genome (one SNP per 10kb of
eukaryotic genomes).
Theoretically each SNPs should have four alleles but most of SNPs are
biallelic ?????
The SNPs are scoured by
• oligonucleotide hybridization analysis

DNA chips

Solution hybridization techniques
Single Nucleotide Polymorphisms
(SNPs)
DNA microarrays and Chips


DNA microarrays: Target sequences are spotted onto a glass or nylon
membrane of 18x18mm into 80x80=6400 spots
DNA chips: DNA target sequence are synthesized by photolithography
onto a wafer of glass or silicon with high density about 300,000 oligos
per cm2
Hybridization with an
oligonucleotide with a terminal
mismatch
Oligonucleotide ligation assay (OLA)
& Amplification refractory mutation
system (ARMS)
Linkage analysis is the basis of
genetic mapping

Principles of Inheritance:
• Law of random segregation of alleles
• Law of independent segregation of pairs of alleles.

Genes on the same chromosomes should
inherent together

Linkage

Partial linkage
Figure 3.14 Genomes 3 (© Garland Science 2007)
Figure 3.15 Genomes 3 (© Garland Science 2007)
Figure 3.16 Genomes 3 (© Garland Science 2007)
From partial linkage to genetic
mapping



Morgan and his student Arthur Sturtevant proposed
the frequency of recombination is a measure of
distance between two genes.
This can be used to construct the order and map of
genes along the chromosome.
Limitations
• Recombination hot spot
• Double cross overs
Linkage analysis with different types
of organisms



Linkage analysis with species such as fruit flies and mice, WHERE
PLANNED BREEDING EXPRERIMENTS ARE POSSIBLE
Linkage analysis with humans, WHERE PLANNED BREEDING
EXPRIMENTS ARE NOT POSSIBLE
Linkage analysis with bacteria, WHERE MEIOSIS DOES NOT OCCURE.
Linkage analysis with planned
breeding experiments






First developed by Morgan and his colleague.
Based on recombination frequency calculation with the
breeding experiments. i.e. with fruit flies.
Possible for all eukaryotic systems including humans.
Ethical limitations narrows the scope of this technique for
humans???
Based on observable markers as well as DNA markers i.e.
RFLPs, SSLP and SNPs.
Use of DNA marker also makes possible the direct
observation of games i.e. sperms and ovum to calculate
recombination frequencies.
Linkage analysis by using human
pedigree




With humans its difficult to plan
breeding experiments therefore for
finding the recombination
frequencies we need to base on the
genotypes of available marriages
and their off springs in the
pedigree.
The direct observation of sperms is
also possible but is difficult.
Example of pedigree analysis of
association of gene and a maker M
with four alleles.
Association of linkage is
established with LOD score
(Logarithm of the odds)
Genetic analysis in Bacteria


Meiosis is not there in bacteria but they do exchange genetic material
which can recombine.
The recombination frequencies can be calculated for these
recombination if genetic difference (mutation) are associated with some
phenotypes.

Genetic exchange occurs by:

Conjugation
• Complete chromosome or part of it can be transferred by conjugation tube.
• Chromosomal DNA can be transferred by episome transfer integrated in
plasmid (up to 1MB)

Transduction
• By bacteriophage (50kb)

Transformation
• Direct from environment (less than 50kb)
Physical Mapping


Genetic map alone is rarely sufficient for directing the
sequencing phase of genome project.
Resolution:
• The resolution of genetic map depends on the number of crossovers
that have been scored.
• This is easy with bacteria and small eukaryotes which can be grown
in huge number so many crossovers can be observed enabling the
construction of highly detailed genetic maps.
• E. coli genome sequencing project in 1990s, the genetic map
contained 1400 markers (average 1maker per 3.3 kb).
• Saccharomyces cerevisiae project (1150 makers 1 per 10kb).
• But with humans large number of progeny can not be obtained so
few cross over can be studies.
• The genetic map is not finely resolved i.e. genes several kilobases
apart may appear at same position on the genetic map

Inaccuracy:
• Crossovers are not randome i.e. recombination hot spot etc
Physical Mapping


We need to SUPPLEMENT and RECHECK the Genetic
map with other techniques such as Physical mapping .
Physical mapping can be done with:
• Restriction mapping
• Fluorescent in situ hybridization (FISH)
• Sequence tagged sites (STS) mapping
Restriction mapping


We can utilize unique cutter for restriction
mapping.
We can map the restriction site in a DNA
up to 50 kb of size using
• double restriction
• and partial restriction.

The resolution limit can be enhanced by
using special Gel electrophoresis
techniques i.e.
• Orthogonal field alteration gel
electrophoresis (OFAGE).
Direct observation of DNA molecules
for restriction sites

We can directly observe the restriction site
on chromosomal DNA by:

Gel Stretching

Molecular combing
Fluorescent in situ hybridization
(FISH)




FISH enables the position of a marker
on chromosome or extended DNA to be
directly visualized.
By hybridization with fluorescent
probe.
The FISH can also be applied to
genome clone library, enabling the
mapping of clones to genome map.
FISH can be done with Mechanically
stretch chromosomes
• Centrifugation based 20X stretching
(resolution 200-300 kb)

Nonemetaphase chromosomes
• Interphase chromosomes (resolution 25kb)
• Fiber FISH (10kb) (Stretching of interphase
chromosome by gel stretching or
molecular combing)
Sequence tagged site mapping

To generate a detailed physical map of a large genome we need high
resolution and high throughput techniques.
• Restriction mapping can not be applied on large genomes
• FISH provide detailed mapping but takes much time and require huge
experimentations.




Presently the most powerful physical mapping technique of large genomes
is STS mapping.
STS is simply a short DNA sequence generally 100 to 500 bp and occurs
only once in the chromosome or genome.
STS mapping is performed by multiple STS or set of STS on
broken/fragmented chromosome/genome.
A collection of DNA fragments is madee by isolating a chromosome and
then breaking it into smaller pieces, so that in collection a single point can
be represented about five/six times.
Sequence tagged site mapping




The mapping is performed by amplification of STS unique sequence using
PCR and looking for the presence of two different STS on the same fragment
from the collection.
The frequency of having two STS on the same fragment depends how close
they are to each other.
Closer the STS to each other higher the chance to find them together on the
more fragments.
Or frequency at which breaks occur between two markers.
Any unique DNA sequence can be
used as an STS

The STS can be any sequence which:
• Have known sequence
• Should have unique position in chromosome/genome

The most common STS are:
• Expressed sequence tags (ESTs) (taken from cDNA projects: limited to genes only)
• Simple sequence length polymorphism (SSLPs) (mini and micro satellite )

Help in directly linking the genetic and physical map
• Random genomic sequences

Fragment of DNA for STS mapping



For STS mapping fragment of a chromosome or
genomic DNA are needed, which are known as
mapping reagents.
These can be produced in many ways:
Radiation hybrids
• The genome can fragmented by irradiation by X-ray
(3000-8000 rad) and then can be fused to make radiation
hybrids with hamster cells
• A single chromosome can be separated by technique like
flow cytometry and can be used to make radiation
hybrids

A genomic clone library
• This can be directly mapped with STS and provide a
direct linked with STS mapping and then can be
sequenced.