Download RECOMBINANT DNA TECHNOLOGY AND BIOTECHNOLOGY

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Mutation wikipedia , lookup

DNA methylation wikipedia , lookup

Epigenetics wikipedia , lookup

DNA barcoding wikipedia , lookup

Human genome wikipedia , lookup

DNA sequencing wikipedia , lookup

Mitochondrial DNA wikipedia , lookup

DNA wikipedia , lookup

Metagenomics wikipedia , lookup

Genetic engineering wikipedia , lookup

Comparative genomic hybridization wikipedia , lookup

Nutriepigenomics wikipedia , lookup

Mutagen wikipedia , lookup

DNA repair wikipedia , lookup

Zinc finger nuclease wikipedia , lookup

Gene wikipedia , lookup

Plasmid wikipedia , lookup

DNA profiling wikipedia , lookup

Designer baby wikipedia , lookup

Cancer epigenetics wikipedia , lookup

Point mutation wikipedia , lookup

Nucleosome wikipedia , lookup

Microevolution wikipedia , lookup

Genealogical DNA test wikipedia , lookup

Genomics wikipedia , lookup

DNA damage theory of aging wikipedia , lookup

United Kingdom National DNA Database wikipedia , lookup

SNP genotyping wikipedia , lookup

DNA polymerase wikipedia , lookup

Primary transcript wikipedia , lookup

Gel electrophoresis of nucleic acids wikipedia , lookup

Non-coding DNA wikipedia , lookup

Genome editing wikipedia , lookup

Site-specific recombinase technology wikipedia , lookup

Bisulfite sequencing wikipedia , lookup

Replisome wikipedia , lookup

Microsatellite wikipedia , lookup

DNA vaccination wikipedia , lookup

Nucleic acid analogue wikipedia , lookup

Epigenomics wikipedia , lookup

Therapeutic gene modulation wikipedia , lookup

Cell-free fetal DNA wikipedia , lookup

Nucleic acid double helix wikipedia , lookup

No-SCAR (Scarless Cas9 Assisted Recombineering) Genome Editing wikipedia , lookup

Vectors in gene therapy wikipedia , lookup

DNA supercoil wikipedia , lookup

Extrachromosomal DNA wikipedia , lookup

Genomic library wikipedia , lookup

Molecular cloning wikipedia , lookup

Helitron (biology) wikipedia , lookup

Cre-Lox recombination wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

History of genetic engineering wikipedia , lookup

Deoxyribozyme wikipedia , lookup

Transcript
RECOMBINANT DNA TECHNOLOGY AND
BIOTECHNOLOGY
Tools of Recombinant DNA Technology
Rakesh Bhatnagar
Professor
Centre for Biotechnology
Jawaharlal Nehru University
JNU New Campus
New Delhi 110067
27-Mar-2006 (Revised 24-Jul-2007)
CONTENTS
What is a Recombinant DNA
Goals of Recombinant DNA Technology
Basic Tools of Recombinant DNA Technology
Visualization of DNA: Agarose Gel Electrophoresis
Restriction Endonuclease – An overview
DNA Modifying Enzymes
Polymerase Chain Reaction (PCR)
Vectors
Plasmids
Bacteriophage vectors
Lambda-Replacement Vector
Insertion Vectors
Shuttle vectors
Plant Vectors
Ti plasmid
Cloning
PCR Mediated Gene Cloning
Methods for Selection and/or Screening of Desired Recombinants
Keywords
Recombinant DNA; Plasmids; Bacteriophages; Agarose Gel Electrophoresis; Restriction endonuclease;
Isochizomers; DNA ligase; DNA polymerase;Bacteriophage polymerase; Reverse Transriptase; Nucleases;
Alkaline phosphatase; Poymerase Chain Reaction (PCR); Plasmid vectors; Gene cloning; Bacteriophage vector;
Lambda replacement vector; Insertion vector; Shuttle vector; Plant vector; Ti plasmid.
What is a Recombinant DNA
DNA molecules constructed outside the living cells that is in vitro by joining natural or
synthetic DNA segments that can replicate in a living cell
Goals of Recombinant DNA Technology
a) To isolate and characterize a gene
b) To make desired alterations in one or more isolated genes
c) To return altered genes to living cells
Basic Tools of Recombinant DNA Technology
Nucleic Acid Enzymes
DNA and RNA polymerases, reverse transcriptase, DNA ligases, Restriction endonucleases
and many more.
Plasmids and Bacteriophages
a) Bacteriophages replicate via the lytic phase cycle and the phage genome is injected into
the cell, phage genes are expressed and phage proteins and DNA are made, progeny
phage are packaged, and the cell is lysed. Two genetically different phage that infect the
same host cell may recombine during the lytic cycle
b) Some PHAGE can also replicate via the Lysogenic cycle. The phage genome is integrated
into the host chromosome and is inherited into the chromosomes of all daughter bacteria.
This "prophage" can be induced to enter the lytic cycle and kill its host by a variety of
stresses like UV light
c) Plasmids: Circular DNAs that replicate autonomously
Visualization of DNA: Agarose Gel Electrophoresis
Electrophoresis is a technique used to separate and purify macromolecules like nucleic acids
that differ in size, charge or conformation. It is one of the most widely-used techniques in
biochemistry and molecular biology. When charged molecules are placed in an electric field,
they migrate toward either the positive or negative pole according to their charge. In contrast
to proteins, which can have either a net positive or net negative charge, nucleic acids have a
consistent negative charge imparted by their phosphate backbone, and migrate towards the
positive end of the electrode, anode.
The nucleic acids are electrophoresed on a matrix called as Gel. Most commonly, the gel is
cast in the shape of a thin slab, with wells for loading the sample. The gel is immersed within
an electrophoresis buffer that provides ions to carry a current and maintain the pH at a
relatively constant value. The gel is composed of agarose, a polysaccharide extracted from
seaweed. It is typically used at concentrations of 0.5 to 2%. The higher the agarose
concentration the gel is of less porosity. The agarose gels have a large range of separation,
but relatively low resolving power. By varying the concentration of agarose, fragments of
DNA from about 200 to 50,000 bp can be separated using standard electrophoretic
techniques.
2
The distance DNA has migrated in the gel can be judged by visually monitoring migration of
the tracking dyes. Bromophenol blue and xylene cyanol dyes are used usually and they
migrate through agarose gels at roughly the same rate as double-stranded DNA fragments of
300 and 4000 bp, respectively. When adequate migration has occured, DNA fragments are
visualized by staining with ethidium bromide, a fluorescent dye which intercalates between
the bases of DNA and RNA. Fragments of linear DNA migrate through agarose gels with a
mobility that is inversely proportional to the log10 of their molecular weight. However,
circular forms of DNA migrate in agarose distinctly differently from linear DNAs of the same
mass. Uncut plasmids will migrate more rapidly than the same plasmid when linearized and
most preparations of uncut plasmid contain two topologically-different forms of DNA,
corresponding to supercoiled forms and nicked circles. Several additional factors have
important effects on the mobility of DNA fragments in agarose gels like:
Agarose concentration: By using gels with different concentrations of agarose, we can
resolve different sizes of DNA fragments. Higher concentrations of agarose facilitate
separation of small DNAs, while low agarose concentrations allow resolution of larger DNAs
Voltage: As the voltage applied to a gel is increased, larger fragments migrate proportionally
faster than the small fragments. For that reason, the best resolution of fragments larger than
about 2 kb is attained by applying no more than 5 volts per cm to the gel where, cm value is
the distance between the two electrodes, not the length of the gel).
Electrophoresis buffer: Several different buffers have been recommended for
electrophoresis of DNA. The most commonly used for duplex DNA are TAE (Tris-acetateEDTA) and TBE (Tris-borate-EDTA). DNA fragments will migrate at somewhat different
rates in these two buffers due to differences in ionic strength. Buffers not only establish a pH,
but provide ions to support conductivity. If water is used instead of buffer there will be no
migration of DNA in the gel and conversely, if concentrated buffer is used like a 10X
solution instead of 1X, heat will be generated in the gel which is enough to melt it.
Ethidium bromide: Ethidium bromide is a fluorescent dye that intercalates between bases of
nucleic acids and allows very convenient detection of DNA fragments in gels. It is added to
the DNA sample before loading to enable visualization of the fragments within the gel or can
be added in the electrophoresis buffer. The binding of ethidium bromide to DNA alters its
mass and rigidity, and therefore its mobility.
Restriction Endonuclease – An overview
Phage growth restriction – Methylase & Nuclease concept
It has been observed that phage particles that grow well and efficiently infect one strain of
bacteria are often unable to grow well and infect other strains of the same bacterial species.
Studies carried out showed that the phage particles that efficiently grow and infect host cells
have DNA molecules which are chemically modified by the addition of –CH3 groups to some
of their A/C bases, while the DNA of phage particles, which poorly infect the bacterial strain,
does not show this pattern of chemical modification of methylation. To summarize, phage
particles with unmethylated DNA do not grow and infect efficiently. Therefore, their DNA
molecules gets cleaved and degraded by the enzyme of host cells, which recognize it as
foreign. In contrast, methylated DNA is recognized as self and is protected from hostmediated destruction. This phenomenon of degrading unmethylated DNA and protecting the
self methylated DNA is responsible for the phenomenon of phage growth restriction. In
3
addition to this particular methylase, individual bacterial strains also contain accompanying
specific endonuclease activity. In late 1960s, Stewart Linn and Werner Arber isolated two
types of enzymes responsible for this observed phage restriction phenomenon. Later on, in
1978, Smith, Nathans and Arber were awarded Nobel Prize for Physiology and Medicine for
the discovery of endonucleases. The first type of enzyme was called a methylase while other
was called a restriction nuclease. The endonuclease cleaves at or near the methylation
recognition site. Different strains have difference in residues which are methylated. Typical
sites of methylation include N6 position of adenine, N4 of cytosine or C5 of Cytosine. And,
only a fraction of bases are methylated, that is, not every A/C will be methylated. This
suggests that there are only few unique bases which are substrates of methylase. Thus, this
unique system was coined as Restriction Modification system in bacteria. It acts as a small
immune system for protection from infection by foreign DNA, example viral DNA, in
individual bacterial strains and provides a reason why the restriction endonuclease doesn’t
chew its own DNA.
Eg. ECOR I methylase, the sequence GAATTC will be internally methylated at adenine base
(shown in asterisks) by the enzyme. The ECOR I endonuclease within the host bacteria will
not cleave the methylated DNA. The foreign viral DNA, which is not methylated at this
particular sequence, GAATTC, will be recognized as foreign invasive DNA and will be
cleaved by ECOR I endonuclease. This cleavage of foreign DNA abandons any viral
infection, thus protecting the host from any fatal consequence. Such endonucleases are
termed as restriction endonucleases since they restrict the DNA within the cell to being
“self”.
If the attacking phage was previously grown on the same strain, which is infecting now, this
system will be rendered ineffective. In this case, the phage’s DNA will already be methylated
at appropriate sequence and will be recognized as self. Structural and biochemical studies
have indicated that for the type II R/M system, the methylase recognizes and methylates one
strand of DNA duplex, whereas restriction endonucleases recognize both the strands, that is,
both should be non-methylated for recognition. It is able to do this because it is a homodimer.
The methyl group protrudes into the major groove of DNA at the binding site and prevents
the restriction enzyme from acting upon it. Together, a Restriction enzyme and its cognate
modification enzyme, works in the bacteria to prevent it from infections, acting as miniature
microbial immune system. In some R-M systems, the restriction enzyme and the modification
enzyme(s) are separate proteins that act independent of each other while in some, the two
activities occur as separate subunits / domains of a larger, combined R-M enzyme.
Restriction enzyme or Endonuclease is an enzyme that cuts double stranded DNA. Precisely,
it cleaves the sugar-phosphate backbone of DNA without damaging the bases. A restriction
endonuclease works by scanning the length of DNA molecule and once it encounters its
particular specific recognition sequence; it will bind to the DNA and makes one incision in
each of the two sugar- phosphate backbones of the double helix. Thus, rather than cutting
DNA indiscriminately, a RE cuts only at a particular nucleotide sequence, termed as
Recognition sequence. Therefore, scientists could cut the DNA molecule in a predictable and
reproducible way. Restriction enzymes are traditionally classified into 3 types on the basis of
subunit composition, cleavage position, sequence specificity and requirement for cofactors.
4
Type I
Complex, multisubunit, cut DNA at random, far from their recognition sequences. They may
have biological significance but have a very little practical value since they do not produce
discrete restriction fragments.
Type II
Cut DNA at very precise and defined positions close to or within the recognition sequences.
It is the only class of restriction endonuclease used in the laboratory for DNA analysis and
gene cloning. They require only Magnesium for activity and corresponding model of
enzymes require only s-adenosyl methionine (SAM). They are smaller with subunits in the
range of 200-350 amino acids. The most common are the ones which cleave within
symmetric recognition sequence. e.g. Hha I, Hind III and Not I. Next common of Type II, the
ones which cleave outside their Recognition sequence to one side. They are intermediate in
size with 400-650 amino acids and recognize sequences that are continuous and asymmetric.
Third major type II Restriction enzymes are referred to as Type IV, are large, 850-1250
amino acids, where two enzymatic activities reside in the same protein chain, cleave outside
the recognition sequence and others those recognize discontinuous sequences and cleave on
both sides releasing a small fragment containing recognition sequence.
Type III
Large, cleave outside the recognition sequence and require two sequences in opposite
orientations within the same DNA molecule to accomplish cleavage. They rarely give
complete digests. They have basically no practical requirement and there is none so available
commercially.
Source and site of Action
Patterns of DNA cutting
Restriction Endonuclease recognizes short, specific palindromic sequences of DNA and
cleaves them. Some restriction enzymes symmetrically leave blunt ends without overhangs
like Sma I as shown below while others cleave DNA backbones in positions that are not
directly opposite to each other. e.g EcoR I. Arrows show the site of cleavage. Once the cuts
have been made, the resulting fragments are held together only by the relatively weak
hydrogen bonds holding complimentary bases with each other. These enzymes generate
either of two types of ends, 5‫ ׳‬or 3‫׳‬, called overhangs or sticky ends.
5
(Generates blunt ends)
(Generates 5’ overhangs)
5’
5’ CCCGGG 3’
3’
3‫ ׳‬GGGCCC 5’
GAATTC
3’
CTTAAG
5’
EcoR I
Sma I
5’ G
5’ CCC
GGG 3’
3’ CTTAA
3‫ ׳‬GGG
CCC 5’
5’ GGTACC 3’
5’ GGTAC
3’ C
AATTC
3’
G
5’
C 3’
CATGG 5’
3’ CCATGG 5’
Kpn I
(generates 3’ overhangs)
Nomenclature
Restriction endonucleases are named based on bacteria from which they are isolated in the
following manner
E
Co
R
I
Escherichia (Genus)
Coli (Species)
RY13 (Strain)
First Identified order (Id) in Bacteria
Name of RE
Organism from which isolated
Recognition Sequence
EcoR I
BamH I
E.coli
Bacillus
amyloliquefaciens
Haemophilus
influenzae
Arthobacter
luteus
Nocardia olitidiscaviarum
GAATCC
GGATCC
6bp cutter, leaves 5’ overhangs
6bp cutter, leaves 5’ overhangs
AAGCTT
6bp cutter, leaves 5’ overhangs
AGCT
4bp cutter, Leaves blunt end
Hind III
Alu I
Not I
GCGGCCGC 8bp cutter, 5‫ ׳‬overhangs
6
Recognition Sequence typically is only 4-12 nucleotides long. Thus probability that they are
found in chromosomes of any organism is quite possible and frequent. Thus, if there is 25%
probability for a specific base at any given site (because 4 bases, A/T/G/C), then the
frequency with which different RE site will occur in a gene will be 4n, where n is nucleotide
specificity. Eg. Alu I is a 4bp cutter, therefore, (4)4 = 256. Therefore, on an average, any DNA
will contain an Alu I site after every 0.25 kilobases, whereas Not I site occurs once about
every 65.5 kilo bases (48). Thus Not I generates bigger fragments while Alu I would be
expected to digest DNA into many small fragments. Thus, length of Recognition sequence
dictates how frequently the enzyme will cut in a random DNA sequence.
Isochizomers
Isochizomers are different Restriction endonucleases having same recognition site. In some
cases, they cut identically within their recognition site, but sometime they do not. They have
different optimum reaction conditions, stabilities and cost that give us an option of what to
purchase. Some Restriction endonucleases recognizes only one sequence but never other,
called as Ambiguous Recognition Sequence. Eg. BamH I recognize GGATCC, while Hinf I
recognizes a 5bp sequence, with an eligibility of sequence starting with GA and ending with
TC and having any base in between GANTC. Some REs recognition site has a site for
cleavage by other Restriction Endonuclease. e.g: BamH I site GGATCC have site recognized
and cleaved by Sau3A I GATC. Thus all BamH I sites can consequently be cut by Sau3A I.
Conditions and requirements that influence Enzyme Activity
One unit activity:- is defined as amount of enzyme required to digest 1µg of reference DNA
in 60 minutes at 370C. Usually reference DNA is λ phage DNA.
Buffer:- The most critical determinant of the enzyme activity is the ionic concentration
(NaCl content) of the buffer. There are commercially 3 buffer systems available for the
activity, low salt buffer (20mM), medium (100mM) or high (250mM) NaCl buffers. The
buffer is usually 10mM Tris-HCl, pH 8.0, supplemented with Magnesium salt (often 50mM
MgCl2), a reducing agent (usually 1mM DTT), and some enzymes require BSA (100µg/ml)
and salt (NaCl). The reaction does not require ATP.
Quality of DNA :- Preparation of DNA that has to be cleaved should be free of contaminants
such as phenol, CHCl3, alcohol, EDTA, detergents, excessive salts, protein, RNA, all of
which can pose a hindrance to the enzyme’s activity.
Stopping a Reaction:- To terminate Restriction enzymes activity, heat inactivation is a way
but for the enzymes which it does not work, phenol / CHCl3 extraction is another means of
inactivation of Restriction enzymes.
Incubation Temperature and Time: - Recommended temperature for most of the
Restriction enzymes is 370C except for the ones which are isolated from thermophilic
bacteria which require higher temperature ranging from 50-650C. While some restriction
enzymes have short ½ lives at 370C and for those lower temperatures are required. Incubation
time as per the unit definition is 1 hour but it can be manipulated. Example, it may be
shortened if more units of Restriction enzyme is added or longer times are used to allow a
reaction to proceed to completion with fewer enzymes units.
7
Cleavage near ends :- When Recognition sequences are located at or very close to the ends
of the linear fragments, most enzymes require few bases on either side of Recognition
sequence in order to bind and cleave. Thus there is some specificity on end requirement for a
variety of enzymes.
DNA methylation :- Almost all strains of E.coli used for propagating cloned DNA contain 2
site-specific DNA methylase:Dam Methylase
Adds a –CH3 group to A in separate GmATC
Dcm Methylase
Adds methyl group to internal cytosine CmC
(A/T)GG
Importance of this phenomenon lies in the fact that a number of Restriction enzymes will not
cleave methylated DNA. Eg. Mbo I and Sau3A I are isochizomers recognizing sequence
GATC. Dam methylase methylates it to GmATC, while Mbo I does not cleave it, Sau3A I is
unaffected by the sequence methylation.
Relaxed or Altered activity of Restriction enzymes - Star activity
If non standard conditions are used for DNA digestion, cleavage can occur at sites different
from canonical recognition sequence. This aberrant non specific DNA cleavage is called as
star activity. Some non standard conditions that contribute to this star activity are as follows:¾
¾
¾
¾
¾
¾
High pH (>8)
Low ionic strength of buffer (<25mM)
>5% glycerol concentration (Enzymes are stored in 50 % glycerol)
High enzyme concentration (>100U for 1µg DNA)
Presence of organic solvents in DNA preparation Eg. DMSO, Ethylene Glycol
Substitution of Mg+2 with other divalent cations like Mn+2, Cu+2, Co+2, Zn+2
Thus, to avoid star activity, fewer units should be used which not only prevents over
digestion but reduces final glycerol concentration in the reaction. High ionic strength should
be used provided enzyme is not inhibited by high salt. pH should be lowered accordingly.
Restriction Mapping:A restriction map is a profile of restriction endonuclease cleavage in a piece of DNA.
Generation of restriction map is the first step in characterizing an unknown DNA for the
presence of recognition sequences. Restriction Endonucleases which are infrequent cutters
are used for generating the maps. Fragments of varying lengths using various combinations of
enzymes are generated and then analyzed on agarose gel. Single digestions are used to
determine which fragments are in the unknown DNA, and double digest to predict the order
and orientation of fragments. Success in using this technique relies on obtaining complete
digestion of the DNA with each of the enzymes used since partial digest will yield false
fragments which will lead to confusion.
DNA Modifying Enzymes
1. DNA Ligase
Ligases are vital enzymes required for physiological cellular processes such as DNA
replication, repair of damaged DNA and recombination. This enzyme facilitates formation of
8
phosphodiester bonds between adjacent 3’OH and 5’ phosphate termini, thereby joining the
nicks in double strandeded DNA. There are 2 categories of ligase depending on the
requirement for cofactors. Eukaryotic and virally encoded enzymes are ATP dependant,
whereas most prokaryotic enzymes require NAD+ for their activity as a cofactor.
ATP dependant DNA ligase
Ligase isolated from bacteriophage T7 is a 41Kda monomer and is found to form a complex
with ATP. The structure involves an N terminal ATP binding domain and a C terminal
domain which is the DNA binding domain. The N-terminal DNA is itself an active ligase but
with the reduced ligase activity. Both domains are able to bind to single stranded DNA but
smaller C terminal domain is only able to bind double stranded DNA. The affinity for nicked
DNA comes only from reconstitution of these two domains. This enzyme also catalyses
covalent joining of two segments to one uninterrupted strand in a DNA duplex, provided that
no nucleotides are missing at the junction (repair reaction). For its catalytic activity, the
enzyme requires ATP and Mg+2. DNA which does not have phosphate residues can be
rendered capable of ligation by Phosphorylation with Kinase.
3’
5’ ACTT-OH
3’ TGAAGA
CTTCGTLigation of DNA with complementary cohesive
termini.
+
AGCA-5’
ATP/Mg2+ and ligase
51 - ACTTCTTCGT – 3’
31 – TGAAGAAGCA – 5’
Repair Reaction:3’
5’
C
G
P
5’
G T A
C A T
P OH P
P
3’
+ ATP and Mg2+ and ligase
P 3’
5’
C G T
G C A
5’ P P
A
T
P
P
P 3’
Bacteriophage T4 DNA ligase is a polypeptide with a Molecular weight of 68000 Daltons.
Mg+2 ions in 10mM concentration sulfhydryl agents (DTT, 2-mercaptoethanol) are also
required for its catalytic activity. High NaCl concentration can be inhibitory. For
intermolecular ligations, PEG (Polyethylene glycol) (1-10%) enhances the efficiency of
ligation reaction. Addition of PEG can also facilitate blunt end ligation, whose efficiency is
as such lower. Since there is no base pairing to hold fragments together temporarily,
concentration of DNA and ligase must be high. The reaction which is driven by ATP is often
carried out at 100C to lower the kinetic energy of molecules, and so reduce the chances of
9
base paired sticky ends parting before they have been fixed by ligase. But, then long reaction
times are needed to compensate for the lowered activity of DNA ligase at lower temperatures.
NAD+ dependent DNA ligase
Isolated from E. coli, is quite different from ATP dependent T4 DNA ligase. In this, there are
two domains with the larger N-terminal domain retaining full activity while the C terminal
domain having all the DNA binding activity. Thus, the scenario is different, in the context
that, there both the domains contribute to the enzyme activity.
Units of Ligase Activity
Weiss unit: - is defined as the amount of ligase that catalyzes the exchange of 1nanomole of
P32 from inorganic pyrophosphate to ATP in 20min at 370C.
2. DNA Polymerases
E.coli DNA polymerase I
E. coli DNA polymerase I is a single polypeptide chain of 109000Da, encoded by E. coli pol
A gene and requires magnesium as a cofactor. Polymerase I carry out three enzymatic
reactions that are performed by three distinct functional domains of the holoenzyme.
Proteolytic deletions can be generated to study the function of each domain in terms of the
activity performed. The domains of E. coli polymerase I are summarized below in a table
which mentions the function of each subunit separately.
Domain
Catalytic Activity
C-terminal (46 kDa)
5’-3’ DNA Polymerase
Central Domain (22 kDa)
3’-5’ exonuclease
-NH2 terminal
5’-3’ exonuclease
Characteristic Function
Addition of nucleotide residue to
3’ –OH termini of RNA/DNA
primes
Cleavage of nucleotide residue
from 3’ –OH, generating 3’ recess
Cleavage of nucleotide from base
paired 5’ termini
3’-5’ exonuclease domain
3’-5’ exonuclease domain
DNA polymerase I
Polymerase domain
Holoenzyme
Treated with Subtilisin
Large Klenow fragment
Polymerase domain
5’-3’ exonuclease domain
5’-3’ exonuclease domain
(5’-3’)
Small fragment
10
Polymerase I on mild cleavage with subtilisin releases two fragments, the larger one, known
as Klenow fragment, carries both DNA polymerase and 3’-5’ exonuclease while the smaller
one possesses 5’-3’ exonuclease activity of the holoenzyme. The exonuclease activity
performs a proof reading function by rejecting DNA, which is mismatched.
Units of DNA polymerase
One unit of DNA polymerase is the amount of enzyme required to catalyze the conversion of
10 nanomoles of total dNTPs to an acid-insoluble form in 30minutes at 37 0C using poly d
[A-T] as the template primer. Mg+2 ions are preferred for accurate replication, whereas Mn+2
if present, increases the frequency of errors and mismatches.
Application of Klenow fragment in Molecular Biology
1.
Synthesis of double stranded DNA from single stranded template
The primary function of DNA polymerase is to synthesize complimentary strands during
DNA replication. DNA polymerase requires a primer to provide 3’–OH group to which newer
nucleotides can be added. The primers used are generally 6-20 bases in length, termed as
oligonucleotides, which are complimentary to a specific region of template DNA. Shown
below is the display of the catalytic activity carried out by the enzyme.
GCTAC
AGGC
AAGTCCGATGCCAATTGCGGATCCGATT
Klenow fragment
dNTPs of each kind
GCTACGGTTAACGCCTAGGCTAA
AAGTCCGATGCCAATTGCGGATCCGATT
2.
Filling in recessed 3’ends of DNA fragments
Klenow fragment is also used to create blunt ends on fragments created by restriction
enzymes that leave 5’ overhangs.
5’AGGCAG3’
3’TCCGTCGAACT5’
Klenow and dNTPs of
each kind
5’AGGCAGCTTGA3’
3’TCCGTCGAACT5’
3.
Digestion of Protruding 3’ overhangs
This is another way of producing blunt ends on a DNA, which is created by restriction
enzymes that produce 3’ overhangs. Removal of nucleotides from 3’ ends will continue, but in
the presence of nucleotides, the polymerase activity balances the exonuclease activity,
yielding blunt ends.
4.
Generating novel cohesive ends
The DNA digested with restriction enzymes generates cohesive ends that can be end-filled.
The end-filling reaction can be controlled by omitting one, two or three of the four dNTPs
from the reaction and thereby generate partially filled termini.
11
3. T4 Bacteriophage polymerase
T4 Bacteriophage polymerase is similar to E. coli DNA polymerase I larger subunit, Klenow
fragment. Thus, it has 5’-3’ DNA polymerase, 3’-5’exonuclease activities but 5’-3’
exonuclease activity is missing. It is also used for similar type of work like creating blunt
ends in a DNA with 5’ or 3’ overhangs. The functional difference between the two enzymes
which have practical significance is as follows:1.
3’-5’ exonuclease of T4 is 200 times more potent than that of Klenow which is a
sluggish enzyme, making it a preferred enzyme for generating blunt ends in a DNA
with 3’ overhangs.
2.
Like Klenow displace downstream oligonucleotides with 5’-3’ exonuclease activity as
it extends the chain, T4 polymerase does not. This makes it a preferred choice for
oligonucleotides directed mutaqenesis.
Use of Polymerase I- Nick Translation
Labelling of DNA by nick translation
Enzyme binds to a nick or gap in duplex DNA. The 5’-3’ exonuclease activity of Polymerase
I remove nucleotide from one strand and create a template for the growing chain. The original
nick is therefore translated along the DNA molecule by the combined action of 5’-3’
exonuclease and 5’-3’ polymerase. This is the oldest method of nucleic acid labeling and is
still used commonly. The DNA preparation to be labeled is treated with DNase I for a very
short time; this induces nicks in the DNA molecule at random positions. A nick is a point in a
DNA duplex where the phosphodiester bond is broken and has a free 3’ OH group. The
nicked DNA is subjected to E. coli DNA polymerase I which successively adds new
nucleotides to the free 3’OH group. As the new bases are synthesized, the existing strand is
displaced progressively and is digested away by the enzyme 5’-3’ exonuclease activity of the
polymerase enzyme. In this reaction, usually one of the deoxynucleoside triphosphates is
radiolabel led or even all the four can be labeled, thus incorporating labeled nucleotides into
the newly synthesized segment. Since this segment is synthesized by complimentary basepairing, it has the same sequence as the strand it replaces. This process is called ‘nick
translation’ because there is a movement (translation) of the nick along the DNA duplex due
to the activities of DNA polymerase I.
5’
31
3’
5’
3’
51
dNTPs / E.coli
DNA Polymerase
5’
3’
3’
5’
Polymerization
5’
3’
Nick translation is usually carried out at
lower temperature to reduce the synthesis of
snap back DNA, which is produced when 3’OH terminus of growing strand loops back
on itself
3’
5’
12
4. T7 DNA Polymerase
The T7 DNA polymerase from T7 bacteriophage has 3’-5’ exonuclease and DNA polymerase
activity but lacks 5’-3’ exonuclease domain, which is similar to T4 DNA polymerase. The
processivity of this enzyme is quite good that is, the average length of DNA synthesized
before the enzyme dissociates from the template, is considerably greater than for other
enzymes. Thus, the average length of DNA synthesized by a single molecule of
bacteriophage T7 polymerase is much greater than that of DNAs synthesized by other DNA
polymerases. The binding and polymerization domain is occupied by the carboxy terminus
while the potent 3’-5’exonuclease activity resides on the amino terminus.
The exonuclease activity is completed inactivated by incubating the enzyme with a reducing
agent, molecular oxygen, and low concentrations of ferrous ions, for several days. Over 99%
of the exonuclease activity is abolished without affecting the polymerization activity by these
agents, which cause mutations and site specific modifications. The resulting chemically
modified enzyme is marketed under the trade name Sequenase is ideal for determining the
sequence of long tracts of DNA by the dideoxy mediated chain termination method.
5. Thermostable DNA polymerase
Taq DNA polymerase:Taq DNA polymerase is a DNA dependent DNA polymerase, first isolated from the hot
spring bacterium, Thermus aquatics in 1976 and in 1989. Due to its wide use in molecular
biology (primarily PCR), it is termed as ‘Molecule of the Year’. This thermophilic DNA
polymerase encodes an 832-amino acid, 94 kDa protein, which consists of two domains.
1. -NH2 domain: Similar to 5’-3’ exonuclease domain of members of polymerase I
family of DNA polymerase
2. -C terminal domain contains a catalytically inactive 3’-5’ exonuclease and a
polymerase sub domain, similar to klenow of DNA polymerase I.
The thermal stability of Taq DNA polymerase is attributed to its hydrophobic core and stable
electrostatic interactions and high density of proline residues on the surface of the enzyme.
The optimal activity is at 75-800C temperature and at 600C, the activity is reduced by a factor
of 2 and at 370C, its activity is reduced to only 10%. To initiate DNA synthesis, like other
DNA polymerases, it also requires a primer that is annealed to the template strand and caries
an extensible 3’-OH group. Taq DNA polymerase requires Mg2+ for its optimal activity.
Phosphate buffers inhibit Taq DNA polymerase and therefore should be avoided. The
reaction is usually carried out in the presence of Tris buffer at pH 8.3. Because of the lack of
a proofreading function, the rate of misincorporation of dNTPs is high in PCR reactions
which are catalyzed by Taq polymerase (or any other DNA polymerase that does not have
editing domains). Several mutant forms of native polymerases, are also available like Pfu
And vent. Both of them have proofreading activities contributed by 3’-5’ exonuclease. Pfu
polymerase is therefore known to generate lowest errors while vent is probably intermediate
between Taq and Pfu.
Pfu polymerase is isolated from Pyrococus furiosus
Vent is isolated from Thermococcus litoralis (also known as Tli polymerase)
13
6. Reverse Transcriptase (RT)
Reverse Transcriptase is a RNA dependent DNA polymerase, encoded by reteroviruses,
where there is a requisite of converting viral RNA genome into DNA, prior to its integration
into the host cells. Reverse transcriptase has two independent functional activities in two
separate domains.
1. DNA polymerase activity: As its native function, Reverse Transcriptase copies only
RNA, but in vitro, it can transcribe both single stranded RNA and single stranded
DNA templates with equivalent efficiency. And in both cases, RNA or DNA primer is
required to initiate synthesis.
2. RNaseH activity: Ribonuclease that degrades RNA from RNA-DNA hybrids. The
products generated includes ribonucleotides which are 4-20 nucleotides in length and
contain 5’phosphates and 3’ OH terminus
Reverse transcriptase is used chiefly to transcribe mRNA into double stranded cDNA that can
be inserted into prokaryotic vectors. Reverse transcriptase lacks 3’-5’ exonuclease activity
which acts as an editing function in E. coli DNA polymerase I, and is therefore more prone to
errors
Usually Reverse Transcriptase for laboratory purposes is isolated from one of the following
reteroviruses, either by purification from virus i.e the native host or from heterologous hosts
as a recombinant protein.
a.) Moloney Murine leukemia virus - Single polypeptide
b.) Avian myelobalstosis virus- Two polypeptide chains
Both of these differ from one another in a number of aspects. Mo-MLV Reverse
Transcriptase contains two non overlapping domains, larger –NH2 domain with DNA
polymerase activity, while C-terminal domain encompasses a comparatively weak RNaseH
activity which has an advantaqe over the avian form, when synthesizing cDNAs
complimentary to long mRNAs. Because at the beginning of cDNA synthesis, there is a
competition between degradation of template mRNA and initiation of DNA synthesis. The
presence of RNaseH activity leads to premature termination of RNA-DNA duplex. Therefore,
certain mutants are made that retains DNA polymerase activity but lacked RNase H activity.
Serendipitously, it has been found that due to these mutations, the enzyme can work at
elevated temperatures, as high as 500C, whereas RNase H deficient mutant forms of Avian
enzyme can be used at 600C while the murine enzyme is rapidly degraded at these
temperatures. These mutated thermostable forms of Reverse Transcriptase facilitated their
use in RT PCR which itself has myriad applications in Molecular Biology. The avian enzyme
works more efficiently at pH 8.3 than at pH 7.6, at which the murine enzyme works.
14
7. Nucleases
Enzyme
Characteristic Feature (Activity catalyzed)
Nuclease S I
Heat stable, isolated from fungi Asperigillus oryzae ,works optimally at
high ionic strength, low pH (~4.2) in the presence of Zn+2, degrades
single stranded DNA / RNA, chief product is 5’ mononucleotide
Exonuclease VII Single strand specific DNase that hydrolyses denatured DNA, Single
stranded regions extending from the termini of duplex DNA, chief
product is oligonucleotide (2-25 nucleotides), does not require divalent
ions, has several applications
Mung bean
To eliminate primers from completed PCRs, isolated from mung bean
Nuclease
sprouts, degrades Single stranded DNA / RNA generating 5’ mono and
oligonucleotides, very specific for single stranded DNA but if used in
high concentration or with long incubation time, can cleave double
stranded DNA also. Used to convert protruding termini on double
stranded DNAs to blunt ends. It is easier to control than SI nuclease. SI
will cleave the DNA strand opposite to a nick while mung bean nuclease
will attack the nick only when it is enlarged to a gap of several
nucleotides, prefers to remove nucleotide from 5’ end of the substrates
and therefore not useful when DNA carries 3’ overhangs
BAL 31
Isolated from marine bacterium Alteromonas espejiana , Ca+2 dependent
exonuclease, has 3’-5’ exonucleolytic activity that removes
mononucleotides from double stranded DNA, a 5’-3’ exonuclease
activity for single stranded DNA. The combination of both exonuclease
causes both stands of double stranded DNA to be shortened from both
the ends. It is used to generate uni and bi-directional deletions from the
duplex DNA cloned in any vector.
Exonuclease III Isolated from E.coli removes 5’ mononucleotides from 3’- OH of duplex
DNA, leaving protruding 5’ termini. Preferred substrate is DNA with
blunt or 5’ protruding ends. It has several applications :a) Generating single stranded templates for sequencing
b) To produce unidirectional sets of progressive deletions in
populations of linear DNA molecules with one resistant (3’)
terminus and one susceptible (blunt or 5’) terminus, inactive on
single stranded DNA.
Ribonuclease T1 Isolated from Aspergillus, endonuclease that cleave RNAs at 3’–(p) of G
residues, producing oligo nucleotide terminal guanosine 3’ phosphates. It
is used to remove unannealed regions of RNA from RNA-DNA hybrids.
Ribonuclease A Endoribonuclease that cleave single stranded RNA at 3’ pyrimidine
residues, generating 3’ phosphorylated mononucleotides and
oligonucleotides.
Deoxyribonuclease Endonuclease which cleaves double stranded or single stranded DNA
preferentially adjacent to pyrimidine (c or T) residues generating 5’phosphorylated di, tri and tetra nucleotides, works optimally in the
presence of Mg2+ cleaving duplex DNA independently, while in the
presence of Mn+2, the enzyme cleaves both the strands of the DNA at
approximately same site, producing blunt ends or fragments with 1-2
base overhangs. It doesn’t work on RNA.
15
8. Alkaline Phosphatase (AP)
Alkaline Phosphatase is an important tool in molecular biological processes like cloning. It
removes 3’- phosphate groups from a variety of substrates. Although in laboratory, it is used
to catalyze the removal of terminal 5’-(P), residues from single stranded or double stranded
DNA and RNA. The resulting 5’-OH termini can no longer take part in ligation reactions,
thus prevents self religation of vectors, reducing the background of transformed bacterial
colonies that carry empty plasmids. This enzyme works optimally at alkaline pH (range of 89 in the presence of low Zn+2 concentrations) and hence derived the name.
Alkaline Phosphatase is isolated from various sources:a) Bacterial Alkaline phosphatase
Secreted in monomeric form into the Periplasmic space of E.coli, where it form dimers and
gets catalytically activated. It’s a remarkably stable enzyme and is resistant to inactivation by
heat and detergent. Thus, bacterial alkaline phosphatase is the most difficult to destroy in the
reaction mix.
b) Calf Intestinal Phosphatase
Calf intestinal phosphatase is a dimeric glycoprotein isolated from bovine intestine. This has
much more practical significance than bacterial alkaline phosphatase, since it can be readily
inactivated from the reaction mixture using proteinase K or by heating at 650C for 30 minutes
or 750C for 15 minutes in the presence of 10mM EGTA.
c) Shrimp alkaline phosphatase
Extracted from cold water shrimp, can be inactivated readily by heating at 650C for 15 min.
9. Terminal Deoxy Nucleotidyl Transferase
Terminal transferase is an unusual DNA polymerase found only in prelymphocytes and in
early staqes of lymphoid differentiation. Synthesis of single stranded tails at the 3’ ends of
either single stranded DNA or double stranded DNA with protruding 3’ termini, by the
enzyme Terminal Deoxy Nucleotidyl Transferase, is called tailing, can be used to generate
protruding ends of defined sequence to facilitate cloning of fragments. It can be used to
generate protruding ends of defined sequence, e.g poly A tails on the 3’ ends of the DNA
insert and poly T tails on 3’ ends of the vector. Thus, the protruding ends of the DNA insert
and vector will base pair under appropriate annealing conditions. Mg2+ cation is preferred
when the nucleotide to be added is a purine while Co2+ is preferred for the addition of
pyrimidines. The enzyme strongly prefers DNA with protruding 3’ terminus, although blunt
or recessed 3’ termini are also used, but less efficiently, in buffers of low ionic strength with
Co2+, Mg2+ or Mn2+ as bivalent cations. As many as thousands of deoxynucleotides can be
incorporated using this enzyme on a template of DNA. Single nucleotide can be added to the
3’ termini of DNA if modified bases like dideoxynucleotides or cordycepin triphosphates are
used instead of natural deoxynucleotide triphosphates.
Polymerase Chain Reaction (PCR)
PCR is a robust, speedy and flexible method, conceived by and put in practice by Kary Mullis
in 1985. It is used to generate microgram quantities of DNA (up to billions of copies) of the
desired DNA (or RNA), from a single copy in few hours. The PCR process has been
16
completely automated and compact thermal cyclers are available in markets. The
fundamental of PCR reaction with its components are as follows:
1.
A thermostable DNA polymerase to catalyze template dependent synthesis of DNA
For routine purpose, Taq polymerase is the enzyme of choice. However, when great
fidelity is required, when working with larger lengths of target amplicons, mutant forms
of Taq, having proof reading exonuclease activity are used to generate faithful copies of
a gene or else cocktail of both enzymes are used to compensate for the low processivity
of proof reading enzymes. eg. A mixture of Taq & Pfu polymerase has desirable features
of both high processivity (a feature of Taq polymerase) and fidelity which is contributed
by Pfu in one Reaction mixture, one can generate high yield of accurate amplified
products up to 35 kb.
2.
A pair of synthetic oligonucleotides to prime DNA synthesis.
Appropriate designing of primers is essential for accurate. Appropriate designing of
primers is essential for accurate amplification avoiding unwanted sequence
accumulation. The concentration of primer is also crucial. Lower concentrations can
limit the reaction while high concentrations favor mispriming & non-specific
amplification.
3.
dNTPs
Equimolar amounts of all the four nucleotides, dATP/ dTTP/ dCTP/ dGTP are required
to carry out a PCR Reaction.
4.
Divalent cations
Divalent cations are required by all thermostable polymerases for their optimal activity.
The most routinely used Taq Polymerase required Mg+2.
Since dNTPs &
oligonucleotides are known to sequester Mg+2 ions, the molar concentrations of
phosphate groups of dNTPs & primers should be standardized. Thus optimal
concentration of ions for each primers combination & template is crucial.
5.
Monovalent cations
Standard PCR buffers are supplemented with 50 mM KCl.
6.
Buffer
Tris-HCl of pH 8.3 and 8.8 is included in standard PCR reaction at a ionic strength of 10
mM. When incubated at 72oC, a temperature at which Taq Polymerase carries out
polymerisation reaction, the pH of Reaction mixture drops by more than 1 unit,
producing a buffer of pH 7.2.
7.
Template DNA
A template is the DNA containing target sequence to be amplified. The preferred
substrate for DNA amplification is linear DNA rather than closed circular DNA
template. The size is not an important parameter but working with large size DNA may
pose some obstruction. Ideally, one single copy is enough to initiate a PCR reaction but
several thousand copies are seeded while putting a PCR Reaction. The recommended
amounts of template DNA in case of yeast DNA, bacterial DNA and plasmid DNA that
should be used per reaction are 10ng, 1ng & 1 pg respectively.
17
PCR Iterative programming
A PCR cycle comprises of basically three repeated reaction including denaturation of the
template DNA followed by primer annealing to target sequence & extension of the primers
by DNA polymerase.
DNA denaturation
The optimum temperature for DNA denaturation is determined by the % GC content of each
particular template DNA. The more is the proportion of GC, higher temperatures are
required to denature the parental duplex strands. At lower temperature, only A-T regions
start melting. Longer DNA templates require longer denaturation time to separate them
completely. Routinely 950C is the temperature used for denaturation, since it is the maximum
temperature at which Taq Polymerase can exhibit its activity. For GC rich DNA,
thermostable polymerase isolated from Archae that can sustain even much higher
temperature, without any deleterious effects, are preferred.
Annealing
The temperature which is required to anneal primers to the template DNA is critical. It
should not be too high to avoid poor annealing and even not too low which facilitate
nonspecific annealing, resulting in accumulation of unwanted sequences. As such there is no
proper formula to calculate annealing temperature. Conditions are optimized by performing
several hit & trial PCRs at temperature ranging from 2-100C below the melting temperature,
calculated from the oligonucleotide primer sequence. The most commonly used formula to
determine the Tm from primer sequence is: 4(G+C) + 2(A+T), where G/C/A/T denotes,
number of times these bases are present. When the temperature is lowered to carry out
annealing Reaction, there are chances of renaturation of template DNA.
Extension
Extension reaction is usually carried out at temperature favorable to the themostable
polymerase used in the Reaction. For Taq polymerase 72-780C is the optimum temperature.
In 1st two cycles, polymerase extends beyond the sequence complementary to the binding site
of the primer but after the 3rd cycle, only sequence that is defined & limited by the binding of
primers are accumulated in a geometrical progression. As a thumb rule, for every 1000 base
pair, 1 minute is used.
No. of cycles
Number of cycles that are to be used to amplify a particular region from the target DNA
depends on the initial concentration of template present and the efficiency of primer
extension and amplification.
Variations from the conventional PCR
Hot start PCR
It a general modification in the protocol followed for setting a conventional PCR to maximize
the yield of desired amplified product and simultaneously to suppress any non specific
amplification. This is accomplished by withholding an essential component that plays a
critical role in the reaction e.g DNA polymerase. It is not added until the temperature is raised
high enough that inhibits non specific priming or oligomerisation of primers. Then to the
preheated mixture, Taq DNA polymerase is supplemented to facilitate the extension reaction.
Initial methods also involve use of monoclonal antibodies against polymerase. As the
temperature rises, the antibody dissociates from the enzyme & renders it active. Another
18
method is the use of enzyme ‘Ampli Taq Gold’ which requires heating to 930C for 10
minutes to become fully active.
Figure 1: Because both strands are copied during PCR, there is an exponential increase of
the number of copies of the gene. Suppose there is only one copy of the desired gene before
the cycling starts, after one cycle, there will be 2 copies, after two cycles, there will be 4
copies, and three cycles will result in 8 copies and so on. Thus, Polymerase chain reaction
(PCR), is a common method of creating copies of specific fragments of DNA. PCR rapidly
amplifies a single DNA molecule into many billions of molecules. In one application of the
technology, small samples of DNA, such as those found in a strand of hair at a crime scene,
can produce sufficient copies to carry out forensic tests.
19
The exponential amplification of the desired gene in a PCR reaction
Touch down PCR
It is another variation of conventional PCR that is followed to optimize yields of specific
amplified DNA when the Tm of hybrids between oligonucleotides (primers) and their target
sequences is not known with certainty. In this, a range of annealing temperature is utilized
in PCR. The annealing temperature during the 1st two cycles of amplification is set to 30C
above the calculated Tm of a perfect hybrid between most GC rich oligonucleotide in the
Reaction & its target DNA. Then the annealing temperature is reduced by 10C each cycle.
When appropriate temperature is encountered, amplification of target sequence begins. But
lowering of temperature continues till the temperature at which nonspecific priming can also
occur, but till that time, the desired product will be in hegemony.
Nested PCR
It is used to amplify specific sequences of DNA from a large complex DNA. It utilizes two
pairs of PCR primers for amplification of a single locus. The first pair amplifies the locus as
done by any PCR. The second pair of primers called as nested primers bind within the 1st
PCR product & produces a second PCR product that will be shorter than the first one. The
advantage of using nested PCR is, if wrong locus were amplified by mistake, the probability
of inaccurate second amplification will be low.
Anchored PCR
When sequence of only one end of the desired segment or gene is known, the primer
complimentary to the 3’ end is used to produce several copies of only one strand of the gene.
Now a poly G (or any other homopolymer) tail is added to the 3’ ends (the end for which the
primer is not available) of the single stranded DNA copies produced by PCR. This allows the
use of complimentary homopolymer poly C, to be used as primer for copying the DNA
generated by PCR, and give rise to the complete DNA duplex that can be amplified.
Asymmetric PCR
It is used to generate single stranded copies of DNA sequence which can be directly used for
DNA sequencing. To achieve this, the quantities of two primers for the 3’ end are so adjusted
20
in the reaction mixture that one of them is exhausted before the termination of PCR reaction.
As a result, in the last cycles, only a single strand of DNA is copied which are the ideal
starting materials for DNA sequencing.
Inverse PCR
Inverse PCR is used to clone sequences flanking a known sequence. Flanking sequences are
digested and ligated to make a circular DNA. PCR primers pointing away from the known
sequences are used to amplify the flanking sequences. Inverse PCR (IPCR), described by
Ochman et al in 1988, is a method for the rapid in vitro amplification of DNA sequences that
flank a region of known sequence. The method uses the polymerase chain reaction (PCR), but
it has the primers oriented in the reverse direction of the usual orientation. Inverse PCR has
many applications in molecular genetics, for example, the amplification and identification of
sequences flanking transposable elements.
Degenerate PCR
Degenerate PCR is in most respects identical to ordinary PCR, but with one major difference.
Instead of using specific PCR primers with a given sequence, you use mixed PCR primers.
That is, if you do not know exactly the sequence of the gene you are going to amplify, you
insert "wobbles" in the PCR primers where there is more than one possibility. Degenerate
PCR primers used to amplify unknown DNA sequences based on similar known sequences.
Primers which have a number of options at several positions in the sequence so as to allow
annealing and amplification of a variety of related sequences.
Introduction of restriction endonuclease sites in the PCR product
In another variation, the recognition sequence for a given restriction endonuclease are added
to the 5’ ends of both the primers used in the PCR amplification of the desired DNA segment.
The recognition sequence will remain as single stranded tails when the primers are annealed
to the original template DNA. This results in amplification of desired segments of DNA
along with inbuilt endonuclease sites which are used during subsequent cloning of the
amplified segment in different vectors.
PCR to introduce mutations at specific sites- Site Directed Mutagenesis
PCR can be used to induce desired mutations at specified sites in any gene. The sequence of
the site where mutations has to be created is used to make oligonucleotide primers. The
mutations in the desired gene segment are introduced by designing primers with desired
changes in their base sequence. These modified primers are used in PCR amplification of the
desired gene. Thus, the new copies of the gene so generated, carry the appropriate mutations
at specific sites, which are their in the base sequence of primers.
Vectors
Plasmids
The word “plasmid” was introduced by ‘Joshua Lederberg’ in 1952 and is defined as an
extrachromosamal circular DNA molecule which are not part of the bacterial genome. A
plasmid is an independent, circular, self replicating DNA molecule that carries few genes
only. A plasmid is usually not essential for the host cellexcept under some specific
conditions. The numbers of plasmids within a cell generally remains constant from
generation to generation and have evolved mechanisms to partition plasmid molecules
accurately to daughter cells during replication. Plasmids are usually dependent on host
synthesized enzymes and proteins for their replication and transcription. Their size ranges
21
from 1-2000 kilo base pairs and they carry additional functions that are advantageous to the
host such as they produce enzymes which degrade antibiotics or heavy metals or complex
organic compounds or produce colicins, enterotoxins etc or produce restriction and modifying
enzymes. Replication of plasmid DNA is coupled with the host replication machinery in two
different ways:
Stringent control: in which one or two plasmids are made during every round of host DNA
replication
Relaxed control: where 10-200 copies of the plasmid is made during each round of bacterial
replication.
The three most widely studied types of plasmids are:
1. F plasmids: responsible for conjugation
2. R plasmids: carry genes for resistance to antibiotics
3. Col plasmids: code for colicins, the proteins that kill sensitive E.coli cells and also
carry genes that provide immunity to the particular colicin
Plasmid incompatibility
Plasmid incompatibility arises when two different plasmids share same replication
machinery. Then the two plasmids compete with each other during replication and
subsequent step of portioning into daughter cells. Thus, plasmids carrying the same replicon
belong to the same incompatibility group. These plasmids have to be maintained by using
antibiotics as selection pressure.
Plasmids as cloning vector
A vector is any DNA that can take foreign DNA by ligation using complementary site in the
plasmid. These sites are generated by digesting the DNA & vector with the same Restriction
enzymes. It should be ensured that the restriction enzymes used for cloning purpose should
be present only once in the plasmid so that a single linear molecular can be generated and
sites should not be present in the foreign DNA to be inserted. Vectors contain genetic
markers that confer strong growth advantages under selective conditions. The plasmid DNA
containing foreign DNA as insert is usually introduced artificially by transformation.
Selection is then carried out by screening for the colonies those have acquired the plasmid
and hence grow on Antibiotic containing plates. Routinely used Antibiotics include
Ampicillin, Kanamycin, chloramphenicol, carbenicillin & tetracycline.
Most versatile cloning vectors should have small size, high copy number and are able to
accept fragments of foreign DNA generated by cleavage with a wide variety of Restriction
enzymes. Smaller size of plasmid facilitates better stability and higher efficiency of
transformation. Earlier used plasmids in cloning, either replicate poorly or carried unsuitable
selectable markers and very few restriction sites. The first plasmid with all desirable features
was pBR322 that has relaxed mode of replication with high copy number, ampicillin &
tetracycline as selectable markers with important restriction enzyme sites. Pst I and Pvu I are
located within ampicillin resistance gene and BamH I and Sal I are located within tetracycline
resistance gene. The presence of restriction sites within the antibiotic markers tetracycline
and ampicillin permits an easy selection for cells transformed with pBR322. insertion of
foreign DNA fragment into the plasmid using restriction enzymes Pst I or Pvu I, places the
DNA insert within ampicillin resistance gene and makes it non functional. Bacterial cells
22
containing such a recombinant will be unable to grow in the presence of ampicillin but grow
well on tetracycline containing plates. Similarly when BamH I or Sal I is used, the DNA
insert is placed within the tetracycline resistance gene making it non functional. Such
recombinants grow well on ampicillin but not on tetracycline. Thus it allows an easy way of
selection of a single bacterial cell having recombinant pBR322 from 108 other wild type
pBR322 containing cells. The name PBR denotes the following: p signifies plasmid, B is
from Boliver and R is from Rodriguez, the two scientists who developed pBR322. Nowadays,
it is the most widely used cloning vehicle and many descendant forms are extracted from it.
Another series of vectors, pUC vectors (derived its name from the place where it was
developed, University of California) revolutionized the cloning strategies. The number of
restriction enzyme sites was expanded. Moreover, it is the first plasmid to contain a series of
synthetic cloning sites, termed as polylinkers, MCS (Multiple Cloning Sites) or polycloning
sites eg. pUC19 has an array of 13 restriction enzyme sites which are very unique and can be
used efficiently for cloning.
Advantages of pBR322 over pUC series of vector
pBR322 carried two different selectable markers, tetracycline and ampicillin. Insertion of
foreign DNA sequences into one of these sites inactivates either of the two markers. The
recombinant plasmids can therefore be distinguished from the empty parental vector by virtue
of its ability to grow only one Antibiotic. This phenomenon is called as Insertional
inactivation which is not possible in pUC vectors which carry only one Antibiotic resistance
gene, usually for Ampicillin and a clustered set of MCS. But pUC series of vector
demonstrate a phenomenon called as α complementation. The recombinant plasmids can
readily be distinguished from parental pUC plasmid by visualizing the color of bacterial
colonies. PUC vectors and their derivatives carry a short segment of E.coli DNA that has
regulatory sequences of lacz gene and code for –NH2 termini of β-galactosidase (called as α
fragment). The fragment expressed by pUC vector (α), when introduced in E.coli strains that
contain inactive C- terminal fragment (ω-fragment), complements for the β-galactosidase
activity. But if a segment of foreign DNA is cloned into the site of pUC vector, carrying α
fragment, the coding region gets disrupted and α complementation is abolished. Thus,
bacterial colonies containing recombinant plasmids are therefore ampicillin resistant but
contain no β –galactosidase activity. Contrary, to this, the parental plasmid carrying colonies,
will also be ampicillin resistant but are able to hydrolyze chromogenic substrate X-gal (5bromo-4-chloro-3-indole- β –D-galactoside) by cleavage of β –galactoside bond. Thus, the
plate will contain two visually distinct colonies: The parental non recombinant deep blue
colonies and recombinant white colonies.
Plasmid vectors with origin of replication derived from single stranded Bacteriophages
M13 or f1 are single stranded filamentous bacteriophage. Many routinely used vectors
contain origin of DNA replication derived from these bacteriophages. Such vectors are often
called phagemids. They confer advantage of two separate modes of replication, one as
conventional double stranded DNA and other as a template to produce single stranded copies
of one of the phagemid strand. Thus single strands of DNA can be packed into the phage
particles. The production of single stranded DNA is induced when bacteria carrying
phagemid are infected with a helper bacteriophage that can supplement for the genes required
for production of virions in trans. The orientation of the origin determines which of the two
strands will be encapsulated in the virus particles. There are many plasmid vectors that carry
promoters derived from bacteriophages like T3, T7 or SP6, adjacent to the MCS. Hence, the
23
foreign DNA so inserted can be transcribed in vitro in the presence of specific RNA
polymerase and Ribonuclease precursors. Presence of bacteriophage promoters on both sides
of MCS allows either of the stands to be transcribed.
Expression plasmid vectors
There is an array of plasmid vectors that can generate large amounts of mRNA in vivo from
cloned foreign genes. The requisite features of a plasmid to serve as an expression vector
are:
a)
Minimal level of basal expression under repressed condition.
b)
Shine Dalgarno sequence upstream of ATG initiation codon.
c)
Presence of tags which are recognized by specific antibodies. Epitope tagged
proteins thus can be visualized by probing with antibodies directed against tags
and can be purified from rest of the host protein using affinity chromatography.
d)
An appropriate system for fast induction of expression of cloned gene.
Cloning
PCR Mediated Gene Cloning
There are three strategies for cloning PCR products1) T/A cloning is the easiest cloning method. T/A cloning takes advantage of the terminal
transferase activity of Taq polymerase and other non-proofreading DNA polymerases
which adds a single 3'-A overhang to each end of the PCR product. The resulting PCR
product is then ligated into a linear vector with a 3´ terminal 'T' or 'U' at both ends.
2)
Directional cloning. A restriction enzyme target site is introduced into each of the PCR
primers. The resulting PCR product and cloning vector are digested with the restriction
enzymes to generate complementary ends at the PCR product and the vector which are
then ligated.
3)
Blunt-end cloning. Blunt-end PCR product generated by proof-reading polymerase
such as the Pfu DNA Polymerase can also be cloned into a blunt-end vector.
The cloning of PCR-amplified fragments into a linear vector is typically a rapid and efficient
process. However, not all PCR fragments will clone with the same efficiency into the same
vector. These differences may be due to fragment size, insert toxicity, and the complexity of
the insert. The size of the fragment being cloned is a primary contributor to the overall
cloning efficiency. Large fragments of DNA (≥ 5 kb) are amenable to cloning in high-copy
number vectors, yet at a much lower efficiency. Optimization of molar concentration ratios of
the vector to insert is critical to ensure efficient cloning. Successful cloning ratios may range
from 1:1 to 1:10. For example, if the vector is 3 kb and the insert is 1 kb, one-third the
amount of insert needs to be added to attain a 1:1 molar ratio.
Cloning using plasmid vectors
Cloning in brief is cleaving closed circular plasmid DNA with one or more Restriction
enzymes and ligating it in vitro to foreign DNA bearing compatible terminus. This is then
transformed into an appropriate E.coli strain and the transformants are then screened for
24
recombinants by either PCR using gene specific primers or restriction enzyme digestion to
check for the presence of desired gene fragment in the vector.
Cloning of DNA with overhangs
Restriction enzymes are known to cleave the DNA within their recognition sequences but
asymmetrically and thus generate 1-6 base pair long overhangs or protruding ends. When the
ends protruding from foreign DNA fragment and the vector are compatible, they can anneal
by hydrogen bonding and form a circular recombinant plasmid that can be easily transformed.
Annealing reaction involves bringing of 5’ phosphate and 3’ –OH residues together and
finally formation of a phosphodiester bond catalyzed by the enzyme DNA ligase.
Directional cloning
1. Both ends cohesive (protruding) and separately matched
This situation arises when the foreign DNA fragment is produced by digestion with two
restriction enzymes having different Recognition sequences. Hence the foreign DNA
fragment will be non complementary and unable to ligate to each other. But it will ligate to
the plasmid vector which is cleaved with the same two restriction enzymes. This will
generate circular recombinants containing a single insert in a predefined orientation. Also in
this situation, only the recombinant vector, containing the insert, will circularize since the
ends of the vectors are non compatible to ligate to each other, as they are generated by cutting
with two different restriction enzymes. The scheme of directional cloning is shown below in
Figure2.
2. Both ends cohesive and compatible
This situation arises when both vector and insert are cleaved using the same restriction
enzyme. Both the vector DNA and the foreign DNA are mixed under annealing conditions
which allow pairing between cohesive ends of the vector and the insert DNA. The DNA
insert can go in either of the two orientations. The orientation of the inserted DNA is
significant when the expression of the insert is required. In addition to the formation of
recombinant vector few undesirable products will also be formed like: the cohesive ends of
the vector will itself pair to form unaltered circular vector. Similarly, the two ends of the
insert DNA can also join to form concatemers, which may not pose a problem since it lacks
an origin of replication and will therefore dilute out. The generation of re-circularized vector
DNA can be prevented by treating the digested vector with alkaline phosphatase which
removes 5’ phosphate present at the ends of the vector.
Cloning blunt ends
Ligation of foreign DNA to a linearized plasmid, both carrying blunt ends, is comparatively
an inefficient reaction. Only at high concentrations of vector, insert and enzyme, blunt ended
molecules can be ligated and phosphodiester bonds can be made between these residues. The
DNA insert will get ligated in either of the orientation. Usually PEG 8000 at the
concentration of 5 % is used to increase macromolecular crowding.
25
EcoR I
BamH I
GAATTC
CTTAAG
GGATCC
CCTAGG
Plasmid
Cut with EcoR I and BamH I restriction enzymes at the respective sites shown, arrows indicate the base at
which the enzyme cuts the DNA molecule
5’ (P)
G
CTTAA 3’OH
3’ OH
AATTC----G
G---CCTAG
GATCC
5’ P G
Plasmid (vector digested with BamH I and EcoR I)
Ligated to
EcoR I
G GAATC
CTTAA G
GGATCC
CCTAGG
BamH I
Cut foreign DNA with BamH I and EcoR I
Figure2
Methods for Selection and/or Screening of Desired Recombinants
Selection refers to applying conditions that allow the desired cells or phages (containing
vector or vector and insert) to replicate while preventing others from replicating. Typical
selections include:
ƒ antibiotic sensitivity and resistance;
ƒ nutrient requirements;
ƒ plaque formation.
Screening allows all cells to grow, but tests the resulting clones for particular properties.
Often these are indicative of the presence of an insert in the vector. Screening properties used
include:
ƒ antibiotic sensitivity and resistance;
ƒ nutrient requirements;
ƒ plaque type;
ƒ blue-white selection ( -galactosidase activity);
ƒ specific (by nucleic acid hybridization, PCR with gene specific primers or
antibodies).
26
Screening of Clones
By PCR
This is the fastest way to screen bacterial colonies. A primer pair is chosen which has a
sequence derived from vector (flanking primer) and a gene specific primer. This allows going
to the replica agar plate on the same day and setting up miniprep cultures of the likely
candidate colonies.
By Hybridization with radiolabeled probe
1.
2.
3.
Lysis of Colonies and Binding of DNA to Nitrocellulose Filters
The colonies are lifted from agar plates onto nitrocellulose membranes, lysed and
neutralized and baked with the bacterial DNA fixed to the membranes.
Labeling of DNA-Probe with 32P
Using DNA polymerase, templates, and primers in combination with radiolabeled
nucleotide (either [a-32P]dATP or [a-32P]dCTP), a radioactive probe is synthesized. The
procedure works with either exact size templates or any (random or specific) primers or
with templates within a vector and specific primers.
Hybridization to Nitrocellulose Filters Containing Replicas of Bacterial Colonies
After the DNA has been fixed to the filters by baking, wash step removes bacterial
debris off the filter surface. Filters then are prehybridized to block binding sites, and the
radiolabeled probe added thereafter will compete for specific binding sites. After
washing, the air died filters are exposed to an X-ray film which shows positive colonies.
Host vector relationship in cloning of the genes
Any DNA cloning procedure consists of:
Generation of DNA fragment for cloning: the source can be genomic DNA, cDNA generated
from mRNA or chemically synthesized gene fragment.
Modification of the termini of DNA and vector to make them compatible with each other
Ligation of the DNA fragment and vector
Introduction of the recombinant DNA into a suitable host
Selection and screening of the recombinant transformants
Cloning can be viewed from two different aspects:
Homologous gene cloning: cloning of a gene into the host from which the genetic
information was derived
Heterologous gene cloning: cloning of a gene into a new host strain
Following are the hosts that are widely used for cloning of genes:
- Bacteria (E. coli)
- Yeast (S. cerevisiae)
- Insects (D. melanogaster)
- Plants (Arabadopsis thaliana)
- Mammals (Mus musculus)
A cloning host must:
- be easy to handle and propagate
- have a defined genotype
- accept a range of vectors
27
DNA cloning requires a bacterial host strain with high transformation efficiency, which does
not cleave the recombinant DNA, provides good yields of high quality plasmid or phage
DNA, and consists of components for various vector screening systems. An expression strain,
on the other hand, must have the appropriate regulatory proteins for expression of cloned
genes—a requirement that may not be compatible with attributes of cloning strains. To
improve yields of expressed proteins from the cloned genes, expression strains are often
deficient in some proteases. For the pET system, expression hosts are λDE3 lysogens, which
have the inducible gene for T7 RNA polymerase.
E. coli is often used as the host cell for expression of foreign genes because the control of
gene expression has been studied most extensively and the organism is most easy to grow and
manipulate. The inclusion of regulatable promoters in expression vectors allows the
expression of foreign gene products that are even toxic to the cells/host. The expression of the
eukaryotic genes in E. coli requires that the coding region should be placed downstream from
a strong E. coli promoter since eukaryotic promoters are not easily or weakly recognized by
the E. coli RNA polymerase. Few strong promoters which are used generally for the
expression of foreign genes in E. coli are bacteriophage lambda PL, bacteriophage lambda PR,
lac, trp, tac (trp-lac hybrid). Few limitations of using E. coli for the expression of foreign
genes are:
No enzyme to process eukaryotic introns in prokaryotes and therefore only cDNAs
and intronless genes can be expressed in bacterial cells
No post translational modification like glycosylation and phosphorylation and any
protein which requires these modifications for its biological activity will be virtually
inactive.
Bacterial cells lack proteolytic enzymes found in eukaryotic cells that are able to
process the precursor peptides and release mature form of the proteins.
A number of features have been incorporated into E. coli DNA cloning strains to facilitate
transformation of the initial ligation reactions, screening for recombinants, stability of the
recombinant plasmid, and high quality plasmid DNA preparations.
Genetic features to look for are:
1. lacZDM15 encodes an enzymatically inactive b-galactosidase lacking amino acids 1141. Some cloning vectors encode the deleted b-galactosidase amino acids (lacZa or
lacZ´), which, when expressed, can associate with the inactive protein in the host to
generate active enzyme. The combination of host strain and vector allow blue/white
screening for recombinant plasmids by a-complementation.
2. recA encodes a protein essential for homologous recombination in E. coli. Plasmids in
host strains deficient in recA are less likely to incur deletions or other rearrangements.
Plasmid DNA prepared from these strains contains monomer circles and lack the
multimers common in preparations from RecA+ strains.
3. endA encodes a thermostable DNase that can degrade plasmid DNA during
purification. Host strains deficient for endA allow higher yields of plasmid DNA.
4. hsdRMS encodes the enzymes responsible for the Eco restriction-modification system.
For transformation with unmodified recombinant plasmid DNA, strains must be
restriction minus.
5. mcrAB encode enzymes that restrict foreign DNA.
28
Bacteriophage vectors
A phage (also called bacteriophage) is a small virus that infects only bacteria . Like viruses that
infect eukaryotes , phages consist of an outer protein coat and the enclosed genetic material
(which consists of double-stranded DNA in 95% of the phages known and few have RNA as
the genetic material) of 5-650 kbp (kilo base pairs ) with a length of 24-200 nm . The vast
majority of phages (95%) have a tail which has the ability to bind to specific molecules on
the surface of their target bacteria and thus inject their genetic material into the host. Phages
were discovered independently by Frederick Twort in 1915 and by Felix d’ Herelle in 1917.
Phages infect only specific bacteria. Some phages are virulent, that is upon infecting a
bacterial cell, they immediately start reproducing, and within a short time, lyse (destroy) the
bacterial cell, releasing new phage particles. These young phages burst from the host cell
(killing it) and infect more bacteria. These types of phages are called as Lytic phages. Some
phages (so-called temperate phages) can instead enter a relatively harmless state, either
integrating their genetic material into the chromosomal DNA of the host bacterium (much
like endogenous retrovirus in animals) or establishing themselves as plasmids. These
endogenous phages, referred to as prophages, are then copied with every cell division
together with the DNA of the host cell. They do not kill the cell, but monitor (via some
proteins they code for) the status of their host. When the host cell shows signs of stress, the
endogenous phages become active again and start their reproductive cycle, resulting in the
lysis of the host cell. An example is phage λ of E. coli . Sometimes, prophages even provide
benefit to the host bacterium while they are dormant, by adding new functions to the bacterial
genome , a phenomenon called Lysogenic Conversion . A famous example is the harmless
Vibrio bacteria strain, which is turned into Vibrio cholerae by a phage, causing cholera.
Phages play an important role in molecular biology as cloning vectors to insert DNA into
bacteria.
Lambda-Replacement Vector
Bacteriophage λ with its large and complex genome, for use as a vector, would seem to be
impossible. Wild type DNA has multiple sites for most useful restriction enzymes but they
are located in the genomic DNA essential for lytic growth of virus. Initially it was thought,
that since bacteriophage λ particles will not accommodate DNA larger than the viral genome,
therefore suitable only for small foreign fragments. Methods have developed large number of
vectors that can accept and propagate fragments of foreign DNA generated by a variety of
different Restriction enzymes. The λ genome remains linear inside the phage head but
circularizes as soon as bacteriophage DNA goes inside the host E.coli. The single stranded
protruding ends of 12 bases, called as cohesive site, (Cos site), anneal to form a closed
circular molecule necessary for replication. The sealed cohesive ends are the sites of cleavage
during packaging of mature DNA into phage heads. The λ DNA must be greater than 38kb
and smaller than 52kb to be packaged into phage head. The genes for lysogeny are located in
this segment which is removed to make high capacity vectors which can accommodate larger
size of foreign DNA.
Vectors that have a pair of cloning sites flanking a segment of non-essential DNA are known
as replacement vectors. Removal of the central non essential stuffer fragment reduces the size
of vector to a size that cannot be packaged into bacteriophage particles. Particles can be
packed only by ligating a segment of foreign DNA. The range of DNA that can be packaged
lies between 8kb–24 kb. If one wants to clone longer DNA fragments, deletion of essential
bacteriophage genes, is required, which then have to be supplied in trans. Cloning in
bacteriophage vectors is advantageous since for successful packaging, there should be enough
29
DNA. No infectious particles are obtained that lack an insert or contain an insert that is below
the lower size limit for a particular vector. Inserts with DNA too large are also not
successfully cloned. Therefore packaging into phage heads of the vector DNA after DNA
insertion offers an efficient selection strategy for recombinant vectors. Thus, the size of the
vector and insert DNA should be carefully adjusted for an effective selection. Few examples
of vectors that employ this strategy are bacteriophage λ vectors EMBL3 and EMBL4.
Cloning in bacteriophage vectors
The vector contains a restriction site (e.g.,an EcoRI site) for insertion of cloned DNA. In addition, cos sites
which are required for packaging DNA into phage particles, are present on both ends of the vector DNA. Insert
DNA is ligated to the vector, and the recombinant molecules are packaged into phage particles when mixed
with phage proteins. The recombinant phages are then used to infect E. coli. Each recombinant phage, which
carries a unique insert of cloned DNA, forms a single plaque in the infected bacterial culture. Progeny phage
carrying unique DNA inserts can then be isolated from individual plaques and grown in large quantity.
This type of vector has a loading capacity of 10-23 kb and consists of a full length lambdamolecule with two identical restriction sites flanking a non-essential region called stuffer
fragment. The idea behind it is that the stuffer fragment is deleted and replaced by foreign
DNA. Again a selection system is required to differentiate between wild type and
recombinant phage.
30
This is done by placing relevant genes onto the stuffer fragment the loss of which gives rise
to a detectable phenotypic signal. Two examples are given:
1. LacZ inactivation: This is done by inserting the lacZ gene onto the stuffer fragment. Loss
of the fragment and therefore lacZ can be monitored by the blue/white colour
discrimination.
2. Spi-phenotype: In this case the stuffer fragment carries the phage genes red and gam
necessary for lambda-DNA replication. Whereas the wild type lambda is not capable of
infecting a E.coli P2 lysogen due to the presence of red and gam (Spi phenotype in short
for sensitive towards P2 prophage inhibition), loss of red and gam in all recombinant
phage particles results in plaque formation on a E.coli P2 lysogen; hence all plaques
should contain recombinant phage molecules! The most popular lambda-replacement
vector for library construction is EMBL3A with Sal I, BamH I and EcoR I sites flanking the
stuffer fragment.
Insertion Vectors
Vectors that have a single target site for insertion of foreign DNA are known as insertion
vectors. 20% DNA that is not required for lytic growth is removed and therefore insertion of
foreign DNA resumes the size back to something like its full length and can be packaged in
vitro. Maximum size of DNA that can be accommodated varies from 9- 11 kb.
For DNA of larger sizes, high capacity vectors are designed like:Vector
Capacity
Origin of replication
Host
Cosmid
BAC
YAC
30-45 kb
120-300
250-400
Col E1 Replicon
Foreign
ARS
E. coli
E. coli
Yeast
Cosmids
Smallest capacity for foreign DNA, contain one or two copies of small region of
bacteriophage λ that is the cohesive end site (Cos), containing all the cis-acting elements
required for packaging of viral DNA into virions. The vector also contains sequences
required for terminase so that it can bind and cleave and under appropriate conditions the
DNA is packaged into empty λ phage particles. It should have unique restriction sites, a
replication origin and selectable markers. In vectors such as supercos-1, two cos sites are
there within Xba I site. The cosmid vectors are opened up by appropriate restriction enzymes
and then mixed with the DNA inserts prepared by using same enzymes under annealing
conditions. Long concatemers so generated are products for packaging into the λ heads. This
selects the DNA to be inserted since for packaging in λ particles, the distance between two
cos sites must be between 38 to 52 kb. Cosmids can accommodate up to 45 kbp long DNA
inserts. Cosmids infect the host cells like λ particles but once inside the host, it will replicate
and propagate as a plasmid. They have been developed in the late 1970s and have been
improved significantly since. They are predominantly plasmids with a bacterial origin, an
antibiotic selection marker and a cloning site, but they carry one, or more recently two cos
sites derived from bacteriophage lambda. The loading capacity of cosmids varies depending
on the size of the vector itself but usually lies around 40-45 kb. The cloning procedure
involves the generation of two vector arms which are then joined to the foreign DNA.
31
Selection against wild type Cosmid DNA is simply done via size exclusion. The strategy used
for cloning in Cosmid vectors is shown below:
Cloning using Cosmid vectors
The cosmid is cut at a Bgl II site next to the cos site. Donor genomic DNA is cut using
Sau3A, which gives sticky ends compatible with Bgl II. A tandem array of donor and vector
DNA results from mixing. Phage is packaged in vitro by cutting at the cos site. The cosmid
with inserts recircularizes once it is inside the bacterial cell.
Bacteriophage Artificial Chromosomes (BACs)
These plasmids are circular DNA molecules carrying conventional antibiotic resistance
marker, origin of replication derived from the F factor of E.coli, an ATP driven helicase
(repE) to facilitate DNA replication and three loci (parA, parB and parC) for proper
partitioning of the plasmid to daughter cells. BAC vectors have no packing constraints and
there is no fixed limit to the size of genomic DNA that they accept. Usually the size of DNA
is approximately 120-kilo base pairs.
32
Yeast Artificial Chromosome (YACs)
These are linear DNA molecules similar to yeast chromosome. Recombinant YACs are made
by ligating large fragments of genomic DNA and then the resultant plasmid is introduced into
yeast by transformation. The vector carries selection marker, DNA sequences called as
telomere, so that the product can be stabilized inside the yeast cell, an origin of replication
called autonomous replication origin, ARS. Large size of DNA can be inserted into YAC
vectors, usually between 250kilobases to 400kilobasepairs. Large size of mammalian
genomic libraries is also made with approximately 1 megabasepairs of foreign inserts.
Insertion of foreign DNA into the cloning site inactivates a mutant expressed in vector DNA
and formation of red rather than white colonies by yeast strain is observed. Thus
transformants are identified as red colonies, which grow in yeast that is mutant for TRP1 and
URA3, which ensure that the cell has received an artificial chromosome and with both the
telomeres since it is complimented for both the mutations. And the colony also contains
foreign DNA because it is red in color.
TEL
TEL
TRP1
ARS1
SUP4
CEN4
URA3
EcoR I
Digest with ECOR I and treated with phosphatase and ligation with restriction digested genomic DNA
TEL
EcoR I
TRP1
ARS1
EcoR I
50-2000kb fragment
TEL
SUP4
URA3
Shuttle vectors
Cloning of foreign DNA is usually carried out primarily in E.coli since the organism is most
thoroughly studied. But subsequent work often requires the foreign segment to be delivered
to different host cells like eukaryotes. A number of vectors are devised to satisfy this
requirement. These vectors are termed as shuttle vectors. These vectors have origins of
replication of various hosts. The also contain fragments of eukaryotic viruses to facilitate
entry into the cell or expression or integration in the cell itself. Thus shuttle vectors allow
DNA to be transferred between two different species where it can be propagated by utilizing
both the origins of replication. Usually the origins of replication are derived from bacterial
and eukaryotic systems. Shuttle vectors also carry antibiotic resistance genes, which are
functional in eukaryotes e.g. Neomycin (G418), Hygromycin, Methotrexate etc. All the DNA
manipulation and characterization are done in prokaryotic system and then the manipulated
DNA is introduced into the eukaryotic systems for protein expression and functional analysis.
Eukaryotic host systems are better for expression of protein for few reasons:
1. Proper folding of the protein to attain functional activity
33
2. Posttranslational modification of proteins for which prokaryotes does not possess any
machinery. The most conventional and convenient model system for expression of
eukaryotic proteins is yeast, Pichia pastoris, which is both genetically and
physiologically well characterized.
Plant Vectors
A vector is a circular DNA molecule capable of independent existence and replication within
a host cell. In case of plants, Ti and Ri are the two most commonly used plasmids which are
used as vectors. Plant cells as such do not possess any endogenous plasmids. But two
plasmids called pTi and pRi, are present naturally in the bacteria, Agrobacterium tumefaciens
and Agrobacterium rhizogenes, respectively. These plasmids provide a naturally occurring
transformation system. A part of the plasmid DNA, called as T-DNA, is transferred into the
genomes of most dicot and some monocot plants. pTi stands for tumor inducing plasmid and
pRi stands for root inducing plasmid. The infection of Agrobacterium tumefaciens is
mediated by transfer of a segment of pTi called as T-DNA into the plant cell. Various
bacterial chromosomal genes, such as chvB, exo genes, cell genes, are concerned with the
biosynthesis of cell attachment polysaccharides due to which the bacterial cells adhere firmly
to the plant cells. While two chromosomal genes are expressed constitutively in bacterial
cells that is expressed at all the times inside a cell, which are responsible for virulence
associated aspects, Agrobacterium tumefaciens Ti plasmid produces tumor like growth from
which roots / shots may sometimes be produced. The infected cells are able to grow in culture
on a medium devoid of any growth regulator while uninfected normal plant cells need
exogenous auxin or cytokinin. These plasmids also carry genes for IAA (Indole Acetic Acid auxin) and cytokinin production which is the reason for indefinite growth on a growth
regulator free culture medium. When pTi is introduced into Rhizobium trifolii, it gains the
ability to produce galls and to utilize opines. The crown gall root cells also synthesize unique
nitrogenous compounds called opines, which are not produced by normal plant cells, which
are not infected, nor are they utilized. The infected cells use opines as their carbon and
nitrogen source. The type of opine produced depends on the bacterial strain. Agrobacterium
tunefaciens strain produces either octopine or nopaline which the Agrobacterium rhizogenes
produce either agropine or mannopine.
Ti plasmid
Ti plasmid is a large mega plasmid conjugative plasmid of ~200kb. pTi is lost when
Agrobacterium is grown above 28oC, such cured bacteria do not induce crown galls that is,
they become avirulent. pTi and pRi, although do not share sequence homology but are unique
in following respects:a) They contain some genes, which are located within their T-DNA which has regulatory
sequences recognized by plant cells, while their remaining genes have prokaryotic
regulatory sequences. As a result, the former are expressed only in plant cells but not
in the Agrobacterium, while the latter are only expressed in the bacterium.
b) These plasmids naturally transfer a part of their DNA, called as T-DNA, into host
plant cells. The T-DNA usually contains following important functional regions.
1. T-DNA contains oncogenes and opine synthesis genes and is transferred into
host plant.
2. Vir region which regulates the transfer of T-DNA
3. Opine catabolism genes for utilization of opines.
34
4. Origin of replication for propagation in Agrobacterium.
The T-DNA contains a 24bp direct repeat border sequence and contains the genes necessary
for tumor / possess gene for auxin and cytokinin biosynthesis. All the genes present in TDNA have eukaryotic regulatory sequences. As a result, these genes are expressed only in
plant cells but never express in Agrobacterium. The vir region mediates the transfer of TDNA into plant genomes and hence is essential for virulence. The genes of vir region are not
transferred but induce the transfer of T-DNA. Also, the genes present in T-DNA are not
responsible for its transfer, but the 24 bp direct repeat at both the left and right ends of TDNA is essential for the transfer. The exact mechanism of transfer of T-DNA is not known
clearly known but is brought by the vir region. The phenols produced by wounded plant
tissue initiates the transfer process. The T-DNA is transferred into the plant cells as single
stranded DNA, which increases the efficiency of its transformation. But, as soon as it enters
into the plant cell, it is immediately converted into a double stranded form. This form
integrates at random sites in the host plant genome by a phenomenon called illegitimate
recombination, which are due to sequence of homology in short segments of the host DNA.
This integration is usually in low copy numbers. Few vectors are derived from pTi (wild
type) due to some problems posed by wild type plasmid eg. The presence of oncogenes
causes a disorganized growth, their large size and lack of cloning sites within the T-DNA,
which are needed for the insertion of DNA segments that has to be cloned.
Disarming
Disarming is the removal of oncogenes from T-DNA and it has resolved many problems.
This disarmed plasmid can still transfer its T-DNA into plant cells. But the cells containing
this modified T-DNA will be non tumorous, produce opines and generate plantlets. Only the
border sequences are necessary for the transfer of any DNA insert placed between them. Thus
pTi and pRi which are disarmed are more in use for gene transfer experiments. But in some
plants, the efficiency of transformation by disarmed pTi is much lower than the wild type
pTi.
Introduction of DNA into plant cells without the involvement of a biological agent like
Agrobacterium leading to a stable transformation is known as Direct Gene Transfer. The
spontaneous uptake of DNA is quite low. The various methods those are utilized for direct
gene transfer are:
1. Chemical methods like PEG, Calcium phosphate
2. Electroporation
3. Particle gun delivery
4. Lipofection
5. Microinjection and Macroinjection
35