Download Review Molecular Biology in Arteriosclerosis Research

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Epigenetics of human development wikipedia , lookup

Messenger RNA wikipedia , lookup

Epigenetics of diabetes Type 2 wikipedia , lookup

Cancer epigenetics wikipedia , lookup

Zinc finger nuclease wikipedia , lookup

Extrachromosomal DNA wikipedia , lookup

Gene therapy wikipedia , lookup

Cell-free fetal DNA wikipedia , lookup

No-SCAR (Scarless Cas9 Assisted Recombineering) Genome Editing wikipedia , lookup

Protein moonlighting wikipedia , lookup

Genetic code wikipedia , lookup

Epigenomics wikipedia , lookup

Metagenomics wikipedia , lookup

Human genome wikipedia , lookup

Gene nomenclature wikipedia , lookup

Deoxyribozyme wikipedia , lookup

Epigenetics of neurodegenerative diseases wikipedia , lookup

Genetic engineering wikipedia , lookup

DNA vaccination wikipedia , lookup

Epitranscriptome wikipedia , lookup

Non-coding DNA wikipedia , lookup

Genome (book) wikipedia , lookup

Gene therapy of the human retina wikipedia , lookup

Cre-Lox recombination wikipedia , lookup

Molecular cloning wikipedia , lookup

Nucleic acid analogue wikipedia , lookup

Gene expression profiling wikipedia , lookup

Nutriepigenomics wikipedia , lookup

Genomic library wikipedia , lookup

Primary transcript wikipedia , lookup

Gene wikipedia , lookup

Genomics wikipedia , lookup

NEDD9 wikipedia , lookup

History of genetic engineering wikipedia , lookup

Genome editing wikipedia , lookup

Site-specific recombinase technology wikipedia , lookup

Vectors in gene therapy wikipedia , lookup

RNA-Seq wikipedia , lookup

Microevolution wikipedia , lookup

Designer baby wikipedia , lookup

Point mutation wikipedia , lookup

Helitron (biology) wikipedia , lookup

Therapeutic gene modulation wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Transcript
Review
Molecular Biology in
Arteriosclerosis Research
David L. Williams
Downloaded from http://atvb.ahajournals.org/ by guest on June 18, 2017
The topics discussed In this article illustrate how molecular biology will have a
dramatic Impact on arteriosclerosis research. DNA clones for a small number of
relevant proteins have been isolated, and studies are underway in numerous laboratories to extend these initial studies. The techniques of molecular biology will provide
major advances in our understanding of numerous proteins directly or indirectly
involved in the atherogenlc process. Cloning technology will solve the primary structures of many proteins that can not be purified In quantities sufficient for classical
methods of analysis. Studies of regulation will benefit from the availability of DNA
probes, the ability to generate site-directed antibodies, and the use of reverse genetics to Identify nucleic acid sequences involved in the regulation of gene expression.
Studies of gene structure and genetic polymorphisms will unravel the genetic basis
for defects In llpld and lipoprotein metabolism and should provide valuable reagents
for clinical screening and diagnosis. The reverse genetics approach will permit the
systematic analysis of structure-function relationships at the protein level In a manner
not previously possible. Each of these will contribute to our understanding of the
atherogenic process and should provide insight into ways of preventing and treating
arteriosclerosis. (Arteriosclerosis 5:213-227, May/June 1985)
he purpose of this article is to explore how molecT
ular biology can have an impact on research in
the broad field of arteriosclerosis. The past few years
in this area. In the very near future, clones for many
more proteins of importance to arteriosclerosis will
be isolated. In addition to the initial information from
cloning experiments regarding protein, mRNA, and
gene structure, these studies will form the foundation
for the analysis of important macromolecules at a
level of understanding not previously available. This
is true in terms of structure and structure-function
relationships as well as in terms of regulation. Furthermore, it is likely that molecular biology will contribute significantly to the clinical side of arteriosclerosis in diagnostics and ultimately in therapeutics.
The subsequent discussion aims to illustrate the
above points by proceeding from present knowledge
to the fantasy of what may be in the not-too-distant
future. First, as a general introduction, strategies for
the isolation of genes will be presented. Second, the
most immediate consequence that molecular biology will have on arteriosclerosis research, that is, the
unraveling of primary sequences of key proteins, will
be considered. Third, molecular genetics, including
aspects of gene structure, regulation, genetic polymorphisms, and reverse genetics will be discussed.
These topics represent the potential that arteriosclerosis research will realize from the application of the
techniques of molecular biology.
have seen the techniques of molecular biology applied to an increasingly wider range of problems directly or indirectly related to arteriosclerosis. The first
focus of research in this area has been the isolation
and characterization of complementary DNA (cDNA)
clones for the major apolipoproteins of the plasma
lipoproteins. The apolipoproteins were prime targets
for initial studies because they are abundant products of lipoprotein-producing tissues and the amino
acid sequences of many of these proteins are
known. While the current literature seems filled with
cloning papers, we only have scratched the surface
From the Department of Pharmacological Sciences, Health Sciences Center, State University of New York at Stony Brook, Stony
Brook, New York.
This work was supported by Grants AM 18171 and HL 32868
from the National Institutes of Health.
Address for reprints: David L. Williams, Department of Pharmacological Sciences, Hearth Sciences Center, State University of
New York at Stony Brook, Stony Brook, New York 11794.
Received November 2, 1984; revision accepted January 22,
1985.
213
ARTERIOSCLEROSIS
214
VOL
5, No 3,
Downloaded from http://atvb.ahajournals.org/ by guest on June 18, 2017
Introns
kbp
mRNA
oligo dT
PDGF
RFLP
poly A
T
v-sis
1985
Isolation and Characterization of Genes
Abbreviations
A
arg
C
cDNA
c-myc
cys
dC
dG
DHFR
(ds)
EcoRI
ELH
exons
Q
HMG CoA reductase
MAY/JUNE
As an introduction, it is instructive to consider typical methods for the isolation and characterization of
genes.1 In most cases, the isolation of a gene first
requires the isolation of cDNA clones for the messenger RNA (mRNA) of interest. The isolation of a
specific cDNA clone involves the construction of an
appropriate cDNA library and the identification of the
specific cDNA within that library. The mRNA is first
purified from a tissue that makes the protein of interest. In the case of the plasma apolipoproteins, liver is
an appropriate source of the mRNA.
Figure 1 shows a typical scheme2 for library
construction in which cDNA is prepared by priming
the reverse transcription of mRNA with oligo deoxythymidylic acid (oligo dT) hybridized to the polyadenylic acid (poly A) at the 3' end of most mRNA
molecules. After first-strand synthesis and removal
of the template mRNA by alkaline hydrolysis, a second strand of cDNA is synthesized with DNA polymerase. In the example shown, second-strand synthesis is primed by a transient hairpin structure at the
3' end of the cDNA. The double-stranded (ds)cDNA
is treated with S1 nuclease to remove the hairpin
loop and single-stranded regions at the 5' end of the
adenlne
arginlne
cytoslne
complementary DNA
an oncogene
cystelne
deoxycytidyllc acid
deoxyguanyllc acid
dlhdrofolate reductase
double stranded
restriction endonuclease
egg-laying hormone
coding DNA segments
guanlne
3-hydroxy-3-methyl-glutaryl
coenzyme A reductase
noncodlng DNA segments
kllobase pairs
messenger RNA
ollgo deoxythymldylic acid
platelet-derived growth factor
restriction fragment length
polymorphism
polyadenyllc acid
thymlne
an oncogene
5'-
mRNA
- A A A B 3'
CMKJOWT)
AAA B
TTTn
( I ) Reverse tronscnptose
(2)Alkdi
T T T . 5'
DNA
Polymerase + 4 dNTPs.
5'
3'
G
3' ACGTC
GGjGACGTC
CC n C
r
G
CCjC
-CTGCAGCG
Select
Tc*. A p s
-GtaCGTCC"C-CTGCAIGGJS
cDNA
GGnGACGTCCCnC
G"
Transformation
host repairs gaps,
reconstruct* Pstl
vies
GGGLACGTC
CC.CTGCA
Figure 1. Construction of an oligo dT-primed cDNA library in pBR
322 with the G-C tailing procedure. Reprinted with permission from
Old and Primrose.2
MOLECULAR BIOLOGY AND ARTERIOSCLEROSIS
Downloaded from http://atvb.ahajournals.org/ by guest on June 18, 2017
first strand. The resultant blunt-ended, doublestranded cDNA is tailed with deoxycytidylic acid (dC)
residues at the 3' ends via reaction with terminal
transferase and dCTP. The dC-tailed cDNA is annealed with vector pBR 322 DNA that has been linearized by restriction endonuclease digestion at the
unique Pst I site and tailed with deoxyguanylic (dG)
acid residues.
As illustrated, the cDNA molecules are incorporated into the vector DNA via base pairing between the
oligo dC and the oligo dG tails. After transformation
of the host bacterium with this hybrid DNA molecule,
the DNA repair systems fill in the gaps and regenerate the Pst I restriction site; this permits researchers
to recover the inserted DNA from the vector by Pst I
digestion.3 Another feature of this scheme is that the
Pst I site lies within the vector gene encoding resistance to ampicillin. Thus, bacteria transformed with
vector molecules containing cDNA inserts will be resistant to tetracycline by virtue of the vector TcR gene
but sensitive to ampicillin, since the cDNA insert has
disrupted the ApR gene.3
A cDNA library formed as illustrated (Figure 1) or
with a variety of alternate strategies generally consists of 103—10s recombinant DNA molecules, each
present in a different bacterium. The frequency with
which a specific cDNA clone occurs in the library is
approximately proportional to the abundance of the
specific mRNA within the mRNA population. Since a
typical mammalian cell may contain 2 to 5 x 105
mRNA molecules, a very large library must be constructed to ensure the inclusion of low abundance
mRNAs that are present in only a few copies per cell.
However, a small cDNA library may suffice for the
isolation of cDNA clones for high abundance
mRNAs.
Once the library is constructed, the specific cDNA
clone of interest must be identified. Table 1 lists a
215
Williams
number of cDNA clones recently isolated and the
strategies used to identify the particular clones. The
avian very low density lipoprotein (VLDL) apoprotein
(apo) II cDNA clone was the first apolipoprotein
clone isolated.45 In this case, the cDNA was prepared from highly purified apo II mRNA, thereby simplifying the identification of the cDNA clone. The apo
II cDNA clone was also identified in unfractionated
liver cDNA libraries by hybridization with a kinetically
fractionated cDNA probe6 and by a plus-minus
screening procedure.7 In the latter case, the library
was screened in parallel with radiolabeled cDNA prepared to mRNA from livers of estrogen-stimulated
chickens (plus probe) and with cDNA prepared to
mRNA from livers of nonstimulated chickens (minus
probe). Since apo II mRNA is only synthesized in
response to estrogen, the double screening with plus
and minus probes readily identified cDNA clones
corresponding to estrogen-induced mRNAs. Nucleotide sequence analysis confirmed the identity of the
apo II clone isolated in this manner.7
Screening of a cDNA library with a kinetically fractionated cDNA probe was also used to identify putative clones to rat apo A IV.19 The identity of the apo A
IV clone was confirmed by a hybrid selection translation procedure in which the cDNA clone was used to
purify the mRNA via hybridization. The purified
mRNA was then translated in vitro, and the translation product was identified as apo A IV. Similar
schemes involving partial mRNA purification and hybrid selection translation have been used to identify
cDNA clones to mouse apo A-l,12 mouse apo E,18
and rat apo E16 mRNAs (Table 1). Human apo E
cDNA clones were identified by screening a cDNA
library with a probe prepared from the rat apo E
cDNA.'5
An approach which has proved to be widely applicable is the use of oligonucleotide probes to identify
Table 1. cDNA Clones
Protein
Apo II
Apo II
Apo II
Apo A-l
Apo A-l
ApoE
ApoE
ApoE
ApoE
ApoE
Apo A-IV
Apo A-ll
Apo C-ll
ApoC-l
Apo C-lll
HMG CoA reductase
LDL receptor
Species
Chicken
Chicken
Chicken
Human
Mouse
Human
Human
Rat
Rat
Mouse
Rat
Human
Human
Human
Human
Chinese hamster
Cow
Method
mRNA purification
cDNA enrichment
Plus/minus screening
Oligonucleotide probe
Hybrid selection translation
Oligonucleotide probe
Homology with rat probe
Hybrid selection translation
Expression library
Hybrid selection translation
cDNA enrichment-hybrid selection translation
Oligonucleotide probe
Oligonucleotide probe
Oligonucleotide probe
Via linkage to apo A-l gene
Hybrid selection translation
Oligonucleotide probe
Reference
4,5
6
7
8-11
12
13,14
15
16
17
18
19
20
20-22
23
20,24
25
26
ARTERIOSCLEROSIS
216
Table 2.
VOL
5, No 3,
MAY/JUNE
1985
Apo A-l Oligonucleotide Probe
Protein or nucleic acid
Sequence
Amino acid sequence
105
gin
106
lys
107
lys
108
trp
109
gin
Potential mRNA sequence
5'
CAG
A
AAG
A
AAG
A
UGG
CAG
A
Mixed oligonucleotide probe
3'
TC
T
TTC
T
TTC
T
ACC
GTC
T
Actual sequence of apo A-l mRNA
5'
CAG
AAG
AAG
UGG
CAG
3'
The table shows the amino acid sequence of apo A-l for residues 105-109, the potential codons that
could code for each amino acid, the mixed oligonucleotide probe prepared by Breslow et al.,9 and the
actual sequence of apo A-l mRNA as determined by Breslow et al. from the apo A-l cDNA.
Downloaded from http://atvb.ahajournals.org/ by guest on June 18, 2017
specific cDNAs. When the complete or partial amino
acid sequence of a protein is known, sequence regions encoded with minimal redundancy in the genetic code can often be identified, and an appropriate oligonucleotide probe to this region can be
chemically synthesized. Table 2 shows an example
of this approach used by Breslow et al.9 to identify
apo A-l cDNA clones. The amino acid sequence of
reidues 105-109 is encoded by a nucleotide sequence that is redundant at only four positions. A
mixed oligonucleotide probe that included both C
and T at the redundant positions was prepared, radiolabeled, and used to screen a human liver cDNA
library. Although only one of the 16 probes in the
mixture is a perfect match to the apo A-l cDNA sequence (Table 2), the hybridization was sufficiently
specific to identify putative apo A-l clones that were
confirmed by nucleotide sequence analysis.9
As shown in Table 1, this approach has been used
to identify cDNA clones to a variety of apolipoproteins for which the complete amino acid sequences
were known. This approach has also been used to
obtain cDNA clones to the bovine LDL receptor.26 In
this case, oligonucleotide probes were prepared on
the basis of a very limited amount of protein sequence data obtained by microsequencing of CNBr
fragments. The bovine cDNA subsequently was
used to identify a human LDL receptor genomic
clone, which in turn was used to isolate the cDNA for
the human LDL receptor.27 The full length cDNA
clone defines the entire primary amino acid structure
of the human LDL receptor.27 Limited microsequencing and oligonucleotide synthesis will clearly play a
major role in the isolation of cDNA clones to a variety
of low abundance proteins which are difficult or impossible to purify in amounts sufficient for classical
methods of analysis.
Table 1 also lists the identification of a rat apo E
clone via immunological screening of an expression
library.1728 In this case, cDNA fragments were
cloned into a bacteriophage lambda vector, and
protein fragments encoded by the cDNA were expressed under control of a regulated lambda gene.
The expressed apo E determinants were detected
with an antibody to apo E. Similar schemes for the
construction of expression libraries in plasmid vectors have been described.29 Expression libraries
should prove useful for the isolation of cDNA clones
to proteins for which high quality polyclonal antibodies are available.
A cDNA clone can serve as a reagent for the identification of DNA sequences within a genomic DNA
library. Since the mammalian genome is very large
(approximately 3 x 109 base pairs), genomic libraries are constructed with large fragments of DNA so
that the entire genome is represented in a library of
manageable size. This usually involves the use of
bacteriophage lambda as a cloning vector since very
large DNA fragments can be propagated in this
vector.
Figure 2 shows a typical scheme30 in which genomic DNA is partially digested with restriction endonucleases to produce fragments of approximately
20 kilobase pairs (kbp). After methylation to block
Eco Rl sites in the DNA and ligation to synthetic Eco
Rl linker molecules, the DNA fragments are annealed and ligated to a lambda vector which has
cohesive Eco Rl ends. The recombinant DNA molecules are packaged into phage particles and amplified on the appropriate bacterial host.
Since the DNA fragments are 20 kbp and the typical mammalian genome contains 2 to 3 x 106 kbp,
approximately 3 x 10s phage particles must be
screened to detect a gene present in one copy per
genome. For this purpose, phage and host bacteria
are plated as a lawn on an agar plate, and the plate is
incubated to permit plaques of lysed cells to develop.
The phage DNA is transferred to a nitrocellulose filter
by placing the filter on the surface of the plate. This
filter is then hybridized with a radiolabeled DNA
probe prepared from the appropriate cDNA clone,
and the radioactive signal is detected by autoradiography. The region of the original plate containing the
hybridizing DNA is recovered, and the phage carrying the desired DNA is purified by additional rounds
of plating and screening with the cDNA probe.
The initial cDNA clone usually provides the probe
for the detection of the gene in a homologous genomic library. However, nucleotide sequence homologies among species are often sufficient to permit
MOLECULAR BIOLOGY AND ARTERIOSCLEROSIS
Williams
217
X Charon 4A DNA
(replacement vector)
High molecular weight eukaryotic
DNA(>IOOkb)
Cleave with a
muture ofMwm and Mi
(very partial digest)
Sue froctionote
njrz/Fu. -20 kb
EcoRl methylose to
block EcoRl sites
Me
Me
Me
Me
Blunt end ligolion with
firoRI linker molecules
Me
Me
i
I
Downloaded from http://atvb.ahajournals.org/ by guest on June 18, 2017
Me
Me
internal fragments
Me
size
fractionate
to remove
internal
fragments
Me
i
Me
Me
anneal fcoRI cohesive
ends, ligote
Me
Me
Me
Me
Me
Me
Me
Me
m vitro packaging
phage particle
1
amplification and screening
Figure 2. Construction of a genomic DNA library in bacteriophage lambda Charon 4A. Reprinted with permission from Old and Primrose.2
the isolation of genes via cross-species hybridization. For example, we have recently isolated the apo
E gene from an African green monkey library by
using the human apo E cDNA clone prepared by
Breslow et al.13 (S Mungal, GS Shelness, T Newman, and DL Williams, unpublished data). When the
protein in question has been highly conserved during
evolution, genes may be identified with heterologous
probes from species much more divergent than this
example.
The isolated gene provides structural information
as well as specific probes and reagents for additional
studies. Generally, the first order of business is the
structural analysis of the gene by restriction endonuclease mapping and nucleotide sequencing. For the
list in Table 1, genes now have been isolated for
avian apo II, 3 1 3 2 human apo A-l 3 3 3 4 human apo C-
III,24 human apo E,35 human apo C-ll,21 human LDL
receptor,27 and Chinese hamster HMG Co A reductase.36 These genes are at various stages of analysis
in a number of laboratories. Nucleotide sequences
for the avian apo II gene37 and the human apo A-l
gene33-M have been reported. Gene organization will
be considered below.
Protein Structure
Primary Sequence
The immediate consequence of the isolation of a
cDNA clone is the availability of the amino acid sequence deduced from the nucleotide sequence of
the cDNA. Table 3 shows a partial list of proteins that
have been or are likely to be the initial targets for
cloning studies by investigators interested in arterio-
218
ARTERIOSCLEROSIS
VOL
Downloaded from http://atvb.ahajournals.org/ by guest on June 18, 2017
sclerosis. The proteins marked with an asterisk have
been sequenced by classical methods of protein
chemistry. In these cases, the cDNA clones provide
confirmatory data and serve to correct errors in the
protein data. In addition, the cDNA will yield the sequence of the protein presegment and the prosegment if these have not been determined by protein
sequencing of cell-free translation products. For
most of the proteins listed in Table 3, however, amino acid sequence data is not available, and cDNA
clones will be the most direct way to obtain this information when the protein can not easily be purified in
amounts sufficient for classical analysis.
Apo B is a good example of an important protein
that has baffled and frustrated protein chemists for
more than 15 years. Although apo B is present at
high concentrations in the blood, is relatively easy to
purify, and has been the subject of intensive investigation, very little is known about the structure or sequence of this protein. Much of the difficulty lies in the
insolubility of apo B, its tendency to aggregate, and
Table 3.
Initial Targets for Cloning
Category
Cloning target
Apolipoproteins
Apo B 100
Apo B48
Apo A-l*
Apo A-ll*
Apo A-IV
Apo C-l*
Apo C-M*
Apo C-lll*
Apo D
Apo E*
Apo F
Lpa determinant
Enzymes
Lecithin-cholesterol acyltransferase
(LCAT)
Fatty acyl-CoA:cholesterol acyltransferase (ACAT)
Peripheral lipoprotein lipases
Hepatic triglyceride lipase
HMG CoA reductase
HMG CoA synthase
Cholesterol ester hydrolases
Fatty acid and phospholipid synthetic enzymes
Cholesterol 7-alpha hydroxylase
Transport and
transfer proteins
Cholesterol ester transfer protein
Phospholipid transfer proteins
Retinol binding protein
Receptors for
Apo B/E (LDL)
Beta VLDL
Acetyl LDL
Apo E
HDL
Platelet-derived growth factor
Other factors regulating arterial
cell growth
*An asterisk indicates that the protein has been
sequenced by classical methods of protein chemistry.
5, No 3,
MAY/JUNE
1985
its susceptibility to proteolysis.38 As a result, even
such a fundamental property as the molecular weight
of apo B has remained controversial. Several research groups are now attempting to isolate cDNA
clones for apo B. Limited amino acid sequence data
obtained by one group have been used to construct
oligonucleotide probes to identify apo B clones.39
Several putative apo B clones were identified. An
alternative approach is the use of expression libraries that may permit apo B clones to be identified with
immunological techniques. In either case, we can
anticipate that the structure of this difficult protein will
be elucidated by these techniques in the not too distant future.
An example of an important protein for which the
amino acid sequence was deduced from the nucleotide sequence is the enzyme 3-hydroxy-3-methylglutaryl coenzyme A (HMG CoA) reductase. HMG
CoA reductase has been the subject of considerable
investigation because of its key role in cholesterol
biosynthesis and its dramatic regulation.40 However,
HMG CoA reductase is a very low abundance enzyme that has been difficult to purify and characterize. Chin et al.25 isolated a cDNA clone for HMG CoA
reductase from a cDNA library prepared from UT-1
cells, a clone of Chinese hamster ovary cells that
overproduces the enzyme and its mRNA. The initial
cDNA clone was used to identify additional cDNA41
and genomic36 clones that define the entire protein
and mRNA sequence and the gene sequence in the
region of transcription initiation. Analysis of the derived protein sequence has permitted the formulation
of an experimentally testable structural model for the
insertion of HMG CoA reductase in the endoplasmic
reticulum membrane / 2
Analyses of the mRNA and 5' flanking sequences
of the gene have identified sequences potentially
important for transcription initiation and the control of
gene expression.36 Thus, considerable new information about the structure and regulation of HMG CoA
reductase at both the protein and nucleic acid level
has been obtained primarily with the use of recombinant DNA techniques. Considerable progress also
has been made in the isolation and analysis of cDNA
clones for the bovine and human LDL receptor.2627
The rapid progress made in elucidating the structures of the LDL receptor and HMG CoA reductase
illustrates the potential of molecular biology for the
analysis of proteins relevant to arteriosclerosis.
Protein Variants
The availability of cDNA clones permits the analysis of protein variants at a level of understanding not
possible with biochemical techniques. Genetic variants have been recognized for apo E, apo A-l, apo
A-IV, apo B, and apo C-ll by an analysis of protein
charge or immunological properties.35 Elucidation of
the nucleotide sequences of the wild type and variant
alleles should explain the basis for the variants and
may contribute to our understanding of structurefunction relationships in these proteins.
MOLECULAR BIOLOGY AND ARTERIOSCLEROSIS
Downloaded from http://atvb.ahajournals.org/ by guest on June 18, 2017
Apo E provides an interesting example in which
variants have been studied at both the protein and
nucleic acid levels. It has long been known that apo E
exhibits charge heterogeneity upon isoelectricfocusing.43 High resolution, two-dimensional gel analysis
demonstrated six human apo E phenotypes that possibly resulted from three common alleles (e4, e3,
and e2) at a single genetic locus.44 The e3 allele was
designated as the wild type because it occurs with
the highest frequency. As shown in Table 4, the three
alleles differ by arginine (arg)-cysteine (cys) interchanges at residues 112 (e3 cys) and 158 (e3
arg).45^7 Subsequent analysis of the nucleotide sequence of the e3 allele showed that the codons for
112 (e3 TGC) and 158 (e3 CGC) could undergo
single base changes to account for the amino acid
differences in the e2 and e4 alleles.13 This also was
the case with an e2* variant that has cysteine instead
of arginine at position 145. Nucleotide sequencing of
the e4 allele has confirmed this interpretation.48
Thus, the charge differences between the alleles
have been explained both at the level of the amino
acid sequence and at the level of the nucleotide substitutions in the gene. This example of allelic variation also provides structure-function information
about the protein since the e2 allele binds poorly to
apo B/E receptors or hepatic apo E receptors.4649
Arginine 158 appears to reside within the receptor
binding domain or to influence receptor interactions
at another site on the apo E molecule. The physiological significance is that the e2 allele is strongly
associated with Type III hyperlipoproteinemia in
which patients show a reduced clearance of remnant
particles presumably due to the inefficient interaction
of the altered apo E with hepatic receptors.50"52
Nucleotide sequence analysis of other apo E
cDNA clones has provided evidence for additional
sites of variation in the gene and in the protein.35 In
some cases, base changes in the DNA are silent,
i.e., amino acid substitutions do not result. In other
Table 4.
Apo E Alleles and Nucleotide Substitutions
Amino acid
Amino acid
112
codon
158
codon
e3
CYS
TGC
ARG
CGC
e2
CYS
TGC
CYS
TGC
e4
ARG
CGC
ARG
CGC
Allele
The table shows the amino acids at positions 112 and
158 of the three common apo E alleles as determined by
amino acid sequence analysis (see references 45-47).
The codons specifying these amino acids are given in
terms of the DNA bases. The codons for the e3 and E4
alleles have been determined by nucleotide sequence
analysis of cDNA or genomic clones (see references 13,
15,35,48), which shows that the amino acid substitution is
due to a point mutation. The indicated nucleotide substitution in the E2 allele could account for the arg-cys interchange at residue 158, but the nucleotide sequence of the
E2 DNA has not yet been determined.
Williams
219
cases, there are amino acid substitutions that do not
involve charged residues and would not be detected
by isoelectric focusing. The analysis at the nucleotide sequence level is clearly more informative with
regard to genetic variation in apo E than is analysis at
the protein level. In general, this will be the case for
most proteins. In addition, nucleotide sequence
analysis is easier and faster than amino acid sequence analysis.
Antibodies to Synthetic Peptldes
Another consequence of the availability of the nucleotide sequence of a cDNA and the derived amino
acid sequence is the potential for generating antibodies to defined regions of the protein. It is now well
recognized that synthetic peptides can elicit antibodies that react with high frequency with the intact
native protein.53"55 Both polyclonal and monoclonal
antipeptide antibodies have been studied. As a result, it is possible to engineer antipeptide antibodies
as probes for specific regions of the protein. This
approach generally involves a hydrophilicity analysis
of the protein sequence to select regions most likely
to be antigenic.56 Appropriate oligopeptides are synthesized by established methods and used to elicit
antibodies by conventional procedures. Antibodies
to a variety of viral antigens have been prepared in
this fashion.53"55 Antipeptide antibodies to the LDL
receptor have been used to determine the orientation of the receptor in the plasma membrane.57 Antipeptide antibodies also have been used to analyze
the domain structure of HMG CoA reductase.42 This
approach should be applicable to almost any protein
for which the sequence is known or can be derived
from the cDNA.
Molecular Genetics
Gene Structure and Organization
The DNA sequences encoding most eucaryotic
proteins occur in multiple segments or exons which
are separated by noncoding DNA segments called
introns or intervening sequence DNA. Several strategies may be used to determine the arrangement of
exons and introns in the DNA and the organization of
exon sequences in the mature messenger RNA
(mRNA). The ultimate goal is to obtain the sequence
of the gene and the mRNA in order to define the
organization at the nucleotide level.
Figure 3 shows the organization of the apo II gene
as an example. Apo II is a small apolipoprotein of 82
amino acid residues that is synthesized in the avian
liver in response to estrogenic hormones.58 Apo II is
secreted from the liver on VLDL particles and functions in the transport of VLDL to the oocyte for egg
yolk formation. Mature apo II as found in VLDL is
illustrated at the bottom of Figure 3, while the preapo II polypeptide which includes the 24 amino acid
signal peptide is diagrammed above mature apo
II 37.59.60 A p o n m RNA of 659 nucleotides includes
ARTERIOSCLEROSIS
220
VOL
5, No 3,
MAY/JUNE
1985
abundance
5%
15%
80%
CCCCTCAC|TATATTA|QTTCTQ|CATAAATlQCCAOTOTCTCAQATOAQCATCA^CCTCAQCTTC»QCCTOQa
-50
-40
-SO
-20
-11
1
7
transcription
start
sites
2907
basepalrs
apo II
mRNA
669
bases
5' M Gppp-l
80 f
T
318
codo
for
NH2 terminus
of mature apo II
start
NH2-C
24
amlno
aclda
signal peptlda
261
stop
D-COOH
mature
plasma
apo II-VLDL
82 amlno acids
NHj-C
82 amlno acids
Organization of the avian apo II gene. Details are given in the text.
the 318 nucleotides that code for pre-apo II, a 5'
noncoding region of 80 nucleotides, and a 3' noncoding region of 261 nucleotides.3761
Also illustrated are the 3' polyadenylic acid track
and the 5'cap structure which are not encoded in the
DNA. As shown above apo II mRNA, the apo II gene
is composed of four exons (I-IV) and three introns
(A,B, and C) of the indicated sizes.37 After synthesis
of a primary transcript that includes all exons and
introns, the precursor mRNA is processed to remove
the intron sequences and splice the exon sequences
as illustrated. Note that exon I represents 5' untranslated sequence in the mRNA. Exon II encodes additional 5' untranslated sequence, the start codon for
translation, and most of the signal peptide for secretion of apo II. Exon III encodes the remainder of the
signal peptide and the amino terminal region of the
mature protein. Exon IV encodes the carboxyl region
of the protein and the entire 3' noncoding region of
the mRNA. The alpha helical regions of apo II believed to be important for lipid interactions5962 are
encoded mainly in exon IV.
It is striking that the overall organization of the
avian apo II gene is very similar to the organization of
the human apo A-l, apo C-III, and apo E genes.35
Each gene has a small 5' exon representing 5' untranslated sequence. In each case, the start codon
for translation and most of the signal peptide are
encoded by exon II, the amino terminal region of the
mature protein is encoded by exon III, and the carboxyl region and 3' untranslated region are encoded
by exon IV. Similarly, the amino acid sequence regions, which can form an amphipathic helix, are encoded by exon IV. These similarities may indicate a
common ancestral gene for avian apo II and the
mammalian apolipoproteins.
Gene Regulation
The entire nucleotide sequence of the apo II gene
and apo II cDNA have been determined.37 This information enables researchers to approach problems
of gene regulation at both the transcriptional and
translational levels. As an example, Figure 4 shows
the accumulation of apo II mRNA in the avian liver
after the administration of estradiol. Apo II mRNA
was measured with a DNA-excess solution hybridization assay employing a single-stranded probe.63
ell
Figure 3.
o
16,000
or
E 12,000
o
Q.
8,000
cn
riolecu
Downloaded from http://atvb.ahajournals.org/ by guest on June 18, 2017
>CO0H
primary
translation
product
preapo II
106
•mlno acids
4,000
A
12
24
36
48
60
72
Time After Estradiol (hours)
Figure 4. Estrogen-stimulated accumulation of apo II
mRNA. White leghorn roosters (0.4 kg to 0.6 kg) were
treated with estradiol (50 mg/kg). Liver RNA was prepared
at the indicated times. Apo II mRNA was measured with a
DNA-excess solution hybridization assay. Bracket indicates one SEM for measurements with three animals at
each point.
MOLECULAR BIOLOGY AND ARTERIOSCLEROSIS
Downloaded from http://atvb.ahajournals.org/ by guest on June 18, 2017
Apo II mRNA accumulates to a level of 15,000 to
20,000 molecules/cell at which point it represents
about 10% of the total liver mRNA. This massive
accumulation of apo II mRNA results from estrogenstimulated transcription of the gene and hormone
action that stabilizes the mRNA against cytoplasmic
degradation.4"6 The stabilization of apo II mRNA presumably involves mRNA sequences in the 5' or 3'
untranslated regions. The availability of the nucleotide sequence and the appropriate cDNA clones permits researchers to ask which mRNA sequences are
necessary for stabilization. Similarly, at the DNA level they can ask which sequences are required for
transcription and hormone regulation.
Recent studies37-61 indicate that transcription is initiated at three sites on the apo II gene with different
frequencies as illustrated at the top of Figure 3. The
significance of multiple initiation sites and the reason
why each site is used with a different frequency are
not known. The nucleotide sequence adjacent to the
initiation sites includes two potential control sequences (boxed sequences) that are believed to be
important as promotor sequences in other genes.64
Studies are in progress in several laboratories to
define the function of these sequences and to identify gene sequences that interact with hormone receptors and are necessary for hormone-regulated
expression.
Similar studies with mammalian apolipoprotein
genes and a variety of genes relevant to arteriosclerosis are clearly of interest and will be pursued as
these genes become available. Problems of interest
are likely to include the effects of hormones and
metabolic intermediates on the expression of apolipoprotein genes, lipoprotein receptor genes, enzymes such as HMG CoA reductase, and a variety of
other enzymes involved in lipid metabolism. Similarly, researchers can view endothelial and smooth
muscle cell growth regulation in the arterial wall as
problems in gene regulation that can be approached
in a similar way. It is thought that platelet-derived
growth factor (PDGF) plays a significant role in stimulating smooth muscle cell growth in the atherosclerotic lesion.65 Recent studies indicate that PDGF is
highly homologous to the v-sis oncogene carried by
the simian sarcoma virus 6667 and can act in concert
with other factors to elicit transformation of fibroblasts.68 PDGF stimulation of cultured cells also results in the enhanced expression of another oncogene, c-myc.69 These data suggest that growth
stimulation in the arterial lesion may share common
pathways and cellular mediators with one type of
oncogenesis. One consequence of these findings is
a merging of the interests of investigators who work
in arteriosclerosis and those who do cancer research. An understanding of key steps in growth regulation at the molecular level will probably reveal
potential targets for pharmacological intervention in
arteriosclerosis as well as in neoplasia.
Another aspect of gene regulation is the tissuespecific expression of genes that is characteristic of
Williams
221
differentiated cells. What factors, for example, determine that apolipoprotein genes will be expressed in
the liver and small intestine but not in peripheral tissues? Similarly, the recent finding that apo E is synthesized in peripheral tissues70 and reticuloendothelial cells71 as well as in the liver raises the question of
how this gene is regulated in different cell types. In
recent studies, we have determined that the quantitative expression of the apo E gene in various tissues
of the nonhuman primate ranges over at least two
orders of magnitude.72 It is of interest to learn the
functional significance of these differences and the
mechanisms through which they are established.
Similar questions may be asked about a variety of
genes that show tissue-specific expression and are
relevant to arteriosclerosis.
In addition to the analysis of gene expression in
tissues or cell culture, DNA probes may be used as
histological reagents to examine in vivo gene expression at the cellular level. Figure 5 shows an example from work by McAllister et al.73 in which a
radiolabeled cDNA for the egg-laying hormone
(ELH) of the marine snail Aplysiawas used to identify
the neurons that synthesize this hormone. Figure 5 A
shows a section through the abdominal ganglion
which has been hybridized with 1 2 5 I-CDNA to ELH
and exposed by autoradiography. The dark staining
of the cell clusters in the upper left and right of the
ganglion represents intense hybridization of the
probe to ELH mRNA in the bag cell neurons. Figures
5 B and C are higher power magnifications of sections showing that the hybridization is restricted to
the bag cells. In Figure 5 C, the localization of grains
to the cell cytoplasm is evident. Figure 5 D shows
localization of the ELH peptide to the bag cells via
immunocytochemical staining with anti-ELH antibody. Note that the immunocytochemical technique
identified the protein product of the ELH gene while
the in situ hybridization technique identified the
mRNA for ELH. Thus, the immunocytochemical approach localized cells that contain the protein, while
the hybridization procedure localized cells that synthesize the protein. This distinction may be particularly important when attempting to identify cell types
that synthesize proteins normally present in the extracellular fluid or those taken up by cells, as is the
case with lipoproteins. Similar approaches could be
used to identify, within tissues, cell types that synthesize proteins such as apo E, lipoprotein lipase, or
LCAT.
Restriction Fragment Length Polymorphisms
A restriction fragment length polymorphism
(RFLP) refers to an altered pattern of gene fragments observed when genomic DNA from different
individuals is digested with a restriction endonuclease. Since a restriction endonuclease has a
specificity for a defined recognition sequence in the
DNA, an altered pattern of cleavage could be due to
as simple a change as a point mutation or to a com-
222
ARTERIOSCLEROSIS
VOL
A
5, No 3,
MAY/JUNE
1985
B
D
Downloaded from http://atvb.ahajournals.org/ by guest on June 18, 2017
Figure 5. In situ hybridization and immunocytochemistry to localize cells expressing the egg laying
hormone (ELH) in Aplysia. A. The abdominal ganglion from an adult animal was dissected, fixed with
Bouin's stain, sectioned, and hybridized with an 125l-labeled DNA probe for ELH mRNA. The bag cells
comprise the topmost rounded clusters of neurons on both sides of the ganglion. The photomicrograph shows hybridization to cells in both bag-cell clusters as well as to a single isolated neuron in the
upper left center of the ganglion. The hybridization is seen as the darkened areas due to the intense
concentration of silver grains on the autoradiograph. x 15. B. Hybridization to a section through a
single bag-cell cluster. Neurons are stained with methylene blue, x 58. C. A higher magnification (x
368) of bag-cell neurons after in situ hybridization. The deposition of silver grains over the cytoplasm of
nerve cell bodies can be seen. D. Immunohistochemical staining with anti-ELH antibody followed by
a second antibody coupled to peroxidase. The ELH-containing cells in the bag-cell cluster show darker
staining, x 230. Reprinted with permission from reference 73 (copyright 1983 by the American
Association for the Advancement of Science).
plex change such as a deletion, insertion, or rearrangement in the DNA.74 A typical analysis for
RFLP involves digestion of isolated DNA with a specific restriction enzyme, electrophoresis of the DNA
fragments on an agarose gel, blotting of the DNA
fragments in the gel to nitrocellulose paper by the
Southern transfer method,75 hybridization of the blot
with a radiolabeled cDNA or genomic DNA probe,
and autoradiography of the blot to detect the gene
fragments recognized by the probe. Genetic polymorphisms are useful markers for the classification
of specific phenotypes or for genetic linkage studies.
In some instances, a polymorphism can be instructive as to the basis for a metabolic disorder.
Figure 6 shows an example of an apo A-l gene
analysis carried out by Karathanasis et al.76 using
DNA from two patients with a familial deficiency of
plasma apo A-l and apo C-lll. The autoradiograph
shows that digestion of DNA from a normal individual
with the enzyme Eco Rl yields a 13 kbp fragment
detected with an apo A-l cDNA probe (Lane A). In
contrast, the patients have a 6.5 kbp Eco Rl fragment
instead of the 13 kbp fragment, indicating an alteration in the DNA in or near the apo A-l gene (Lanes F
and G). Since the patients have only the 6.5 kbp
fragment, they must be homozygous for this altered
apo A-l allele. Note that the mother (Lane D), father
(Lane C), and brother (Lane E) of the patients have
both fragments, indicating that they are heterozygous for the normal and mutant alleles. The patients
also showed a 0.95 kbp fragment in comparison to
the wild type, 2.2 kbp fragment when the DNA was
digested with the enzyme Pst I.76 These data suggested that the altered apo A-l gene might be the
basis for the lack of plasma apo A-l, but additional
studies were needed to define the mutant gene.
Figure 7 shows schematic representations of the
normal and mutant apo A-l genes and the cDNA
MOLECULAR BIOLOGY AND ARTERIOSCLEROSIS
A
B ' C D E
F
223
Williams
G H
I
J K
23.1
9.4
6.5
4.3
Downloaded from http://atvb.ahajournals.org/ by guest on June 18, 2017
2.3
2.0
Figure 6. Restriction length polymorphism of the human apo A-l gene. The gel shows hybridization of an apo A-l cDNA probe to genomic DNA which has been digested with Eco Rl. The
DNA samples are from a normal individual (Lane A) and the apo A-l-deficient probands (Lanes F
and G). Other samples are from the maternal grandfather (Lane B), father (Lane C), mother
(Lane D), and brother (Lane E) of the probands. In addition, the son (Lane H) and daughter
(Lane I) of Proband G, and the son (Lane J) and daughter (Lane K) of Proband F are shown.
Reprinted with permission from Nature 301, 718-720 (1983); copyright 1983 Macmillan Journals Ltd.
Pi
Ri
I
1
I
II
IV
III
P2
P2
I
I
normal
apo A1
gene
apo A1 cONA probe
Ri
mutant
apo A1
gene
>-//
* Inserted DNA
Rs -1 L P 3
SIZE
(Kllobase Pairs)
GENE
ENZYME
Normal
Eco R1
R1-R2
Normal
Pst 1
P1-P2
Mutant
Eco R1
R3-R2
6.5
Ps-P2
0.95
Mutant
Pst 1
Figure 7.
FRAGMENT
13
2.2
Structure of the normal and mutant apo A-l gene. See text for details.
224
ARTERIOSCLEROSIS
VOL
Downloaded from http://atvb.ahajournals.org/ by guest on June 18, 2017
probe that were used for the blot in Figure 6. The
mutant apo A-l gene is now known to contain a DNA
insertion of 7.5 kbp within exon IV that disrupts production of the apo A-l protein. 35 '" Whereas the normal gene yields an Eco Rl fragment of 13 kbp (fragment R1-R2), the mutant gene yields a fragment of
6.5 kbp (fragment R3-R2) due to the presence of an
Eco Rl site in the inserted DNA. Note that the R1-R3
fragment was not detected (Figure 6, Lanes F and G)
because the cDNA probe lies completely to one side
of the inserted DNA (Figure 7). The normal and mutant Pst I fragments that were detected with the
cDNA probe are also shown (Figure 7).
In this example, the altered phenotype is clearly
due to the DNA rearrangement detected as a rare
RFLP. This is an excellent example of how molecular
biology has been used to link a specific genetic lesion with premature coronary artery disease.7677
Further studies showed that the DNA inserted in the
apo A-l gene was derived from the apo C-lll gene
which lies adjacent to the apo A-l gene.35
A common RFLP has been identified in the human
apo C-ll gene after digestion with the enzyme Taq
I.78'79 This RFLP was used to show linkage between
the apo C-ll and apo E genes78'79 and to demonstrate
linkage between the apo C-ll gene and apo C-ll deficiency.80 Rees et al.81 also have detected a polymorphism in the flanking region of the apo A-l gene that
was subsequently localized to a single base change
in the 3' noncoding region of the adjacent apo C-lll
gene.24 This polymorphism was present at higher
frequency in hypertriglyceridemic patients although
additional studies are required to confirm this finding.
These initial studies of RFLP illustrate the potential
of such analysis with genes coding for apolipoproteins and enzymes of lipid metabolism. Large patient
populations characterized for disorders of lipid and
lipoprotein metabolism are available through various
lipid clinics in the United States. The analysis of DNA
from these patients will probably yield new information relating specific changes in the DNA to phenotypic changes in lipid metabolism. In some cases this
approach may lead to the molecular basis for the
defect. In other cases RFLP are likely to provide
further subcategorization of lipid disorders on the
basis of DNA alterations. Such information may be
useful in diagnosis and will provide leads for further
research.
RFLP has proven useful in the diagnosis of sickle
cell anemia because a specific restriction site polymorphism occurs in the 3' flanking region of the human beta globin gene.82 Prenatal diagnosis of beta
thalassaemias also takes advantage of polymorphisms in the globin gene locus.83 Recent studies
indicate that RFLP may be used for prenatal detection of an altered phenylalanine hydroxylase gene
associated with phenylketonuria.84 In addition to the
use of polymorphisms occurring within a restriction
enzyme recognition sequence, knowledge of the nucleotide sequence may permit researchers to design
specific DNA probes to detect an altered gene. For
5, No 3,
MAY/JUNE
1985
example, Conner et al.85 have designed specific oligonucleotide probes that can distinguish the sickle
cell beta S allele from the normal beta A allele. One
synthetic 19 nucleotide probe formed a perfect hybrid with the sickle cell DNA but did not hybridize at
all to the beta A DNA due to a single base mismatch.
Thus, it is possible to design probes for the detection
of point mutations. One could also design probes to
screen for specific insertions or deletions in the DNA.
While many of these procedures are now performed
in the research laboratory, it is not difficult to envision
the extension of such procedures to the clinical testing situation. This potential for arteriosclerosis research should develop rapidly as relevant genes are
characterized and as RFLP are identified and related
to specific metabolic disorders.
Reverse Genetics
Reverse genetics refers to the process through
which a specific DNA sequence is altered in vitro and
subsequently expressed in vivo to evaluate the functional signifiance of the alteration. The evaluation of
function might be at the nucleic acid level or at the
level of the protein product depending upon the particular question being asked. The ability to express a
gene that has been mutated at specific sites enables
the investigtor to approach structure-function relationships with a level of discrimination not previously
possible. The typical approach in such studies is to
carry out site-directed mutagenesis on an isolated
gene to produce a point mutation, deletion, or insertion in the DNA. The altered DNA is constructed into
an expression vector and introduced into an appropriate host cell for expression. The evaluation of the
engineered mutation might be performed by monitoring function within the host cell or might require purification of the mutated gene product. Reverse genetics has been used primarily to identify the DNA
sequences necessary for transcription, the processing of primary mRNA transcripts, and the regulation
of gene expression. However, recent studies have
shown the potential of reverse genetics for evaluating structure-function relationships at the protein
level.
This approach has been used by Villafranca et
al.86 to examine the role of specific amino acid residues in the catalytic activity of dihydrofolate reductase (DHFR) from Escherichia coli. Site-directed mutagenesis was carried out to systematically change
three specific amino acid residues within the 159
residue DHFR polypeptide. The specific residues
mutated were selected on the basis of the X-ray crystal structure of DHFR to provide information about
the mechanism of catalysis and about the folding of
the polypeptide chain. After in vitro mutagenesis, the
altered DNA was constructed into a plasmid vector
and expressed to high levels in E. coli. Subsequently, DHFR was purified from the E. coli and evaluated
for activity. With an efficient plasmid expression vector, up to 8 mg of purified DHFR was obtained from a
MOLECULAR BIOLOGY AND ARTERIOSCLEROSIS
Downloaded from http://atvb.ahajournals.org/ by guest on June 18, 2017
1 liter culture, thereby providing sufficient enzyme for
studies of catalytic activity and physical properties.
These initial experiments clearly illustrate the potential of this strategy for structure-function studies at
the protein level. In the example cited above, the
crystal structure of DHFR and a wealth of information
about the mechanism of catalysis provided the
groundwork for designing mutations and asking very
directed questions about the enzyme.
Application of a similar strategy to a far less defined situation has also been highly informative in
evaluating the biochemical activities of the human
transforming protein, ras p21. In these studies, constructs were made to express the normal cellular ras
protein or the oncogenic variant which has a valine
substituted for glycine at residue 12 of the polypeptide.87 Expression of these constructs in E. coli permitted the investigators to compare the biochemical
properties of the normal and oncogenic ras proteins
in crude extracts of the bacteria. These comparisons
showed no differences in the binding of GTP or the
autophosphorylation activity of the ras proteins.
However, the experiments revealed a previously unknown GTPase activity in the normal ras p21 that
was significantly reduced in the oncogenic ras p21, 87
For the proteins listed in Table 1, similar approaches could be used to define amino acid residues or sequence domains required for fundamental
activities such as apolipoprotein interactions with lipoproteins, enzyme substrate interactions, or the association of ligands such as apo E or growth factors
with their receptors. In the case of apo E, analyses of
variant proteins and chemically modified proteins
have identified a polypeptide domain involved in
binding to the apo B/E and the hepatic apo E receptor.45'464988 Site-directed mutagenesis should prove
useful in extending the analysis of this domain as
well as in evaluating other regions of the apo E
polypeptide.
Another application of reverse genetics at the protein level is in studies of intracellular protein metabolism. By expressing a mutant protein in a mammalian
host cell, one could address questions relating to
protein biosynthesis, intracellular protein topogenesis, protein turnover, or the assembly of macromolecular complexes. For example, apo A-1 is synthesized with a 6 amino acid terminal prosegment that is
not cleaved until after the protein is secreted.89"91
Expression experiments with mutant apo A-1 constructs in liver cell cultures could be carried out to
determine whether the prosegment plays a role in
the intracellular transport or secretion of apo A-1.
Similarly, expression experiments could be used
to examine the polypeptide regions responsible for
the plasma membrane localization of lipoprotein receptors or the endoplasmic reticulum localization of
HMG CoA reductase. Previously, problems of this
nature could only be approached by correlating biochemical changes with amino acid sequence alterations in naturally occurring variants. The development of procedures for site-directed mutagenesis
Williams
225
and in vivo expression systems now permits researchers to ask directly how specific mutations influence biochemical function. This is a powerful experimental tool that will permit investigators to
examine structure-function relationships at the protein level in a systematic way.
References
1. Manlatls T, Frltsch EF, Sambrook J. Molecular cloning. A
laboratory manual. Cold Spring Harbor Laboratory, 1982:
1-545
2. Old RW, Primrose SB. Principles of gene manipulation.
Berkeley: University of California Press, 1981:1-250
3. Bolivar F, Backman K. Plasmids of Eschenchia coli as cloning vectors. Methods Enzymol 1979;68:245-267
4. Wlerlnga B, Roskam W, Arnberg A, van der Zwaag-Gerrltsen J, Ab G, Gruber M. Purification of the mRNA for chicken
very low density lipoprotein II and molecular cloning of its fulllength double stranded cDNA. Nucleic Acids Res 1979;
7:2147-2163
5. Chan L, Dugalczyk A, Means AR. Molecular cloning of the
gene sequences of a major apoprotein in avian very low density lipoprotein. Biochemistry 1980;19:5631-5637
6. Wlskocll R, Bensky P, Dower W, Goldberger RF, Gordon
Jl, Deeley RG. Coordinate regulation of two estrogen-dependent genes in avian liver. Proc Natl Acad Sci USA 1980;
77:4474-^478
7. Protter AA, Wang S-Y, Shelness GS, Ostapchuk P, Williams DL. Isolation and characterization of a cDNA clone
specific for avian vitellogenin II. Nucleic Acids Res 1982;
10:4935-4950
8. Shoulders CC, Baralle FE. Isolation of the human HDL apeprotein Al gene. Nucleic Acids Res 1982;10:4873—^882
9. Breslow JL, Ross D, McPherson J, et al: Isolation and
characterization of cDNA clones for human apolipoprotein AI. Proc Natl Acad Sci USA 1982;79:6861-6865
10. Cheung P, Chan L. Nucleotide sequence of cloned cDNA of
human apolipoprotein A-l. Nucleic Acids Res 1983;11:37033715
11. Law SW, Gray G, Brewer HB Jr. cDNA cloning of human
apo A-1: amino add sequence of preproapo Al. Biochem
Biophys Res Commun 1983;112:257-264
12. Miller JCE, Barth RK, Shaw PH, Elliott R, Hastle ND. Identification of a cDNA clone for mouse apoprotein A-l (apoA-l)
and its use in characterization of apo Al mRNA expression in
liver and small intestine. Proc Natl Acad Sci USA 1983;
80:1511-1515
13. Breslow JL, McPherson J, Karathanasls SK, Zannls VI.
Identification and DNA sequence of a human apolipoprotein
E cDNA clone. J Biol Chem 1982257:14639-14641
14. Wallls SC, Rogne S, Gill L, et al. The isolation of cDNA
clones for human apolipoprotein E and the detection of apo E
RNA in hepatic and extra-hepatic tissues. EMBO J 1983;
:2369-2373
McLean JW, Elshourbagy NA, Chang DJ, Mahley RW,
Taylor JM. Human apolipoprotein E mRNA. cDNA cloning
and nucleotide sequencing of a new varient. J Biol Chem
1984;259:6498-6504
16. McLean JW, Fukazawa C, Taylor JM. Rat apolipoprotein E
mRNA. Cloning and sequencing of double-stranded cDNA. J
Biol Chem 1983;258:8993-9000
17. de Jong FA, Howlett G, Aldred AR, Fldge N, Schrelber G.
Synthesis of rat apolipoprotein E by Eschenchia coli infected
with recombinant bacteriophage. Biochem Biophys Res
Commun 1984;119:657-662
18. Reue KL, Quon DH, O'Donnell KA, Dlzlkes GJ, Fa reed
GC, Lusls AJ. Cloning and regulation of messenger RNA for
mouse apolipoprotein E. J Biol Chem 1984;259:2100-2107
226
ARTERIOSCLEROSIS
VOL
Downloaded from http://atvb.ahajournals.org/ by guest on June 18, 2017
19. Gordon Jl, Smith DP, Alpers DH, Strauss AW. Cloning of a
complementary deoxyribonucleic acid encoding a portion of
rat intestinal preapolipoprotein AIV messenger ribonucleic
acid. Biochemistry 1982;21:5424-5431
20. Sharpe CR, Sldoll A, Shelley CS, Lucero MA, Shoulders
CC, Baralle FE. Human apolipoproteins Al, All, CM and CHI.
cDNA sequences and mRNA abundance. Nucleic Adds Res
1984;12:3917-3932
21. Jackson CL, Bruns GAP, Breslow JL. Isolation and sequence of a human apolipoprotein CM cDNA clone and its use
to isolate and map to human chromosome 19 the gene for
apolipoprotein CM. Proc Natl Acad Sci USA 1984;81:29452949
22. Myklebost O, Williamson B, Markham AF, et al. The isolation and characterization of cDNA clones for human apolipoprotein Cll. J Biol Chem 1984;259:4401-4404
23. Knott TJ, Robertson ME, Priestley LM, Urdea M, Wallls S,
Scott J. Characterization of mRNAs encoding the precursor
for human apolipoprotein Cl. Nucleic Acids Res 1984;12:
3909-3915
24. Karathanasls SK, McPherson J, Zannls VI, Breslow JL.
Linkage of human apolipoproteins Al and C-lll genes. Nature
1983:304:371-373
25. Chin DJ, Luskey KL, Faust JR, MacDonald RJ, Brown MS,
Goldstein JL. Molecular cloning of 3-hydroxy-3-methylglutaryl coenzyme A reductase and evidence for regulation of its
mRNA. Proc Natl Acad Sci USA 1982:79:7704-7708
26. Russell DW, Yamamoto T, Schneider WJ, Slaughter CJ,
Brown MS, Goldstein JL. cDNA cloning of the bovine low
density lipoprotein receptor: feedback regulation of a receptor
mRNA. Proc Natl Acad Sci USA 1983:80:7501-7505
27. Yamamoto T, Davis CG, Brown MS, et al. The human LDL
receptor: A cysteine-rich protein with multiple ALU sequences in its mRNA. Cell 1984:39:27-38
28. Young RA, Davis RW. Efficient isolation of genes by using
antibody probes. Proc Natl Acad Sci USA 1983:8011941198
29 Helfman DM, Feramlsco JR, Flddes JC, Thomas GP,
Hughes SH. Identification of clones that encode chicken tropomyosin by direct immunological screening of a cDNA expression library. Proc Natl Acad Sci USA 1983:80.31-35
30. Manlatis T, Hardlson RC, Lacy C, et al. The isolation of
structural genes from libraries of eucaryotic DNA. Cell 1978;
15 687-701
31. Meljllnk FCPW, van het Schip AD, Arnberg AC, Wlerlnga
B, Ab G, Gruber M. Structure of the chicken apo very low
density lipoprotein II gene. J Biol Chem 1981,256:9668-9671
32. Wiskocil R, Goldman P, Deeley RG. Cloning and structural
characterization of an estrogen-dependent apolipoprotein
gene. J Biol Chem 1981 ;256:9662-9667
33. Karathanasis SK, Zannis VI, Breslow JL. Isolation and
characterization of the human apolipoprotein A-l gene. Proc
Natl Acad Sci USA 1983:80:6147-6151
34. Shoulders CC, Kornbllhtt AR, Munro BS, Baralle FE.
Gene structure of human apolipoprotein Al. Nucleic Acids
Res 1983:11:2827-2837
35. Breslow JL. Human apolipoprotein molecular biology and
genetic variation. Ann Rev Biochem 1985,54:699-727
36. Reynolds GA, Basu SK, Osborne TF, et al. HMG CoA
reductase: a negatively regulated gene with unusual promotor and 5' untranslated regions. Cell 1984:38:275-285
37. van het Schlp AD, Mei|llnk FCPW, Strijker R, et al. The
nucleotide sequence of the chicken apo very low density
lipoprotein II gene. Nucleic Acids Res 1983:11:2529-2540
38. Kane JP. Apolipoprotein B: structural and metabolic heterogeneity. Ann Rev Physiol 1983:45:637-650
39. LeBoeuf RC, Miller C, Shively JE, Schumaker VN, Balla
MA, Lusis AJ. Human apolipoprotein B: partial amino acid
sequence. FEBS Lett 1984;170:105-108
40. Brown MS, Goldstein JL. Multivalent feedback regulation of
HMG CoA reductase, a control mechanism coordinating isoprenoid synthesis and cell growth. J LJpid Res 1980;21:
505-517
41. Chin DJ, Gil G, Russell DW, et al. Nucleotide sequence of 3hydroxy-3 methyl-glutaryl coenzyme A reductase, a glycopro-
5, No 3,
MAY/JUNE
1985
tein of endoplasmic reticulum. Nature 1984;308:613-617
43. Llscum L, Finer-Moore J, Stroud RM, Luskey KL, Brown
MS, Goldstein JL. Domain structure of 3-hydroxy-3-methylglutaryl coenzyme A reductase, a glycoprotein of the endoplasmic reticulum. J Biol Chem 1985:260:522-530
43 Utermann G, Vogelberg KH, Stelnmetz A, et al. Polymorphism of apolipoprotein E. Genetics of hyperiipoproteinemia
type III. Clin Genet 1979;15:37-62
44. Zannls VI, Breslow JL. Human very low density lipoprotein
apolipoprotein E isoprotein polymorphism is explained by genetic variation and post-translational modification. Biochemistry 1981:20:1033-1041
45. Welsgraber KH, Rail SC Jr, Mahley RW. Human E apoprotein heterogeneity. Cysteine-arginine interchanges in the
ammo acid sequence of the apo-E isoforms. J Biol Chem
1981:256:9077-9083
46. Rail SC Jr, Weisgraber KH, Innerarlty TL, Mahley RW.
Structural basis for receptor binding heterogeneity of apolipoprotein E from type III hyperiipoproteinemic subjects. Proc
Natl Acad Sci USA 1982;79:4696-4700
47. Rail SC Jr, Welsgraber KH, Mahley RW. Human apolipoprotein E. The complete ammo acid sequence. J Biol Chem
1982:257:4171-4178
48. Das HK, McPherson J, Bruns GAP, Karathanasls SK,
Breslow JL. Isolation, characterization, and mapping to
chromosome 19 of the human apolipoprotein E gene. J Biol
Chem 1985 (in press)
49 Hul DY, Innerarlty TL, Mahley RW. Defective hepatic lipoprotein receptor binding of beta-very low density lipoproteins
from Type III hyperiipoproteinemic patients. Importance of
apolipoprotein E. J Biol Chem 1984:259:860-869
50 Utermann G, Jaeschke M, Menzel J. Familial hyperiipoproteinemia Type III: deficiency of a specific apolipoprotein (apo
E-lll) in the very low density lipoproteins. FEBS Lett 1975;
56.352-355
51. Pagnan A, Havel RJ, Kane JP, Kotlte L. Characterization of
human very low density lipoproteins containing two electrcphoretic populations: double prebeta lipoproteinemia and primary dysbetalipoproteinemia. J LJpid Res 1977;18:613-622
52 Zannls VI, Breslow JL. Characterization of a unique human
apolipoprotein E variant associated with type III hyperiipoproteinemia J Biol Chem 1980:255:1759-1762
53. Niman HL, Houghten RA, Walker LE, et al. Generation of
protein reactive antibodies by short peptides is an event of
high frequency: Implications for the structural basis of immune recognition Proc Natl Acad Sci USA 1983;80:49494953
54 Cohen GH, Dietzschold B, Ponce de Leon M, et al. Localization and synthesis of an antigenic determinant of Herpes
simplex virus glycoprotein D that stimulates the production of
neutralizing antibody. J Virol 1984;49-102-108
55. Emlni EA, Jameson BA, Wlmmer E. Priming for and induction of antipoliovirus neutralizing antibodies by synthetic peptides. Nature 1983:304:699-703
56 Hopp TP, Woods KR. Prediction of protein antigenic determinants from ammo acid sequences. Proc Natl Acad Sci USA
1981;78:3824-3828
57. Russell DW, Schneider WJ, Yamamoto T, Luskey KL,
Brown MS, Goldstein JL. Domain map of the LDL receptorsequence homology with the epidermal growth factor precursor. Cell 1984:37:577-585
58. Chan L, Jackson RL, O'Malley BW, Means AR. Synthesis
of very low density lipoproteins in the cockerel. Effects of
estrogen. J Clin Invest 1976:58:368-379
59. Chan L, Bradley WA, Means AR. Amino acid sequence of
the signal peptide of apo VLDL-II, a major apoprotem in avian
very low density lipoproteins. J Biol Chem 1980;255:
10060-10063
60. Dugaiczyk A, Inglis AS, Strike PM, Burley RW, Beattle
WG, Chan L. Comparison of the nucleotide sequence of
cloned DNA coding for an apolipoprotein (apo VLDL-II) from
avian blood and the amino acid sequence of an egg-yolk
protein (apovitellenin I): equivalence of the two sequences.
Gene 1981 ;14:175-182
61. Shelne88 GS, Williams DL. Apolipoprotein II messenger
MOLECULAR BIOLOGY AND ARTERIOSCLEROSIS
62.
63.
64.
65.
66.
67.
Downloaded from http://atvb.ahajournals.org/ by guest on June 18, 2017
68.
69.
70.
71.
72.
73.
74.
75.
76.
77.
RNA. Transcriptional and splicing heterogeneity yields six 5'untranslated leader sequences. J Biol Chem 1984;259:99299935
Jackson RL, Chan L, Snow LD, Means AR. Hormonal regulation of lipoprotein synthesis. In: Dietschy JM, Gotto AM Jr,
Ontko JA. Disturbances in lipid and lipoprotein metabolism.
Bethesda: American Physiological Society, 1978; 139-154
Elbrecht A, Lazier CB, Protter AA, Williams DL. Independent developmental programs for two estrogen-regulated
genes. Science 1984;225:639-641
McKnight SL, Kingsbury R. Transciptional control signals of
a eucaryotic protein-coding gene. Science 1982;217:316324
Ross R. The arterial wall and atherosclerosis. Annu Rev Med
1979;30:1-15
Doolittle RF, Hunkapiller MW, Hood LE, et al. Simian sarcoma virus one gene, v-sis, is derived from the gene (or
genes) encoding a platelet-derived growth factor. Science
1983:221:275-277
Waterfield MD, Scrace GT, Whittle N, et al. Platelet-derived
growth factor is structurally related to the putative transforming protein p28sis of simian sarcoma virus. Nature 1983;
304:35-39
Assoian RK, Grotendorst GR, Miller DM, Sporn MB. Cellular transformation by coordinated action of three peptide
growth factors from human platelets. Nature 1984;309:804806
Kelley K, Cochran BH, Stiles CD, Leder P. Cell-specific
regulation of the c-myc gene by lymphocyte mitogens and
platelet-derived growth factor. Cell 1983;35:603-610
Blue M-L, Williams DL, Zucker S, Khan SA, Blum CB.
Apolipoprotein E synthesis in human kidney, adrenal gland,
and liver. Proc Natl Acad Sci USA 1983;80:283-287
Basu SK, Brown MS, Ho YK, Havel RJ, Goldstein JL.
Mouse macrophages synthesize and secrete a protein resembling apolipoprotein E. Proc Natl Acad Sci USA 1981;
78:7545-7549
Newman TC, Dawson PA, Rudel LL, Williams DL. Quantitation of apolipoprotein E mRNA in the liver and peripheral
tissues of nonhuman primates. J Biol Chem 1985;260:
2452-2457
McAllister LB, Scheller RH, Kandel ER, Axel R. In situ
hybridization to study the origin and fate of identified neurons.
Science 1983;222:800-808
Jeffreys AJ. DNA sequence variants in the G y-, A y-, 8-, and
13-globin genes of man. Cell 1979;18:1-10
Southern EM. Detection of specific sequences among DNA
fragments separated by gel electrophoresis. J Mol Biol 1975;
98:503-517
Karathanasis SK, Norum RA, Zannis VI, Breslow JL. An
inherited polymorphism in the human apolipoprotein A-l gene
locus related to the development of atherosclerosis. Nature
1983;301:718-720
Karathanasis SK, Zannis VI, Breslow JL. A DNA insertion
Index Terms: arteriosclerosis
• atherosclerosis
78.
79.
80.
81.
82.
83.
84.
85.
86.
87.
88.
89.
90.
91.
• cDNA
Williams
227
in the apolipoprotein A-l gene of patients with premature
atherosclerosis. Nature 1983:305:823-825
Myklebost O, Rogne S, Olaisen B, Gedde-Dahl T, Prydz H.
The locus for apolipoprotein Cll is closely linked to the apolipoprotein E locus on chromosome 19 in man. Hum Genet
1984:67:309-312
Humphries SE, Berg K, Gill L, et al. The gene for apolipoprotein C-ll is closely linked to the gene for apolipoprotein E
on chromosome 19. Clin Genet 1984:26:389-396
Humphries SE, Williams L, Myklebost O, et al. Familial
apolipoprotein Cll deficiency: A preliminary analysis of the
gene defect in two independent families. Hum Genet 1984;
67:151-155
Rees A, Shoulders CC, Stocks J, Galton DJ, Baralle FE.
DNA polymorphism adjacent to human apoprotein A-l gene:
relation to hypertriglyceridemia. Lancet 1983; 1:444—446
Kan YW, Dozy AM. Polymorphism of DNA sequence adjacent to human beta-globin structural gene: relationship to
sickle mutation. Proc Natl Acad Sci USA 1978;75:5631-5635
Orkin SH, Kazazian HH Jr, Antonarakis SE, et al. Linkage
of beta-thalassaemia mutations and beta-globin gene polymorphisms with DNA polymorphisms in human beta-globin
gene cluster. Nature 1982;296:627-631
Woo SLC, Lidsky AS, Guttler F, Chandra T, Robson KJH.
Cloned human phenylalanine hydroxylase gene allows prenatal diagnosis and carrier detection of classical phenylketonuria. Nature 1983;306:151-155
Conner BJ, Reyes AA, Morin C, Itakura I, Teplitz RL, Wallace RB. Detection of sickle cell betas-globin allele by hybridization with synthetic oligonucleotides. Proc Natl Acad Sci
USA 1983;80:;278-282
Villafranca JE, Howell EE, Voet DH, et al. Directed mutagenesis of dihydrofolate reductase. Science 1983:222:782788
McGrath JP, Capon DJ, Goeddel DV, Levinson AD. Comparative biochemical properties of normal and activated human ras p21 protein. Nature 1984;310:644-649
Weisgraber KH, Innerarity TL, Mahley RW. Abnormal lipoprotein receptor-binding activity of the human E apoprotein
due to cysteine-arginine interchange at a single site. J Biol
Chem 1982;257:2518-2521
Gordon Jl, Smith DP, Andy R, Alpers DH, Schonfeld G,
Strauss AW. The primary translation product of rat intestinal
apolipoprotein Al mRNA is an unusual preproprotein. J Biol
Chem 1982;257:971-978
Gordon Jl, Sims HF, Lentz SR, Edelstein C, Scanu AM,
Strauss AW. Proteolytic processing of human preproapolipoprotein A-l. A proposed defect in the conversion of pro A-l to
A-l in Tangier's disease. J Biol Chem 1983;258:4037-4044
Zannis VI, Karathanasis SK, Kreutmann H, Goldberger G,
Breslow JL. Intracellular and extracellular processing of human apolipoprotein A-l: secreted apolipoprotein A-l isoprotein 2 is a propeptide. Proc Natl Acad Sci USA 1983;
80:2574-2578
• gene
• molecular biology
• apolipoprotein
Molecular biology in arteriosclerosis research.
D L Williams
Downloaded from http://atvb.ahajournals.org/ by guest on June 18, 2017
Arterioscler Thromb Vasc Biol. 1985;5:213-227
doi: 10.1161/01.ATV.5.3.213
Arteriosclerosis, Thrombosis, and Vascular Biology is published by the American Heart Association, 7272 Greenville
Avenue, Dallas, TX 75231
Copyright © 1985 American Heart Association, Inc. All rights reserved.
Print ISSN: 1079-5642. Online ISSN: 1524-4636
The online version of this article, along with updated information and services, is located on the
World Wide Web at:
http://atvb.ahajournals.org/content/5/3/213
Permissions: Requests for permissions to reproduce figures, tables, or portions of articles originally published in
Arteriosclerosis, Thrombosis, and Vascular Biology can be obtained via RightsLink, a service of the Copyright
Clearance Center, not the Editorial Office. Once the online version of the published article for which permission
is being requested is located, click Request Permissions in the middle column of the Web page under Services.
Further information about this process is available in the Permissions and Rights Question and Answerdocument.
Reprints: Information about reprints can be found online at:
http://www.lww.com/reprints
Subscriptions: Information about subscribing to Arteriosclerosis, Thrombosis, and Vascular Biology is online
at:
http://atvb.ahajournals.org//subscriptions/