Download Gene Expression and Regulation

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Deoxyribozyme wikipedia , lookup

Messenger RNA wikipedia , lookup

Secreted frizzled-related protein 1 wikipedia , lookup

Genome evolution wikipedia , lookup

Protein moonlighting wikipedia , lookup

Molecular evolution wikipedia , lookup

Non-coding RNA wikipedia , lookup

Gene desert wikipedia , lookup

Non-coding DNA wikipedia , lookup

Community fingerprinting wikipedia , lookup

Epitranscriptome wikipedia , lookup

Transcription factor wikipedia , lookup

Histone acetylation and deacetylation wikipedia , lookup

Two-hybrid screening wikipedia , lookup

Point mutation wikipedia , lookup

Gene expression profiling wikipedia , lookup

RNA-Seq wikipedia , lookup

RNA polymerase II holoenzyme wikipedia , lookup

Eukaryotic transcription wikipedia , lookup

Gene wikipedia , lookup

List of types of proteins wikipedia , lookup

Vectors in gene therapy wikipedia , lookup

Endogenous retrovirus wikipedia , lookup

Lac operon wikipedia , lookup

Promoter (genetics) wikipedia , lookup

Gene expression wikipedia , lookup

Gene regulatory network wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Transcriptional regulation wikipedia , lookup

Silencer (genetics) wikipedia , lookup

Transcript
Gene Expression and Regulation - 1
We have been discussing the molecular structure of DNA and its function in DNA
replication and in transcription. Earlier we discussed how genes interact in
transmission genetics, based on Mendelian principles. We will now address how gene
expression is regulated, primarily at the level of transcription and translation and
some of the ways in which we can use our current knowledge of molecular genetics
via DNA technology to modify and alter genetic molecules.
Gene expression and regulation is one of the most active areas of genetic research.
Developmental biology, the biology of aging, genetic diseases research and cancer
research all look at how genes are expressed and controlled. We will look at models of
gene expression in both prokaryotic cells and in eukaryotic cells.
It is important for cells to be able to control gene activity. We have genetic
information for thousands of proteins. We do not want to synthesize enzymes that
are not needed, nor do we want to synthesize molecules in greater quantity than
needed. The bacteria of our oral cavity for example, secrete a slime sheath when
sucrose is in their diet. When no sucrose is present, they do not secrete the slime,
nor do they synthesize the enzymes that would process the sucrose.
In a similar fashion, intestinal bacteria do not synthesize tryptophan if their host's diet
is rich in tryptophan and they can absorb it from their intestinal environment. If there
is not tryptophan in the intestine, the bacterium will activate the metabolic pathway
needed to synthesize its own tryptophan.
Prokaryotes, in general, control genes for rapid response to their environment. By
selectively activating or inhibiting gene activity, bacterial cells can take advantage of
changing conditions.
For eukaryotes, gene regulation is tied to maintaining homeostasis – a consistent
internal environment in the face of ever-changing external conditions. Multicellular
organisms require different genes at different times of growth and development in
different tissues. We have more complex controls of gene expression to ensure that
genes function selectively in our different tissues.
In addition, many genes in multicellular organisms are activated only at one stage of
development, do their job, and function no more. The effects of these genes are not
reversible. One of the current research interests is that of stem cells – the cell lines
that lead to the development of precise tissue types, such as skin, immune system or
blood cells. At some point in development, stem cells are "programmed", do their
job, and, as a part of their programming, may lead to programmed cell death. The
chapter in your text that looks at the genetics of development goes into this topic.
Gene Expression and Regulation - 2
Regulating Gene Expression
Gene expression starts with transcription and “ends” with an enzyme catalyzing a
particular chemical reaction, or with a structural/metabolic protein. In brief, we can
control the expression of a gene at any of the levels of gene activity.
• Making the DNA readable
• Transcription
• mRNA Processing
• Translation
• Protein Modification
• Enzyme Activity
Before we go too far, we need to do some preparation work (naturally). Recall that
the typical gene codes for a polypeptide that is used to help the cell function in some
way, or is a structural protein. A gene that codes for such proteins is a structural
gene.
Other genes control how much of a polypeptide gets formed and when it gets formed.
Such genes are regulatory genes.
• Some regulatory genes code for small polypeptides that control how other genes
get expressed. These polypeptides are called transcription factors.
• Another type of regulatory gene is a piece of DNA that a transcription factor binds
to. These regulatory sites of DNA do not actually code for any protein.
Gene Expression and Regulation - 3
Gene control is exerted chemically in two general ways: affecting molecules that
interact with DNA, RNA and/or the polypeptide chains, or controlling the synthesis of
an enzyme or the activity of an enzyme in the cell. In addition to transcription factors
and regulatory genes, both hormones and enzymes can affect gene expression and
enzyme activity.
Binding to the DNA to Read the DNA Molecule
A first step in regulation is accessing the DNA molecule to get to its instructions. To
see how this happens we need to revisit the DNA molecule structure and look at how
a gene control molecule might attach. It is the ability of certain binding proteins to
attach to the DNA molecule at precise locations that provides for gene regulation.
How do these binding proteins recognize the appropriate binding site?
Recall in our discussion of the DNA double helix, that, as it coils, it automatically forms
grooves – one deeper, the major groove, and one shallow, the minor groove. The
nucleotides within the major groove have hydrophobic methyl groups, hydrogen
atoms, and hydrogen bond donor and acceptor sites that protrude out into the
groove. These groups create a chemical pattern specific for each of the four possible
nitrogen base pair combinations. This provides a mechanism for reading the
nucleotide sequences by transcription factor proteins.
Transcription factors (the gene regulators) are characterized by specific binding
sites that bind to the DNA major groove within the double helix. Although more
than 30 regulatory protein structures are known, the actual binding sites on the
regulatory proteins fall into a set of DNA-binding motifs (or domains).
Gene Expression and Regulation - 4
There are four common binding protein domains.
Helix-turn-helix
•
Homeodomain
Leucine Zipper
Zinc Finger
The helix-turn-helix motif is a protein helix with a bend. The helical region of
the motif fits into the DNA major groove.
• The helix-turn-helix motif is formed from a region of the polypeptide that has
two α helices with a non-helix "turn" between them. The two helices are held at
right angles to each other.
• When binding to DNA, one of the helices (the recognition helix) fits into the
major groove and the second rests against the outside of the DNA to stabilize
and position the recognition helix.
• Regulatory proteins generally have pairs of helix-turn-helix motifs distanced 3.4
nm (one DNA coil) apart. Pairing of the motifs adds more strength.
Tryptophan binding site with paired helix-turn-helix
The leucine zipper motif has a coiled double a helix (two protein subunits), each
with a hydrophobic, mostly leucine "zipper" at the end.
• The two leucine ends interact to hold the two subunits together while the rest
of the two subunits are apart. This results in a "Y" shape, in which the two
arms of the "Y" fit into the major groove.
• The leucine zipper motif can provide for flexibility because the two helical
regions of the two arms of the "Y" can be very different.
• In the zinc motif zinc atoms link an α helix to a β sheet pleated protein so that
the α helix fits into the major groove.
•
The homeodomain motif, a variant of the helix-turn-helix, is found in
developmental switch box genes, called homeotic genes. Homeotic genes
determine body-part identity and pattern development. (Chapter 17. pp. 346-348.)
•
Mutant Antennapedia Homeobox Gene
Gene Expression and Regulation - 5
Gene Regulation Mechanisms
We can now turn to some of the ways in which genes specifically are regulated. In
general, it is easier to study bacterial gene regulation than eukaryotic gene regulation
because the DNA is part of the cytoplasm and direct chemical contact is more readily
seen in the prokaryotic cell. Much of the research in gene regulation has been
accomplished with Escherichia coli. Let's first look at some examples of prokaryote
gene regulation.
Gene Regulation in Bacteria - The Operon of the Prokaryotic Cell
An active gene (or group of genes) includes the DNA that will be transcribed
along with a promoter and an operator. This complex is known as the operon and
was first described in 1961 by Francois Jacob and Jacques Monod.
An operon also has an additional regulatory gene that activates (turns on) or
represses (turns off) the operon.
The Operon Complex
1. Promoter
Recognized by RNA polymerase as the place to start transcription
2. Operator
Controls RNA polymerase's access to the promoter, and is usually located within
the promoter or between the promoter and the transcribable gene
3. Structural (Transcribable) Gene
Codes for the needed protein
4. Regulatory Gene
• The regulatory gene usually codes for a repressor protein. The regulatory
gene is located apart from the operon.
• Repressors work with controller molecules.
• A repressor can be active when attached to its controller molecule or
deactivated when attached to its controller molecule. A controller
molecule is typically a substance in the cell.
• A controller may function to deactivate the repressor that is blocking
the gene by attaching to the repressor (hence stopping its repression)
This is called an inducible operon.
• A controller may be a substance that is normally attached to the
repressor, and when removed, allows the gene to be activated. This is
a repressible operon.
Gene Expression and Regulation - 6
Inducible Operon
Repressor active – Gene o f f
Repressible Operon
Repressor active with controller
(Tryptophan) attached – Gene o f f
Removing controller removes
repressor from operator t o
activate gene – Gene on
Controller (Lactose) removes
repressor to activate gene
Some Examples of Prokaryotic Gene Regulation
Product Inhibition - The Tryptophan Operon
A cellular product (the end result of gene activity) can function to inhibit
transcription, and is an example of the feedback inhibition common in cell
homeostasis. This has been shown with tryptophan, an amino acid in E. coli. High
concentration of tryptophan stops the transcription of the set of enzymes needed to
manufacture tryptophan in the bacterial cell. Tryptophan does so by binding to an
allosteric (non-active) site on the tryptophan repressor. This alters the shape of the
repressor protein so that the operator blocks the attachment of RNA polymerase to
the promoter. The tryptophan operon is an example of gene regulation of
repressible enzymes, because the presence of the product of the metabolic
pathway represses (or stops) the synthesis of the enzyme(s) needed to synthesize it.
Technically, tryptophan is called a corepressor because it works with the repressor
protein to block transcription, and the tryptophan operon is a repressible operon.
Repressible Operon
Repressor off (not attached) when controller
is not on the repressor. RNA is synthesized
Repressor active with controller
(corepressor) on. No RNA is made
Gene Expression and Regulation - 7
The Lactose Operon – An Inducible Operon
Jacob and Monod studied the operon that controls lactase synthesis in E. coli, the
Lactose operon. In the Lactose operon, the substrate, allolactose (an isomer of
lactose), attaches to a repressor protein that normally sits on the operator region of
the gene, inhibiting transcription by blocking RNA polymerase from attaching to the
promoter. When allolactose (the controller) binds to the repressor molecule, the
repressor is removed from the gene so RNA polymerase can attach to the promoter
and the genes that code for the three enzymes to digest lactose can be transcribed.
When the enzymes are synthesized, the lactose is degraded, including the allolactose
molecules attached to the repressor. When allolactose is no longer available to bind
to the repressor protein, the repressor shuts down the promoter (by sitting on the
operator), which stops transcription.
The lactose operon is an example of gene regulation of inducible enzymes,
because the presence of the substrate of the metabolic pathway can induce the
synthesis of the enzyme. Because allolactose induces transcription, the lactose
operon is called an inducible operon (or in some sense, a derepressable operon
because lactose stops the repressor from its activity).
Inducible Operon
Repressor active until controller
removes repressor; no RNA synthesized
Controller (inducer) removes
repressor from operator; RNA synthesized
It should be noted that both the lactose operon and the tryptophan operon exhibit
blocking control of gene activity. The presence of tryptophan actively blocks the
operator. With lactose, the repressor normally blocks the operator and lactose
deactivates the negative repressor. It is common for anabolic processes to have
repressible operons and catabolic pathways to have inducible operons.
Combinations of Gene Controls
Not all genes are controlled just by repressors that function to turn genes off. Some
genes are also controlled by molecules that function to turn a gene on. These
molecules are called activators. Activators work to enhance the promoter's
receptivity to RNA polymerase. Without the activator molecule present, the gene is
poorly transcribed, if at all.
Gene Expression and Regulation - 8
The lactose inducible operon works in tandem with an activator, the catabolite
activator protein (CAP) that monitors the amount of glucose in the cell. Since
glucose is the preferred fuel molecule in cell respiration, high levels of glucose in the
cell will stop any lac operon activity no matter how much lactose is present. There is
no need to degrade lactose for fuel if there is plenty of fuel available, and the lac
operon will be inhibited.
When glucose levels are low in the cell, however, cAMP (discussed previously as a
secondary messenger in cell communications) accumulates. cAMP binds to the
allosteric site of CAP making CAP active.
Specifically, cAMP induces a conformational change in CAP that allows the CAP's helixturn-helix binding motifs to bind to the DNA promoter. When active, CAP binds to a
site next to the lac operon promoter and makes it easier for RNA polymerase to bind
to the promoter region enhancing transcription of the lactase enzymes. (CAP is also
called CRP for cAMP receptor protein.)
Gene Expression and Regulation - 9
Active CAP attached to Lac promoter
Activation
of the Lac operon relative
to available fuels
CAP functions in many operons dealing with catabolism in E coli, not just the lac
operon, just as cAMP is a secondary messenger for many transduction pathways.
Gene Expression and Regulation - 1 0
Eukaryotic Gene Regulation
Given some examples of how genes are regulated in bacteria, we can now turn our
attention to regulating gene activity in eukaryotic cells. Although regulating DNA
polymerase's ability to fit into the DNA to start transcription works for prokaryotes,
the need for many genes to interact in eukaryote cells suggests a need for more
complex controls as well. Most genes in eukaryotes are controlled by a
transcription factor complex, not one or two repressor molecules. However,
controls exist at virtually all levels of gene activity as mentioned earlier, not just
transcription. We can now look at some examples of how genes are controlled at
each of these levels.
• Making the DNA readable
• Transcription
• mRNA Processing
• Translation
• Protein Modification
• Enzyme Activity
Gene Expression and Regulation - 1 1
Control at the DNA level
DNA Methylation
Inactive DNA contains nucleotides (especially cytosine) that have methyl groups
attached. (The Barr body is an example of a chromosome that is highly methylated.)
Most methylated DNA will remain inactive during differentiation and during the
repeated cell divisions of that cell line. Methylation probably is most important in
preventing the transcription of the genes intended to be permanently turned off.
Special enzymes during DNA replication methylate the "daughter" DNA strands. The
imprinting of DNA in gametes, discussed briefly during our discussion of inheritance, is
an example of methylation. At least some developmental abnormalities are related to
a mutation that prevents methylation. Removing methyl groups from nucleotides has
been shown to activate a gene.
Histone Alteration
Recall that nucleosomes are formed when a set of histone proteins are wrapped in
DNA when chromosomes condense. A histone covering a promoter region can block
the assembly of transcription factor complexes needed for transcription.
Nucleosomes do not block activators or RNA polymerase, however.
On the other hand, adding an acetyl group (-COCH3 ) to the histone proteins
associated with DNA facilitates transcription. Histones with acetyl groups bind more
loosely to DNA . Transcription factors may act with enzymes that add acetyl to
histone proteins.
Gene Expression and Regulation - 1 2
Gene Regulation at the Level of Transcription
Any Eukaryotic gene has a number of regions
1.
Control elements that form the Transcription Initiation
2.
The codable gene including introns and exons
3.
Termination Signals which end transcription
Complex
The Control Elements of the Transcription Initiation Complex consists of:
• A specific promoter region within the control elements that indicates the starting
point for transcription (and contains the TATA sequence recognized by one of the
transcription factors)
•
An enhancer region that stimulates the binding of RNA polymerase to the
promoter region. The enhancer region has both proximal and distal control
elements. The enhancer region is comprised of non-coding DNA that binds to the
transcription factors. A transcription factor that binds to the enhancer and
activates transcription is called an activator. Activators fold the DNA so that
the enhancers are brought to the promoter region of the gene where they bind to
transcription factors
Gene Expression and Regulation - 1 3
There are duplications of nucleotide sequences in the enhancer region of genes;
only about a dozen small sequences are known. It is believed that the combination
of control elements is what is important in gene regulation rather than the code of
a specific enhancer.
•
Silencers are control elements which can inhibit transcription. A transcription
factor that binds to a silencer control element and blocks transcription is called a
repressor.
Transcription Initiation Complex
For transcription to occur, regulatory proteins, or transcription factors, bind to the
enhancer where activator proteins have attached. Hundreds of transcription factors
have been identified in eukaryotes, and most likely are the direct control of
transcription. You will recall that the result of many signal transduction pathways is
the synthesis of a transcription factor.
The binding of activators to the enhancer folds the DNA molecule to bring activator
proteins closer to the promoter region. This attracts more transcription factors to
facilitate the job of RNA polymerase by forming a transcription initiation
complex at the promoter.
When everything comes together, we get transcription. If a repressor has attached to
the silencer region near the enhancers, activators are prevented from binding to the
enhancers, and transcription is repressed.
Gene Expression and Regulation - 1 4
Hormones as Chemical Regulators of Transcription
Hormones can function as chemical regulators of gene expression in their target cells.
A steroid hormone can bind to a receptor protein in the cytosol that then acts as a
transcription activator. The activator will be recognized by all genes that respond to
that particular hormone. For example:
• Bird egg albumin synthesis is promoted by an estrogen-protein complex that binds
near the enhancer region of the albumin gene.
• The molting gene in insect larvae is controlled by a hormone that activates a
regulatory protein on the gene.
• The androgen receptor in human males is essential for testosterone to function.
Non-steroid hormones and other signal molecules also trigger protein receptors in the
cell membrane that activate signal transduction pathways sending transcription
factors into the nucleus.
Feedback Transcription Repression
Just as the product can repress transcription in prokaryotes (the tryptophan
repressor), a product can activate enzymes that block transcription in eukaryotes.
For example:
• Microtubules are synthesized from two polypeptides, α -tubulin and β-tubulin. When
a cell has adequate amounts of the polypeptides for microtubule synthesis, it
triggers the synthesis of an enzyme that blocks transcription of the tubulin
polypeptides. When the level of the tubulins decreases, the enzyme is deactivated
and tubulin polypeptides are produced again.
Gene Expression and Regulation - 1 5
Gene Regulation at the level of mRNA Processing
We saw earlier that when we do mRNA processing we can get different functional
mRNA transcripts depending on which part of the primary transcript is determined to
be introns and which exons. These differences can be accomplished via alternative
splicing by the spliceosomes, thereby controlling gene activity.
For example, a common gene is used for the synthesis of calcitonin, a hormone, or the
synthesis of CGRP, a polypeptide messenger molecule. In the thyroid gland, the CGRP
section is an intron, and the gene codes for the synthesis of calcitonin. In the
hypothalamus, the gene codes for CGRP, and the calcitonin information is an intron.
Two different active transport protein binding sites can be transcribed by switching
exons in a common pre-mRNA transcript. If the final mRNA contains the potassium
exon, the protein will bind and transport potassium. If a calcium ion exon is coded,
the protein will bind and transport calcium.
Gene Expression and Regulation - 1 6
Post Transcription Controls of Gene Activity
Duration of the mRNA Transcript
The length of time a mRNA transcript can be read prior to degradation will affect the
amount of final product, which will affect cell activity. Some mRNA lasts for weeks;
some for hours. In general, mRNA degradation starts with the breakdown of its poly-A
tail, followed by the degradation of the cap. Once the mRNA cap is degraded, the
rest of the molecule rapidly degrades.
Special A-U nucleotide sequences in the mRNA near the 3' poly-A tail typically
determine the degradation. Some nucleotide sequences near the 3' end are also
recognition sites for endonucleases that promote rapid degradation of the transcript.
Translation Control
Blocking mRNA attachment to Ribosomes
Translation can be controlled by regulatory proteins that block mRNA attachment to
the small subunit of the ribosome. In humans, translation of the protein ferritin that
binds to iron for iron storage is blocked by the enzyme aconitase unless iron is
present in the cell. Aconitase binds to a nucleotide sequence on the mRNA for ferritin
preventing attachment of the mRNA to the ribosome.
There are some regulatory proteins that are capable of blocking all translation within
the cell. This is important in egg cell cytoplasm that is primed for protein synthesis
after fertilization, but does not want to start protein synthesis too early. Special
translation factors will initiate translation after fertilization. A similar translation block
occurs in some algae that are only metabolically active when there is light.
Gene Expression and Regulation - 1 7
Post Translation Processing
Once a polypeptide is synthesized it can be altered by processing.
• Many metabolic proteins are non-functional until activated by other molecules.
Hydrolytic enzymes, in particular, are synthesized in inactive forms.
• Membrane recognition proteins must have additional molecules attached to them
before they function.
• Regulation can also take place when proteins are moved from one part of a cell to
another, or when exported from the cell.
Proteins targeted for degradation are tagged by a tiny protein, ubiquitin, which is
recognized by huge protease enzymes called proteasomes. The tagged protein is
readily degraded within the proteasome. In cystic fibrosis, the chloride ion channel
protein gets tagged and degraded before reaching the plasma membrane.
Proteasomes
Although just a few examples of how genes can be controlled have been illustrated,
we can see that gene control is a complex interaction of internal and external cell
signals and feedback responses.