Download Document

Document related concepts

X-inactivation wikipedia , lookup

Short interspersed nuclear elements (SINEs) wikipedia , lookup

Cancer epigenetics wikipedia , lookup

Gene desert wikipedia , lookup

Cre-Lox recombination wikipedia , lookup

Genetic engineering wikipedia , lookup

Polycomb Group Proteins and Cancer wikipedia , lookup

Oncogenomics wikipedia , lookup

No-SCAR (Scarless Cas9 Assisted Recombineering) Genome Editing wikipedia , lookup

Gene expression programming wikipedia , lookup

Primary transcript wikipedia , lookup

Biology and consumer behaviour wikipedia , lookup

Extrachromosomal DNA wikipedia , lookup

Nutriepigenomics wikipedia , lookup

Ridge (biology) wikipedia , lookup

Genomic imprinting wikipedia , lookup

Vectors in gene therapy wikipedia , lookup

Transposable element wikipedia , lookup

Pathogenomics wikipedia , lookup

Genomic library wikipedia , lookup

Point mutation wikipedia , lookup

Metagenomics wikipedia , lookup

Gene expression profiling wikipedia , lookup

Epigenetics of human development wikipedia , lookup

Genome (book) wikipedia , lookup

Therapeutic gene modulation wikipedia , lookup

Human genome wikipedia , lookup

RNA-Seq wikipedia , lookup

Site-specific recombinase technology wikipedia , lookup

Genomics wikipedia , lookup

Genome editing wikipedia , lookup

Gene wikipedia , lookup

Minimal genome wikipedia , lookup

Designer baby wikipedia , lookup

History of genetic engineering wikipedia , lookup

Non-coding DNA wikipedia , lookup

Microevolution wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Helitron (biology) wikipedia , lookup

Genome evolution wikipedia , lookup

Transcript
CAMPBELL
BIOLOGY
TENTH
EDITION
Reece • Urry • Cain • Wasserman • Minorsky • Jackson
21
Genomes and
Their Evolution
Lecture Presentation by
Nicole Tunbridge and
Kathleen Fitzpatrick
© 2014 Pearson Education, Inc.
Reading the Leaves from the Tree of Life
 Complete genome sequences exist for a human,
chimpanzee, E. coli, brewer’s yeast, corn, fruit fly,
house mouse, rhesus macaque, and many other
organisms
 Comparisons of genomes among organisms
provide insights into evolution and other biological
processes
© 2014 Pearson Education, Inc.
 Genomics is the study of whole sets of genes and
their interactions
 Bioinformatics is the application of computational
methods to the storage and analysis of biological
data
© 2014 Pearson Education, Inc.
Figure 21.1
© 2014 Pearson Education, Inc.
Figure 21.1a
House mouse (Mus musculus)
© 2014 Pearson Education, Inc.
Concept 21.1: The Human Genome Project
fostered development of faster, less expensive
sequencing techniques
 Officially begun as the Human Genome Project in
1990, the sequencing of the human genome was
largely completed by 2003
 The genome was completed using sequencing
machines and the dideoxy chain termination
method
 A major thrust of the project was development of
technology for faster sequencing
© 2014 Pearson Education, Inc.
 Two approaches complemented each other in
obtaining the complete sequence
 The initial approach built on an earlier storehouse
of human genetic information
 Then J. Craig Venter set up a company to
sequence the entire genome using an alternative
whole-genome shotgun approach
 This used cloning and sequencing of fragments of
randomly cut DNA followed by assembly into a
single continuous sequence
© 2014 Pearson Education, Inc.
Figure 21.2-1
1 Cut the DNA into
overlapping fragments
short enough for
sequencing.
2 Clone the fragments
in plasmid or other
vectors.
© 2014 Pearson Education, Inc.
Figure 21.2-2
1 Cut the DNA into
overlapping fragments
short enough for
sequencing.
2 Clone the fragments
in plasmid or other
vectors.
3 Sequence each
fragment.
© 2014 Pearson Education, Inc.
CGCCATCAGT AGTCCGCTATACGA
ACGATACTGGT
Figure 21.2-3
1 Cut the DNA into
overlapping fragments
short enough for
sequencing.
2 Clone the fragments
in plasmid or other
vectors.
3 Sequence each
fragment.
CGCCATCAGT AGTCCGCTATACGA
CGCCATCAGT
ACGATACTGGT
ACGATACTGGT
4 Order the sequences
into one overall
sequence with
computer software.
AGTCCGCTATACGA
⋯CGCCATCAGTCCGCTATACGATACTGGT⋯
© 2014 Pearson Education, Inc.
 Today the whole-genome shotgun approach is
widely used, though newer techniques are
contributing to the faster pace and lowered cost of
genome sequencing
 These newer techniques do not require a cloning
step
 These techniques have also facilitated a
metagenomics approach in which DNA from a
group of species in an environmental sample is
sequenced
© 2014 Pearson Education, Inc.
Concept 21.2: Scientists use bioinformatics to
analyze genomes and their functions
 The Human Genome Project established
databases and refined analytical software to make
data available on the Internet
 This has accelerated progress in DNA sequence
analysis
© 2014 Pearson Education, Inc.
Centralized Resources for Analyzing Genome
Sequences
 Bioinformatics resources are provided by a
number of sources
 National Library of Medicine and the National
Institutes of Health (NIH) created the National
Center for Biotechnology Information (NCBI)
 European Molecular Biology Laboratory
 DNA Data Bank of Japan
 BGI in Shenzhen, China
© 2014 Pearson Education, Inc.
 Genbank, the NCBI database of sequences,
doubles its data approximately every 18 months
 Software is available that allows online visitors to
search Genbank for matches to
 A specific DNA sequence
 A predicted protein sequence
 Common stretches of amino acids in a protein
 The NCBI website also provides 3-D views of all
protein structures that have been determined
© 2014 Pearson Education, Inc.
Figure 21.3
WD40 - Sequence Alignment Viewer
WD40 - Cn3D 4.1
CDD Descriptive Items
Name: WD40
WD40 domain, found in a number
of eukaryotic proteins that cover
a wide variety of functions
including adaptor/regulatory
modules in signal transduction,
pre-mRNA processing and
cytoskeleton assembly; typically
contains a GH dipeptide 11-24
residues from its N-terminus and
the WD dipeptide at its
C-terminus and is 40 residues
long, hence the name WD40;
© 2014 Pearson Education, Inc.
Identifying Protein-Coding Genes and
Understanding Their Functions
 Using available DNA sequences, geneticists can
study genes directly
 The identification of protein coding genes within
DNA sequences in a database is called gene
annotation
© 2014 Pearson Education, Inc.
 Gene annotation is largely an automated process
 Comparison of sequences of previously unknown
genes with those of known genes in other species
may help provide clues about their function
© 2014 Pearson Education, Inc.
Understanding Genes and Gene Expression at
the Systems Level
 Proteomics is the systematic study of full protein
sets encoded by a genome
 Proteins, not genes, carry out most of the activities
of the cell
© 2014 Pearson Education, Inc.
How Systems Are Studied: An Example
 A systems biology approach can be applied to define
gene circuits and protein interaction networks
 Researchers working on the yeast Saccharomyces
cerevisiae used sophisticated techniques to disable
pairs of genes one pair at a time, creating double
mutants
 Computer software then mapped genes to produce a
network-like “functional map” of their interactions
 The systems biology approach is possible because of
advances in bioinformatics
© 2014 Pearson Education, Inc.
Figure 21.4
Translation and
ribosomal
functions
Mitochondrial
functions
Peroxisomal
functions
RNA processing
Transcription and
chromatin-related
functions
Metabolism
and
amino acid
biosynthesis
Nuclearcytoplasmic
transport
Secretion
and vesicle
transport
Nuclear
migration
and protein
degradation
Mitosis
DNA replication
and repair
© 2014 Pearson Education, Inc.
Glutamate
biosynthesis
Cell polarity and
morphogenesis
Protein folding and
glycosylation;
cell wall biosynthesis
Vesicle
fusion
Serinerelated
biosynthesis
Amino acid
permease pathway
Figure 21.4a
Translation and
ribosomal
functions
Mitochondrial
functions
Peroxisomal
functions
RNA processing
Transcription and
chromatin-related
functions
Metabolism
and
amino acid
biosynthesis
Nuclearcytoplasmic
transport
Secretion
and vesicle
transport
Nuclear
migration
and protein
degradation
Mitosis
DNA replication
and repair
© 2014 Pearson Education, Inc.
Cell polarity and
morphogenesis
Protein folding and
glycosylation;
cell wall biosynthesis
Figure 21.4b
Glutamate
biosynthesis
Vesicle
fusion
Serinerelated
biosynthesis
Amino acid
permease pathway
Metabolism
and
amino acid
biosynthesis
© 2014 Pearson Education, Inc.
Application of Systems Biology to Medicine
 The Cancer Genome Atlas project, started in
2010, looked for all the common mutations in three
types of cancer by comparing gene sequences
and expression in cancer versus normal cells
 This was so fruitful, it has been extended to ten
other common cancers
 Silicon and glass “chips” have been produced that
hold a microarray of most known human genes
 These are used to study gene expression patterns
in patients suffering from various cancers or other
diseases
© 2014 Pearson Education, Inc.
Figure 21.5
© 2014 Pearson Education, Inc.
Concept 21.3: Genomes vary in size, number of
genes, and gene density
 By early 2013, over 4,300 genomes were
completely sequenced, including 4,000 bacteria,
186 archaea, and 183 eukaryotes
 Sequencing of over 9,600 genomes and over 370
metagenomes is currently in progress
© 2014 Pearson Education, Inc.
Genome Size
 Genomes of most bacteria and archaea range
from 1 to 6 million base pairs (Mb); genomes of
eukaryotes are usually larger
 Most plants and animals have genomes greater
than 100 Mb; humans have 3,000 Mb
 Within each domain there is no systematic
relationship between genome size and phenotype
© 2014 Pearson Education, Inc.
Table 21.1
© 2014 Pearson Education, Inc.
Number of Genes
 Free-living bacteria and archaea have 1,500 to
7,500 genes
 Unicellular fungi have from about 5,000 genes and
multicellular eukaryotes up to at least 40,000
genes
© 2014 Pearson Education, Inc.
 Number of genes is not correlated to genome size
 For example, it is estimated that the nematode
C. elegans has 100 Mb and 20,100 genes, while
Drosophila has 165 Mb and 14,000 genes
 Researchers predicted the human genome would
contain about 50,000 to 100,000 genes; however
the number is around 21,000
 Vertebrate genomes can produce more than one
polypeptide per gene because of alternative
splicing of RNA transcripts
© 2014 Pearson Education, Inc.
Gene Density and Noncoding DNA
 Humans and other mammals have the lowest
gene density, or number of genes, in a given
length of DNA
 Multicellular eukaryotes have many introns within
genes and a large amount of noncoding DNA
between genes
© 2014 Pearson Education, Inc.
Concept 21.4: Multicellular eukaryotes have
much noncoding DNA and many multigene
families
 Sequencing of the human genome reveals that
98.5% does not code for proteins, rRNAs, or
tRNAs
 About a quarter of the human genome codes for
introns and gene-related regulatory sequences
© 2014 Pearson Education, Inc.
 Intergenic DNA is noncoding DNA found between
genes
 Pseudogenes are former genes that have
accumulated mutations and are nonfunctional
 Repetitive DNA is present in multiple copies in the
genome
 About three-fourths of repetitive DNA is made up
of transposable elements and sequences related
to them
© 2014 Pearson Education, Inc.
Figure 21.6
Regulatory
sequences (5%)
Exons (1.5%)
L1
sequences
(17%)
Introns
(∼20%)
Repetitive
DNA that
includes
transposable
elements
and related
sequences
(44%)
Unique
noncoding
DNA (15%)
Repetitive
DNA
unrelated to
transposable
elements (14%)
Alu elements
(10%)
Simple sequence
DNA (3%)
© 2014 Pearson Education, Inc.
Large-segment
duplications (5–6%)
 Much evidence indicates that noncoding DNA
(previously called “junk DNA”) plays important
roles in the cell
 For example, genomes of humans, rats, and mice
show high sequence conservation for about 500
noncoding regions
© 2014 Pearson Education, Inc.
Transposable Elements and Related Sequences
 The first evidence for mobile DNA segments came
from geneticist Barbara McClintock’s breeding
experiments with Indian corn
 McClintock identified changes in the color of corn
kernels that made sense only if some genetic
elements move from other genome locations into
the genes for kernel color
 These transposable elements move from one
site to another in a cell’s DNA; they are present in
both prokaryotes and eukaryotes
© 2014 Pearson Education, Inc.
Figure 21.7
© 2014 Pearson Education, Inc.
Figure 21.7a
© 2014 Pearson Education, Inc.
Figure 21.7b
© 2014 Pearson Education, Inc.
Movement of Transposons and
Retrotransposons
 Eukaryotic transposable elements are of two types
 Transposons, which move by means of a DNA
intermediate and require a transposase enzyme
 Retrotransposons, which move by means of an
RNA intermediate, using a reverse transcriptase
© 2014 Pearson Education, Inc.
Figure 21.8
Transposon
DNA of
genome
Transposon
is copied
Mobile copy of transposon
© 2014 Pearson Education, Inc.
New copy of
transposon
Insertion
Figure 21.9
Retrotransposon
New copy of
retrotransposon
Synthesis of a
single-stranded
RNA intermediate
RNA
Insertion
Reverse
transcriptase
DNA
strand
Mobile copy of retrotransposon
© 2014 Pearson Education, Inc.
Sequences Related to Transposable Elements
 Multiple copies of transposable elements and
related sequences are scattered throughout
eukaryotic genomes
 In primates, a large portion of transposable
element–related DNA consists of a family of
similar sequences called Alu elements
 Many Alu elements are transcribed into RNA
molecules; some are thought to help regulate
gene expression
© 2014 Pearson Education, Inc.
 The human genome also contains many
sequences of a type of retrotransposon called
LINE-1 (L1)
 L1 sequences have a low rate of transposition and
may have effects on gene expression
 L1 transposons may play roles in the diversity of
neuronal cell types
© 2014 Pearson Education, Inc.
Other Repetitive DNA, Including Simple
Sequence DNA
 About 15% of the human genome consists of
duplication of long sequences of DNA from one
location to another
 In contrast, simple sequence DNA contains many
copies of tandemly repeated short sequences
© 2014 Pearson Education, Inc.
 A series of repeating units of 2 to 5 nucleotides is
called a short tandem repeat (STR)
 The repeat number for STRs can vary among sites
(within a genome) or individuals
 Simple sequence DNA is common in centromeres
and telomeres, where it probably plays structural
roles in the chromosome
© 2014 Pearson Education, Inc.
Genes and Multigene Families
 Many eukaryotic genes are present in one copy
per haploid set of chromosomes
 The rest of the genes occur in multigene families,
collections of identical or very similar genes
 Some multigene families consist of identical DNA
sequences, usually clustered tandemly, such as
those that code for rRNA products
© 2014 Pearson Education, Inc.
Figure 21.10a
DNA Direction of transcription
RNA transcripts
Nontranscribed
spacer
Transcription unit
DNA
rRNA
18S
28S
5.8S
28S
5.8S
18S
(a) Part of the ribosomal RNA gene family
© 2014 Pearson Education, Inc.
 The classic examples of multigene families of
nonidentical genes are two related families of
genes that encode globins
 -globins and -globins are polypeptides of
hemoglobin and are coded by genes on different
human chromosomes and are expressed at
different times in development
© 2014 Pearson Education, Inc.
Figure 21.10b
β-Globin
α-Globin
α-Globin
β-Globin
α-Globin gene family
Chromosome 16
ζ
ζ α α α2 α1 θ
1
2
Embryo
Heme
β-Globin gene family
Chromosome 11
ϵ
G A β
Fetus
and adult Embryo Fetus

Adult
(b) The human α-globin and β-globin gene
families
© 2014 Pearson Education, Inc.
β
Concept 21.5: Duplication, rearrangement, and
mutation of DNA contribute to genome
evolution
 The basis of change at the genomic level is
mutation, which underlies much of genome
evolution
 The earliest forms of life likely had only those
genes necessary for survival and reproduction
 The size of genomes has increased over
evolutionary time, with the extra genetic material
providing raw material for gene diversification
© 2014 Pearson Education, Inc.
Duplication of Entire Chromosome Sets
 Accidents in meiosis can lead to one or more extra
sets of chromosomes, a condition known as
polyploidy
 The genes in one or more of the extra sets can
diverge by accumulating mutations; these
variations may persist if the organism carrying
them survives and reproduces
 In this way genes with novel functions can evolve
© 2014 Pearson Education, Inc.
Alterations of Chromosome Structure
 Humans have 23 pairs of chromosomes, while
chimpanzees have 24 pairs
 Following the divergence of humans and
chimpanzees from a common ancestor, two
ancestral chromosomes fused in the human line
 Duplications and inversions result from mistakes
during meiotic recombination
 Comparative analysis between chromosomes of
humans and seven mammalian species paints a
hypothetical chromosomal evolutionary history
© 2014 Pearson Education, Inc.
Figure 21.11
Human
chromosome
Chimpanzee
chromosomes
Telomere
sequences
Centromere
sequences
Telomere-like
sequences
12
Centromere-like
sequences
2
© 2014 Pearson Education, Inc.
13
Figure 21.12
Human chromosome
16
© 2014 Pearson Education, Inc.
Mouse chromosomes
7
8
16
17
 The rate of duplications and inversions seems to
have accelerated about 100 million years ago
 This coincides with when large dinosaurs went
extinct and mammals diversified
 Chromosomal rearrangements are thought to
contribute to the generation of new species
© 2014 Pearson Education, Inc.
Duplication and Divergence of Gene-Sized
Regions of DNA
 Unequal crossing over during prophase I of
meiosis can result in one chromosome with a
deletion and another with a duplication of a
particular region
 Transposable elements can provide sites for
crossover between nonsister chromatids
© 2014 Pearson Education, Inc.
Figure 21.13
Nonsister
Gene
chromatids
Incorrect pairing
of two homologs
during meiosis
Crossover
point
and
© 2014 Pearson Education, Inc.
Transposable
element
Evolution of Genes with Related Functions: The
Human Globin Genes
 The genes encoding the various globin proteins
evolved from one common ancestral globin gene,
which duplicated and diverged about 450–500
million years ago
 After the duplication events, differences between
the genes in the globin family arose from the
accumulation of mutations
© 2014 Pearson Education, Inc.
Figure 21.14
Ancestral globin gene
Evolutionary time
Duplication of
ancestral gene
Mutation in
both copies
α
Transposition to
different chromosomes
Further duplications
and mutations
α
β
α
ζ
ζ
ζ α α 1 α2 α1 yθ
2
α-Globin gene family
on chromosome 16
© 2014 Pearson Education, Inc.
β
ϵ
β

ϵ
G A
β

β
β-Globin gene family
on chromosome 11
 Subsequent duplications of these genes and
random mutations gave rise to the present globin
genes, which code for oxygen-binding proteins
 The similarity in the amino acid sequences of the
various globin proteins supports this model of
gene duplication and mutation
© 2014 Pearson Education, Inc.
Evolution of Genes with Novel Functions
 The copies of some duplicated genes have
diverged so much in evolution that the functions of
their encoded proteins are now very different
 For example the lysozyme gene was duplicated
and evolved into the gene that encodes
-lactalbumin in mammals
 Lysozyme is an enzyme that helps protect animals
against bacterial infection
 -lactalbumin is a nonenzymatic protein that plays
a role in milk production in mammals
© 2014 Pearson Education, Inc.
Figure 21.15
(a) Lysozyme
Lysozyme
1
α–lactalbumin
1
Lysozyme
51
α–lactalbumin
51
Lysozyme
(b) α–lactalbumin
101
α–lactalbumin 101
(c) Amino acid sequence alignments of lysozyme and α–lactalbumin
© 2014 Pearson Education, Inc.
Rearrangements of Parts of Genes: Exon
Duplication and Exon Shuffling
 The duplication or repositioning of exons has
contributed to genome evolution
 Errors in meiosis can result in an exon being
duplicated on one chromosome and deleted from
the homologous chromosome
 In exon shuffling, errors in meiotic recombination
lead to some mixing and matching of exons, either
within a gene or between two nonallelic genes
© 2014 Pearson Education, Inc.
Figure 21.16
EGF
EGF
EGF
EGF
Epidermal growth
factor gene with multiple
EGF exons
F
F
F
Exon
shuffling
Exon
duplication
F
Fibronectin gene with multiple
“finger” exons
F
EGF
K
K
K
Plasminogen gene with a
“kringle” exon
Portions of ancestral genes
© 2014 Pearson Education, Inc.
Exon
shuffling
TPA gene as it exists today
How Transposable Elements Contribute to
Genome Evolution
 Multiple copies of similar transposable elements
may facilitate recombination, or crossing over,
between different chromosomes
 Insertion of transposable elements within a
protein-coding sequence may block protein
production
 Insertion of transposable elements within a
regulatory sequence may increase or decrease
protein production
© 2014 Pearson Education, Inc.
 Transposable elements may carry a gene or
groups of genes to a new position
 Transposable elements may also create new sites
for alternative splicing in an RNA transcript
 In all cases, changes are usually detrimental but
may on occasion prove advantageous to an
organism
© 2014 Pearson Education, Inc.
Concept 21.6: Comparing genome sequences
provides clues to evolution and development
 Comparisons of genome sequences from different
species reveal much about the evolutionary history
of life
 Comparative studies of embryonic development
are beginning to clarify the mechanisms that
generated the diversity of life-forms present today
© 2014 Pearson Education, Inc.
Comparing Genomes
 Genome comparisons of closely related species
help us understand recent evolutionary events
 Relationships among species can be represented
by a tree-shaped diagram
© 2014 Pearson Education, Inc.
Figure 21.17
Bacteria
Most recent
common
ancestor
of all living
things
Eukarya
Archaea
4
3
2
Billions of years ago
1
0
Chimpanzee
Human
Mouse
70
60
50
40
30
20
Millions of years ago
© 2014 Pearson Education, Inc.
10
0
Comparing Distantly Related Species
 Highly conserved genes have changed very little
over time
 These help clarify relationships among species
that diverged from each other long ago
 Bacteria, archaea, and eukaryotes diverged from
each other between 2 and 4 billion years ago
 Highly conserved genes can be studied in one
model organism, and the results applied to other
organisms
© 2014 Pearson Education, Inc.
Comparing Closely Related Species
 Genomes of closely related species are likely to
be organized similarly
 For example, using the human genome sequence
as a guide, researchers were quickly able to
sequence the chimpanzee genome
 Analysis of the human and chimpanzee genomes
reveals some general differences that underlie the
differences between the two organisms
© 2014 Pearson Education, Inc.
 Human and chimpanzee genomes differ by 1.2%
at single base-pairs, and by 2.7% because of
insertions and deletions
 Sequencing of the bonobo genome in 2012
reveals that in some regions there is greater
similarity between human and bonobo or
chimpanzee sequences than between chimpanzee
and bonobo
© 2014 Pearson Education, Inc.
 A number of genes are apparently evolving faster
in the human than in the chimpanzee or mouse
 Among them are genes involved in defense
against malaria and tuberculosis and one that
regulates brain size
© 2014 Pearson Education, Inc.
 Humans and chimpanzees differ in the expression
of the FOXP2 gene, whose product turns on
genes involved in vocalization
 Differences in the FOXP2 gene may explain why
humans but not chimpanzees communicate by
speech
 The FOXP2 gene of Neanderthals is identical to
that of humans, suggesting they may have been
capable of speech
© 2014 Pearson Education, Inc.
Figure 21.18
Experiment
Heterozygote: one
copy of FOXP2
disrupted
Homozygote: both
copies of FOXP2
disrupted
Experiment 1: Researchers cut thin sections of brain and stained
them with reagents that allow visualization of brain anatomy in a
UV fluorescence microscope.
Results
Experiment 1
Wild type
Experiment 2: Researchers
separated each newborn pup
from its mother and recorded
the number of ultrasonic
whistles produced by the pup.
Experiment 2
Heterozygote
Homozygote
Number of whistles
Wild type: two
normal copies of
FOXP2
400
300
200
100
0
(No
whistles)
Wild Hetero- Homotype zygote zygote
© 2014 Pearson Education, Inc.
Figure 21.18a
Experiment
Wild type: two
normal copies of
FOXP2
Heterozygote: one
copy of FOXP2
disrupted
Homozygote: both
copies of FOXP2
disrupted
Experiment 1: Researchers cut thin sections of brain and stained
them with reagents that allow visualization of brain anatomy in a
UV fluorescence microscope.
Results
Experiment 1
Wild type
© 2014 Pearson Education, Inc.
Heterozygote
Homozygote
Figure 21.18aa
Wild type: two
normal copies of
FOXP2
© 2014 Pearson Education, Inc.
Figure 21.18ab
Heterozygote: one
copy of FOXP2
disrupted
© 2014 Pearson Education, Inc.
Figure 21.18ac
Homozygote: both
copies of FOXP2
disrupted
© 2014 Pearson Education, Inc.
Figure 21.18b
Experiment
Wild type: two
normal copies of
FOXP2
Heterozygote: one
copy of FOXP2
disrupted
Homozygote: both
copies of FOXP2
disrupted
Experiment 2: Researchers separated each newborn pup from
its mother and recorded the number of ultrasonic whistles
produced by the pup.
Number of whistles
Results
Experiment 2
400
300
200
100
0
(No
whistles)
Wild Hetero- Homotype zygote zygote
© 2014 Pearson Education, Inc.
Figure 21.18ba
© 2014 Pearson Education, Inc.
Comparing Genomes Within a Species
 As a species, humans have only been around
about 200,000 years and have low within-species
genetic variation
 Variation within humans is due to single nucleotide
polymorphisms, inversions, deletions, and
duplications
 Most surprising is the large number of copynumber variants
 These variations are useful for studying human
evolution and human health
© 2014 Pearson Education, Inc.
Widespread Conservation of Developmental
Genes Among Animals
 Evolutionary developmental biology, or evo-devo,
is the study of the evolution of developmental
processes in multicellular organisms
 Genomic information shows that minor differences
in gene sequence or regulation can result in
striking differences in form
© 2014 Pearson Education, Inc.
 Molecular analysis of the homeotic genes in
Drosophila has shown that they all include a
sequence called a homeobox
 An identical or very similar nucleotide sequence
has been discovered in the homeotic genes of
both vertebrates and invertebrates
 Homeobox genes code for a domain that allows a
protein to bind to DNA and to function as a
transcription regulator
 Homeotic genes in animals are called Hox genes
© 2014 Pearson Education, Inc.
Figure 21.19
Adult
fruit fly
Fruit fly embryo
(10 hours)
Fruit fly
chromosome
Mouse
chromosomes
Mouse embryo
(12 days)
Adult mouse
© 2014 Pearson Education, Inc.
 Related homeobox sequences have been found in
regulatory genes of yeasts, plants, and even
prokaryotes
 In addition to homeotic genes, many other
developmental genes are highly conserved from
species to species
© 2014 Pearson Education, Inc.
 Sometimes small changes in regulatory
sequences of certain genes lead to major changes
in body form
 For example, variation in Hox gene expression
controls variation in leg-bearing segments of
crustaceans and insects
 In other cases, genes with conserved sequences
play different roles in different species
© 2014 Pearson Education, Inc.
Figure 21.20
Genital
Thorax segments
Abdomen
(a) Expression of four Hox genes in the brine
shrimp Artemia
Thorax
Abdomen
(b) Expression of the grasshopper versions of
the same four Hox genes
© 2014 Pearson Education, Inc.
Figure 21.10
DNA Direction of transcription
RNA transcripts
β-Globin
α-Globin
Nontranscribed
spacer
α-Globin
Transcription unit
β-Globin
DNA
rRNA
18S
28S
5.8S
28S
5.8S
18S
(a) Part of the ribosomal RNA gene family
© 2014 Pearson Education, Inc.
α-Globin gene family
Chromosome 16
ζ
 ζ  α  α α2 α1  θ
Embryo
2
1
Heme
β-Globin gene family
Chromosome 11
ϵ
G A
Fetus
and adult Embryo Fetus
β

β
Adult
(b) The human α-globin and β-globin gene
families
Figure 21.10c
DNA Direction of transcription
RNA transcripts
Nontranscribed
spacer
© 2014 Pearson Education, Inc.
Transcription unit
Figure 21.UN01a
Globin
Alignment of Globin Amino Acid Sequences
α1
ζ
1 MVLSPADKTNVKAAWGKVGAHAGEYGAEAL
1 MSL T KTER T I I VSMWAK I S TQADT I G TE T L
α1
ζ
31 ERMFLSF P T TKTYFPHFDLSH – GSAQVKGH
31 ERLFLSHPQTKTYF P HFDL –HPGSAQLRAH
α1
ζ
61 GKKVADALT NAVAHVDDMPNALSALSDLHA
61 GSKVVAAVGDAVKS I DD I GGALSKLSELHA
α1
ζ
91 HKLRVDPVNFKLLSHCL LV T L AAHL PA E FT
91 Y I LRVDPVNFKLLSHCL LV TLAARFPAD F T
α1
ζ
© 2014 Pearson Education, Inc.
121 PAV HASLDKF L ASVST V LT SKYR
121 AEAHAAWDKFLSVVSSVLT EKYR
Figure 21.UN01b
Amino Acid Identity Table
β Family
α Family
α Family
α1
α2
ζ
β Family
β

ϵ
A
G
© 2014 Pearson Education, Inc.
α1
(alpha 1)
α2
(alpha 2)
ζ
(zeta)
β
(beta)

(delta)
ϵ
(epsilon)
-----
100
61
45
44
39
42
42
-----
61
45
44
39
42
42
-----
38
40
41
41
41
-----
93
76
73
73
-----
73
71
72
-----
80
80
-----
99
A
G
(gamma A) (gamma G)
-----
Figure 21.UN01c
β
α
α
β
Hemoglobin
© 2014 Pearson Education, Inc.
Figure 21.UN02
© 2014 Pearson Education, Inc.
Figure 21.UN03
Archaea
Bacteria
Genome
size
Number of
genes
Gene
density
Introns
Other
noncoding
DNA
© 2014 Pearson Education, Inc.
Most are 1–6 Mb
1,500–7,500
Higher than in eukaryotes
None in
protein-coding
genes
Present in
some genes
Very little
Eukarya
Most are 10–4,000 Mb, but a
few are much larger
5,000–40,000
Lower than in prokaryotes
(Within eukaryotes, lower
density is correlated with larger
genomes.)
Present in most genes of
multicellular eukaryotes, but
only in some genes of
unicellular eukaryotes
Can exist in large amounts;
generally more repetitive
noncoding DNA in
multicellular eukaryotes
Figure 21.UN04
β-Globin gene family
α-Globin gene family
Chromosome 11
Chromosome 16
ζ
ζ α α
© 2014 Pearson Education, Inc.
2
1
α2 α1 θ
ϵ
G
A
β

β
Figure 21.UN05
© 2014 Pearson Education, Inc.
Figure 21.UN06
© 2014 Pearson Education, Inc.