* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Document
X-inactivation wikipedia , lookup
Short interspersed nuclear elements (SINEs) wikipedia , lookup
Cancer epigenetics wikipedia , lookup
Gene desert wikipedia , lookup
Cre-Lox recombination wikipedia , lookup
Genetic engineering wikipedia , lookup
Polycomb Group Proteins and Cancer wikipedia , lookup
Oncogenomics wikipedia , lookup
No-SCAR (Scarless Cas9 Assisted Recombineering) Genome Editing wikipedia , lookup
Gene expression programming wikipedia , lookup
Primary transcript wikipedia , lookup
Biology and consumer behaviour wikipedia , lookup
Extrachromosomal DNA wikipedia , lookup
Nutriepigenomics wikipedia , lookup
Ridge (biology) wikipedia , lookup
Genomic imprinting wikipedia , lookup
Vectors in gene therapy wikipedia , lookup
Transposable element wikipedia , lookup
Pathogenomics wikipedia , lookup
Genomic library wikipedia , lookup
Point mutation wikipedia , lookup
Metagenomics wikipedia , lookup
Gene expression profiling wikipedia , lookup
Epigenetics of human development wikipedia , lookup
Genome (book) wikipedia , lookup
Therapeutic gene modulation wikipedia , lookup
Human genome wikipedia , lookup
Site-specific recombinase technology wikipedia , lookup
Genome editing wikipedia , lookup
Minimal genome wikipedia , lookup
Designer baby wikipedia , lookup
History of genetic engineering wikipedia , lookup
Non-coding DNA wikipedia , lookup
Microevolution wikipedia , lookup
Artificial gene synthesis wikipedia , lookup
CAMPBELL BIOLOGY TENTH EDITION Reece • Urry • Cain • Wasserman • Minorsky • Jackson 21 Genomes and Their Evolution Lecture Presentation by Nicole Tunbridge and Kathleen Fitzpatrick © 2014 Pearson Education, Inc. Reading the Leaves from the Tree of Life Complete genome sequences exist for a human, chimpanzee, E. coli, brewer’s yeast, corn, fruit fly, house mouse, rhesus macaque, and many other organisms Comparisons of genomes among organisms provide insights into evolution and other biological processes © 2014 Pearson Education, Inc. Genomics is the study of whole sets of genes and their interactions Bioinformatics is the application of computational methods to the storage and analysis of biological data © 2014 Pearson Education, Inc. Figure 21.1 © 2014 Pearson Education, Inc. Figure 21.1a House mouse (Mus musculus) © 2014 Pearson Education, Inc. Concept 21.1: The Human Genome Project fostered development of faster, less expensive sequencing techniques Officially begun as the Human Genome Project in 1990, the sequencing of the human genome was largely completed by 2003 The genome was completed using sequencing machines and the dideoxy chain termination method A major thrust of the project was development of technology for faster sequencing © 2014 Pearson Education, Inc. Two approaches complemented each other in obtaining the complete sequence The initial approach built on an earlier storehouse of human genetic information Then J. Craig Venter set up a company to sequence the entire genome using an alternative whole-genome shotgun approach This used cloning and sequencing of fragments of randomly cut DNA followed by assembly into a single continuous sequence © 2014 Pearson Education, Inc. Figure 21.2-1 1 Cut the DNA into overlapping fragments short enough for sequencing. 2 Clone the fragments in plasmid or other vectors. © 2014 Pearson Education, Inc. Figure 21.2-2 1 Cut the DNA into overlapping fragments short enough for sequencing. 2 Clone the fragments in plasmid or other vectors. 3 Sequence each fragment. © 2014 Pearson Education, Inc. CGCCATCAGT AGTCCGCTATACGA ACGATACTGGT Figure 21.2-3 1 Cut the DNA into overlapping fragments short enough for sequencing. 2 Clone the fragments in plasmid or other vectors. 3 Sequence each fragment. CGCCATCAGT AGTCCGCTATACGA CGCCATCAGT ACGATACTGGT ACGATACTGGT 4 Order the sequences into one overall sequence with computer software. AGTCCGCTATACGA ⋯CGCCATCAGTCCGCTATACGATACTGGT⋯ © 2014 Pearson Education, Inc. Today the whole-genome shotgun approach is widely used, though newer techniques are contributing to the faster pace and lowered cost of genome sequencing These newer techniques do not require a cloning step These techniques have also facilitated a metagenomics approach in which DNA from a group of species in an environmental sample is sequenced © 2014 Pearson Education, Inc. Concept 21.2: Scientists use bioinformatics to analyze genomes and their functions The Human Genome Project established databases and refined analytical software to make data available on the Internet This has accelerated progress in DNA sequence analysis © 2014 Pearson Education, Inc. Centralized Resources for Analyzing Genome Sequences Bioinformatics resources are provided by a number of sources National Library of Medicine and the National Institutes of Health (NIH) created the National Center for Biotechnology Information (NCBI) European Molecular Biology Laboratory DNA Data Bank of Japan BGI in Shenzhen, China © 2014 Pearson Education, Inc. Genbank, the NCBI database of sequences, doubles its data approximately every 18 months Software is available that allows online visitors to search Genbank for matches to A specific DNA sequence A predicted protein sequence Common stretches of amino acids in a protein The NCBI website also provides 3-D views of all protein structures that have been determined © 2014 Pearson Education, Inc. Figure 21.3 WD40 - Sequence Alignment Viewer WD40 - Cn3D 4.1 CDD Descriptive Items Name: WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; © 2014 Pearson Education, Inc. Identifying Protein-Coding Genes and Understanding Their Functions Using available DNA sequences, geneticists can study genes directly The identification of protein coding genes within DNA sequences in a database is called gene annotation © 2014 Pearson Education, Inc. Gene annotation is largely an automated process Comparison of sequences of previously unknown genes with those of known genes in other species may help provide clues about their function © 2014 Pearson Education, Inc. Understanding Genes and Gene Expression at the Systems Level Proteomics is the systematic study of full protein sets encoded by a genome Proteins, not genes, carry out most of the activities of the cell © 2014 Pearson Education, Inc. How Systems Are Studied: An Example A systems biology approach can be applied to define gene circuits and protein interaction networks Researchers working on the yeast Saccharomyces cerevisiae used sophisticated techniques to disable pairs of genes one pair at a time, creating double mutants Computer software then mapped genes to produce a network-like “functional map” of their interactions The systems biology approach is possible because of advances in bioinformatics © 2014 Pearson Education, Inc. Figure 21.4 Translation and ribosomal functions Mitochondrial functions Peroxisomal functions RNA processing Transcription and chromatin-related functions Metabolism and amino acid biosynthesis Nuclearcytoplasmic transport Secretion and vesicle transport Nuclear migration and protein degradation Mitosis DNA replication and repair © 2014 Pearson Education, Inc. Glutamate biosynthesis Cell polarity and morphogenesis Protein folding and glycosylation; cell wall biosynthesis Vesicle fusion Serinerelated biosynthesis Amino acid permease pathway Figure 21.4a Translation and ribosomal functions Mitochondrial functions Peroxisomal functions RNA processing Transcription and chromatin-related functions Metabolism and amino acid biosynthesis Nuclearcytoplasmic transport Secretion and vesicle transport Nuclear migration and protein degradation Mitosis DNA replication and repair © 2014 Pearson Education, Inc. Cell polarity and morphogenesis Protein folding and glycosylation; cell wall biosynthesis Figure 21.4b Glutamate biosynthesis Vesicle fusion Serinerelated biosynthesis Amino acid permease pathway Metabolism and amino acid biosynthesis © 2014 Pearson Education, Inc. Application of Systems Biology to Medicine The Cancer Genome Atlas project, started in 2010, looked for all the common mutations in three types of cancer by comparing gene sequences and expression in cancer versus normal cells This was so fruitful, it has been extended to ten other common cancers Silicon and glass “chips” have been produced that hold a microarray of most known human genes These are used to study gene expression patterns in patients suffering from various cancers or other diseases © 2014 Pearson Education, Inc. Figure 21.5 © 2014 Pearson Education, Inc. Concept 21.3: Genomes vary in size, number of genes, and gene density By early 2013, over 4,300 genomes were completely sequenced, including 4,000 bacteria, 186 archaea, and 183 eukaryotes Sequencing of over 9,600 genomes and over 370 metagenomes is currently in progress © 2014 Pearson Education, Inc. Genome Size Genomes of most bacteria and archaea range from 1 to 6 million base pairs (Mb); genomes of eukaryotes are usually larger Most plants and animals have genomes greater than 100 Mb; humans have 3,000 Mb Within each domain there is no systematic relationship between genome size and phenotype © 2014 Pearson Education, Inc. Table 21.1 © 2014 Pearson Education, Inc. Number of Genes Free-living bacteria and archaea have 1,500 to 7,500 genes Unicellular fungi have from about 5,000 genes and multicellular eukaryotes up to at least 40,000 genes © 2014 Pearson Education, Inc. Number of genes is not correlated to genome size For example, it is estimated that the nematode C. elegans has 100 Mb and 20,100 genes, while Drosophila has 165 Mb and 14,000 genes Researchers predicted the human genome would contain about 50,000 to 100,000 genes; however the number is around 21,000 Vertebrate genomes can produce more than one polypeptide per gene because of alternative splicing of RNA transcripts © 2014 Pearson Education, Inc. Gene Density and Noncoding DNA Humans and other mammals have the lowest gene density, or number of genes, in a given length of DNA Multicellular eukaryotes have many introns within genes and a large amount of noncoding DNA between genes © 2014 Pearson Education, Inc. Concept 21.4: Multicellular eukaryotes have much noncoding DNA and many multigene families Sequencing of the human genome reveals that 98.5% does not code for proteins, rRNAs, or tRNAs About a quarter of the human genome codes for introns and gene-related regulatory sequences © 2014 Pearson Education, Inc. Intergenic DNA is noncoding DNA found between genes Pseudogenes are former genes that have accumulated mutations and are nonfunctional Repetitive DNA is present in multiple copies in the genome About three-fourths of repetitive DNA is made up of transposable elements and sequences related to them © 2014 Pearson Education, Inc. Figure 21.6 Regulatory sequences (5%) Exons (1.5%) L1 sequences (17%) Introns (∼20%) Repetitive DNA that includes transposable elements and related sequences (44%) Unique noncoding DNA (15%) Repetitive DNA unrelated to transposable elements (14%) Alu elements (10%) Simple sequence DNA (3%) © 2014 Pearson Education, Inc. Large-segment duplications (5–6%) Much evidence indicates that noncoding DNA (previously called “junk DNA”) plays important roles in the cell For example, genomes of humans, rats, and mice show high sequence conservation for about 500 noncoding regions © 2014 Pearson Education, Inc. Transposable Elements and Related Sequences The first evidence for mobile DNA segments came from geneticist Barbara McClintock’s breeding experiments with Indian corn McClintock identified changes in the color of corn kernels that made sense only if some genetic elements move from other genome locations into the genes for kernel color These transposable elements move from one site to another in a cell’s DNA; they are present in both prokaryotes and eukaryotes © 2014 Pearson Education, Inc. Figure 21.7 © 2014 Pearson Education, Inc. Figure 21.7a © 2014 Pearson Education, Inc. Figure 21.7b © 2014 Pearson Education, Inc. Movement of Transposons and Retrotransposons Eukaryotic transposable elements are of two types Transposons, which move by means of a DNA intermediate and require a transposase enzyme Retrotransposons, which move by means of an RNA intermediate, using a reverse transcriptase © 2014 Pearson Education, Inc. Figure 21.8 Transposon DNA of genome Transposon is copied Mobile copy of transposon © 2014 Pearson Education, Inc. New copy of transposon Insertion Figure 21.9 Retrotransposon New copy of retrotransposon Synthesis of a single-stranded RNA intermediate RNA Insertion Reverse transcriptase DNA strand Mobile copy of retrotransposon © 2014 Pearson Education, Inc. Sequences Related to Transposable Elements Multiple copies of transposable elements and related sequences are scattered throughout eukaryotic genomes In primates, a large portion of transposable element–related DNA consists of a family of similar sequences called Alu elements Many Alu elements are transcribed into RNA molecules; some are thought to help regulate gene expression © 2014 Pearson Education, Inc. The human genome also contains many sequences of a type of retrotransposon called LINE-1 (L1) L1 sequences have a low rate of transposition and may have effects on gene expression L1 transposons may play roles in the diversity of neuronal cell types © 2014 Pearson Education, Inc. Other Repetitive DNA, Including Simple Sequence DNA About 15% of the human genome consists of duplication of long sequences of DNA from one location to another In contrast, simple sequence DNA contains many copies of tandemly repeated short sequences © 2014 Pearson Education, Inc. A series of repeating units of 2 to 5 nucleotides is called a short tandem repeat (STR) The repeat number for STRs can vary among sites (within a genome) or individuals Simple sequence DNA is common in centromeres and telomeres, where it probably plays structural roles in the chromosome © 2014 Pearson Education, Inc. Genes and Multigene Families Many eukaryotic genes are present in one copy per haploid set of chromosomes The rest of the genes occur in multigene families, collections of identical or very similar genes Some multigene families consist of identical DNA sequences, usually clustered tandemly, such as those that code for rRNA products © 2014 Pearson Education, Inc. Figure 21.10a DNA Direction of transcription RNA transcripts Nontranscribed spacer Transcription unit DNA rRNA 18S 28S 5.8S 28S 5.8S 18S (a) Part of the ribosomal RNA gene family © 2014 Pearson Education, Inc. The classic examples of multigene families of nonidentical genes are two related families of genes that encode globins -globins and -globins are polypeptides of hemoglobin and are coded by genes on different human chromosomes and are expressed at different times in development © 2014 Pearson Education, Inc. Figure 21.10b β-Globin α-Globin α-Globin β-Globin α-Globin gene family Chromosome 16 ζ ζ α α α2 α1 θ 1 2 Embryo Heme β-Globin gene family Chromosome 11 ϵ G A β Fetus and adult Embryo Fetus Adult (b) The human α-globin and β-globin gene families © 2014 Pearson Education, Inc. β Concept 21.5: Duplication, rearrangement, and mutation of DNA contribute to genome evolution The basis of change at the genomic level is mutation, which underlies much of genome evolution The earliest forms of life likely had only those genes necessary for survival and reproduction The size of genomes has increased over evolutionary time, with the extra genetic material providing raw material for gene diversification © 2014 Pearson Education, Inc. Duplication of Entire Chromosome Sets Accidents in meiosis can lead to one or more extra sets of chromosomes, a condition known as polyploidy The genes in one or more of the extra sets can diverge by accumulating mutations; these variations may persist if the organism carrying them survives and reproduces In this way genes with novel functions can evolve © 2014 Pearson Education, Inc. Alterations of Chromosome Structure Humans have 23 pairs of chromosomes, while chimpanzees have 24 pairs Following the divergence of humans and chimpanzees from a common ancestor, two ancestral chromosomes fused in the human line Duplications and inversions result from mistakes during meiotic recombination Comparative analysis between chromosomes of humans and seven mammalian species paints a hypothetical chromosomal evolutionary history © 2014 Pearson Education, Inc. Figure 21.11 Human chromosome Chimpanzee chromosomes Telomere sequences Centromere sequences Telomere-like sequences 12 Centromere-like sequences 2 © 2014 Pearson Education, Inc. 13 Figure 21.12 Human chromosome 16 © 2014 Pearson Education, Inc. Mouse chromosomes 7 8 16 17 The rate of duplications and inversions seems to have accelerated about 100 million years ago This coincides with when large dinosaurs went extinct and mammals diversified Chromosomal rearrangements are thought to contribute to the generation of new species © 2014 Pearson Education, Inc. Duplication and Divergence of Gene-Sized Regions of DNA Unequal crossing over during prophase I of meiosis can result in one chromosome with a deletion and another with a duplication of a particular region Transposable elements can provide sites for crossover between nonsister chromatids © 2014 Pearson Education, Inc. Figure 21.13 Nonsister Gene chromatids Incorrect pairing of two homologs during meiosis Crossover point and © 2014 Pearson Education, Inc. Transposable element Evolution of Genes with Related Functions: The Human Globin Genes The genes encoding the various globin proteins evolved from one common ancestral globin gene, which duplicated and diverged about 450–500 million years ago After the duplication events, differences between the genes in the globin family arose from the accumulation of mutations © 2014 Pearson Education, Inc. Figure 21.14 Ancestral globin gene Evolutionary time Duplication of ancestral gene Mutation in both copies α Transposition to different chromosomes Further duplications and mutations α β α ζ ζ ζ α α 1 α2 α1 yθ 2 α-Globin gene family on chromosome 16 © 2014 Pearson Education, Inc. β ϵ β ϵ G A β β β-Globin gene family on chromosome 11 Subsequent duplications of these genes and random mutations gave rise to the present globin genes, which code for oxygen-binding proteins The similarity in the amino acid sequences of the various globin proteins supports this model of gene duplication and mutation © 2014 Pearson Education, Inc. Evolution of Genes with Novel Functions The copies of some duplicated genes have diverged so much in evolution that the functions of their encoded proteins are now very different For example the lysozyme gene was duplicated and evolved into the gene that encodes -lactalbumin in mammals Lysozyme is an enzyme that helps protect animals against bacterial infection -lactalbumin is a nonenzymatic protein that plays a role in milk production in mammals © 2014 Pearson Education, Inc. Figure 21.15 (a) Lysozyme Lysozyme 1 α–lactalbumin 1 Lysozyme 51 α–lactalbumin 51 Lysozyme (b) α–lactalbumin 101 α–lactalbumin 101 (c) Amino acid sequence alignments of lysozyme and α–lactalbumin © 2014 Pearson Education, Inc. Rearrangements of Parts of Genes: Exon Duplication and Exon Shuffling The duplication or repositioning of exons has contributed to genome evolution Errors in meiosis can result in an exon being duplicated on one chromosome and deleted from the homologous chromosome In exon shuffling, errors in meiotic recombination lead to some mixing and matching of exons, either within a gene or between two nonallelic genes © 2014 Pearson Education, Inc. Figure 21.16 EGF EGF EGF EGF Epidermal growth factor gene with multiple EGF exons F F F Exon shuffling Exon duplication F Fibronectin gene with multiple “finger” exons F EGF K K K Plasminogen gene with a “kringle” exon Portions of ancestral genes © 2014 Pearson Education, Inc. Exon shuffling TPA gene as it exists today How Transposable Elements Contribute to Genome Evolution Multiple copies of similar transposable elements may facilitate recombination, or crossing over, between different chromosomes Insertion of transposable elements within a protein-coding sequence may block protein production Insertion of transposable elements within a regulatory sequence may increase or decrease protein production © 2014 Pearson Education, Inc. Transposable elements may carry a gene or groups of genes to a new position Transposable elements may also create new sites for alternative splicing in an RNA transcript In all cases, changes are usually detrimental but may on occasion prove advantageous to an organism © 2014 Pearson Education, Inc. Concept 21.6: Comparing genome sequences provides clues to evolution and development Comparisons of genome sequences from different species reveal much about the evolutionary history of life Comparative studies of embryonic development are beginning to clarify the mechanisms that generated the diversity of life-forms present today © 2014 Pearson Education, Inc. Comparing Genomes Genome comparisons of closely related species help us understand recent evolutionary events Relationships among species can be represented by a tree-shaped diagram © 2014 Pearson Education, Inc. Figure 21.17 Bacteria Most recent common ancestor of all living things Eukarya Archaea 4 3 2 Billions of years ago 1 0 Chimpanzee Human Mouse 70 60 50 40 30 20 Millions of years ago © 2014 Pearson Education, Inc. 10 0 Comparing Distantly Related Species Highly conserved genes have changed very little over time These help clarify relationships among species that diverged from each other long ago Bacteria, archaea, and eukaryotes diverged from each other between 2 and 4 billion years ago Highly conserved genes can be studied in one model organism, and the results applied to other organisms © 2014 Pearson Education, Inc. Comparing Closely Related Species Genomes of closely related species are likely to be organized similarly For example, using the human genome sequence as a guide, researchers were quickly able to sequence the chimpanzee genome Analysis of the human and chimpanzee genomes reveals some general differences that underlie the differences between the two organisms © 2014 Pearson Education, Inc. Human and chimpanzee genomes differ by 1.2% at single base-pairs, and by 2.7% because of insertions and deletions Sequencing of the bonobo genome in 2012 reveals that in some regions there is greater similarity between human and bonobo or chimpanzee sequences than between chimpanzee and bonobo © 2014 Pearson Education, Inc. A number of genes are apparently evolving faster in the human than in the chimpanzee or mouse Among them are genes involved in defense against malaria and tuberculosis and one that regulates brain size © 2014 Pearson Education, Inc. Humans and chimpanzees differ in the expression of the FOXP2 gene, whose product turns on genes involved in vocalization Differences in the FOXP2 gene may explain why humans but not chimpanzees communicate by speech The FOXP2 gene of Neanderthals is identical to that of humans, suggesting they may have been capable of speech © 2014 Pearson Education, Inc. Figure 21.18 Experiment Heterozygote: one copy of FOXP2 disrupted Homozygote: both copies of FOXP2 disrupted Experiment 1: Researchers cut thin sections of brain and stained them with reagents that allow visualization of brain anatomy in a UV fluorescence microscope. Results Experiment 1 Wild type Experiment 2: Researchers separated each newborn pup from its mother and recorded the number of ultrasonic whistles produced by the pup. Experiment 2 Heterozygote Homozygote Number of whistles Wild type: two normal copies of FOXP2 400 300 200 100 0 (No whistles) Wild Hetero- Homotype zygote zygote © 2014 Pearson Education, Inc. Figure 21.18a Experiment Wild type: two normal copies of FOXP2 Heterozygote: one copy of FOXP2 disrupted Homozygote: both copies of FOXP2 disrupted Experiment 1: Researchers cut thin sections of brain and stained them with reagents that allow visualization of brain anatomy in a UV fluorescence microscope. Results Experiment 1 Wild type © 2014 Pearson Education, Inc. Heterozygote Homozygote Figure 21.18aa Wild type: two normal copies of FOXP2 © 2014 Pearson Education, Inc. Figure 21.18ab Heterozygote: one copy of FOXP2 disrupted © 2014 Pearson Education, Inc. Figure 21.18ac Homozygote: both copies of FOXP2 disrupted © 2014 Pearson Education, Inc. Figure 21.18b Experiment Wild type: two normal copies of FOXP2 Heterozygote: one copy of FOXP2 disrupted Homozygote: both copies of FOXP2 disrupted Experiment 2: Researchers separated each newborn pup from its mother and recorded the number of ultrasonic whistles produced by the pup. Number of whistles Results Experiment 2 400 300 200 100 0 (No whistles) Wild Hetero- Homotype zygote zygote © 2014 Pearson Education, Inc. Figure 21.18ba © 2014 Pearson Education, Inc. Comparing Genomes Within a Species As a species, humans have only been around about 200,000 years and have low within-species genetic variation Variation within humans is due to single nucleotide polymorphisms, inversions, deletions, and duplications Most surprising is the large number of copynumber variants These variations are useful for studying human evolution and human health © 2014 Pearson Education, Inc. Widespread Conservation of Developmental Genes Among Animals Evolutionary developmental biology, or evo-devo, is the study of the evolution of developmental processes in multicellular organisms Genomic information shows that minor differences in gene sequence or regulation can result in striking differences in form © 2014 Pearson Education, Inc. Molecular analysis of the homeotic genes in Drosophila has shown that they all include a sequence called a homeobox An identical or very similar nucleotide sequence has been discovered in the homeotic genes of both vertebrates and invertebrates Homeobox genes code for a domain that allows a protein to bind to DNA and to function as a transcription regulator Homeotic genes in animals are called Hox genes © 2014 Pearson Education, Inc. Figure 21.19 Adult fruit fly Fruit fly embryo (10 hours) Fruit fly chromosome Mouse chromosomes Mouse embryo (12 days) Adult mouse © 2014 Pearson Education, Inc. Related homeobox sequences have been found in regulatory genes of yeasts, plants, and even prokaryotes In addition to homeotic genes, many other developmental genes are highly conserved from species to species © 2014 Pearson Education, Inc. Sometimes small changes in regulatory sequences of certain genes lead to major changes in body form For example, variation in Hox gene expression controls variation in leg-bearing segments of crustaceans and insects In other cases, genes with conserved sequences play different roles in different species © 2014 Pearson Education, Inc. Figure 21.20 Genital Thorax segments Abdomen (a) Expression of four Hox genes in the brine shrimp Artemia Thorax Abdomen (b) Expression of the grasshopper versions of the same four Hox genes © 2014 Pearson Education, Inc. Figure 21.10 DNA Direction of transcription RNA transcripts β-Globin α-Globin Nontranscribed spacer α-Globin Transcription unit β-Globin DNA rRNA 18S 28S 5.8S 28S 5.8S 18S (a) Part of the ribosomal RNA gene family © 2014 Pearson Education, Inc. α-Globin gene family Chromosome 16 ζ ζ α α α2 α1 θ Embryo 2 1 Heme β-Globin gene family Chromosome 11 ϵ G A Fetus and adult Embryo Fetus β β Adult (b) The human α-globin and β-globin gene families Figure 21.10c DNA Direction of transcription RNA transcripts Nontranscribed spacer © 2014 Pearson Education, Inc. Transcription unit Figure 21.UN01a Globin Alignment of Globin Amino Acid Sequences α1 ζ 1 MVLSPADKTNVKAAWGKVGAHAGEYGAEAL 1 MSL T KTER T I I VSMWAK I S TQADT I G TE T L α1 ζ 31 ERMFLSF P T TKTYFPHFDLSH – GSAQVKGH 31 ERLFLSHPQTKTYF P HFDL –HPGSAQLRAH α1 ζ 61 GKKVADALT NAVAHVDDMPNALSALSDLHA 61 GSKVVAAVGDAVKS I DD I GGALSKLSELHA α1 ζ 91 HKLRVDPVNFKLLSHCL LV T L AAHL PA E FT 91 Y I LRVDPVNFKLLSHCL LV TLAARFPAD F T α1 ζ © 2014 Pearson Education, Inc. 121 PAV HASLDKF L ASVST V LT SKYR 121 AEAHAAWDKFLSVVSSVLT EKYR Figure 21.UN01b Amino Acid Identity Table β Family α Family α Family α1 α2 ζ β Family β ϵ A G © 2014 Pearson Education, Inc. α1 (alpha 1) α2 (alpha 2) ζ (zeta) β (beta) (delta) ϵ (epsilon) ----- 100 61 45 44 39 42 42 ----- 61 45 44 39 42 42 ----- 38 40 41 41 41 ----- 93 76 73 73 ----- 73 71 72 ----- 80 80 ----- 99 A G (gamma A) (gamma G) ----- Figure 21.UN01c β α α β Hemoglobin © 2014 Pearson Education, Inc. Figure 21.UN02 © 2014 Pearson Education, Inc. Figure 21.UN03 Archaea Bacteria Genome size Number of genes Gene density Introns Other noncoding DNA © 2014 Pearson Education, Inc. Most are 1–6 Mb 1,500–7,500 Higher than in eukaryotes None in protein-coding genes Present in some genes Very little Eukarya Most are 10–4,000 Mb, but a few are much larger 5,000–40,000 Lower than in prokaryotes (Within eukaryotes, lower density is correlated with larger genomes.) Present in most genes of multicellular eukaryotes, but only in some genes of unicellular eukaryotes Can exist in large amounts; generally more repetitive noncoding DNA in multicellular eukaryotes Figure 21.UN04 β-Globin gene family α-Globin gene family Chromosome 11 Chromosome 16 ζ ζ α α © 2014 Pearson Education, Inc. 2 1 α2 α1 θ ϵ G A β β Figure 21.UN05 © 2014 Pearson Education, Inc. Figure 21.UN06 © 2014 Pearson Education, Inc.