Download 3. Inheritance and hereditary

Document related concepts

Ploidy wikipedia , lookup

Chromosome wikipedia , lookup

Polymorphism (biology) wikipedia , lookup

Gene therapy of the human retina wikipedia , lookup

Epigenetics of human development wikipedia , lookup

Karyotype wikipedia , lookup

Human genetic variation wikipedia , lookup

Heritability of IQ wikipedia , lookup

Inbreeding wikipedia , lookup

Twin study wikipedia , lookup

Site-specific recombinase technology wikipedia , lookup

Gene expression programming wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Vectors in gene therapy wikipedia , lookup

Polyploid wikipedia , lookup

Genetic engineering wikipedia , lookup

Pharmacogenomics wikipedia , lookup

NEDD9 wikipedia , lookup

Gene wikipedia , lookup

X-inactivation wikipedia , lookup

Genomic imprinting wikipedia , lookup

Genome editing wikipedia , lookup

History of genetic engineering wikipedia , lookup

Transgenerational epigenetic inheritance wikipedia , lookup

Medical genetics wikipedia , lookup

Designer baby wikipedia , lookup

Genetic drift wikipedia , lookup

Behavioural genetics wikipedia , lookup

Genome (book) wikipedia , lookup

Epistasis wikipedia , lookup

Population genetics wikipedia , lookup

Hardy–Weinberg principle wikipedia , lookup

Microevolution wikipedia , lookup

Quantitative trait locus wikipedia , lookup

Dominance (genetics) wikipedia , lookup

Transcript
 3. Inheritance and hereditary
How do you determine how a phenotype is inherited? A specific phenotype is an
observable characteristic of an organism, and is encoded by the genotype, or genetic makeup,
of an organism. Specific phenotypes are usually studied with the goal of understanding how the
genes function within the organism to create its phenotype. For example, suppose you were
examining a population of nematodes and found a worm that was short and stout (dumpy)
instead of its usual body shape. Why are these worms dumpy? To start to understand this
question, you might begin by examining the inheritance pattern for this phenotype.
Understanding how this phenotype is inherited would allow you to learn things like how many
genes are responsible for the phenotype and how heritable this phenotype is. Once a gene is
pinpointed as being required for the inheritance of a phenotype, further studies on a gene can
be carried out. These types of studies include examining the function of that gene’s product
through genetic analysis (Chapter 4), investigating how that gene’s product interacts with other
gene products to determine phenotype (Chapter 5) and determining the physical location in the
genome (Chapter 9), which can tell you about the gene product encoded by the locus of
interest,
In this chapter, we discuss how to understand and analyze the pattern of inheritance of a
particular phenotype, starting with how to examine and understand the phenotype of interest.
We then discuss various modes of inheritance and the types of conclusions that can be made
as the inheritance pattern of phenotypes is observed. Next, we will discuss the mechanisms by
which inheritance patterns are governed, including the more traditional Mendelian inheritance
and the complications for genetic analysis from effects such as penetrance, expressivity, and
parent of origin effects. Then, we will discuss non-Mendelian inheritance, which includes
situations in which the observed phenotype is not directly predicted from the nuclear genotype.
Throughout the chapter, we will incorporate examples from key model organisms with
exemplary genetic characteristics to illustrate general principles that also apply to other
organisms, including new model organisms for which new technologies now or soon will allow
some aspects of genetic manipulation and study.
The general nomenclature used in this book is described in Box 3-1.
Understanding heritability of phenotypes
Phenotypes are observable characteristics or traits. Some phenotypes can be easily
visualized, such as a person’s eye color. Some phenotypes can require sophisticated research
tools to see, such as the localization of a particular protein within the cell (Fig. 1). The types of
phenotypes that can be studied include those that may be naturally occurring at easily
observable frequencies in populations, or those obtained in experimental organisms within the
lab (Chapter 7). Although the concept of a phenotype may seem deceptively simple, the proper
analysis of phenotypes can be complex.
Phenotypes can be defined by discrete or continuous variation
Before studying the heritability of a phenotype, exactly what is being measured as a
phenotype must be carefully defined. The classical examples of phenotypic analysis usually
involve discrete phenotypic variation, where an all-or-none trait is being discussed. This type
of variation can also be considered qualitative variation. Discrete phenotypic variation
comprises categories such as big versus small, blue eyes versus brown eyes, or alive versus
dead. On the other hand, a phenotype may involve continuous phenotypic variation, and is
usually described in a quantitative fashion. Continuous phenotypic variation is described as a
quantitative variable and involves measurements such as height, lifespan, the velocity of
Systems Genetics, Chapter 3 v1, 4/1/13
1
locomotion, or the resistance to stress (Fig. 2). These categories must be defined so that
phenotypes can be scored appropriately.
Phenotypes should be classified by a method that accurately and simply describes
variation
What determines whether a phenotype is treated as discrete or continuous? The
distinction depends on how the variation in a particular trait can be represented accurately, yet
as simply as possible. At one extreme, a trait whose variation is difficult to assign a numerical
value (for example, wrinkled versus smooth) is treated as discrete. On the other hand, a trait
readily assigned a numerical value (for example, height) can be treated as either discrete or
continuous depending on its pattern of variation. If the trait can be represented accurately as a
discrete trait, then the variation can be treated as a qualitative trait, which would simplify its
analysis.
For example, consider height as a phenotype you wish to study (Figure 3 [SGF-1025]). If
the heights of an organism cluster into discrete classes (in the simplest case, tall and short, with
no lengths in between), such variation can be treated as a qualitative, discrete phenotype,
simplifying its analysis. In this case, the organisms would be classified as either short or tall.
However, if the heights observed vary continuously from tall to short, then this variation is
probably best represented by describing individual phenotypes by their numerical values rather
than by partitioning them into categories, or bins.
For any qualitative phenotype, the description used to describe the discrete phenotype
implies an imposition of some threshold. Sometimes this threshold is intuitively obvious. For
example, the color of an organism can be scored as a qualitative phenotype. Whether an
organism is green or yellow is simply distinguished by the human eye, as we perceive colors
from different parts of the light spectrum as discrete colors. However, the color of an organism
can also be represented qualitatively by the wavelength of light reflected from the organism.
Although our eye perceives green and yellow as discrete, the distinction between green and
yellow represents two parts of the light spectrum (510 nm versus 570 nm, respectively) that are
not that far apart. The binning of these two parts of the light spectrum is essentially what our
eyes are doing when we decide whether an organism is either green or yellow.
Although our eye readily bins colors into discrete phenotypes, there are situations when
we might want to represent color as a continuous phenotype. For example, if your organism
displays phenotypes of “yellowish-green” or “greenish-yellow”, which are colors that lie in
between 510 and 570 nm of the light spectrum. Perhaps you suspect that multiple loci
contribute to the determination of the color of an organism, and an organism of the color from
535 nm is significantly distinct from an organism the color from 570 nm (Figure 4 [SGF-1199]). In
this case, the representation of color by the terms yellow and green would oversimplify the
phenotypes. Ultimately, this phenotypic oversimplification would be discovered when the
genetic models do not accurately predict inheritance patterns. In this chapter, we will focus on
methods used for phenotypes that show discrete phenotypic variation. The genetic dissection
of phenotypes showing continuous phenotypic variation will be discussed further in Chapter 6,
when we discuss the analysis of quantitative traits.
Determining patterns of inheritance for a phenotype
To understand how a phenotype is inherited, you will need to examine how a phenotype
behaves as it passes from the parents to its offspring. Often, the inheritance pattern over
several generations will need to be observed in order to understand the appropriate model of
inheritance.
Breeding crosses can be used to determine the pattern of inheritance
Systems Genetics, Chapter 3 v1, 4/1/13
2
Patterns of inheritance can be observed and determine by setting up specific breeding
crosses and examining the segregation of the phenotypes in the resulting progeny. This
strategy can be used to understand inheritance in model organisms, livestock, agricultural
organisms, or companion animals.
There are two main modes of breeding crosses: cross-fertilization and selffertilization. During cross-fertilization, organisms need to be crossed to another in order to
reproduce (i.e., Drosophila, humans, and most other common animal species) (Fig 5A [SGF1005]). Organisms that can self-fertilize include those that reproduce without being crossed to
another (Fig. 5B), such as C. elegans and some plant species. For the analysis of inheritance
pattern, microbes that undergo vegetative growth and create a new cell that is a copy of itself by
cell division (i.e., most bacteria like E. coli, as well as fungi such as the haploid S. cerevisiae)
are analyzed like self-fertilizing organisms. When these organisms grow vegetatively, no
introduction of genetic material is necessary to produce a new cell, much how a self-fertilizing
organisms propagates itself. Furthermore, when haploid microbes are mated to construct a
diploid (Fig. 5C [SGF-1042]), the creation of haploids from diploids occurs without introduction of
genetic material, thus behaving more like the self-fertilization process, as no additional crosses
are needed.
Pedigree analysis can be used to determine patterns of inheritance when breeding
crosses are not possible
In cases where crosses cannot be experimentally arranged (as in humans), pedigree
analysis is carried out to examine the pattern of inheritance. Pedigree analysis involves
examining different generations of individuals for the phenotype of interest, and using that
information to determine how that phenotype is expressed. Included in a pedigree is the
relation of each individual to the others within the pedigree. In human pedigrees, the sex of the
individual is usually stated as well. A simple example is shown in Fig. 6 [SGF-1002], where an
affected female mates with an unaffected male and generates two affected and two nonaffected progeny. Many fewer individuals are usually studied when pedigrees are used
compared to breeding crosses. Because of this, it can be difficult to narrow down a model of
inheritance for a particular phenotype. The more extensive a pedigree, the more likely you will
be able to narrow down the models of inheritance that fit that particular pedigree.
Simple versus complex models of inheritance
By examining inheritance patterns, we can construct models to explain how the
phenotype is inherited. When assessing patterns of inheritance, you usually approach the
problem by starting from one of two extreme alternative positions. At one extreme, the genetic
properties can be assumed to be simple (i.e., strict Mendelian inheritance, see below), with
more complicated hypotheses considered only when experimental observations conflict with the
simple model. At the other extreme, high complexity is anticipated from the outset.
Whether simple or complex models are considered depends on the particulars of the
analysis being conducted. If the alleles under study are present in a population that is otherwise
rather genetically homogeneous, the assumption that genetic properties are simple is most
expeditious. This is typically the case when studying laboratory-induced mutations in model
organisms, where the mutations are usually induced into a defined genetic background. In this
case, if it turns out that the genetic properties are not simple, additional genetic inter-breeding
experiments can be performed to dissect the complexity. However, for phenotypic variations
found in populations in which many genes are likely to vary, an approach involving few or no
assumptions about the nature of the underlying genetic properties is often employed. This is
typically the case for the examination of naturally occurring variations, like those studied in
humans. This approach recognizes the higher likelihood that variations in multiple genes
Systems Genetics, Chapter 3 v1, 4/1/13
3
underlie phenotypic variation within genetically heterogenous populations. Furthermore, in
many of these cases, the ability to test specific hypotheses through subsequent rounds of
interbreeding is limited, and thus it is better to make fewer assumptions. In these studies, data
are collected, and hypotheses subsequently formed to fit these data (e.g., Box 3-2).
Analyses involving simple models of inheritance usually involve discrete phenotypic
variation. For a simple model of inheritance, it is assumed that a single genetic locus is
responsible for the phenotypic effect and that all organisms of the appropriate genotype will
express the phenotype (i.e., Mendelian Inheritance, see below). These phenotypes are usually
assumed to be recessive to the allele found in the phenotypically wild-type strain. In addition,
the expression of the phenotype is assumed to be free of parental effect and not particularly
sensitive to environmental conditions.
However, there are many ways that inheritance does not follow the rules governing
simple inheritance, and a phenotype may display complex inheritance in many ways (Table 1).
Instead of the phenotype being due to the action of one locus, it is possible that multiple loci are
required for the phenotype. The phenotype may not be recessive to the allele found in wild type
strains. Furthermore, it is also possible that organisms that are genotypically similar may not
display the same phenotype, a phenomenon referred to as partial penetrance. The phenotype
may display differences depending on which parent it was inherited from (parent of origin effect)
and there may also be extracellular material transferred from the parent (i.e., maternal
inheritance) that may be involved. Finally, expression of the phenotype may be affected by
perturbations such that genotypically similar individuals may express the phenotype differently
depending on their environment. These more complex situations will be discussed later in this
chapter.
Like begetting like suggests genetic homogeneity
A true breeding phenotype is one where the organism and all its descendents display the
phenotype of interest (Fig. 7 [SGF-1044]). This phenomenon is observed at a population level.
For organisms that breed by crossing, a true breeding phenotype would be a phenotype that
exists in all members of a particular population where there is free crossing within that particular
population. For organisms that self-fertilize or reproduce by cell division, a true breeding
phenotype is one where an individual generates progeny of the same phenotype and each of
these progeny will generate descendents of that same phenotype.
Single cell colonies can display true breeding
True breeding phenotypes can be most simply observed in single celled organisms
growing vegetatively. A single cell that generates only cells of the same phenotype is inferred to
be true-breeding. Suppose that while studying a strain of yeast that normally forms creamcolored colonies, you isolate a colony containing a dark-pigmented sector (Fig. 8 [SGF-1014]). To
determine whether the dark pigmented phenotype is true breeding, you pick cells from the dark
colored part of the colony and use these cells to create many additional colonies, each founded
by a single dark-colored cell. Consider two different types of results you might obtain. First, the
simplest result you might obtain would be if cells that derive from the dark-colored colony all
give colonies that are dark colored. In this case, the dark-colored variant would appear to be
true breeding. A second type of result would be if cells that derive from the dark-colored colony
produce a mixture of dark and cream-colored colonies. This would suggest that the darkcolored variant was not true-breeding, as the dark-colored cells produce cells of varied
phenotype (Table 301).
Alleles represent different variants of a gene
The different variants of a given gene are referred to as alleles of that gene (see Box 3-1
for nomenclature conventions used in the text). For example, one diploid individual might
Systems Genetics, Chapter 3 v1, 4/1/13
4
contain two different alleles of gene A, namely a1 and a2. A diploid organism can be
homozygous for an allele, where the alleles at the gene are the same on both copies of the
chromosomes (i.e., for gene A, the organism is a1/a1). A heterozygous organism will have two
different alleles at the gene of interest (i.e., a1/a2) (Fig. 9 [SGF-1200]).
Sometimes diploid organisms are hemizygous at a particular locus and only have a
single copy of a gene. For example, if a gene were on a sex chromosome, the heterogametic
sex would be hemizygous. If the A locus were on the human X chromosome, a human male
would be hemizygous for the A locus and would be genotypically a1/O.
Mendelian Inheritance
Gregor Mendel demonstrated that variation within a single gene can underlie discrete
phenotypic variation. Thus, traits due to variation within a single gene are referred to as
Mendelian traits.
Simple Mendelian inheritance involves clear dominant-recessive relationships between
alleles
The simplest cases of Mendelian inheritance are those in which the multiple alleles of a
gene exhibit clear dominant-recessive relationships. In such a situation, a diploid animal will
express the phenotype associated with the dominant allele whenever at least one dominant
allele is present, while animals will express the phenotype associated with the recessive allele
only if the dominant allele is not present (Figure 10 [SGF-1201]). Thus, animals homozygous,
hemizygous or heterozygous for a dominant allele will show the dominant phenotype, while only
animals hemizygous or homozygous for recessive alleles will exhibit the recessive phenotype.
Often, though not always, wild-type phenotypes are associated with dominant alleles and
mutant phenotypes are associated with recessive alleles. The terms dominant and recessive
refer to the relationships of the various alleles within a genetic locus. More complicated
variations to the simple dominant-recessive relationship are discussed below under
complications to Mendelian inheritance.
Mendel’s laws describe simple patterns of inheritance
The two main genetic tenets from Mendel are the Law of Segregation and the Law of
Independent Assortment. The Law of Segregation is now understood to mean that when a
diploid organism produces gametes by meiosis, each allele is segregated into different gametes
(Fig. 11 [SGF-1003]). The Law of Independent Assortment says that this segregation of alleles
happens independently of other alleles (Figures 12 and13 [SGF-1004, 1013]). We now know this is
true for genetic loci that are not closely associated along a piece of DNA. These independently
assorting loci also known as unlinked loci, while those that that are closely associated along a
piece of DNA are considered linked (Chapter 9).
Together, Mendel’s two laws ensure that the segregation of genetic traits is intrinsically
probabilistic in nature: knowing the genotypes of the parents, you can predict the probability that
the offspring will have a particular genotype and phenotype. For a phenotype inherited in a
Mendelian fashion within a true-breeding population, the gene involved in the phenotype would
be homozygous. However, having a true-breeding strain does not always imply a homozygous
state. Sometimes, genetic situations can exist to maintain true-breeding phenotypes in a
heterozygous state (see Box 3-8); these types of situations can be useful for genetic analyses,
as we will see in later Chapters.
Mendelian traits display typical segregation patterns
One of the key features of Mendelian inheritance is the segregation of phenotypes in
precise ratios from a heterozygote, as this segregation pattern stems from Mendel’s laws (Box
Systems Genetics, Chapter 3 v1, 4/1/13
5
3-2). Whether a single gene determines a trait can be determined by examining the segregants
from a cross.
First consider the case in a diploid organism where we have two parents that are both
heterozygous at one locus, of genotype a1/a2. In this case, consider the a2 allele recessive to
the a1 allele such that only an organism homozygous for allele a2 displays the non-A phenotype.
For this type of cross, 1/4 of the progeny will homozygous for the a1 allele, 1/2 will be
heterozygous and have the genotype a1/a2, and 1/4 will be homozygous for the a2 allele (Fig. 14
[SGF-1055]). The ratio of progeny from this cross that will display the phenotype A to phenotype
non-A will therefore be 3:1.
As Mendel was studying peas, Mendel’s laws most simply apply to diploid organisms.
However, we sometimes speak of Mendelian inheritance in organisms that go through haploid
states in cases where inheritance follows similar rules as if the organism were always diploid.
One such example is the budding yeast S. cerevisiae, which can be grown as either a diploid or
a haploid. Rather than examining the segregation of a given trait by examining the phenotypes
of the diploid progeny created by crossing two diploid organisms, patterns of inheritance in
yeast are commonly studied by mating two haploid yeast cells to create a diploid yeast cells.
The diploid cell is then allowed to undergo meiosis through the process of sporulation, and the
meiotic products are scored (Fig. 15 [SGF-1202]). Because of this, if you were to start with a
diploid yeast cell that is heterozygous and of genotype a1/a2, the segregation pattern seen from
a yeast meiosis will be two haploid cells of phenotype A and two haploid cells of phenotype nonA.
What if more than a single locus is responsible for the phenotype of interest? This can
usually be determined from a simple cross. For a diploid organism, in the cross above, we see
that a 3:1 ratio would be expected, resulting in 3/4 of the offspring displaying the dominant
phenotype and 1/4 displaying the recessive phenotype. However, if two recessive loci were
required to produce the recessive phenotype, only 1 in 16 would display the recessive
phenotype given that there is only a 1/16 chance that both recessive alleles would be
homozygous in the offspring (Fig. 16 [SGF-1035]). For a number n genes, the expected proportion
of offspring displaying the recessive phenotype would be 1/4n, as each additional recessive
gene necessary for the phenotype contributes a factor of ¼.
Complications to Mendelian Inheritance
Traits that display simple Mendelian inheritance are relatively easy to analyze. These
traits are due to a single recessive allele that always displays a specific phenotype when the
recessive allele is homozygous. Many traits display this simple mode of inheritance. Others
that require more than one locus to display a phenotype can also display a relatively simple
mode of inheritance in the case where the phenotype is displayed only when all alleles involved
are homozygous, yielding offspring in an expected ratio (Fig. 16 [SGF-1035]). However, additional
factors can complicate the scoring of even a single gene trait including incomplete penetrance,
variable expressivity, incomplete dominance, parent of origin effect, and sex-linked inheritance.
In this next section, we will discuss these situations can that complicate the analysis of
inheritance patterns and will also provide examples of mechanisms that may underlie these
phenomena.
Incomplete penetrance complicates observing simple patterns
One factor that can complicate the scoring of a single gene trait is that not all organisms
of a given genotype will necessarily exhibit the phenotype, a phenomenon called incomplete
penetrance. Penetrance reflects the fraction of individual organisms of a given genotype that
display the phenotype (Fig. 17A [SGF-1015]).
Systems Genetics, Chapter 3 v1, 4/1/13
6
For example, consider a gene, B, important for viability. 100% of individuals who are
homozygous for b1 live. In this case, the allele b1 would be considered completely penetrant, as
all individuals of this genotype display the same phenotype. However, allele b2 is important for
viability, but behaves slightly differently from allele b1. Only 90% of individuals homozygous for
allele b2 are viable, while 10% of individuals homozygous for allele b2 are dead. The b2 allele is
considered incompletely penetrant (see Table 7 in Box 3-4).
When simply examining an individual for viability, it would be difficult to assess whether
an individual carries the b1 or b2 allele. To distinguish these alleles, you would need to examine
multiple members of a true breeding population, meaning, multiple members of a population
homozygous for the b1 allele and multiple member of a population homozygous for the b2 allele
(Figure 17A). If 90% of individuals in the population live, this would suggest the presence of
allele b2. On the other hand, if 100% live, this would suggest the presence of allele b1.
Examining multiple individuals of the same genotype will determine phenotype
penetrance
If you suspect the trait you are examining might have an incompletely penetrant
phenotype, the most important observation to make (if possible) is to examine the penetrance of
a homozygous individual. Since incomplete penetrance implies that individuals of the same
genotype can have distinct phenotypes, this also means that individuals with distinct
phenotypes can have the same genotype. Thus, analysis of multiple individuals with an
identical genotype will reveal incomplete penetrance: regardless of phenotype class, an
individual of the same genotype will segregate the same spectrum or proportion of phenotypes.
This is most readily seen in self-fertilizing organisms or haploid organisms (Figure 17B
[SGF-1015]). Consider allele c1, an incompletely penetrant allele of gene C where organisms who
are c1/ c1 will produce offspring that are 20% non-C phenotype and 80% C phenotype. In a
population of homozygous c1 individuals, an individual displaying either the non-C or C
phenotype will have offspring that are also 20% non-C phenotype and 80% C phenotype. Since
individuals from both phenotypic classes of the parents’ offspring generate descendents that
show the same spectrum of phenotypes as their parents’ and grandparents’ generations, we
can infer that the starting individual is homozygous for the c1 allele.
An example of incomplete penetrance can be seen when examining the C. elegans lin-3
gene (Figure 17C). When homozygous, the C. elegans lin-3(e1417) alelle results in 89% egglaying defective (Egl) hermaphrodites and 11% non-egg-laying defective (wild-type)
hermaphrodites. Thus, each homozygous lin-3(e1417) Egl hermaphrodite generates the same
distribution of Egl and wild-type progeny: 89% of the offspring will be Egl and 11% will be nonEgl, whether the hermaphrodite was of Egl or non-Egl phenotype. The lin-3 gene encodes the
major inductive signal necessary for induction of the vulva, which is required for egg-laying. At
the molecular level, the lin-3(e1417) allele appears to reduce the level of lin-3 activity to such a
degree that the probability that a vulva will form declines from 100% in wild-type to 11% in the
mutant.
Distinguishing incomplete penetrance from the effect of dominance involves examination
of the segregation pattern from crosses
How might you distinguish between the effects of incomplete penetrance or the effects of
dominance? Consider the example where you have two genetically identical individuals for
whom 75% of their offspring display phenotype and 25% display phenotype non-D (Fig. 18 [SGF1016]). Two hypotheses that could explain this segregation pattern are:
Hypothesis 1: The parents are heterozygous and both are genotypically d1/d2, where d1 is a
dominant allele. In this case, you would expect a 3:1 segregation of D phenotype: non-D
phenotype.
Systems Genetics, Chapter 3 v1, 4/1/13
7
Hypothesis 2: The parents are homozygous, and of genotype d1/d1, and the d1 allele is
incompletely penetrant such that only 75% organisms which are d1/d1 will display the D
phenotype.
One method to distinguish between these two possibilities is to breed two of the offspring
displaying the non-D phenotype together. If the non-D phenotype is due to a dominant, but
completely penetrant allele, 100% of the offspring should display the non-D phenotype.
However, if the non-D phenotype is due to incomplete penetrance, 25% of the offspring from
this cross will display the non-D phenotype.
Variable expressivity complicates observing simple patterns
The expressivity of a phenotype refers to the range of phenotypic severity that is
displayed in genetically identical organisms (Figure 20). For example, for a particular allele e1,
some individuals carrying this allele may have light blue eyes while other individuals have dark
blue eyes. The phenotypic range can be expressed in a discrete (i.e., light blue versus dark
blue) or in continuous (i.e., shades of blue, ranging from light to dark) terms.
A simple example of a phenotype with variable expressivity are temperature-sensitive
phenotypes. Geneticists have often isolated alleles of genes that show differing phenotypes at
different temperatures. These types of alleles have been particularly useful for genes that
otherwise would cause lethality when the gene function is completely removed. These
temperature sensitive alleles are also examples of alleles that display conditional inheritance,
which are alleles that display phenotypes only under particular conditions (e.g., Table 2).
The relationship of incomplete penetrance and variable expressivity is analogous to that
between continuous and discrete traits, as the two properties can be viewed as extremes along
a continuum. For incomplete penetrance, phenotypic variation is characterized as resulting in
multiple discrete classes of phenotype, while for variable expressivity, phenotypic variation
occurs over a more continuous spectrum with many gradations. Penetrance and expressivity
are both examples of genetically identical individuals that differ in phenotype. Box 3-4
discusses some general mechanisms how this might occurs.
Recessive and Dominant traits are not always absolute
In the simplest cases, the mutant allele is recessive to the wild-type allele. In this case,
the wild type allele is considered dominant, as in the heterozygote, the effect of the wild type
allele predominates (Fig. 21 [SGF-1204]). However, this is not always the case. Sometimes when
the heterozygous organism displays incomplete penetrance of the dominant phenotype, that
allele is said to be semi-dominant or partially dominant. Sometimes when effects of both alleles
are observed, the alleles are considered co-dominant (Table 3).
Dominant inheritance is most easily seen when two true-breeding strains that are each
homozygous for different alleles of a single gene are crossed to create a heterozygous
individual carrying two different alleles (Fig. 21 [SGF-1204]). If the alleles have a complete
dominant/recessive relationship, the allele whose phenotype is displayed would be the dominant
allele. When starting with a heterozygote, we can also determine which of the alleles it carries is
dominant by crossing it to a homozygous strain of the opposite phenotype (Fig. 22 [SGF-1205]).
What makes an allele dominant or recessive? Most recessive mutations disrupt some or
all of the function of a gene. Because the amount of gene product is either compensated due to
feedback within the cell or because the missing amount of gene product does not detectably
disrupt function, there is often no obvious immediate consequence of having only one functional
copy of a gene. The feedback control within cells is exquisite and takes care of a lot of the
noise inherent in any biological system. For example, quality control systems exist to get rid of
excess amounts of unbound subunits in multi-protein complexes such that proteins in
Systems Genetics, Chapter 3 v1, 4/1/13
8
complexes are stable and unbound subunits are marked for degradation. This is one kind of
mechanism that can compensate for the lack of two functional alleles, which is why alleles that
completely lack gene function can be recessive. The amount of gene product is in general,
proportional to the number of functional copies of the gene. So, a cell carrying a wild-type allele
and a complete loss-of-function allele would produce about 50% of the gene product compared
to cells that have two copies of the wild-type allele. In practice, this can sometimes exert a
biological effect. This phenomenon will be discussed further in Chapter 4.
For example, consider the case where gene H encodes an enzyme important for
production of pigment. Allele h1 specifies a functional enzyme while allele h2 specifies a nonfunctional enzyme. Allele h1 (which produces a functional enzyme) will likely be dominant. If
the exact level of enzyme is critical, then allele h1 may be semi-dominant. If allele h1 results in a
pigment being expressed in the tail and h2 results in a pigment being expressed in the belly,
they would be codominant, as now there would be functional allele in the tail as well as the belly
(Figure 23 [SGF-1206]).
Phenotypes can take time to develop [delete unless there are other mechanisms]
Sometimes there can be delay in the manifestation of a phenotype for a given genotype;
this is termed phenotypic lag. A phenotypic lag due to a stable protein being passed onto a
cell’s progeny is most often seen in single-celled organisms. The parent of origin effect is a
similar effect is seen in multicellular organisms and is described in the next section. One
mechanism for phenotypic lag involves genes that encode particularly stable proteins. In this
case, phenotypic lag is seen because while the progeny of that cell may inherit a gene that no
longer produces the gene product, the cell still has the gene product inherited from the mother
cell, and thus the phenotype of a cell lacking that gene product is not displayed. In this way, the
phenotype is not simply initially correlated with the genotype (Fig. 24 [SGF-1020]). If the protein is
present in sufficient quantities, particularly stable and also active in small quantities, there will
need to be enough cell divisions to occur until cells no longer inherit sufficient amounts of
protein so that the phenotype is displayed.
A clear-cut example of phenotypic lag involves the problems seen with telomere loss.
As discussed in Chapter 2 Box 2-1, telomeres have repeats that allow replication of the ends of
chromosomes. These repeats are lost each cycle of replication, but copies of these repeats are
added by the enzyme telomerase. In the absence of the telomerase enzyme, telomeres will
gradually shorten; however, cells often have kilobases of telomeric repeats and only a few
hundred bases will be lost each round of cell division. If an animal is missing the gene encoding
for the telomerase enzyme, the chromosome instability and other consequences of telomere
loss will not be manifested until the telomeric repeats are sufficiently shorted, leading to a
phenotypic lag. Indeed, in C. elegans, mutant lacking the telomerase enzyme will reproduce as
do wild type animals for several generations before becoming sterile. In Chapter 4, we will
discuss genetic mosaics in which perdurance of gene activity can be observed: an allele is lost
from a daughter cell but the effect of the gene continues for several cell generations.
One of the first examples of phenotypic lag involves the pep4 protease mutants in S.
cerevisiae. These mutants are defective in a set of vacuolar proteases (the yeast vacuole is
equivalent to the lysosome in higher eukaryotes). Proteinase A (the product of the PEP4 locus)
is necessary to cleave inactive pro-proteases to their active form. Mutant cells defective in pep4
gene nonetheless inherit enough active proteinase A and other proteinases (such as as
proteinase B), by cytoplasmic segregation even though they are unable to synthesize new
enzyme. After enough generations, the Pep4 protein has been diluted out. The yeast pep4-3
mutant is unable to cleave a carboxypeptidase Y substrate (N-acetylphenylalanine P-naphthyl
ester; APE) that can be in agar overlaid onto yeast colonies. The cleavage product, P-naphthol,
is treated with a chemical that results in an insoluble red dye deposited where there is enzyme
activity. Thus, colonies producing CPY stain red; those colonies lacking CPY activity are yellow.
Systems Genetics, Chapter 3 v1, 4/1/13
9
When a diploid heterozygous for a pep4-3 mutation undergoes meiosis and sporulation, two of
the spores are PEP4 (wild type) and two pep4-3. The PEP4 spores generate colonies that stain
red. The two pep4-3 spores generate mixed colonies, which stain with red and yellow regions.
Cells from yellow regions only generate yellow cells. The cells that are genotypically pep4-3 in
the red sectors are displaying phenotypic lag.
During each cell division, about half of the protein is lost from the cell. For this particular
protease, it can take tens of generations to manifest the phenotype! This is more the
exceptional case, as the stability of proteins are often regulated, including through the use of the
proteosome system of regulated protein destruction. Protein stability is highly regulated and
also differs among proteins.
Parent of origin effects arise from multiple causes
Parental effects are a causal effect of the parental genotype on the phenotype of the
offspring. Whenever parent of origin effects are seen, the genotype of the parent determines
the phenotype of the offspring, essentially overriding the genotype of the offspring. An effect
due to the genotype of the mother is called a maternal effect (Figure 25 [SGF-1207]). If the effect
is due to the genotype of the father, it is called a paternal effect.
Different types of maternal effects can be manifested. Sometimes a wild type copy of an
allele from a mother can lead to a situation where a recessive phenotype is not expressed in a
homozygous animal. Sometimes a mutant copy of an allele can lead to a situation where the
heterozygous individual from a homozygous mother will express a phenotype even though they
are heterozygous for a usually recessive allele (Figure 26 [SGF-1041]).
The situation with a homozygous mother’s genotype affecting the phenotype of her
offspring is illustrated in Figure 27 [SGF-1012]. In this case, allele s1 is recessive to the wild-type
allele (+). Allele s1 is a recessive allele, but also displays a maternal effect. Because allele s1 is
recessive, individuals that are s1/+ display the non-S phenotype. In generation II, a s1/+ mother
is crossed to an s1/+ father (generation I). The s1/ s1 progeny in generation II do not display the
S phenotype. However, when these s1/s1 females from generation II now mate to either a wild
type (+/+) male, all progeny in generation III now will display the S phenotype, regardless of
genotype. This would also be true if the s1/s1 females from generation II mated to a male that
was heterozygous (s1/+). However, if the s1/ s1males from generation II now are mated to a wild
type or heterozygous female, all progeny in generation III will be wild type, since the female was
genotypically wild type.
Maternal effects are more common than paternal effects because in many organisms,
the females produce oocytes, which contain much more than just the maternal DNA
contribution. In addition to the DNA, oocytes can contain large amounts of RNA and protein.
Thus, the mother’s genes, which are responsible for the contents of the egg, can have profound
effects on the newly formed organism.
Genomic imprinting can underlie some observed parental effects
Genomic imprinting is a heritable change that is not due to a change in DNA
sequence, but instead due to epigenetic marks on the DNA or chromatin. Genomic imprinting is
due to the parental genotype affecting the expression of particular alleles in the offspring and is
thus due to interplay between the parental and zygotic genomes.
In mammals, genomic imprinting can be caused by DNA methylation, where a methyl
group is covalently attached to a DNA base (Fig. 28A [SGF1053]). In this case, the methylation of
a particular region of DNA in one parent will decrease the expression of a particular allele in its
offspring, often leading to the lack of expression of the allele that is imprinted (Fig. 28B [SGF1209]). When cells divide in the offspring, the state DNA methylation can be inherited. Because
of this, the state of an imprinted allele can be stably maintained throughout the organism’s
development and its life (Fig. 28C [SGF-1210]). Because imprinting by DNA methylation results in
Systems Genetics, Chapter 3 v1, 4/1/13
10
the lack of expression of the allele, the offspring with the imprinted allele is in practice
hemizygous (Fig. 29 [SGF-1212]). The silencing of an allele by imprinting can lead to difficulties in
determining whether or not an allele behaves in a dominant or recessive fashion, as the
hemizygous state created by the imprinted allele means that the effect of an otherwise
recessive allele can now be exerted. Since only one active allele is inherited, imprinting is
referred to as a case of monoallelic inheritance. The inheritance of imprinting patterns can be
complex. In mice, during meiosis, the parental imprinting is removed and then reestablished in
the gamete according to the sexual genotype of the male or female.
Mouse Igf2 locus exemplifies imprinting
Genomic studies, discussed in Chapter 11, have revealed potentially hundreds of
imprinted loci in the mouse. For an individual locus, imprinting is perhaps best understood in
the mouse Igf2/H19 locus (Fig. 30 [SGF-1211]). igf2 encodes an insulin-like growth factor that is
necessary for growth of the embryo, and H19 is necessary to limit growth. Paternal imprinting
of the locus leads to increased expression of igf2 and decreased expression of H19. By
contrast, maternal imprinting of the locus leads to decreased expression of igf2 and increased
expression of H19.
The apparent dominance of either igf2 or H19 will depend upon the parent of origin. By
setting up reciprocal crosses, one can distinguish imprinting from dominance. In particular (Fig.
31 [SGF-1024]) if a3/a3 males are crossed with a1/a2 females and all progeny are of the a3associated phenotype, a3 appears dominant. However, if the cross is the reciprocal, with a3/a3
females crossed with a1/a2 males, then no progeny are of the a3-associated phenotype
Imprinting-like phenomena can also occur at a chromosomal level. Mammals have
evolved a mechanism to normalize the expression from the X chromosomes in males, who have
a single X chromosome, and females, who have two X chromosomes. This mechanism is called
dosage compensation. Mammalian dosage compensation occurs by preventing gene
expression from one of the two X chromosomes; this is also called X-inactivation. X-inactivation
occurs at random, and is stably maintained through somatic cell divisions. One consequence of
X-inactivation is that in a heterozygous individual (like all humans!), only one allele will be
expressed in a clone that has inactivated the other allele.
Phenotypic sex can affect the phenotypic expression
Another effect on phenotypic expression is an organism’s sexual phenotype. Some traits
are linked to sexual phenotype or limited to one sex (Fig. 32 [SGF-1032]). The simplest types of
sex-limited traits are secondary sex characteristics that are naturally sex limited, such as milk
production in females. These effects could be indirect, for example, deriving from secondary
sexual characteristics such as size. Some traits are genetically linked to sexual phenotype.
These are usually attributable to the mechanisms of sex determination.
An organism’s sex can be a striking phenotype (for sexually dimorphic species), and
thus it is often noted as it is a simple phenotype to observe in a pedigree or cross. Phenotypes
that co-segregate (are linked) to sexual phenotype are referred to as sex-linked. The nature of
the linkage depends upon the genetic determinants of sex.
Understanding chromosomal sex determination is necessary to understand the effect of
sex on phenotypic expression
To understand the segregation patterns of genes on sex chromosomes, you must first
know how chromosomal sex determination occurs in your organism of interest. Many organisms
specify sex based on chromosomal constitution such that there is often a heterogametic sex
that has two distinct sex chromosomes. The homogametic sex would have a pair of a single
sex chromosome. For example, in humans, males are the heterogametic sex and have a single
X and a single Y chromosome while females have two X chromosomes and no Y chromosome.
Systems Genetics, Chapter 3 v1, 4/1/13
11
In birds, the female is the heterogametic sex with one W chromosome and one Z chromosome
while the male has two W chromosomes. Sometimes the relative number of sex chromosomes
relative to can determine sex. In C. elegans, males have a single X chromosome while
hermaphrodites have two X chromosomes. In Drosophila melanogaster, sex determination also
depends on the ratio of X chromosomes to autosomes although there also is a Y chromosome.
The sex that has two distinct sex chromosomes is called the heterogametic sex (males
in humans and females in birds). Because the heterogametic sex has only a single of each
chromosome, a recessive allele can be expressed in the heterogametic sex because it is
hemizygous.
Chromosomal sex determination impacts the segregation patterns of genes on sex
chromosomes because recessive alleles can be expressed in the heterogametic sex since it is
hemizygous. This can affect the segregation patterns of genes. Instead of a 3:1 segregation for
a recessive allele, all males (or all females in birds) could display the recessive phenotype.
For example, in lovebirds (Agapornis roseicollis; small parrots), the standard color is
green and there is an autosomal recessive blue allele that confers blue feathers (Figure 33 [SGF1018]). In a blue/blue background, a Z linked lutino recessive allele confers yellow feathers. A
yellow male is homozygous for the blue allele and is homozygous for the W-linked recessive
lutino allele.
There are a wide variety of sex determination systems (Figure 34 [SGF-1036]). In animals,
there can be sex chromosomes such as the familiar XX-XY system of mammals, the WZ-WW
system of birds, or multiple sex chromosomes. Other less familiar sex determination systems
include maternal sex determination as in the carrion fly, or environmental sex determination in
which the sex of an individual depends on factors such as nutrition, population density or social
structure. In bacteria, either a cell has an F factor or not. In fungi there are one or more mating
type loci (Figure 35 [SGF-1208]). In a simple case, that of baker’s yeast, there is one mating type
locus and two sexes, a and α (alpha).
Non-Mendelian inheritance
Mendel’s great discoveries of segregation and independent assortment underlie the
most common modes of genetic inheritance. However, there are inherited characteristics that do
not obey Mendel’s laws and are thus considered non-Mendelian. While genes located on the
chromosomes in the nuclei of diploid cells underlie Mendelian inheritance, genes controlling
traits inherited in a non-Mendelian fashion reside on cytoplasmic organelles, cytoplasmic
elements (other than organelles), and plasmids. We need to understand non-Mendelian modes
of inheritance not only because it is crucial in its own right, but also because we need to rule out
other patterns of inheritance to infer Mendelian inheritance. The key characteristic of nonMendelian inheritance in eukaryotes is that the origin of the cytoplasm (and not the nuclear
DNA) determines the phenotype. Cytoplasmic inheritance can be due to organelles or any
other particles that can be passed on to the next cell generation and affect its phenotype (Fig.
36 [SGF-1017]).
Cytoplasmic inheritance can arise from variant organelle genomes
Eukaryotic cells have organelles, which include mitochondria and chloroplasts.
Mitochondria provide energy for the cell as well as carrying out some key biochemical reactions.
Chloroplasts carry out photosynthesis. Most importantly for genetic analysis, mitochondria and
chloroplasts have their own genomes. As these organelles and their genomes are partitioned at
mitosis and passed on to the next generation, they will have distinct patterns of inheritance (Fig.
36 [SGF-1017]). In most cases, one parent will provide the organelles for the offspring. This
parent is usually the female, as the oocyte contains the materials necessary in the cytoplasm for
the development of the zygote. There are diseases in humans that are due to mitochondrial
Systems Genetics, Chapter 3 v1, 4/1/13
12
DNA mutations. Typically in these cases, the transmission of the diseases states occurs only
via the female.
An example of a disease inherited through organellar inheritance is encephalomyopathy
(myoclonic epilepsy with ragged red fibers; abbreviated MERRF). MERRF results from a defect
in the ability of mutant mitochondria to synthesize proteins. This inability results from a mutation
of a Lysine-inserting tRNA gene encoded by the mitochondrial genome. A pedigree of a family
affected by MERRF is shown in Fig. 37 [SGF-1037]. All four progeny of the key individual studied in
a particular pedigree (the proband), individual I-1 display MERRF. The male II-4, who was
affected, had three normal children because these children received their mitochondria from
their unaffected mother. Mitochondrial inheritance is another mechanism by which a parental
effect can occur. However, unlike the cases described above, this type of inheritance would
involve a maternal effect of a non-Mendelian trait. Cytoplasmic inheritance can be similar to
dominant maternal effect inheritance, but in practice these can be distinguished (Box 3-7).
A typical cell also has many copies of the organellar genome, as there are many copies
of the mitochondria (and chloroplast, in cells that have them) within the cell. Having multiple
copies of the genome can affect the inheritance patterns and genome function. For example, a
cell can have a heterogeneous organelle set, with some organelles having a defective genome
that are carried along with other organelles containing a fully functional genome. The
organelles with the defective genome can function because of the presence of the other
organelles with a functional genome.
Cytoplasmic inheritance can also arise from other nucleic acid based entities
Similar to how organellar genomes can affect phenotype, any other genetically-based
entity that is carried within a cell or organism and is passed onto the organism’s progeny can
affect inheritance. Pathogens, symbionts, and cytoplasmically inherited viruses are examples of
these types of situations. These situations can be difficult to distinguish from an environmentally
based effect, but are important to keep in mind when complex inheritance patterns are seen.
For example, a double-stranded RNA virus in yeast called “killer element” causes yeast
cells that harbor this virus to secrete a toxin that kills cells that lack the killer element. This killer
element is cytoplasmically inherited, and thus displays properties similar to other
mitochondrially-inherited traits (Fig. 38 [SGF-1215]).
The popcorn mutant found in Drosophila is another example of this type of inheritance.
Flies that carry the popcorn mutant undergo brain degeneration (Fig. 39). However, the
popcorn gene was not found on any Drosophila chromosome. Instead, this mutation was found
on the genome of a mutant form of Wolbachia, a Rickettsial family microbe that is normally a
symbiont of insects.
A more complicated case of inheritance due to other genetic entities is the maternal
transfer of an infectious agent. The mouse mammary tumor virus (MMTV) is a retrovirus whose
genome can be integrated into the host genome and thus passed along to its offspring through
transmission of the genome. The presence of MMTV will predispose mice to mammary tumors.
However, passage through the genome is not always the mode of inheritance, as this virus can
also be transmitted from an infected mother to its offspring through its milk. Ingestion of cells
carrying this virus leads to the virus passing through the digestive system into the organism’s
immune cells, where it can infect some cells of the immune system and ultimately infect
mammary gland cells where it affects their proliferation. This unusual mode of transmission was
seen because adopted offspring (who are genetically dissimilar to the mother) were also
displaying a predisposition to mammary tumors when nursed by mothers infected with MMTV,
as the virus was being transmitted in the milk of the infected host to adoptive offspring through
nursing. HIV also passes through mother’s milk.
Box 3-9 describes a way to distinguish nuclear from cytoplasmic inheritance.
Systems Genetics, Chapter 3 v1, 4/1/13
13
Cytoplasmic inheritance can arise from protein states- prions
Prions describe a cytoplasmic inheritance agent that does not involve nucleic acids.
Instead, prions involve an altered conformation of a protein (designated as PrP*) that has the
ability to convert other normal conformations of that protein (designated as PrP) into a prion
state (Fig. 40 [SGF-1039, 1214]). This prion-forming ability is usually due to specific sequences
within the protein that allow for the protein in the prion state to form and create self-propagating
filaments from the normal protein. Prions include pathogenic agents for a variety of disease
such as human Cruetzfeldt-Jacob disease, bovine spongiform encephalopathy [a.k.a., mad cow
disease] and sheep scrapie as well as altered forms of a handful of yeast genes such as
SUP35, RNQ1, and URE3. Because the altered PrP* can convert the normally folded PrP
protein to the PrP* prion state, the transmission of the protein in a PrP* prion state can occur in a
cytoplasmic manner as cells divide. Once a cell inherits a protein in a prion state, this prion
protein can convert the normally folded proteins into the cell into a prion state. If the presence
of the prion now causes a phenotype, this phenotype has now been inherited in a cytoplasmic
fashion in a manner not involving nucleic acids. Prions apparently infect immune cells in the gut
and then spread into the brain.
Determining inheritance: the case of the Dumpy phenotype
How do you determine the inheritance pattern of a dumpy worm? First, you might want
to examine what the progeny of the dumpy worms are like: do they produce other dumpy
worms? Assuming that the dumpy phenotype is heritable, you might want to examine the
phenotype more carefully and see whether dumpy versus non- dumpy is a discrete phenotype
or a more continuous phenotype with different gradations of dumpiness. Since this is a
nematode, you can examine what happens when you cross worms from a dumpy culture with
worms from a non-dumpy culture: what phenotypes will segregate from the progeny of this
cross? Does this differ when you cross two dumpy strains together? Do your results suggest
simple Mendelian inheritance? Is it due to the inheritance of several loci that are required for
the phenotype to be expressed? If not, you will then have to consider the factors that can
complicate Mendelian inheritance and also non-Mendelian inheritance.
How would you examine a newly isolated phenotype: Let us consider an example from
C. elegans. Like Sydney Brenner, you pick a variant C. elegans that has a Dumpy (Dpy)
phenotype (Fig. 41): this animal appears shorter and squatter than the typical C. elegans worm.
First, you want to test whether this phenotype is heritable (Fig. 42 [SGF-1021-1]). You let
the worm you pick self-fertilize and find that the Dumpy phenotype breeds true: all its progeny
were similarly Dumpy. Since you are studying C. elegans, let’s assume this variation is due to
a simple model of inheritance; we will consider otherwise if the data suggests.
You can ask if this variant allele is recessive to the wild-type allele. You cross the Dpy
strain to a non-Dpy strain and all the F1 animals are non-Dpy (Fig. 43 [SGF-1021-2]). This
observation implies that the variant allele is recessive to the wild-type allele. Since you have
now created a heterozygous F1 hermaphrodite, you let an individual F1 hermaphrodite selffertilize, and find that about one-quarter of its progeny are Dpy, the remaining three-quarters
being non-Dumpy (Fig. 44 [SGF-1022]). These observations are consistent with the variant being
a single locus.
To test your hypothesis of the Dumpy phenotype being due to a single locus, you let
some of the F2 individuals self-fertilize. The F2 Dpy gives only Dpy progeny, consistent with
your original observation of it being recessive. You also observe that 2/3 of the non-Dpy F2
segregate both Dpy and non-Dpy progeny, while 1/3 of the non-Dpy F2 segregate only nonDpy progeny (Fig. 44 [SGF-1022]).
Why is there a 1:2 ratio of non-Dpy segregating F2 to individuals that segregate both
Dpy and non-Dpy progeny? Consider the genotype of the F1 used to generate these F2
individuals. The F1 is dpy/+ (non-Dpy) and produces ¼ dpy/ dpy (Dpy): ½ dpy /+ (non-Dpy): ¼
Systems Genetics, Chapter 3 v1, 4/1/13
14
+/+ non-Dpy progeny. Half of the non-Dpy progeny (which are ¾ of the total progeny are
heterozygous, and thus ½ / ¾ =2/3. In other words, we only look at the 2:1 part of the 1:2:1
ratio.
Now, suppose the Dumpy variant is dominant to the wild-type allele. You might first
notice this when you cross your original Dpy individual to the wild type animal and see that all
F1 individuals are Dpy (Fig. 45 [SGF-1023]). The segregation patterns from individual F1 Dpy
heterozygotes (dpy /+) will then yield one-quarter non-Dpy progeny and ¾ Dpy progeny. Of the
F2 Dpy individuals, 1/3 will be homozygous (dpy/dpy) and will only have Dpy progeny while 2/3
will be heterozygous (dpy /+) and will yield ¼ non-Dpy progeny and ¾ Dpy progeny.
Understanding how to analyze the inheritance patterns of a phenotype allows for more
mechanistic understanding of how the causative agent (usually a gene) affects the organism.
More generally, although most genes in most organisms are inherited in patterns consistent with
Mendel’s Laws, a fraction of the organism’s genes may display complicating factors such as
parent-of-origin effects and incomplete penetrance. Furthermore, phenotypes that are due to
non-Mendelian inheritance are ultimately also part of an organism. Thus, the understanding of
an individual comprises the organism and its associated symbionts and parasites. Because of
this, the phenotypic properties of an individual ultimately arises from the individual’s zygotic
genome, it’s maternal and paternal material, as well as the genotype of its associated
organisms.
Essential Concepts
Ø Phenotypic variation can discrete or continuous, either of which can allow
analysis of heredity
Ø True breeding phenotypes indicate genetic homogeneity
Ø Wild-type alleles are usually dominant to mutant alleles
Ø Single Mendelian inheritance is single locus segregation, clear dominance,
complete penetrance and no parental effects
Ø Models of inheritance are fit to data, statistically-tested and verified if possible
Ø Incomplete penetrance suggests an underlying stochastic process
Ø Traits can be co-dominant
Ø Phenotypes can take many cell generations to manifest
Ø Sex limited traits suggest sex linkage
Ø Sex determination is often based on chromosomal constitution
Ø Parent of origin effects have many and interesting causes
Ø Cytoplasmic inheritance has multiple causes including organelles, viruses, and
prions
References and recommended reading
Collinge, J. (2010). Prion strain mutation and selection. Science 328: 1111-1112.
SW Leibman and F Sherman Cytoplasmic Inheritance, In Landmark papers in yeast biology. P.
Linder, D. Shore and MN Hall, Eds. (2006).
Wrinkled Peas & White-Eyed Fruit Flies: The Molecular Basis of Two Classical Genetic Traits
Patrick Guilfoile The American Biology Teacher, Vol. 59, No. 2 (Feb., 1997), pp. 92-95 (article
consists of 4 pages) Published by: University of California Press on behalf of the National
Association of Biology Teachers Stable URL: http://www.jstor.org/pss/4450256
Systems Genetics, Chapter 3 v1, 4/1/13
15
East, E. M. (1935). Genetic reactions in Nicotinia. III. Dominance. Genetics 20: 443 S
Horvitz HR, Sulston JE. Isolation and genetic characterization of cell-lineage mutants of the
nematode Caenorhabditis elegans. Genetics. 1980 Oct;96(2):435-54. PubMed PMID: 7262539;
PubMed Central PMCID: PMC1214309.
Shoffner JM, Lott MT, Lezza AM, Seibel P, Ballinger SW, Wallace DC. Myoclonic epilepsy and
ragged-red fiber disease (MERRF) is associated with a mitochondrial DNA tRNA(Lys) mutation.
Cell. 1990 Jun 15;61(6):931-7. PubMed PMID: 2112427. Example of a mitochondrial mutation
Systems Genetics, Chapter 3 v1, 4/1/13
16
Systems Genetics Chapter 3 Figures and Tables, version 1
Figure 1. Pretty pictures of phenotypes.
Whole organism
Plant, animal
Cell
Behavioral
Figure 2. Examples of discrete vs continuous traits. Normal distributed phenotype,
real examples A. Discrete. B. continuous.
(blue vs brown eyes, different flower colors, agamous vs wild type flowers, etc.) ––
people of all different heights?
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
1
Figure 3. [SGF-1025]. Continuous versus discrete traits.
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
2
Figure 4. Colors as discrete versus continuous traits.
[SGF-1199]
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
3
Figure 5. Typical mating systems. A. Cross-Fertilization. [SGF-1005 =A, B] B. Selffertilization. C. Yeast mating system [SGF-1042].
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
4
Figure 6. Pedigrees. Black circles indicate females that display the particular trait of
interest (affected). Generations are indicated by roman numerals. Horizontal lines
between females and males indicate mating [SGF-1002]
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
5
Table 1. Simple versus complex modes of inheritance
SIMPLE
COMPLEX
Discrete (as opposed to continuous)
Combination of additive and epistatic
variation
effects
One locus with all the effect
Multiple loci with a spectrum of effects
Complete penetrance
Partial penetrance
Recessive to the wild-type strain
Partial dominance
Phenotype immediately discernible
Delayed appearance of phenotype
Zygotic expression
Parental effects
Robust to environmental perturbations
Sensitivity to environment
Independent of paternal origin
Parent of origin effects (maternal effect)
Nuclear, chromosomal inheritance
Cytoplasmic inheritance
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
6
Figure 7. True Breeding population [SGF-1044] Figure showing a true breeding
population for A) a crossing strain and B) a selfing strain and C) a microbial strain. C. A
single cell of genotype A1 gives rise to two progeny cells also of genotype A1. By
separating out the microbes to single cells and letting them grow, a colony derived from
a single parent cell is obtained.
D Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
7
Figure 8. Sectored colonies. [SGF-1014] Schematic of origin of sectored colonies.
Table 301: Results from the yeast sectoring experiment
Type of
result
I.
II.
Dark gives:
III.
Dark with
cream
sectors
Dark only
Dark
Cream gives: Dark true
breeding?
Cream only
Yes
Cream with
Yes
dark sectors
Cream with
No
dark sectors
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
Cream true
breeding?
Yes
No
No
8
Fig. 9. Definition of homozygous, heterozygous and hemizygous. [SGF-1200]
Fig. 10. Definition of dominant and recessive. [SGF-1201]
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
9
Figure 11. Mendel’s law of Segregation. [SGF-1003] Segregation of chromosomes at
meiosis into individual gametes.
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
10
Figure 12. Mendel’s Law of Independent Assortment. [SGF-1004] Each pair of
homologous chromosomes independently undergoes meiosis (replication and
disjunction). With two chromosomes each heterozygous, meiosis generates four
gamete types.
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
11
Figure 13 Punnett square of two locus segregation. [SGF-1013] Each female
gamete, or male gamete can have either allele at either of the two loci. There are thus
four types of gametes and nine genotypes possible.
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
12
Figure 14. Diploid segregation patterns for Mendelian traits [SGF-1055].
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
13
Figure 15. Yeast-like alternation of haploid and diploid. [SGF-1202]
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
14
Figure 16. For each independent locus necessary to confer the phenotype, there is a
factor of ¼, because of Mendel’s first Law. [SGF-1035]
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
15
Figure 17. Penetrance. A. Incomplete penetrance: by contrast to wild-type
(penetrance v = 0), and a fully penetrant (v=1), incompletely penetrant phenotypes have
some wild-type and some mutant, for example, live worms or dead eggs. B. From a
cross of potentially true-breeding individuals, some of the progeny (proportion v) are of
Phenotypically R while others are not (proportion 1-v). Breeding of Phenotype R
individuals yields both R and non-R, and breeding of non-R also yields R and non-R
progeny in proportion v : 1-v. [SGF-1015]. C. lin-3(e1417) is incompletely penetrant.
[SGF-1226]
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
16
C. C. elegans lin-3 example of incomplete penetrance. add Photographs.
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
17
Figure 18. Incomplete penetrance versus dominant phenotype. Two Phenotype
D individuals are crossed, and among their progeny 75% are Phenotype D and 25% are
phenotype non-D. Progeny of phenotype D are crossed together and their progeny
examined. There are two simple possibilities. If again 75% are of phenotype D and
25% non-D, we infer that the D individuals are true breeding but of incomplete
penetrance (75% penetrance). On the other hand, if all the progeny are phenotype D,
then we infer that the original D parents were heterozygous for a dominant allele
conferring the D phenotype. Only 1/16 of the progeny of all the D F2 animals will be
non- D, they must be from two heterozygous parents ( ½ x ½ ), and inherit the + allele
from both ( ½ x ½ ) for 1/16. [SGF-1016]
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
18
Fig. 19. [SGF-1230] Calculation for dominant crosses.
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
19
Figure 20. Expressivity. A. C. elegans Multivulva phenotypes. top, wild type, nonMultivulva hermaphrodite; middle, partial Multivulva phenotype; bottom, strong
Mulitvulva phenotype. Blue arrow indicates functional vulva; red arrow indicateds
pseudo-vulva. Left-lateral views of L4 stage hermaphrodite larvae. Modified from Zhong
and Sternberg Science 2006. B. temperature sensitive yeast growth. The expressivity
of the ts mutant depends on temperature
INCLUDE WHOLE ANIMAL PICTURES
A.
B.
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
20
Table 2. Conditional alleles
genotype f+ f+ f1 f1 f2 f2 f3
f3
type of allele
wild-­‐type wild-­‐type non-­‐conditional non-­‐conditional ts ts cs cs temperature 16 30 16 30 16 30 16 30 Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
phenotype alive alive dead dead alive dead dead alive 21
Figure 21. Using two true-breeding strains to determine allele relationships. Cross
of two homozygotes. [SGF-1204]
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
22
Table 3: Dominance.
Complete
dominance
Co-dominance
Semi-dominance
Genotype
a1/a2
Phenotype
All organisms display A phenotype.
a1/a2
All organisms display an intermediate
phenotype compared to organisms that are
a1/a1 and a2/a2 or there is reduced penetrance
relative to the homozygotes
Only some organisms display the A
phenotype, a fraction will display the non-A
phenotype.
a1/a2
Table 3a: Numerical example of semi-dominance
Genotype Phenotype
a1/a1
100% display A phenotype
a1/a2
10% display A phenotype.
a2/a2
0% display the A phenotype,
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
23
Figure 22. [SGF-1205] Determining dominance starting with a heterozygote.
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
24
Figure 23 [SGF-1206]. Codominance. h1 allele specifies tail pigmentation; h2 alleles
specifies belly pigmentation.
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
25
Figure 24. Phenotypic lag. The a1 mutant allele becomes homozygous without
phenotypic effect for several generations, and then is stable. There is thus a lag in
expression of the R phenotype. In the pictorial example, the R phenotype is lack of
telomeres, which get shorter each generation in the absence of the a+ allele [SGF1020]
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
26
Figure 25. Maternal versus Paternal Effects. [SGF-1207] Crosses that demonstrate
maternal effects or paternal effects. Alleles k1 and p1 are recessive (not shown) and
confer the K and P phenotypes, respectively. The progeny have different phenotypes
depending on which parent carries the allele.
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
27
Figure 26. Maternal rescue: [SGF-1041] A. Maternal Rescue. Homozygous offspring
are rescued to a wild-type phenotype by the presence of the + allele in mother. B.
Dominant maternal effect. Genotypically wild-type offspring display the mutant
phenotype due to the dominant maternal effect.
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
28
Figure 27. Pedigree displaying a recessive Maternal Effect trait. A. Pedigree. B.
Inferred genetic model. [SGF-1012]
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
29
Figure 28. Methylation of DNA. A. [SGF-1053] Methylation of the base cytosine at the 5
position (5-methyl cytosine) is an important modification of DNA. Specific enzymes catalyze this
methylation reaction. B. Methylation can interfere with transcription [SGF-1209]. After DNA
replication, a maintenance DNA methylase recognizes the hemi-methylated DNA and
methylates the CpG.[SGF-1210]
A
B.
C.
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
30
Figure 29. Imprinting affects transcriptional regulation; the simple view. A. An
enhancer and insulator are located 5’ to a coding regions. B. A female (non-imprinted
allele) has an active insulator, which blocks activation of transcription. C. A paternallyimprinted insulator element does not prevent transcriptional activation. [SGF-1212]
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
31
Figure 30. [SGF-1211] Imprinting at the IGf2-H19 locus of the mouse. A. The Igf2
and H19 coding regions are nearby. The transcriptional enhancers for both promoters
are located 3’ to H19. B. Alternative looping allows the enhancers to interact with etiher
H19 (left) or Igf2 (right). C. A female (non-imprinted allele) binds CCTF complex and
favors looping to H19. D. A paternally-imprinted insulator element does not bind the
CCTF complex and allows the distal enhancer to activate transcription of Igf2.
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
32
Figure 31. Female imprinting. Asterisk indicates an imprinted and thus not
expressed allele. In this case the maternal alleles are imprinted. [SGF-1024]
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
33
Figure 32. Human pedigree demonstrating sex linkage. This pedigree demonstrates
sex-linkage in which only male grandchildren (III-3 and III-6) of an affected grandfather
(I-2) are affected, and then only half of those grandchildren. This pedigree is fully
consistent with X-linked recessive inheritance. [SGF-1032]
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
34
Figure 3. Sex linkage [SGF-1018 ]. A. Sex linkage in lovebirds B. [Add pictures of
birds if available]
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
35
Figure 34. Sex determination [SGF-1036] Many familiar sex determination systems
are based on asymmetry in the types of chromosomes (top) or the numbers of
chromosomes.
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
36
Figure 35. Chromosomal sex determination in fungi. Cells with the MATa locus are
a-type mating cells; cells with the MATα locus are α cells. [SGF-1208]
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
37
Fig. 36. Patterns seen with mitochondrial inheritance [SGF-1017]
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
38
Figure 37. Simplified pedigree of mitochondrial disease MERRF. Data described in
Cell 1990. [SFG-1037]
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
39
Figure 38 [SGF-1215] Killer element. A. One killer element has a double-spranded
RNA in a virus like particle. B. L RNA replicates and the RNA and particle segregate
at mitosis.
Figure 39. Flies that carry the popcorn mutant undergo brain degeneration.
photo
¿Add MMTV here – as a pedigree or picture showing inheritance?
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
40
Figure 40. Prions. [SGF-1039]. A. Abnormal Prion conformation converts normal
conformation. B. [SGF-1214] At cell division, the cytoplasmic PrP* is segregated
to both daughters, and converts newly synthesized PrP to PrP*.
B.
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
41
Figure 41 Picture of a Dpy worm
Figure 42 C. elegans genetics: recovering a mutant in the F2 of a mutagenized
grandparent. [SGF-1021-1]
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
42
Figure 43. Demonstrating recessivity. [SGF-1021-2]
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
43
Figure 44. [SGF-1022]
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
44
Figure 45. Demonstrating dominance. [SGF-1023]
Systems Genetics, Chapter 3 Figures & Tables v1, 4/1/13
45
Systems Genetics Chapter 3: BOXES
Box 3-1.
[SGB-321]
Nomenclature.
Genes are referred to by capital letters, italicized, e.g., A.
Alleles will be lower case, either just letters or letters plus numbers if more than one
allele is being discussed, e.g., a, a1, a2, a3. If the a1 allele referred to is the wild-type
allele, this allele sometimes represented with a “+”.
Phenotypes will be referred to by bold, capital letters, such as A. A wild-type
phenotype will be referred to as non-A, or sometimes wild type.
A protein product will be capital non-italicized. Thus, the protein product encoded by
gene A will be protein A.
When we discuss specific examples, we will use the approved nomenclature for that
organism. Organism nomenclature is described in the Boxes introducing the genetics of
that organism (see Boxes __ in Chapter 1).
The absence of a chromosome in a diploid animal will be indicated by an “O”. This
nomenclature is mostly relevant for discussion of sex determination. For example, XX
indicates and animal with two X chromosomes while XO would refer to an animal with
only one X chromosome.
Systems Genetics, Chapter 3 Boxes V1, 4/1/13 1
Box 3-2 [SGB-322] Real data do not always yield clear results: using statistics to
determine significance
Sometimes your data do not yield a clear-cut interpretation. For example, if you
wanted to determine whether a single genetic locus is involved in a phenotype, you
could examine the segregation of progeny from heterozygous parents. If this single
locus involved an allele that behaved in a recessive fashion, you would predict a 3:1
segregation pattern of wild type to mutant, as predicted by Mendel’s laws. However,
sometimes you obtain data that does not obviously segregate as 3:1. One option is to
collect more data, as a small sample size is more subject to sampling error. (Imagine
that you flip a coin 4 times and it comes up heads each time; is it a rigged coin? No,
since four heads will come up once every 16 times.) Sometimes collecting more data is
not an option, and statistics can be used to determine the likelihood that your phenotype
is segregating as a single genetic locus.
Let’s consider the situation where you take a true breeding variant, with
phenotype A, and cross it to a wild-type strain. If we assume a simple Mendelian
model, the F1 organisms created by this cross are presumably heterozygous at the A
locus. Now, if you cross the F1 organisms to each other, you can examine the
phenotypes of the grandprogeny in the F2 generation. When you examine 100 grandprogeny, you see that 82 are phenotypically wild type (non-A) while 18 have the A
phenotype.
Is your observation consistent with the A phenotype segregating as a simple
Mendelian trait? The Chi-squared test for significance is used to determine whether the
data obtained are consistent with a hypothesis. This test will return the probability that
the observed values deviate from the expected values simply by chance. In other
words, this test will tell you what are the chances you will get the observed values
purely by the luck of the draw. Convention is that a hypothesis is rejected if there is less
than 1 in 20 (0.05) chance that the observed data fit the expectation. Note that for
performing a chi-squared analysis, the data are analyzed as the actual observation, and
are not converted into proportions or percentages. Because of this, the significance
value obtained does take into account whether few or many observations were made.
To calculate chi-squared:
χ 2 = Σover all categories of observation (Observed Number– Expected Number) 2
Expected Number
(The math symbol for sum is the capital Greek letter S: Σ.)
Systems Genetics, Chapter 3 Boxes V1, 4/1/13 2
Figure 1/Box3-2 [SGF-1216]
Phenotype
A
non-A
Observed
18
82
Expected (hypothesis of 3:1)
25
75
Obs-Exp
7
7
(O-E)2 /e
49/25
49/75
We calculate χ2 = 2.613, P=0.11
For the example above, χ2 = (18-25)2 /25 + (82-75) 2 /75 = 49/25 + 49/75 = 2.613.
We look up in Table 6 the P value for one degree of freedom and find P=0.11. The
degrees of freedom are the number of independent variables, typically one less than the
number of classes. If you have two phenotypes with proportion f and q, then f + q = 1
since an individual can be either of the two phenotypes. Thus q = 1 - f, and f is the only
variable (degree of freedom). If a parameter is estimated from the data, then it does
not count as an independent variable. For this analysis, the degrees of freedom are
one less than the number of classes of phenotypes (or genotypes).
This result means that there is an 11% chance these observed data fit the
hypothesis of a 3:1 segregation (indicating a Mendelian trait) were to account for the
phenotype. From this result, we conclude that our observed data are consistent with
this phenotype being caused by a single gene.
Now, consider the same experiment described above. However, what if we
instead observed 15 individuals of phenotype A and 85 individuals of phenotype non-A?
Are these ratios consistent with the phenotype being caused by a single locus (giving a
3:1 segregation) or two loci (giving a 15:1 segregation)? First, we need to calculate the
expected ratios for each hypothesis and calculate the chi-squared values.
Phenotype
A
Non-A
Observed
15
85
Expected (hypothesis of 3:1)
25
75
χ2 = 5.3, P=0.02
Phenotype
A
Non-A
Observed
15
85
Expected (hypothesis of 15:1)
6.25
93.75 χ2 = 13.067, P=0.0003
From these observations, the chi-squared values indicate that our results are
consistent with neither hypothesis. This does not necessarily mean that both
hypotheses are incorrect. Instead, data inconsistent with neither hypothesis may be due
Systems Genetics, Chapter 3 Boxes V1, 4/1/13 3
to an insufficient number of segregants observed, giving the chance that these ratio of
observed phenotypes do not fit the expected values.
How might you resolve this inconclusive situation? One solution might be to
examine more segregants from the heterozygote, to increase your numbers and see if
more data points make it so that you will now be consistent with one of these
observations.
Perhaps a better way to go is to generate a new hypothesis that explains the 5:1
ratio. See BOX 3-3.
A better statistical test for small numbers (and any numbers if you have sufficient
computer power) is Fisher’s Exact Test as the chi-squared is an approximation that
matches the real values more closely with large numbers than with small numbers (see
Appendix).
Systems Genetics, Chapter 3 Boxes V1, 4/1/13 4
BOX 3-3. [SGB-323] What happens when your data do not fit simple Mendelian
rules: the 5:1 ratio.
Gregor Mendel was lucky when he picked the alleles of the traits he studied, as
they all behaved like 100% penetrant recessive alleles. If the alleles you are interested
do not behave in a simple Mendelian fashion, how do you proceed.
For example, let’s consider the case where you think a single genetic locus is
required for the trait in which you are interested. However, you do not see the 3:1
segregation pattern you might expect from Mendel’s laws, and instead observe a 5:1
ratio. How might you explain a 5:1 ratio? First, you need to make a hypothesis to test.
For example, one model that might explain a 5:1 ratio is that there is a single locus, with
incomplete penetrance.
Hypothesis: one-locus, incompletely penetrant recessive
genotype
proportion
penetrance fraction
from cross (f)
with
phenotype
+/+
¼
0
0
a1/+
½
0
0
a1/a1
¼
0.6
0.15
TOTAL
1
0.15
To obtain 0.15 of the progeny with the phenotype, when only the a1/a1 individuals
contribute with penetrance f, we need ¼ f = 0.15. This implies that f = 0.6.
If this is an experimentally manipuable organism, we can test this hypothesis
experimentally. To do so, we can verify whether this is the case by examining
organisms of a1/a1 genotype (a fraction of which will display the A phenotype). In
practice, you could take individuals of A phenotype, which are presumable homozygous
a1/a1, and examine their progeny. If this hypothesis is valid, you would expect 60% of
their progeny to display the A phenotype and 40% to display the non-A phenotype.
Let’s examine a more complicated hypothesis: two loci are involved, where one
allele is recessive and one allele is dominant.
Hypothesis: two loci, a1 is recessive; b1 is dominant
genotype
proportion from
penetrance (f)
fraction with A
cross
phenotype
a/a b/b
1/16
1.0
0.0625
a/a b/+
2/16
0.7
0.0875
a/a +/+
1/16
0
0
+/+ b/b
1/16
0
0
a/+ b/b
2/16
0
0
a/+ b/+
4/16
0
0
a/+ +/+
2/16
0
0
+/+ b/+
2/16
0
0
+/+ +/+
1/16
0
0
TOTAL
1
0.15
Systems Genetics, Chapter 3 Boxes V1, 4/1/13 5
If an allele is segregating according to this model, we want 15% of the offspring
to display the A phenotype. The double homozygotes (a1/a1 b1/b1) contribute 0.0625, so
we need .0875 (0.15 – 0.0625 = 0.0875) with penetrance f, where 2/16 (0.125) of the
progeny are of this genotype. We thus have
0.125 f = 0.0875, and thus
f = 0.0875/0.125 = 0.7.
If we took phenotypically A animals and crossed each to genotypically a1/a1
b1/b1true-breeding parents, we would expect that
58% (0.0875/0.15) would be a1/a1 b1/+ x a1/a1 b1/b1 and thus give
0.5 a1/a1 b1/b1 of which all are A
0.5
0.5 a1/a1 1b/+ of which 0.7 are A
0.35
total
0.85 are of A phenotype
and 42% (0.0625/0.15) of the crosses would be a1/a1 b1/b1 x a1/a1 b1/b1 and thus give all
A progeny.
We could then compare our expectations by a statistical test). For example, if we
observed 12 crosses, with 6 crosses giving ~0.85 A progeny and 6 crosses giving all A
progeny, we could use Fisher’s Exact Test (see Appendix/FET) to compare 6:6 to our
expected 58:42 ratio, and find that it is not significantly different from expectation.
Table 6. Values of χ 2. chi-squared table (NIST)
See: http://www.itl.nist.gov/div898/handbook/eda/section3/eda3674.htm
Upper critical values of chi-square distribution with degrees of freedom
Probability of exceeding the critical value
0.10
0.05
0.025
0.01
0.001
1
2.706
3.841
5.024
6.635
10.828
2
4.605
5.991
7.378
9.210
13.816
3
6.251
7.815
9.348
11.345
16.266
The chi-squared test cannot be used if the number of individual observations in any
class is <5b because it is a poor approximation to the real values of the distribution. For
small numbers it is easy to use Fisher’s Exact Test (see Appendix/FET or use
http://www.graphpad.com/quickcalcs/contingency1.cfm).
Systems Genetics, Chapter 3 Boxes V1, 4/1/13 6
Box 3-4. [SGB-­‐324] Stochastic effects can play a role in the expression
of phenotypes
How might you get differences in penetrance and expressivity for a particular
allele for two individuals that carry an identical allele? One source of variability is
genetic: that is, there can be unknown genetic modifiers segregating in the population.
In this case, it can be argued that then the phenotype is not be due to a single genetic
locus, as the modifier is also involved in the phenotype. Such cases of multiple loci
determining phenotypes will be discussed in Chapters 5 and 6.
Let’s consider the simplest case where two genetically identical individuals show
differing penetrance and or expressivity for a gene. Although this can be a difficult
situation to obtain in populations without controlled breeding, this situation is usually the
case when working with genetically tractable model organisms.
The environment can be one source of variability. For example, for phenotypes
such as animal body weight and height, nutrition can affect how these phenotypes are
expressed. Similarly, temperature, humidity, and light can affect how genetically
identical plant might grow.
Stochastic, or random, effects also play a role in penetrance and expressivity.
These random effects can be due to many sources, which may include effects due to
the probabilities of basic biochemical reactions or effects due to the distribution of
cellular components during the cytoplasmic and membrane partitioning seen during cell
division processes.
One way to envision stochastic effects due to partitioning is to consider a case
where there are a limited number of particles. An example of this type of particle might
be an organelle such as a mitochondrion. In the extreme, when there is only one
particle, this single particle can segregate at random to either cell (Figure 1). If this
particular particle is required for viability, then one daughter cell at each cell division will
die.
Figure 1 [SGF-1007] illustrates the random segregation of a small number of
particles at a cell division. If there is one particle, either of the daughters will have the
particle, each receiving it 50% of the time (Figure 1A). A population would have equal
numbers (1:1 ratio) of cells with zero or one particle. If there are two particles, then ¼ of
the cells will have zero particles, ½ will have one particle and ¼ will have two particles
(Figure 1B). The one particle class is twice as frequent because there are two ways a
cell might have one particle. This can be seen if we arbitrarily color the particles
(purple and red). If there are three particles, then 1/8 of the cells will have none, 3/8 will
have one, 3/8 will have two and 1/8 will have three (Figure 1C). We can see how these
frequencies arise if we label the particles purple, red and and green. If there are four
particles, then 1/16 will have none, ¼ will have one, 3/8 will have two, ¼ will have three
and 1/16 will have all four (Figure 1D). Again, this can be seen if we color the particles.
Systems Genetics, Chapter 3 Boxes V1, 4/1/13 7
Figure 1 [SGF-1007]. Binomial sampling.
If we now consider two particles that segregate randomly and independently,
there are three possible outcomes:
1. The left daughter has no particles while the one on the right has two.
2. Each daughter cell has a single particle. This particular case will
happen twice as often, because the left and right daughter cells look
equivalent in this situation compared to situations 1 and 3.
3. The left daughter cell has two particles while the one on the right has
none.
In this example, if you were to examine each daughter cell for these three possible
outcomes, you can see that there are two daughter cells that have 0 particles, four
daughter cells will have 1 particle, and two daughter cells will have 2 particles, giving a
ratio of 1:2:1 for cells with 0, 1, or 2 particles. If the particle were required for viability,
you would then end up with 1/4th (or 25%) of cells dying.
If we now consider three particles that segregate randomly and independently,
the ratio now will be 1:3:3:1 for cells that have 0, 1, 2, and 3 particles. In this case, 1/8th
(or 12.5%) of the cells will die. In general, 1 / 2N daughters dies.
To think about stochastic effects having to do with a biochemical function,
consider cases where gene activity is dependent on a threshold level. (Consider an
example of incompletely penetrant allele of a gene required for viability. This allele has
Systems Genetics, Chapter 3 Boxes V1, 4/1/13 8
partial loss of gene activity, due to a mutation in the gene that causes most of the
proteins produced to fold incorrectly and be biochemically inactive; however, a small
fraction of the proteins will fold correctly and provide enzymatic activity. Furthermore,
only 1/5th of the activity of the enzyme (compared to wild type cells) is necessary in
order for cells to live. Thus, even though only a fraction of the enzymes produced by the
cell are active, 90% of the cells have enough enzyme activity to live and thus do not die.
If the allele were a complete loss of function, complete penetrance would be seen and
100% of the cells would be dead (Table 7).
Table 7. Incomplete penetrance due to reduction of gene activity
Activity of enzyme
viability
wild-type
100%
partial loss of function
90%
complete loss of function
0%
A threshold (shown in orange) splits a distribution of gene activities (blue) into part
above threshold (green) and a part below threshold (shown in red). As the threshold is
centered to the left of the peak of the distribution of gene activity, a greater proportion of
individuals are over threshold than are under. The interpretation of the threshold curve
is that the probability of an individual being green is proportional to the height of the
curve. The penetrance would be the ratio of the area under the red curve to the area
under the blue curve. (Figure 2 [SGF-1009-2])
Figure 2 Incomplete penetrance can arise from a threshold effect.
An alternative possibility for creating an incompletely penetrant phenotype can
involve loss of function alleles in distinct pathways of gene action that have the same
final result In this case, a complete loss of function of a gene can still create an
incompletely penetrant phenotype. If you consider a case where pathway A contributes
80% of the effect of a phenotype while pathway B contributes 20% of the effect (Figure
3 [SGF-112]). If a cell were to completely lose function in pathway A, there would only be
an 80% defect, as pathway B was still intact. Similarly, a complete loss of pathway B
would result in a less severe 20% defect, as pathway A is still providing 80% of the
activity. In particular, one pathway (pathway A) makes product Z from substance X.
Another pathway (pathway B) makes the same product Z from substance Y. A.
Pathway A makes 0.8 of the total product Z, while pathway B makes 0.2 of the total
product Z. B. Loss of pathway A results in only 0.2 of product Z begin produced (by
Systems Genetics, Chapter 3 Boxes V1, 4/1/13 9
pathway B). C. Loss of pathway B results in only 0.8 of product Z begin produced (by
pathway A). D. Loss of both pathway A and pathway B results in complete lack of
product Z. If the phenotype is simply the amount of Z produced, then pathway A mutant
would display 80% penetrance as it lacks 80% of the wild-type activity.
Figure 3. [SGF-112]. Two contributing pathways creating incompletely penetrant
phenotypes.
Systems Genetics, Chapter 3 Boxes V1, 4/1/13 10
Box 3-5 [SGB-325 ] Using pedigrees to analyze modes of
inheritance
Figure 1.
When controlled crosses are not possible, pedigree analysis can be used to
analyze modes of inheritance. Because for most pedigree analysis the sample size is
small, chi-squared analysis does not apply. Instead, you can calculate the probability
that the segregation pattern you see would fit your model by chance.
Consider the two pedigrees in Figure 1, both of which show hereditary patterns
consistent with the segregation of dominant alleles or alleles that display incomplete
penetrance.
The pedigree in A has two of four children of an affected mother displaying the
phenotype. There are several simple models with which this pedigree is consistent.
These data are consistent with dominant allele conferring the phenotype, with the
mother being heterozygous for the dominant allele. These data are also consistent with
her being homozygous for the dominant allele and the trait being about 50% penetrant.
Finally, these data are consistent with a recessive allele that is homozygous in the
mother and heterozygous in the father.
The pedigree shown in B has all four children of an affected mother displaying
the phenotype. One obvious explanation is that the phenotype is due to a dominant
allele inherited from the mother. However, is it possible that this segregation pattern is
due to a recessive allele that is homozygous in the mother and heterozygous in the
father? If this were the case, we would expect a 1:1 ratio in the progeny of affected
individuals to non-affected individuals.
Given the small sample size in this situation (4 progeny), what is the probability
of observing 4 affected individuals if the expected situation is 2 affected individuals and
2 non-affected individuals? To calculate this probability you would calculate 1/2N where
N is the number of affected individuals. For N=4, 1/24 is 1/16. In other words, there is
a 1/16 chance that you would obtain this result by chance. Because conventionally, an
hypothesis is rejected if there is a 1/20 probability of obtaining the data by chance, we
cannot reject the hypothesis here and conclude that the difference is not statistically
Systems Genetics, Chapter 3 Boxes V1, 4/1/13 11
significant. Therefore, either hypothesis fits the data because the sample size is too
small.
On the other hand, if there were 5 progeny that all were affected (and thus) N=5,
the probability of obtaining this result by chance 1/32. In this case, since the probability
is less than 0.05 and thus significant, we would reject the hypothesis of the
heterozygous father. Similarly, with 7 progeny, all affected, the probability would be
1/128, which is <0.01 and thus highly significant.
How can one distinguish these possibilities? One way is to obtain a larger
sample. This larger sample could come from including additional generations in the
pedigree, or from additional families with the trait. If additional families are included,
you have to assume that it has the same genetic cause.
Systems Genetics, Chapter 3 Boxes V1, 4/1/13 12
Box 3-7. [SGB-327 ] Can a dominant maternal effect mutation be distinguished
from cytoplasmic inheritance associated with mitochondria?
How can you distinguish cytoplasmic inheritance from a dominant, strict maternal
effect? Most simply, the mapping of the locus to a chromosome would indicate that it is
not due to cytoplasmic inheritance. Methods associated with the traditional mapping of
genetic loci will be discussed in Chapter 9.
Sometimes, mapping is not simple. Another method to distinguish a maternal
effect mutation from an effect due to mitochondrial inheritance is to set up crosses and
analyze their segregants. For example, if you were to take an affected female and
cross to an affected male, then take the females from this cross (who are all affected)
and cross it to an unaffected male, you can then look at the F2 female progeny and
examine their offspring. If the effect were due to a strict maternal effect, only ½ of the
F2 females will give rise to affected progeny. However, if the effect were due to a
mitochondrial marker (assuming a homogenous mitochondrial population), all F3
progeny of the F2 females will produce affected progeny. If the mitochondria were to
exist as a genetically diverse population, there may be a mixture of non-affected and
affected females that can be seen segregating in all generations.
Figure 1. A. dominant maternal effect. B. cytoplasmic inheritance. [SGF-1033].
Systems Genetics, Chapter 3 Boxes V1, 4/1/13 13
Box 3-6
[SGB-326]
Balanced lethals and balancer chromosomes.
True-breeding strains do not necessarily have to be homozygous, as there can be
stable strains that are maintained in heterozygous state (Figure 1 [SGF-1019]). For
example, from a heterozygote + a1 lethal2/ lethal1 a2 +, one-half of the progeny will also
be this heterozygote, and ½ the progeny will die from either of the recessive lethals.
Such a situation is called balanced lethals because the recessive lethal mutations keep
each other in an otherwise precarious situation (see Chapter 13).
As balanced lethals allow for the maintenance of the heterozygous state of the A gene,
they can also be useful for strain maintenance and strain construction, particularly if the
homozygous state of one of the A alleles leads to a suboptimal condition such as
sterility, death, or slow growth. This concept was extended to the construction of
specialized balancer chromosomes used in model organisms to maintain a
heterozygous state. Balancer chromosomes are placed in trans to the chromosome
carrying your allele of interest, and will either suppress the recombination from the other
chromosome or allow you to detect that a recombination event occurred (see Chapter 9)
More generally, one can use marked chromosomes to keep track of alleles for
maintenance of strains, strain construction or mutagenesis (see Chapter 7). Figure 2
shows a simple generic balancer chromosome. This contains a dominant marker, for
example a GFP expressing transgene, to distinguish any individual that has the
balancer chromosome. A recessive lethal kills off the homozygous balancer, which
would also express GFP. An aberration such as a large inversion or a translocation that
suppresses recombination (see Chapter 14) prevents recombination.
Figure 1 Balanced lethals [SGF-1019]
Figure 2. Generic balancer chromosomes. [SGF-1217]
Systems Genetics, Chapter 3 Boxes V1, 4/1/13 14
Box 3-9. [SGB-329] Exchanging compartments between cells allows for
the distinction of nuclear from cytoplasmic effects.
Sometimes the physical mapping of the trait to either the nucleus or the
cytoplasm can be accomplished by the transfer of material from one cell to the other
Figure 1 [SGF-1218]. For example, by transf erring the nucleus from an affected cell to the
cytoplasm of an enucleated unaffected cell (and vice versa), you can determine which
cellular compartment is important for conveying the inheritance of the trait (Table 8). If
the trait followed the nucleus, the conclusion is that the nucleus (and presumably the
chromosomes) is important for inheritance.
Figure 1 [SGF-1218]. Nuclear transplant.
Table 8. Logic of nuclear transplant experiments.
genotype
phenotype
nucleus
cytoplasm
+
+
wild type
+
mutant
+
wild type
wild type
Systems Genetics, Chapter 3 Boxes V1, 4/1/13 15