* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Problems of Macroevolution (Molecular Evolution, Phenotype
Heritability of IQ wikipedia , lookup
Viral phylodynamics wikipedia , lookup
Human genetic variation wikipedia , lookup
Designer baby wikipedia , lookup
Adaptive evolution in the human genome wikipedia , lookup
Genetic drift wikipedia , lookup
Dual inheritance theory wikipedia , lookup
Polymorphism (biology) wikipedia , lookup
Quantitative trait locus wikipedia , lookup
Koinophilia wikipedia , lookup
Population genetics wikipedia , lookup
Natural selection wikipedia , lookup
AMER. ZOOL., 15:295-314 (1975). Problems of Macroevolution (Molecular Evolution, Phenotype Definition, and Canalization) as Seen from a Hierarchical Viewpoint S. N. SALTHE Department of Biology, Brooklyn College, C.U.N.Y., Brooklyn, New York 11210 SYNOPSIS. As seen from a hierarchical viewpoint, macroevolution is neither a functional process nor a series of events in the past. It is a record only. For this reason macroevolutionary laws are all statistical laws. Natural selection is a process that operates from one generation to the next at the population level in the hierarchy. Yet structures at the organism level are found to "evolve." It is possible to formulate only a tautological form of the concept of natural selection at the population level alone; the bridge between levels in this case is the phenotype. The phenotype (i) exists at the boundary between the organismic and population levels of the hierarchy; (ii) is a functional manifestation of the interaction between the genotype and the local environment only during the period of a single generation; (iii) should ideally be defined so as to exclude traits not reviewed by natural selection; (iv) is factorable into many individual functional traits if one views viability selection as being instituted by a sequence of environmental catastrophes, each of which emphasizes a particular set of traits as being temporarily important to survival. It is reemphasized that the action of natural selection on continuously distributed, nonpolymorphic traits curtails variability in proportion to the intensity of selection. The necessity for coadaptation within the organism imposes a bell-shaped curve upon surviving variability. Canalizing selection is proposed as the process that modifies these bell-shaped curves into lognormal parametric distributions. It is also proposed that the per cent variability of the sample populations can serve as a measure of the intensity of natural selection (normalizing and directional together) that has most recently been acting upon the traits in question in the populations used to establish the parametric distributions. What we now call macroevolution was what Darwin hoped to explain using his theory of natural selection. Macroevolution can be defined as phenotypic transformations occurring over periods of time which are long with respect to generation times. There are at least three problems concerning the relationship between macroevolution and neo-Darwinian evolutionary I would like to thank the following persons for helpful criticisms of an earlier version of the last portion of this paper: C. Cans of the University of Michigan; W. C. Kerfoot of the University of Washington; D. C. Simberloff of the Florida State University; D. W. Tinkle of the University of Michigan; L. Van Valen of the University of Chicago. P. F. A. Maderson of Brooklyn College has been contributing to the development of this model since its inception. I have had valuable suggestions and help from R. H. Kaplan and J. Steinberg, both of Brooklyn College. I would like to thank S. J. Gould, Harvard University, for his comments on the manuscript. This work was carried out during the time the author had the following research grants: GB-7749 from N.S.F.; and 1108 and 1594 from CUNY. theory, which has for the most part been concerned with genetic changes at single loci over a period of relatively few generations. First, it is clear that the simple Mendelian "traits" which have served as model loci in the development of evolutionary theory in the Twentieth Century are not typical of those usually dealt with in considering long-term transformations, which are generally multigenetic traits. Attempts at combining the adaptive values of separate Mendelian loci to build up a total adaptive value for an individual have not been quite successful, and some work has been devoted to sidestepping this problem by proposing instead, for example, to view individuals in terms of the proportion of their loci that are heterozygous (Sved et al., 1967). Even more formidable problems have appeared in attempting to understand the effects on population genetic models of tight linkage or of the differential fitnesses of different combinations of alleles at linked Mendelian loci (Kojima and Lewon- 295 296 S. N. SALTHE tin, 1970). Only very few genes can be handled together analytically without overtaxing existing computational technology. A second major problem arising when a link is sought between current evolutionary theory and macroevolution is that of a definition of the "phenotype." This is related to the logical and practical problems mentioned above but those problems can be dealt with on the genetic level without ever explicitly asking, "What is a useable definition of phenotype?" Thus far, the implicit answer to this question appears to have been "anything we can measure." The reason this answer is not adequate, as I hope to suggest in the course of this paper, is that we are capable of measuring properties of living systems that have little or no relationship to their teleological nature—to their functioning and adaptive needs. A third problem arising from an attempt to extend microevolutionary theory over the area of macroevolution derives from differences in the absolute duration of the events involved, on the one hand, in populations, and on the other, in lineages. This problem can be best formulated by reference to a hierarchy of natural phenomena. A NATURAL HIERARCHY Table 1 depicts an incomplete hierarchy of natural phenomena as viewed by biologists. Clearly this is not the only hierarchy that could be constructed traversing some of the same levels of organization. However, as one moves up into higher levels of organization, that statement becomes less and less true. Thus, there are many molecular phenomena not involved in living systems; there are only some organisms (asexual) not integrated into populations; there are a very few populations (for example, lichens living on bare rocks in boreal regions) that are not parts of communities; there are probably no communities not forming parts of an ecosystem, which is presumably the next level of organization in the hierarchy in Table 1. That would imply that the ecosystem level is the final level in the hierarchy depicted; at that level different communities would interact, for example, at ecotones. A geologist, however, could construct a partly different hierarchy of nature, deriving from a different viewpoint. Furthermore, one might look for hierarchical organization, for example, within the organismic level, but that organization would of necessity extend downward into the molecular and nuclear levels below. Prigogine and Nicolis (1971) have made some interesting comments concerning the origin of hierarchies by the emergence of higher levels from lower ones as spontaneous thermodynamic processes (see also Simon, 1969). I believe the hierarchy in Table 1 represents the viewpoints of biologists in many fields. The principle of organization at each level is clearly reductive in that the functional processes at each level are processes which occur over time periods shorter than the duration of the entities or elements found at that level. The lower limit for the inclusion of a process in a given level is a threshold value of absolute time which, if a process occurs to completion in a shorter time, is a process functioning at the next lower level of organization in the hierarchy. Thus, each level of the hierarchy is viewed as a functioning domain. One can make complete and scientifically satisfying explanatory statements and predictions at each level separately. One may do experiments at each level by manipulating the constraints and boundary conditions surrounding it. In order to analyze the processes at work at each level, I have found it useful to work within the following syntax: "given these constraints and boundary conditions, this force drives these mechanisms to produce these processes measurable as . . . . " Constraints are naturally imposed by adjacent levels of organization while boundary conditions are imposed by more distant levels. It is only by altering constraints and boundary conditions that one level may affect another in a hierarchical organization (Pattee, 1973). For the purpose of this paper, I would like to focus upon the population level of organization and its relationship to the organismic. A direct result of the organizational principle of this hierarchy is that macroevolution (in the usual sense of long- TABLE 1. A hierarchy of nature from the xriewpoint of the biologist." Molecular Cellularorganismic Population Community Arbitrary measurement changes in concentration of reactants change of stage of development or in degree of differentiation changes in gene frequencies or in means, modes, and variances of measurements changes in the differentiation and number of ecological niches found in, and in biomass Descriptive mode (process) rearrangement of bonds epigenesis and homeostasis natural selection niche diversification, increase in coadaptations, efficiency, and stability Mechanisms redistributions of bond energies enzyme kinetics, differential synthesis, diffusion, active transport, biological rhythms, etc. intraspecific (including intragenotypic) competition—e.g., differential reproduction interspecific competition, speciation, adaptation, predation unstable chemical bonds—e.g., "free energy" dissipation of energy in structure resource utilization energy flow through, entropic changes in genetic information, etc. critical variable concentration or distribution, protein sequences, temperature critical variable concentration or distribution, which alleles are present at various loci Driving force Constraints (and boundary conditions) -a m \ •a o primary generation of variability, population density, limited resources, rates of environmental change and population growth, etc. Pi •n Z H 5 land mass size, habitat, temperature, preadaptations, informational capacities of genomes a Constraints originate in adjacent levels, boundary conditions from more distant ones. The next level below the molecular would be the nuclear or elementary particle level, the next level above the community would be the ecosystem level. In principle the number of levels is not fixed, although the upper level does become fixed when a particular viewpoint is chosen. For the biological viewpoint the upper level is seemingly the ecosystem. I have not included these other levels here because this paper explores only the organismic and population levels. NO 298 S. N. SALTHE term transformations of structure at the organismic level) is not a functional process in nature. Precisely, the evolutionary transformation of structure, as found in the fossil record, is a record only and not a series of events "in the past." One consequence of this circumstance is that there can be no causal laws of macroevolution. Those few generalities about macroevolution that appear to be broadly valid—Dollo's Law (Gould, 1970; Salthe, 1972) and Van Valen's new Law of Extinction (Van Valen, 1973; see also Raup et al., 1973), for example—are clearly seen to be statistical laws. Other interesting but less general patterns in the fossil record, such as parallel evolution, can also be most easily viewed as the outcome of the interaction of many events with fairly clear probability distributions as opposed to the more unique results of evolutionary transformations in more "specialized" lineages. In the latter the probability distributions are unclear and the term "capricious" (Lewontin, 1967) best describes our lack of feeling for the probabilities that were involved in these events. Within the context of the natural hierarchy, then, evolutionary change is a functional property of the population level (microevolution) and, in some as yet unclear ways, of the community and ecosystem levels of organization. It is not a property of the organismic level, where ontogenetic change (in the broadest sense, including aging and learning) is the name for those events occurring at the appropriate rates. The interesting relationship is the one between the population and organismic levels. Formally, that relationship can only be one where microevolutionary events, guided largely by the process of natural selection, impose constraints upon the outcome of epigenetic and homeostatic processes working at the organismic level. The community level establishes boundary conditions on these ontogenetic events because it constrains the outcome of the workings of natural selection. (It should be noted in passing that constraints may work upward from lower levels upon higher ones too, but the question we are concerned with here would only be obscured by considering this added complexity.) In the remainder of this paper I would like to do two things. First, I will attempt to demonstrate that, given appropriate data, the non-functional nature of long-term change at the organismic level (macroevolution) is a property of great value in dealing with historical questions precisely because the record of events viewed in this way is a statistical accumulation. Because only very limited sorts of data can be treated in this statistical manner over any but the shortest geological time spans, this approach will have limited applicability to evolutionary problems. Therefore, I will subsequently attempt to begin obtaining some practical grip upon the problem posed by the fact that evolutionary change takes place at the population level, while measurements pertinent to macroevolution refer to the organismic level. NON-DARWINIAN EVOLUTION It is possible to measure various parameters of fossil remains and, in the presence of dating techniques, to construct series from which one can induce rates of change (Simpson, 1949, 1953). One can find that an organ dimension in such and such a lineage was transformed in time at such and such a rate (Lerman, 1965). Non-Darwinian evolution (King and Jukes, 1969) might serve as a handy phrase to cover all considerations of "evolution" at the organismic level even though it was invented specifically in connection with protein evolution. I would like to attempt to delineate some of the sorts of information that studies of non-Darwinian evolution can contribute, and it will be convenient, as well as timely, to use data from those dealing indirectly with the evolution of primary gene products for this purpose. Anatomical transformations could presumably be treated in the same ways but the restricted range of taxa that could be covered, and therefore the small time spans to which one would be restricted, make them less suitable. Figure 1 shows the relationship between the number of amino acid sequence differences (normalized per 100) in some proteins from several living species in different PHENOTYPE DEFINITION - - 10O MILLION YEARS cytochronwc 200 SINCE 300 DIVERGENCE FIG. 1. The relationship between number of amino acid sequence differences in several proteins and the time since divergence of the lineages from which the proteins were sampled. The data are taken from Dayhoff (1972). A similar graph can be found in Dickerson (1971). The open circle data represent insulin, and the wide variability of these data in more recent times of divergence seems to be due to the fact that one of the species sampled has a very atypical molecule—i.e., it is a sampling problem. The closed circles are alpha and beta hemoglobins together. taxa and the estimated time since their lineages diverged in the past (based on the fossil record). The slopes of these lines are characteristically different for different proteins, presumably reflecting differences in the overall evolutionary conservatism of the loci in question over time (Dickerson, 1971). Fitch and Markowitz (1970) have presented evidence that when codons showing no substitutions between taxa (very conservative codons) are eliminated from these comparisons, the different proteins will tend to line up on the same slope. It has been suggested that these relationships will allow retrodiction of dates of divergence of various lineages for which there are no fossil records (Zuckerkandl and Pauling, 1962, 1965). For this possibility to be realized it will be necessary to know the true shapes of these curves. A linear relationship between difference and time since divergence has been assumed so far, presumably as a simplifying working assumption; Figure 1 does not give me confidence in that assumption, and, indeed, the obvious requirement that, when two things are being compared as they become more different with time, there must 299 be an assymptotic approach to the maximum possible difference, clearly prevents the relationship from being linear. I have performed some simple simulations in order to test whether simple assumptions about the specific rate of substitution could generate difference curves of the sort in Figure 1. The simulations were essentially those used by Kirsch (1969), on a sequence of 50 elements, with the added stipulation that after every generation the probabilities of change for each "codon" or element are changed randomly except that half of the elements that had a probability of zero change in the last generation would retain that probability in the present one, thus building in a concept of conservative codons. The two simple assumptions about the specific rate of substitution (AY I At. \IY) were (i) constantly decreasing (kit), and (ii) constant (k). As in Kirsch (1969), if a "codon" once becomes different between the two lineages, it is not allowed to change so as to reacquire identity, i.e., no convergence is allowed (see Jukes and Holmquist, 1972, for a discussion of the minor effects of convergence at this level). Figures 2 and 3 show the results of these simulations. For the present purposes the important curves are those showing the accumulation of difference (Z) with time since these are directly comparable with Figure 1. On the assumption of a constantly decreasing specific rate of change (after the suggestion of Goodman—see, for example, Goodman etal., 1971), we obtain a sigmoid, in fact a logistic, mean curve for the accumulation of difference. On the assumption of constant specific rate of substitution we obtain a hyperbolic curve for the mean accumulation of difference. While it is possible that either of these simplifying assumptions could explain the curves in Figure 1 (although I feel the hyperbolic curve is closer), the important point is that the empirical curves are consistent with stochastic models of change. In order to expand the data base somewhat, I have utilized some calculations from Goodman et al (1971). Using the sequence data available for various globins they reconstructed primitive globins for two periods in the past. Figure 4 shows the sequence differ- 300 S. N. SALTHE o CHANGES ACCUMULATI1 ALONG Y WITH DECREASING RATE OF CHANGE Y - ACCUMULATION O F DIFFERENCE ( Z ) W I T H O U T SIZE R E S T R I C T I O N (EXPONENTIAL) ACCUMULATION ( Z ) WITH S I Z E RESTRICTION (LOGISTIC) z UJ IT Hi SO a. UJ t TIME -«—TIME ° •• SINCE D I V E R G E N C E CHANGES ACCUMULATING ALONG Y AT CONSTANT RATE ACCUMULATION OF DIFFERENCE ( Z ) WITHOUT SIZE RESTRICTION (CONSTANT) ACCUMULATION (Z) WITH SIZE RESTRICTION TIME SINCE DIVERGENCE FIGS. 2, 3. The results of a simulation of molecular evolution as a stochastic process with the specific rate of substitution (dY/dt • \IY = kit) constantly decreasing (Fig. 2), and (dK/d( • \IY = k) constant (Fig. 3). The accumulation of difference (Z) with time in the presence of size restrictions is the curve that simulates those in Figure 1. ences between modern globins and both of these projected ancestral sequences plotted against time since divergence. Again, both of the specific rates of change utilized in the simulations could explain the data; Goodman et al. favor a constantly decreasing rate model. Note that in Figures 1 and 4 the dates of divergence are uncertain enough so that no real statistical test of the goodness of fit of the two theoretical curves is possible. Again, the point to be established is that purely stochastic processes working within roughly known constraints are sufficient to generate curves very much like the empirical ones. In Figure 4 I have shown the per cent variability of the data at the two times in the past. The data from the more recent date of divergence are more variable. In the simulations the per cent variability decreases as the sequences approach the assymptote of maximum difference, and so this aspect of the empirical data (Stebbins and Lewontin, 1972, p. 34) are also consistent with stochastic patterns of change. At this juncture it becomes important to point out that I am not moving toward a suggestion that "evolution is random." Much debate has occurred over whether or not these regular patterns of change are consistent with natural selection being the main process in evolution. It has been suggested that in order to explain these regularities we must suppose that most amino acid substitutions are carried in by genetic drift at selectively neutral or near neutral codons (for example, Kimura and Ohta, I97la,b; Ohta and Kimura, 1971). Since this question bears upon whether or not it is reasonable to suppose that we could have constant or constantly decreasing specific rates of amino acid substitutions in our models, I would like to show in two different ways that such constancy is possible even if every substitution is mediated by selection. First, keeping the discussion in the same gaming field as previous ones, we may note that there is ample documentation for the fact that there is no simple one-to-one relationship between phenotype and genotype; for any given phenotype there are a number of equivalent genotypes possible (one could refer to a 301 PHENOTYPE DEFINITION i: or . . • -> "£V= 13.4 !J 1 LU ^ uj IONAL 6OODM/ Qz - »- RECONSTRUCTED PRIMITIVE EUTHERIAN HEMOGLOBIN / . " / • ' •" RECONSTRUCTED ANCESTRAL GLOBIN (GOODMAN ETAL.I97I) • s TIME SINCE FIG. 4. The relationship between the number of amino acid sequence differences/100 between modern and reconstructed ancestral globin sequences (data 1 1 1 1 1800 500 DIVERGENCE (M.Y,) from Goodman et al., 1971) and time since divergence of the modern groups from the ancestor. CV is the coefficient of variability. polygene model for continuously distributed traits or to selectively equivalent alleles at some Mendelian locus). Suppose we have a locus producing an enzyme in a poikilotherm: the environment is warm; the rate of activity of the enzyme is at an optimum. Now the environment becomes colder and the rate of activity of the enzyme is reduced to a suboptimum level. Any mutation that is capable of restoring the rate of activity to an optimal level is favored provided it has no untoward pleiotropic effects. Since more than one allotype is capable of showing the same kinetic rate, there are several to many possible, selectively equivalent alleles that can solve the adaptive problem. Furthermore, since altered rates of activity of other enzymes can compensate for a reduced rate in our enzyme, the adaptive solution is not restricted to mutations at a single locus. Later the environment becomes warmer again: now the enzyme in question has a superoptimal rate that disrupts homeostasis. Will a back muta- tion be most likely to restore the old allele? Or is it not more reasonable to invoke a more probable forward mutation to still a third allele whose product will be phenotypically like the original warmadapted allotype? (It need not be exactly alike since other aspects of the intracellular milieu will have changed in the meantime.) We may observe this locus through a long period of time during which the environmental temperatures fluctuate and never once see a back mutation fixed so that, while the environment oscillates, the gene steadily becomes more and more different from its original, ancestral form—that is, it evolves. Recalling that most selectively relevant phenotypic expressions are multigenetic we must suppose that our locus could be brought to bear upon solutions to quite different physiological problems stemming from many different environmental challenges whose primary effects are "at other loci." Since the functional genotype, like the niche, is an n-dimensional concept, it is ^ 1 1 1 1 100 302 S. N. SALTHE possible that our locus might be called upon be derived from Hennig [1966] on the one to evolve in response to selection pressures hand, and from Eldredge and Gould at any time in what is now an effectively [1972] on the other). If we are dealing with continually changing (or deteriorating) en- populations of two species, we find ourvironment, and the same is true for all selves no longer at the population level of other loci. Under these circumstances any organization, but at the community level locus has some probability of being called (and not even there unless certain other upon to solve any given adaptive problem conditions are met). Thus, no process is generated by the continually deteriorating mediating macroevolution at the organisenvironment, and so the probability of any mic or any lower level. one mutation becoming fixed by selection is Note that I am not claiming that natural roughly the same as the probability of any selection is not involved in the substitution other mutation becoming fixed by selec- of one allele for another. It may be, or getion. Under these circumstances divergent netic drift may be. I am claiming that the evolution of the genotype between differ- accumulation of these substitutions over ent lineages would go on constantly, while long periods of time is not viewed by the difference in magnitudes between for- natural selection. I am suggesting that the ward and back mutations would suppress number of amino acid sequence differences convergence at individual loci. Because this between two taxa, like electrophoretic momodel invokes large numbers of selectively bility differences or immunological differequivalent alleles at any locus (Maynard ences between taxa, are a species of nonSmith, 1970), it is only a contextually richer adaptive character (Simpson, 1961) or or more filled-out (but less elegant) version selectively neutral trait. I am furthermore of the neutral allele drift theory. For this suggesting by extension of this idea that any reason we are fortunate in having a com- comparative macroevolutionary observapletely different approach to this problem tion at the organismic level—for example, deriving from considerations of the hierar- the transformation of an anatomical strucchy of nature (Table 1). ture from one form to another in some lineage (Bock and Von Wahlert, 1965)—is It is clear from the relationships depicted in Table 1 that macroevolution at the or- not mediated as such by natural selection, or ganismic (or any lower) level, including pri- by any other process. Such series are the mary gene products, is not a result of a result of comparative operations by the scifunctional process in nature. Furthermore, entist and have no ontological status of natural selection cannot be the process me- their own in nature. diating macroevolution at any level of the Thus, transformations of structure at the hierarchy, since it is a process operating organismic level are merely post hoc results only at the population level. Natural selec- of meaningful changes occurring at other tion operates from one generation to the levels of organization. As such, one might next, does not skip generations, and cannot, predict that macroevolutionary changes at in any sense, "detect" the future. Any at- this level will occur at random with respect tempt to bring natural selection into the to the teleological requirements of evolving macroevolutionary range by looking for populations except to the degree that estabthe upper limit of duration of a population lished structure at the organismic level in time and then trying to connect two pop- produces (preadaptive) constraints upon ulations in a lineage that have a measurable the directions and rates of change that can difference at some trait is illogical (because occur at the population level of organizamore than a single generation is being view- tion. This last "except" clause is what preed). Furthermore it is easily faulted by not- vents any further elaboration of this idea at ing that the populations being thus viewed the level of phenotypic structure, but no are easily considered to be in different spe- such limitation appears at the genetic or cies if they have enough genetic difference to primary gene product level because of the make a measurable phenotypic difference loose connection between genotype and (the logical structure of this argument can phenotype. This question as to whether a PHENOTYPE DEFINITION 303 random with respect to the adaptive requirements of organisms because the process that operates on these adaptive requirements (natural selection) is not the process, nor is there any process, mediating the accummulating changes making up macroevolution. Of course, adaptively or teleologically meaningless data are not necessarily useless. For example, measurements of protein sequence differences between lineages lacking fossil records have permitted the retrodiction of their possible dates of divergence. Indeed, in some sense data collected at the molecular and nuclear levels of organization by chemists and physicists represent just such collections unrelated to the forms of the macrostructures built hierarchically upon those levels, and these data have not been useless. Interestingly, the well-known law of the irreversibility of evolution proposed by Dollo is deducible as a corollary (proceeding as in Chapter 5, Salthe, 1972) from what may be presented as a fundamental statistical law of evolutionary biology: for any time period, if a lineage accumulates a mean amount (X) of genomic difference per locus from an ancestral population, then any other lineage descending from this ancestral population will also accumulate an average of X genomic differences from that ancestral population during the same time period. However, this law will be irrelevant to most observations that have been made concerning macroevolution, namely observations about phenotypic expressions. Concerning these, the hierarchical viewpoint revives in even more urgent form (and then allows resolution of) the problem of the possible tautological nature of the concept of natural selection by placing various, presumably adaptive, phenotypic expressions (antler tine number, scale number, heart beat rate after exercise, performance on some test, fur thickness, etc.) at one level, the organismic, while placing comparisons between individuals in a The inappropriate viewpoint engen- population in terms of reproductive success dered by counting ideologically informative at another, the population level. This events converts them to noise. The provi- means that no phenotypic attributes of orsional conclusion from all of this, then, is ganisms exist at the population level. Indithat molecular evolution is non-Darwinian viduals in populations are effectively black because all macroevolution at the organis- boxes with varying adaptive values that are mic level is non-Darwinian, that is, occurs at series of events involving some sytem that are not mediated by a mechanism-driven process will occur at random with respect to the adaptive needs or functional requirements of the system is too large to be pursued at length here. Instead, I will attempt to partially justify it by reference to several examples from the realm of human artefacts, where in each case we know the "selective" meaning of each choice. Each series of artefacts was examined in such a way as to deliberately measure parameters which probably could not reflect the adaptive significance of the choices made in order to test the hypothesis that aggregate data about such events would be randomly distributed. I believe that the following analysis will show that they are. Thus, the number of moves made by the white queen in the first 18 moves in 185 master chess games (Horowitz, 1956) forms a Poisson distribution (x2 = 2.67; 6df; P = 0.90 > 0.80), as does the number of times in 120 master games (Horowitz, 1956) that a white knight move is followed by a black pawn move in the first 18 moves (\2 = 5.65; 4df; P = 0.30 > 0.20). The number of times the word "in" appears per 20 words in a long major poem (Little Gidding by T. S. Eliot), scanning 96 segments of 20, is again described by a Poisson distribution (X2 = 0.010; 2df; P > 0.99). Finally, the number of times the note A appears per bar of music in a G-major violin sonatina by Telemann was also found to be randomly distributed (x2 = 7.27; 3df; P = 0.10 > 0.05). It appears as though data collected from a viewpoint unrelated to the teleological aspects of a system will, at least often, be found to be randomly distributed (or to change at random) with respect to the teleological requirements of that system, or with respect to regularities which we would project (or predict) if the system were working according to the forces and mechanisms known to us. 304 S. N. SALTHE non-factorable. Fisher's influential fundamental theorem of natural selection (Fisher, 1929) makes no reference to anything other than the gestalt notion of adaptive value or Darwinian fitness. Yet, Darwin himself wanted to use the notion of natural selection to explain transformations over time of phenotypic traits (e.g., macroevolution), and he was eager to relate fitness to the environment in terms of adaptation (Williams, 1974). From the vantage point of the hierarchy, we see that the environment, working through the ecosystem and community levels of organization, contrains the results of natural selection to within certain values for any given phenotypic attribute, and this in turn constrains the epigenetic and homeostatic processes of any organism sampled as an individual from that population, including those processes that produce one-time structures such as a tooth. Thus, we could sample a population repeatedly, obtaining organismic data from many individuals and feel confident that the statistical population before us then, representing a model of some aspect of the real population we sampled, will be rdatable to the wider environment. This is the way out of the tautology provided by the hierarchical approach, and this without contesting the operational fact that individuals in populations are non-factorable" black boxes without discernible phenotypic attributes. Thus, in order to avoid a tautological statement of natural selection, we note the interplay of constraints between the population and community levels on the one hand and between the population and organismic levels on the other. This paper will continue to examine the latter interaction only. THE NATURE OF THE PHENOTYPE Duration of existence of the phenotype In order to clarify the relationship between evolutionary change at the organismic and population levels, we can use an imaginary experiment. An individual organism is drawn at random from a population of haploids, and one gene, or its pro- duct, is sequenced. In actual fact at the population level there are at this locus two alternative alleles, one (the one we sequenced) a common wild type and the other a rare allele. Now the environment changes in such a way that the gene product of the rare allele is better at producing some adaptation than is the previous wild type. This comparison is associated with a reproductive difference so that the old wild type becomes selected against, and this process will register as changing gene frequencies each generation. After many generations the previously rare allele reaches fixation. We again sample an individual organism from this population and note that the gene in question has "evolved." At the population level the gene frequencies have become altered. Natural selection monitors the probabilities of existence of various alleles in gene pools from one generation to the next. These probabilities constrain the kind of organism you can pick up at any given moment. But long-term evolution is no more ^functional process at the population level than it is at the organism level. A comparison of gene frequencies between populations separated by many generations is no more than a scorecard tally kept by evolutionary biologists. The organism, and its genes and gene products viewed separately, cannot evolve in any functional sense at all. The population can evolve, by natural selection, only from generation to generation, and we must note here that the functional time period of the population level of organization is precisely and only one generation. Recall that the functional time period for the organismic level is the length of life of an individual organism. If the phenotype is to be a functional concept, its existence must be located, then, somewhere between the fastest events that may occur in an organism without crossing the threshold into the molecular level of organization (nerve impulse transmission, rate of production of functional amounts of ATP, etc.) and events that may take a lifetime of an organism to be completed (additions and repairs to a family nesting place, the total number of offspring left, etc.). PHENOTYPE DEFINITION 305 ous randomly chosen traits during artificial selection experiments demonstrates only Does the phenotype exist only in indi- that those traits are potentially parts of the vidual, sampled, organisms? Or do some functioning phenotype. Of course, traits aspects of the phenotype emerge from the that are not currently heritable or modifiinterface between the organismic and able by artificial selection could in the fupopulation levels of organization? (I will ture come under the view of natural selectake the position that the phenotype cannot tion if they became important in the lives of be a property of populations themselves, the organisms involved. since that would place the concept too far The genotype of an organism is clearly outside traditional notions to be useful.) located at the organismic level. The Since evolutionary decisions made at the phenotype, if we use the definition just population level can never be reduced in proposed (which is really not new, only toto to the organismic level because of vari- more clearly stated), is located at the boundous emergent properties of populations ary between the organismic and population (density, for example), the phenotype must levels, and forms, in fact, the bridge bebe both an organismic property and a tween them, the means by which conproperty of the interaction of individuals in straints from each level impinge upon the populations. But does any aspect of a func- other. On the one hand, individuals with tional phenotype really occur in a sampled genetic diseases (disruption of homeostatic individual organism? In other words, is a and/or epigenetic processes by the presence tooth height measurement a phenotypic of non-wild type allotypes) never appear datum? The phenotype is usually viewed as functionally at the population level, thus a product of the interaction of the genotype constraining the kinds of individuals that with the local environment. This rules out can compete at that level. On the other any properties of organisms that do not hand the results of competition between have a measurable heritability as, for individuals at the population level conexample, the spotting patterns on the backs strains the frequencies of various alleles, of leopard frogs (Casler, 1967). On the thus establishing probabilities concerning other hand, it also rules out properties that the rates of enzymatically mediated procould never be elicited in any possible local cesses, for example, in organisms sampled environment—for example, the length of as individuals from some population. In the time an arctic fish can survive in water of first case we have selective elimination contropical temperature, or the nature of cerning absolute fitness, and in the second courtship behavior elicited out of season by we are concerned with relative fitness, hormone injections in a laboratory. Not ev- wherein an individual is "compared" with erything we can measure is usefully consi- other organisms, or with population statisdered as part of a phenotype—not the elec- tics, in respect to its reproductive success. trophoretic mobility of some protein at a given pH in a given temperature on a given substratum, for example, nor the im- Is the phenotype divisible? munological difference between two orThe next problem is to understand in ganisms as viewed by a third (Williams, what manner it is possible to observe the 1964). If we take these restrictions seriously we arrive at an interesting definition of interactions between individual genotypes phenotype as any property of organisms and their local environment. The main that could be involved in intraspecific problem here is that relative reproductive competition—because we have eliminated success arises finally out of all sequential all other properties with our restrictions. and simultaneous interactions between the Thus, we can define as aspects of the genotype and the local environment. As a phenotype only those traits that are visible limiting case, we can note that artificial to natural selection. The fact that we can selection experiments have demonstrated alter the population measurements of vari- that a single dimension of a single trait can Location of the phenotype be detected by selection, and this observa- 306 S. N. SALTHE tion touches upon natural selection conceptually when we imagine isogenic individuals—say, a female cladoceran starts a population in a pond, and these accumulate a few single gene differences by mutation during the summer. In this limiting case we might be near the real situation when we ascribe differences in reproductive success to some single phenotypic attribute, possibly temperature tolerance. That being so, if we could measure the temperature tolerances of each individual in the population and compare these measurements with the normal temperature regime in the pond, we could predict which individuals will out-reproduce others. In one case any individual with tolerance for more than a certain amount of heat in its environment will reproduce maximally; those below this threshold level will not contribute as many individuals to the next generation. Adaptive value is here a simple function of temperature tolerance. In most real cases, however, individuals differ from each other at more than a single trait, thereby having the consequence that more than a single dimension of the environment will be important in determining reproductive success. When a very large number of traits contribute to Darwinian fitness, can any one of them be considered to be visible to natural selection? In order to examine this, imagine many functionally independent normally and unimodally distributed traits. Suppose that on the average any individual whose measurement is within a standard deviation of the mean would be capable of realizing its maximum reproductive potential as far as that trait is concerned. Then the probability of being within a standard deviation of the mean for each trait is 0.66, and the probability of being within a standard deviation of the mean for n traits = P(trait 1) x P (trait 2) X P (traiti).. .P (traitn). Reasoning this way, it can be shown that in a large population with a large number of genes very few individuals (< 1%) can have a large number of their traits (> 50%) measure within a standard deviation of the mean if all the traits are continuously (and therefore simultaneously) visible to selection. This is analogous to Haldane's substitu- tional load problem (Kimura and Ohta, 19716), but there is a way out of this one that is not available to Haldane's dilemma. Consider the effective environment to be constituted by a succession of ecological catastrophes. Some will be highly predictable seasonal phenomena, others more capricious events with longer term periodicities. Each catastrophe emphasizes from one to a few traits (those most important in coping) temporarily. If one were to total the "instantaneous" contributions of various traits to survival as above (Pi x?2 X Pi . .. P„), then these few traits would come "first," "second," "third," and so on, in the calculation at this time, in order of importance, until, after about five traits had been combined, the rest of them would contribute very little each and together towards survival for the moment—i.e., they would not be visible at that time as individual traits. Although the quantitative picture does not change from one catastrophe to the next, the qualitative picture does; there are different traits being considered in different ranking each time. As the intensity of the selection instituted by the catastrophe increases, the number of traits visible (or factorable) at that time will increase and the number of survivors will be correspondingly less (it will take adequacy for more traits to survive an exceptionally cold spell than it takes to survive a milder one—see a similar idea in O'Donald, 1973). Traits relatively unimportant most of the time become more important under the onslaught of an exceptionally severe deterioration of the environment when the required number of adequate traits for survival increases. Selection intensities being equal, traits reviewed more often by catastrophic events (heart beat rate, tibia length) will undergo more rigorous selection than will those that are reviewed fewer times (some factor involved in surviving a relatively rare event). In this model very many, and perhaps most, traits will be scanned at least once by selection during the duration of a cohort, but there will be a ranking of traits by importance based upon both the intensity of the selective episode a trait is typically involved in and the number of such episodes the trait is called upon to function in. The ranking PHENOTYPE DEFINITION may change from generation to generation as the environment changes. As the last few members of a cohort die out, one could in principle rank traits over the whole life of the cohort, but this would be highly artefactual. In this case again it would seem that natural selection scanned very few traits individually (perhaps wing color in industrial melanism would stand out), and one would feel forced to conclude that the organism is a black box. But, if one analyzed the cohort from moment to moment, one would see the intermittent functioning of traits, a block of a few at a time. (If epistasis were to be taken into consideration, more traits could be visible at any given time because many of the probabilities of being within a standard deviation of the mean would be conditionals— P trait 2 P trait 1 X P trait 1 Also, as will be explored below, canalization increases the probability of an individual being found under the area of the [lognormal] curve between plus and minus one standard deviation.) Thus, I conclude that natural selection can view any given trait as a separate or near separate entity. How are we to measure it.? MEASURING THE PHENOTYPE AND THE INTENSITY OF NATURAL SELECTION In order to be clear about what is to be measured, I will begin with a recapitulation and gradually work into a concrete mode. Neo-Darwinian theory has as perhaps its most important axiom: Natural selection works to weed out the ecologically unfit from the 307 a model of natural selection proposed by the early biometricians: Natural selection tends on the whole (at least in multicellular organisms with moderate intrinsic rates of natural increase) to disfavor individuals to the degree that the magnitudes of their trait dimensions depart from the mean for the population (Bumpus, 1898; Weldon, 1901). This can be connected with the observation that such dimensions are typically normally or lognormally distributed (Simpson et al., 1960). These bellshaped distributions are explainable in general as resulting from the requirement for coadaptation among the various phenotypic attributes of organisms, as suggested by the use of the word "unfit" above. Any evolving dimension is constrained by this requirement to within quite small allowable changes in the modal measurement per generation (Lerner, 1954). Put another way, natural selection cannot foresee the future. All modes of selection are constrained by this requirement, including directional. Figure 5 shows how this works in regard to directional selection. Thus, the mode can always be taken to represent the value most recently favored by natural selection, regardless of whether the selection is mostly directional or stabilizing. In other words, it is not possible on this view to distinguish between these modes, and I will make no such distinctions in this paper. Furthermore, in the hierarchical view selection works over the period of time that is found within a single generation. Selectional modes are described on the basis of what a trend in some dimension might be over several generations, and thus represent records of events rather than descriptions of functional events. population (informal paraphrase of axiom D The fact of the bell-shaped curves of trait 4; Williams, 1970). A modern interpreta- dimensions can be used as a premise, tion of the term unfit that I will use here is: which, in combination with another premunfitness arises in individual organisms when ise (to be noted shortly) and with the axiom their traits are not sufficiently harmoniously in- stated above, provide the basis for deriving tegrated to produce adequate responses to all en- a theorem that will form the framework of countered environmental onslaughts (for one the remainder of this paper. The second expression of this idea see Lerner, 1954). premise is that the intensity of selection in reThe import of this is that natural selection spect to some trait being experimentally followed tends always to disfavor extreme phenotyp- is not at all times or under all conditions the same. ic expressions relative to what is common, In one sense this simply means that selecaverage, or normal for the kind of organ- tion coefficients can have many values. The ism in question. This points immediately to justification for this statement can be taker 308 S. N. SALTHE DIRECTIONAL SELECTION FIG. 5. A comparison of directional natural selection with directional artificial selection. The shaded area covers the proportion of the population favored ARTIFICIAL SELECTION by the selective elimination of those in the unshaded portion. from any recent general work, for example, well (Rothstein, 1973), thereby bringing in Dobzhansky (1970). Since continuously dis- an entirely separate and usually unknown tributed dimensions of traits are usually element. Secondly, some provision must be distributed as bell-shaped curves, and made for factoring out and estimating the therefore must be reviewed by selection (by degree of canalization (Waddington, 1957; the axiom) despite the theoretical limita- Rendel, 1967) that has taken place with tions on the number of dimensions that can each dimension observed. This will be atbe simultaneously viewed by selection, the tempted below. Finally, the model assumes theorem is that natural selection must scan that viability selection (all selection directed individual traits sequentially in time. It can do at non-reproductive traits) is more or less this because the intensity of selection faithfully recorded by natural selection as a operating upon any given trait can vary. whole, even though there can be wide difThe detailed argument for this proposal ferences in the reproductive success of viawas presented in the section above entitled ble (ecologically adequate) individuals. Dif"Is the phenotype divisible?" The work of ferences in reproductive success of indithe rest of this section is to set this theorem viduals prescreened by viability selection, into an explicit model, and to explore the but not reflecting the results of that selecgeneral conditions under which the model tion, will be adjusted each successive genershould hold. Extending the biometrical ation by viability selection again. Thus, model of the axiom as stated above, on the even if at one point in time there comes to basis of this theorem, we can derive the be an association between high fertility and observable consequence that if in one popula- values a full standard deviation away from tion, selection has been more intense in connec- the mean for some given dimension, that tion with trait X than it has been with trait Y, then association would not lead to any significant the intrinsic variability of dimensions of trait Y change in the mean dimension when comwill exceed that of those of trait X. paring two or more generations unless the This model can only hold if certain con- environment changes. In other words, such ditions are met. First, polymorphic traits an association would not be likely to lead to must be excluded because for them varia- the formation of a supergene complex unbility can be connected to niche breadth as less the environment changed so as to 309 PHENOTYPE DEFINITON change the optimum value for the dimension in question. If this were not true overall and most of the time, living systems could not maintain their teleological nature, which they patently do. Any dimensions of a trait can be represented by a bell-shaped curve of value distribution. There are, however, three different possible bell-shaped distributions (Fig. 6). The curve may be symmetrical (normal), skewed to the left (positively skewed lognormal), or skewed to the right (negatively skewed lognormal) (Aitchison and Brown, 1957). Which shape one is dealing with is determined by plotting the standard deviations versus the means for all the samples drawn from the parametric domain (Kerfoot, 1969). If the relationship is linear with positive slope, the parametric distribution is a positively skewed lognormal one; if there is no systematic relationship between the means and standard deviations, the distribution can be taken to be PARAMETRIC DISTRIBUTIONS ^ • • , normal; if the relationship is linear with negative slope, the parametric distribution is a negatively skewed lognormal. Cases where the relationship is non-linear will not be discussed in this paper; they clearly represent only complications that can be examined elsewhere. The domain of the parametric distribution is determined arbitrarily, like the domains of adaptive zones. One of them might represent scale counts in different populations of a species of lizard; another might be scale counts in a genus of lizards; still another might be scale counts in a family of lizards. The sample populations would be of different taxonomic rank in these three cases. The upper limit upon possible domains is set by various biological factors, the earliest to come into play of which might be differences in age structure of the populations. Such differences would tend to increase the dispersion of the points on the mean versus standard deviation plot, increasingly so as •-" ' • • / / \ • .' • / o/ /o 1 \ / * ^A • • • • * * / \ . / ' * T—~ • \ \ \ / / * / N • \ / \ \ * ^^ • MAXIMUM MINIMUM SAMPLE STATISTICS CANALIZED' FIG. 6. A comparison of parametric distributions for some trait dimensions and the statistical parameters (standard deviation and the mean) of sample popula- tions drawn from such distributions. See text for further explication. 310 S. N. SALTHE the sample variances increase. At some level of comparison that dispersion could become so great that the usefulness of the model is curtailed. This is so because different survivorship patterns will affect the number, intensity, and spacing of environmental catastrophes so that random samples including all age groups in characteristic proportions from two populations very different in this regard will not contain similar proportions of cohorts that have undergone viability selection for given lengths of time. The lower limit to variate values in the positively skewed parametric distribution is zero; there is no lower limit in the unskewed and negatively skewed cases. On the other hand the latter has an upper limit, while the other two do not. In any case in real biological examples the curves are always truncated, the values never being found over more than a restricted portion of the curve. When the distribution goes to zero in the positively skewed case, the slope of the line is the coefficient of variability, %V = cv a where CV is the coefficient of variability the line would have if it went through zero, and a is the sample mean minus r (Aitchison and Brown, 1957). If we take canalization to be a selective process the result of which is to restrict the possible values a dimensional measurement can take, we can fit the concept into the model being developed here as follows. Suppose we have a near-normal parametric distribution going to the origin. The actual values found in nature occur over a portion of the ascending limb of the distribution. We can view canalization as a process that changes this parametric distribution into a more skewed lognormal one by establishing a threshold larger than zero and then gradually moving it to higher and higher values. Imposed upon this process will be the constraint that the modal value favored by selection can not change, so that the result of changing the threshold will not be simply to displace the original curve to the right. Instead, the curve will become dean accepted measure of intrinsic variability formed, with a larger and larger portion of (Simpson et al., 1960). The problem is to the variability centering around the mode find a way to use validly this coefficient, or (Fig. 7). In fact, the proportion of the dissomething equivalent, in all cases; other- tribution contained between the quantiles wise we can have no general measure of X - SD and X + SD will increase, and, intrinsic variability for all traits. The nega- clearly, the distribution will have become tively skewed curve can be treated simply as "canalized." (Changing the threshold alone a mirror image of the positively skewed one will not reflect the natural canalizing pro(Aitchison and Brown, 1957), thereby cess adequately because in that case the eliminating that as a theoretical problem. curve will simply be deflected to the right The problem arises in those cases where the without changing its shape so as to incorpominimum value of the parametric distribu- rate more variability between the mean plus tion is not zero, but another "threshold" and minus a standard deviation). In terms value greater than zero. In this case the per of the sample statistics, what occurs is that cent variability is not a single coefficient of the line relating the standard deviations to variability (or is not equivalent to the slope the means will move to the right away from of the line over the whole of the plot of the origin and its slope will decrease in value standard deviation versus the mean). (curve C versus curve A in the lower left Rather, per cent variability forms an as- hand plot in Fig. 6). symptotic function approaching the value Now, while this manipulation of the of the slope of the line as the mean increases model is as close to representing canalizain value, as in Figure 10. If T is the amount tion as I think possible, it lacks definitional by which the curve has been displaced from clarity because two operations are being the position it would have if its minimum carried out upon the parametric distribuvalue were zero, the per cent variability tion simultaneously—that is, the threshold function is: of the distribution is being moved off zero PHENOTYPE DEFINITION 311 change of the mean. In the sample standard deviation versus the mean plots, this is MODE reflected by a decreased slope only. Thus, I line B in Figure 6 represents a less variable dimension than does line ,4. The intrinsic variability in these two particular cases is represented by the coefficients of variability of the sample statistics (because the curves go to the origin), and that of line A is larger than that of line B. If two lines like these originated at a threshold value greater than zero, the intrinsic variability of the distributions would be represented by the coefficients of variability of two other lines of identical slope originating at the origin, which, incidentally, would represent the CV's of the parametric distribution themFIG. 7. Comparison of a nearly-normal distribution selves. Thus, canalization alone does not afcurve (A) with a curve derived from it (B, a lognormal fect the parametric distribution. From this distribution) by moving the threshold (T) away from it is concluded that, within the scope of this the origin while at the same time maintaining the orig- model, canalization itself does not necessarinal modal value. This process mimics canalization. ily decrease the intrinsic variability of a diSee text. mension; however, it can be the process that results in such a decrease when certain and further and further to the right, and at the same time the variance of the paramet- constraints are in effect, as they probably ric distribution is being decreased (because mostly are—for example, selection mainof the constraint that the mode should not taining the previous mode. On the other change). It will be more useful to define as hand, normalizing selection always dethe results of canalization only the chang- creases intrinsic variability. ing theshold. This process alone will result All that has been said so far can be taken in increasingly smaller values of per cent to apply directly to anatomical traits. Exvariability since the standard deviation will tending the discussion to physiological (and not change even though the mean is in- presumably behavioral) traits is begun by creasing. It will not, however, affect the as- noting that lognormal parametric distribusymptotic value of the coefficient of varia- tions with negative slope are frequently enbility that the distribution would have if it countered among them, and even, went to the origin. I am proposing that that perhaps, almost completely symmetrical value be taken as the magnitude of the in- distributions. Figure 8 is a schematic sumtrinsic variability for any given trait dimen- mary of the kinds of situations I have found sion instead of the per cent variability that in the physiological literature. As environcan be calculated from any given sample mental conditions change, the parametric population. Therefore, only if the distributions change their character at cerparametric distribution goes to the origin, tain points, as shown. Sometimes there is a will the per cent variability of any sample very abrupt transition from a positively represent the intrinsic variability of the di- skewed to a negatively skewed distribution mension in question. This strategy allows us with no plateau between them, as in a study to factor out the effects of canalization in showing breathing rates in rats under difcomparing the variabilities of two dimen- ferent conditions of oxygen availability sions, which can now be considered to be (Clegg and Ainsworth Harrison, 1967). caused by normalizing selection alone. Figure 9 shows some sample statistics from a study (Stevens and Randall, 1967) that Normalizing selection is then defined as can serve as an example of the kind of the process that results in diminished variance of the sample populations with no analysis proposed. First, both blood pres- 312 S. N. SALTHE Sd X BREATHING RATE FIG. 8. Comparison of parametric distributions with sample statistics for a representative physiological trait, showing the presence of negatively skewed parametric distributions (at low oxygen concentration). See text. sure parametric distributions appear to have been canalized (see also Fig. 10). The intrinsic variability of the ventral aortic systolic blood pressure is less than the intrinsic variability of the "subintestinal" vein maximum blood pressure (see also Fig. 10). This is biologically a reasonable result when it is realized that during swimming the amount of venous return via this vessel is not as great (or as important) as other returns. Therefore, we can provisionally conclude that natural selection is much more "interested" in the ventral aorta under these conditions than it is in the "subintestinal" vein. Alternatively, "subintestinal" vein maximum blood pressure during swimming is a relatively less important trait than is systolic blood pressure in the ventral aorta. This is not to say that these relationships would be maintained under all conditions. After feeding the relationships might be quite different. It must be realized that in defining physiological traits the local environmental conditions bearing must be taken explicitly into account. Thus, in Anolis carolinensis, seminiferous tubule diameter is extremely variable in the winter compared with the breeding season (data from Fox and Dessauer, 1958). Presumably this trait is not up for possible review by selection except during the breeding season. Out of season it may be essentially a selectively neutral trait. In Figure 9 it is shown that the heart rate has a negatively skewed parametric distribution. This is also true for heart rate in Limulus (data from Pax and Sanborn, 1964) and in the chicken (data from Sturkle et al., 1953). The presence of such curves in physiological traits and their evident absence in anatomical traits deserves consideration. Put another way, the seemingly exclusive presence of positively skewed curves in anatomical traits needs to be explained. I suspect the answer will involve canalization, as follows. The negatively skewed curves in the physiological context can be interpreted to indicate that selection has been pushing toward maximum values (as of heart beat rate) resulting in decreased variability as the assymptote is approached (see Fig. 4), and so the parametric distributions are skewed towards the maximum values possible for the trait in question. Conversely, then, positively skewed 0 \ HEART RATE, beots/min, v <> X 14 "SUBINTESTINAL", V E I N MAXIMUM , BLOOD PRESSURE, x'xx 'X XX 12 / 10 Sd HN TROUT,SWIMMING IN DIFFERENT WATER VELOCITIES ! o 8 ;' 6 9 0 f '' ,0 4 0 "00 ' '• VENTRAL AORTA SYSTOLIC BLOOD PRESSURE, mm.Hg ' ^ ' 2 10 20 7^ 30 40 60 FIG. 9. Statistical parameters of three physiological traits monitored simultaneously in fishes swimming under different conditions of water velocity. Each point is a sample from a different water velocity. Notice that the units of measurement differ in one case; this is really two figures combined because the absolute values of the statistical parameters happened to be in a similar range. We are interested only in the slopes and X-origin for each curve. (Data from Stevens and Randall, 1967.) PHENOTYPE DEFINITION 313 to this one can find many subtraits—blood pressure, breathing rates, oxygen loading capacity of hemoglobin, and so on. Each of these subtraits can compensate for a deficiency in any of the others, but presumably to different degrees. The more crucially cv important subcomponents of traits may be identified by appropriate studies from the present viewpoint. Those components that are more often visible to selection during catastrophic events, or that cannot be as easily compensated for by other subcomponents, will undergo more intense selection and consequently be less intrinsically FIG. 10. Some of the same data from Figure 9, this variable than other traits. We could then time plotting the dimensionless per cent of variability (C V) against the sample means. See text for explica- view the phenotype of the organism as a composite of elements that are under different degrees of constraint decided at the population level. parametric distributions may be representative of traits that have had their values REFERENCES "minimized" by selection—for example, maximum blood pressure during swimming in a vessel not important during Aitchison,J.,andJ. A. C. Brown. 1957. The lognormal distribution. Cambridge Univ. Press, Cambridge. swimming, or, indeed, any blood pressure Bock, W. J., and G. von Wahlert. 1965. Adaptation per se; minimum allowable pressures may and the form-function complex. Evolution 19:269incur less overall wear and tear on the sys299. tem. We may then ask why anatomical di- Bumpus, H. C. 1898. The elimination of the unfit as illustrated by the introduced sparrow, Passer domesmensions tend to be "minimized" in this ticus. Biol. Lect. Mar. Biol. Lab. Woods Hole 1898: sense. One possibility is that if selection 209-226. tends to favor increasingly larger dimen- Casler, E. 1967. Pattern variation in isogenic frogs. J. sions in some trait, the efficiency of the deExp. Zool. 166:121-135. velopmental processes are improved if the Clegg, E. J., and G. Ainsworth Harrison. 1967. Some phenotypic modifications in rats exposed to reduced range of possible dimensions has been reatmospheric pressures. Environ. Res. 1:231-239. stricted to a relatively small value, thereby Dayhoff, M. O. 1972. Atlas of protein sequence and initiating canalization to bring the structure. Vol. 5. Nat. Biomed. Res. Foundation, minimum nearer to the real values. This Washington. argument must eventually turn upon ener- Dickerson, R. E. 1971. The structure of cytochromec and the rates of molecular evolution. J. Mol. Evol. getic considerations. From this point of 1:26-45. view all anatomical traits would be consi- Dobzhansky, T. 1970. Genetics of the evolutionary dered to be to some degree canalized, and it process. Columbia Univ. Press, N.Y. raises the problem of what the nature of a Eldredge, N., and S. J. Gould. 1972. Punctuated equilibria: an alternative to phyletic gradualism. zero-valued variate is in biology, or of what Pages 82-115 in T. J. M. Schopf, ed., Models in the minimum ancestral value in some paleontology. Freeman, Cooper & Co., San Francisco. lineage should be considered to have been. Fisher, R. A. 1929. The genetical theory of natural These problems are too large for the preselection. Reprint, Dover Publishing Co., N.Y. sent treatment. Fitch, W. M., and E. Markowitz. 1970. An improved method for determining codon variability in a gene Finally, it should be pointed out that the and its application to the rate of fixations of mutapotential usefulness of this model for dealtions in evolution. Biochem. Genet. 4:579-593. ing with suites of traits hierarchically re- Fox, W., and H. C. Dessauer. 1958. Response of the male reproductive system of lizards t/lnolis carolinenlated to each other is very great. Thus, one sis) to unnatural day-lengths in different seasons. can define as a trait the amount of oxygen Biol. Bull. 115:421-439. utilized per unit of time by an organism Goodman, M., J. Barnabas, G. Matsuda, and G. W. under some given conditions. Contributing Moore. 1971. Molecular evolution in the decent of 0 / ,Q "SUBINTESTINAL'VEIN MAXIMUM BLOOO PRESSURE 314 S. N. SALTHE man. Nature (London) 233:604-613. Gould, S. J. 1970. Dollo on Dollo's law: irreversibility and the status of evolutionary laws. J. Hist. Biol. 3:189-212. Hennig, W. 1966. Phylogenetic systematics. Univ. Illinois Press, Urbana. Horowitz, I. A. 1956. The golden treasury of chess. 2 Ed. Reprint, Simon and Schuster, N.Y. Jukes, T. H., and R. Holmquist. 1972. Estimation of evolutionary changes in certain homologous polypeptide chains. J. Mol. Biol. 64:163-179. Kerfoot, W. C. 1969. Selection of an appropriate index for the study of the variability of lizard and snake body scale counts. Syst. Zool. 18:53-62. Kimura, M., and T. Ohta. 1971a. Protein polymorphism as a phase of molecular evolution. Nature (London) 229:467-469. Kimura, M., and T. Ohta. 19716. Theoretical aspects of population genetics. Princeton Univ. Press, Princeton. King, J. L., and T. H. Jukes. 1969. Non-Darwinian evolution. Science 164:788-798. Kirsch, J. A. W. 1969. Serological data and phylogenetic inference: the problem of rates of change. Syst. Zool. 18:296-311. Kojima, K., and R. C. Lewontin. 1970. Evolutionary significance of linkage and epistasis. Pages 367-388 in K. Kojima, ed., Mathematical topics in population genetics. Springer-Verlag, N.Y. Lerman, A. 1965. On rates of evolution of unit characters and character complexes. Evolution 19:16-25. Lerner, I. M. i954. Genetic homeostasis. Oliver and Boyd, Edinburgh. Lewontin, R. C. 1967. The principle of historicity in evolution. Pages 81-94 in P. S. Moorhead and M. M. Kaplan, eds., Mathematical challenges to the neoDarwinian interpretation of evolution. Wistar Inst. Press, Philadelphia. Maynard Smith, J. 1970. Natural selection and the concept of a protein space. Nature (London) 225:563-564. O'Donald, P. 1973. A further analysis of Bumpus' data: the intensity of natural selection. Evolution 27:398-404. Ohta, T., and M. Kimura. 1971. On the constancy of the evolutionary rate of cistrons.J. Mol. Evol. 1:1825. Pattee, H. H. 1973. The physical basis and origin of hierarchical control. Pages 71-108 in H. H. Pattee, ed., Hierarchy theory. Geo. Braziller, N.Y. Pax, R. A., and R. C. Sanborn. 1964. Cardioregulation in Limulus. I. Physiology of inhibitor neurons. Biol. Bull. 126:133-141. Prigogine, I., and G. Nicolis. 1971. Biological order, structure and instabilities. Quart. Rev. Biophys. 4:107-148. Raup, D. M., S. J. Gould, T. J. M. Schopf, and D. M. Simberloff. 1973. Stochastic models of phylogeny and the evolution of diversity. J. Geol. 81:525-542. Rendel, J. M. 1967. Canalisation and gene control. Academic Press, N.Y. Rothstein, S. I. 1973. The niche-variation model—is it valid? Amer. Natur. 107:598-620. Salthe, S. N. 1972. Evolutionary biology. Holt, Rinehart and Winston, N.Y. Simon, H. A. 1969. The sciences of the artificial. M.I.T. Press, Cambridge. Simpson, G. G. 1949. Rates of evolution in animals. Pages205-228in G. L. Jepsen, G. G. Simpson, and E. Mayr, eds., Genetics, paleontology, and evolution. Princeton Univ. Press, Princeton. Simpson, G. G. 1953. The major features of evolution. Columbia Univ. Press, N.Y. Simpson, G. G. 1961. Principles of animal taxonomy. Columbia Univ. Press, N.Y. Simpson, G. G., A. Roe, and R. C. Lewontin. 1960. Quantitative zoology. Harcourt, Brace, N.Y. Stebbins, G. L., and R. C. Lewontin. 1972. Comparative evolution at the levels of molecules, organisms, and populations. Pages 23-42 in L. M. LeCam, J. Neyman, and E. L. Scott, eds., Darwinian, neoDarwinian, and non-Darwinian evolution. Univ. Calif. Press, Berkeley. Stevens, E. D., and D. J. Randall. 1967. Changes in blood pressure, heart rate, and breathing rate during moderate swimming in rainbow trout. J. Exp. Biol. 46:307-315. Sturkle, P. D., H. S. Weiss, and C. K. Ringer. 1953. Effects of age on blood pressure in the chicken. Amer. J. Physiol. 174:405-407. Sved, J. A., T. E. Reed, and W. F. Bodiner. 1967. The number of balanced polymorphisms that can be maintained in a natural population. Genetics 55:469-481. Van Valen, L. 1973. A new evolutionary law. Evol. Theory 1:1-30. Waddington, C. H. 1957. The strategy of the genes. George Allen & Unwin, London. Weldon, W. F. R. 1901. A first study of natural selection in Clausilia laminata (Montagu). Biometrika 1:109-124. Williams, C. A. Jr. 1964. Immunochemical analysis of serum proteins of the primates: a study of molecular evolution. Pages 25-74 in J. Buettner-Janusch, ed., Evolutionary and genetic biology of the primates. Vol. 2. Academic Press, N.Y. Williams, M. B. 1970. Deducing the consequences of evolution: a mathematical model. J. Theoret. Biol. 29:343-385. Williams, M. B. 1974. The logical status of natural selection and other evolutionary controversies: resolution by axiomatization. In M. Bunge, ed., The methodological unity of science. D. Reidel Publishing Co., Dorderecht. Zuckerkandl, E., and L. Pauling. 1962. Molecular disease, evolution, and genie heterogeneity. Pages 189-225 in M. Kasha and B. Pullman, eds., Horizons in biochemistry. Academic Press, N.Y. Zuckerkandl, E., and L. Pauling. 1965. Evolutionary divergence and convergence at the level of informational macromolecules. Pages 97-166 in V. Bryson and H. J. Vogel, eds., Evolving genes and proteins. Academic Press, N.Y.