* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Philosophy of Science, 69 (September 2002) pp
Survey
Document related concepts
Indeterminism wikipedia , lookup
History of randomness wikipedia , lookup
Random variable wikipedia , lookup
Probabilistic context-free grammar wikipedia , lookup
Dempster–Shafer theory wikipedia , lookup
Probability box wikipedia , lookup
Birthday problem wikipedia , lookup
Infinite monkey theorem wikipedia , lookup
Inductive probability wikipedia , lookup
Conditioning (probability) wikipedia , lookup
Ars Conjectandi wikipedia , lookup
Transcript
Philosophy of Science, 69 (September 2002) pp. S98-S103. 0031-8248/2002/69supp-0009 Copyright 2002 by The Philosophy of Science Association. All rights reserved. It All Adds Up: The Dynamic Coherence of Radical Probabilism S. L. Zabell Northwestern University http://www.journals.uchicago.edu/PHILSCI/journal/issues/v69nS3/693009/693009. html Send requests for reprints to the author, Departments of Mathematics and Statistics, 2033 Sheridan Rd., Northwestern University, Evanston, IL 60208; [email protected]. Acknowledgements. I would like to thank Brian Skyrms for suggesting this topic and providing several references. Thanks also to Persi Diaconis and Teddy Seidenfeld for helpful conversations. Brian Skyrms (1987, 1990, 1993, 1997) has discussed the role of dynamic coherence arguments in the theory of personal or subjective probability. In particular, Skyrms (1997) both reviews and discusses the utility of martingale arguments in establishing the convergence of beliefs within the context of radical probabilism. The classical martingale converence theorem, however, assumes the countable additivity of the underlying probability measure; an assumption rejected by some subjectivists such as Bruno de Finetti (see, e.g., de Finetti 1930 and 1972). This brief note has a very modest goal: to briefly consider the extent to which Skyrms's argument can be extended to the finitely additive case. 1. Dynamic Coherence. Let us begin by briefly reviewing Skyrms's argument. Let ( , , P) be a probability space; that is, is a set (the sample space), is a -algebr> of subsets of , and P is a countably additive probability measure on . By definition, a random variable X on ( , , P) is an -measurable function>X: Let n 1 2 3 R. ... be an increasing sequence of sub- -algebras of> ; intuitively represents the information available to us at time n. Let X1, X2, X3, ... be a sequence of random variables on the probability space. One says that X1, X2, X3, ... is a martingale with respect to { n : n 1} provided: 1. Xn is n measurable; 2. E [ | Xn| ] < ; 3. E [ Xn+1>| n] = Xn almost surely. The celebrated Doob martingale convergence theorem (see, for example, either Billingsley, 1995 or Durrett, 1996) assures us that if the expectations E[| Xn|] are bounded, then such a sequence of random variables converges almost surely to an -measurable random variable>X: Skyrms's clever observation is the following. Suppose that Xn represents your subjective probability regarding an uncertain outcome on the basis of the information available to you at time n; and let n= (X1, X2, ... , Xn), the -field generated b> the random variables X1, X2, ... . , Xn. (No particular model such as Bayesian updating is being assumed here, the revision has taken place in a "black box".) Then the resulting sequence 1, 2, 3, ... is an increasing sequence of sub- -algebras o> and each of the three conditions above in the definition of a martingale must be satisfied: the first because a random variable is (by definition) measurable with respect to the -algebr> it generates; the second because Xn is a probability (and therefore 0 Xn 1); and the third is an instance of the "principal principle", and can be justified on the grounds of a Dutch book coherence argument (see Skyrms, 1997, Section 5). Thus the sequence X1, X2, X3, ... is a martingale. Because of the very special nature of the random variables in question (they are nonnegative and bounded), the martingale convergence theorem applies to the sequence, and therefore (subjective) probability one is necessarily assigned to the event that the sequence of probability assessments will converge to a limiting value. What are the potential vulnerabilities in such an argument? There are two obvious points of attack. First, there is the key martingale assumption that E[ Xn+1 || X1, X2, ... , Xn] = Xn almost surely. It is certainly possible, given the black box nature of the process, that this might not hold but, as Skyrms discusses, this would violate a natural coherence condition. That leaves the assumption of countable additivity, implicitly invoked in the use of the MCT (martingale convergence theorem). Is it really necessary? That question is the focus of this note. 2. The Finitely Additive Version of the MCT. In an important paper, Purves and Sudderth (1976) have discussed the extension of many of the classical limit theorems of probability to the finitely additive setting. Their approach is based on the concept of a strategy. Here are the basic definitions. Let X and Y be nonempty sets. From now on, a probability on a set means a finitely additive probability measure defined on all the subsets of the set; and a conditional probability on Y given X is a function that assigns to each element x X a probability (in this sense) on Y. A strategy on X × Y is then a pair ( 0, consisting of a probability 0 on X and a conditional probability 1 on 1) Y given X. Every such strategy in turn determines a probability on X × Y: given a set S X × Y, if Sx = { y Y: (x, y) S}, then (The existence and uniqueness of this integral is a standard topic in finitely additive integration theory.) Probability measures on X × Y arising in this way are said to be strategic; and it is an unexpected feature of finitely additive probability measures on a product space X × Y that not all such probabilities are strategic. In the martingale convergence theorem, probability statements are made about infinite sequences of observations. Such probabilities are built up from probabilities defined on more basic objects, sets of finite sequences. Thus, given a set X, let X* denote the set of all finite sequences of elements in X (including the null sequence). A strategy in this extended context is then defined to be a conditional probability on X given X*. (The earlier definition of strategy only assigned conditional probabilities given sequences of length zero and one.) Let 0 denote the conditional probability on X that assigns to the empty sequence (that is, the unconditional probability on X). It is apparent that one can, in the manner discussed earlier, use (in conjunction with 0) to obtain a strategic measure on Xn, the Cartesian product of X with itself n times, for all n 1. It is less obvious, but a fundamental result of Dubins and Savage (1965, 7 21) that such a strategy determines a positive linear functional on the bounded real-valued functions on X , the space of infinite sequences of elements of X, endowed with the discrete topology. This in turn means that the strategy determines a probability on certain special subsets of X , the sets that are at once both open and closed in the discrete topology. Purves and Sudderth (1976) prove that the probability can in fact be extended to a much larger class of sets ( ); this class is an algebra (but not a -algebra) and is quit> rich: it contains the Borel sets. This extension of in turn permits the establishment of a martingale convergence theorem in the finitely additive setting (Purves and Sudderth 1976, Theorem 7.3). The set of sequences for which convergence does not occur is contained in an ( )-set that has (extended) probability zero. (Purves and Sudderth (1983) discuss some other examples of zero-one laws in the finitely additive setting.) 3. Strategic Measures and Coherence. One remaining question is the justification for considering strategies and strategic measures. Lane and Sudderth (1985) have advanced an attractive notion of coherence that singles out precisely the strategic measures. This goes briefly as follows. Suppose P is a probability on X × Y, and q is a conditional probability on Y given X. If A X × Y is a set, let IA denote the indicator function of the set (that is, IA(x) = 1, x A, IA (x) = 0, x A), and let Ax = { y Y: (x, y) A}. Interpreting P and q as prices for gambles on (x, y) and y given x, respectively, a bet of c on A yields a net return of and, letting S X and B = {(x, y): y Bx}, a bet of d on Bx conditional on x S yields a net return of The Lane and Sudderth notion of coherence is then very simple: viewing the posted odds as those of a bookie, if a gambler is limited to a finite number of transactions of each of these two types, then the bookie is coherent provided the gambler cannot ensure a positive return. That is, one cannot find functions m and 1, ... , n such that 1, ... , Lane and Sudderth then prove that a pair (P,q) is coherent in this sense if and only if P is a finitely additive probability and (letting P0 denote the marginal of P on X) for every subset A X × Y. 4. Concluding Comments. Strategic probability measures are also sometimes termed disintegrable. Dubins (1975, Theorem 1) demonstrated that this property is equivalent to another, apparently different one, the earlier property of conglomerability, discovered by de Finetti (1930 and 1972, 98). The Lane-Sudderth notion of coherence is closely related to an earlier notion of coherence introduced by Freedman and Purves (1969) and later studied by Heath and Sudderth (1978) and Lane and Sudderth (1985). This earlier notion (and implicitly the later Lane-Sudderth notion) has been criticized by Kadane, Schervish, and Seidenfeld (1996), appealing to a result of Schervish, Seidenfeld, and Kadane (1984). This important result states that every finitely additive probability that is not countably additive fails to be disintegrable (or, as discussed above, conglomerable) with respect to some countable partition. The comments of Lane and Sudderth, however, appear to adequately address this point. They point out that this argument, which appears to argue that a probability must be countably additive in order to be coherent, runs afoul of a simple objection: if countable partitions are permitted, then it seems arbitrary to exclude uncountable partitions. But then even countably additive probabilities can fail to satisfy this restriction. In a later survey paper, Sudderth (1994) returned to this issue, and his brief comment seems very much to the point: Several authors (see, for example, Heath and Sudderth (1972) or Skyrms (1984)) have remarked that if a gambler is allowed to make countably many bets, then P must be countably additive to avoid a sure loss. However, de Finetti (1972, p. 91) considered such arguments to be circular because they rely on the usual conventions about infinite sums which are tantamount to an assumption of countable additivity. References Billingsley, Patrick (1995), Probability and Measure, 3rd ed. New York: John Wiley and Sons. First citation in article de Finetti, Bruno (1930), "Sulla proprietà coglomerativa delle probabilità subordinate," Rend. R. Inst. Lombardo Milano 63: 414 418. First citation in article de Finetti, Bruno (1972), Probability, Induction, and Statistics. New York: John Wiley and Sons. First citation in article Dubins, Lester E. (1975), "Finitely Additive Conditional Probablities, Conglomerability and Disintegrations," The Annals of Probability 3: 89 99. First citation in article Dubins, Lester E., and Leonard J. Savage (1965), How to Gamble If You Must: Inequalities for Stochastic Processes. New York: McGraw-Hill. (Second edition, Dover Press, New York, 1976.) First citation in article Durrett, Richard (1996), Probability: Theory and Examples, 2nd ed. Belmont, Calif.: Duxbury Press. First citation in article Freedman, David A., and Roger A. Purves (1969), "Bayes' Method for Bookies," The Annals of Mathematical Statistics 40: 1177 1186. First citation in article Heath, David C., and William D. Sudderth (1972), "On a Theorem of de Finetti, Oddsmaking, and Game Theory," The Annals of Mathematical Statistics 43: 2072 2077. First citation in article Heath, David C., and William D. Sudderth (1978), "On Finitely Additive Priors, Coherence, and Extended Admissibility," The Annals of Statistics 6: 333 345. First citation in article Kadane, Joseph B., Mark J. Schervish, and Teddy Seidenfeld (1996), "Reasoning to a Foregone Conclusion," Journal of the American Statistical Association 91: 1228 1236. (Paper reprinted in Kadane, Schervish, and Seidenfeld 1999, Section 3.7.) First citation in article Kadane, Joseph B., Mark J. Schervish, and Teddy Seidenfeld (1999), Rethinking the Foundations of Statistics. Cambridge University Press. Lane, David A., and William D. Sudderth (1984), "Coherent Predictive Inference," Sankhy 46, Series A: 166 185. Lane, David A., and William D. Sudderth (1985), "Coherent Predictions Are Strategic," The Annals of Statistics 13: 1244 1248. First citation in article Purves, Roger A., and William D. Sudderth (1976), "Some Finitely Additive Probability," The Annals of Probability 4: 259 276. First citation in article Purves, Roger A., and William D. Sudderth (1983), "Finitely Additive Zero-one Laws," Sankhy 45, Series A: 32 37. First citation in article Schervish, Mark J., Teddy, Seidenfeld, and Joseph B. Kadane (1984), "The Extent of Nonconglomerability of Finitely Additive Measures," Zeitschrift fur Wahrscheinlichkeitstheorie und verwandte Gebiete 66: 205 226. First citation in article Skyrms, Brian (1984), Pragmatics and Empiricism. Yale University Press. First citation in article Skyrms, Brian (1987), "Dynamic Coherence and Probability Kinematics," Philosophy of Science 54: 1 20. First citation in article Skyrms, Brian (1990), The Dynamics of Rational Deliberation. Harvard University Press. First citation in article Skyrms, Brian (1993), "Discussion: A Mistake in Dynamic Coherence Arguments," Philosophy of Science 60: 320 328. First citation in article Skyrms, Brian (1997), "The Structure of Radical Probabilism," Erkenntnis 45: 285 297. First citation in article Sudderth, William D. (1994), "Coherent Inference and Prediction in Statistics," in D. Prawitz, B. Skryms, and D. Westerst;arahl (eds.), Logic, Methodology and Philosophy of Science IX Elsevier. The Annals of Probability 4: 259 276.