Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Tutorial 10, STAT1301 Fall 2010, 30NOV2010, MB103@HKU By Joseph Dong β’ Convergence of a Point Sequence β’ The limit is a point lim ππ = π πββ β’ Local convergence of a random variable sequence at a particular outcome π0 in the sample space requires the value sequence of these random variables evaluated at π0 to converge. β’ Local convergence is the convergence of a point sequence. β’ Local convergence is yes or no. There doesnβt exist any halfway between sure local convergence and sure local divergence. β’ Sure Convergence of a random variable sequence β’ The same as Everywhere Convergence of a function sequence. Using the word βsureβ because we are addressing events rather than sets. The domain of a random variable is the state space. The whole state space maps to the βsure event.β β’ The limit is a random variable. Formal definition of sure convergence of an r.v. sequence: βπ β Ξ©, lim ππ π = π π πββ compactly, ππ β π β’ Sometimes the limit can also be a single value(e.g. π π β‘ π₯0 ). In this case, the limit random variable is a constant, non-random variable, sending each outcome of state space to the single point of sample space. β’ Sure convergence of a random variable sequence is a slack way to say that the random variable locally converges everywhere on the state space. β’ This makes it to become convergence of a random variable sequence. 2 β’ Addresses convergence of a random variable sequence. β’ is sure convergence except for a minuscule set of places on the state space. β’ Is sure convergence except for a null event. β’ A null event is a possible event (non-empty set) with probability zero. β’ E.g. β’ Ξ©, πΉ = ββ, +β , π 0,1 , the event {1, 3, 5,} is a null event because βΞ¦ 1,3,5 = 0. β’ {Randomly draw a point from [0,1] and the outcome is the point 0.5} because the probability implied here is βπ 0,1 and βπ 0,1 0.5 = 0. β’ {Tossing a fair coin infinitely many times and the outcome is a sequence of heads only} because the outcome is the singleton {0000000β¦β¦} and it has the probability zero. β’ is convergence with probability one, but still not on the entire state space. β’ is divergence with probability zero, but still diverges on a possible event. β’ Definition β’ For almost all but maybe not all πβs in Ξ©, lim ππ π = π π . πββ β’ Using probability notation to be precise about βalmost allβ: β π: lim ππ π = π π πββ Compactly, ππ π.π . =1 π 3 β’ The random variable sequence converges on an event with arbitrarily high probability. β’ β1β is the highest possible probability. β’ β1β is an βarbitrarily high probabilityβ, but not the converse. β’ The laymanβs language does not distinguish between βhighestβ and βarbitrarily high.β β’ We resort to mathematical language for a description of the subtle difference. β’ To account for the qualifier βarbitraryβ we need someone to arbitrarily specify a bound so that above it every value is βarbitrarily high.β Say someone specified a very small πΏ so that 1 β πΏ is a very high bound to satisfy her requirement. Convergence in Probability means: there will be some large enough π to let β π: ππ π β π π <π fall within the arbitrarily small πΏ-neighborhood of 1. β’ βFalling at the single point 1β is a much stringent requirement than βFalling within an arbitrarily small neighborhood of 1.β β’ Youβll need a much larger π to fulfill the former than the latter. 4 β’ Very weak form of probabilistic convergence. β’ For each π in the sample space, ππ π need not even be close to π π . β’ The convergence need not take place in the state space. β’ The convergence does take place in sample space on which the CDF function is defined. β’ E.g. Tossing a fair coin gives the sample space Ξ© = π», π . If we want to define a random variable from this sample space to 0,1 we at least have two choices: π1 = π», 0 , π, 1 and π2 = π», 1 , π, 0 . These are completely different random variables but they share exactly the same sample space and the same probability measure on it: both of them measure {0} with probability 0.5 and {1} with probability 0.5 in the sample space. Thus they are equal by distribution. If we define a sequence of r.v.s all equal to π1 , then this sequence should converge to π2 in distribution. β’ Definition: βπ β β, lim πΉπ π = πΉ (π) πββ 5 Sure Convergence Almost Sure Convergence Convergence in Probability Convergence in Distribution Strongest Stronger Strong Weak β’ Convergence to a Constant β’ A special situation happens when the limit random variable is a constant (not random at all), in this case, Convergence in Probability to a constant is equivalent to Convergence in Distribution to that same constant. That is β π ππ β π β ππ β π 6 β’ IID = Independent and Identically Distributed β’ An iid sequence of r.v.s π1 , π2 , β― , ππ means β’ ππ βs are mutually independent r.v.s. β’ All ππ βs follow one same distribution, say, πΉ. β’ IID mean is the arithmetic mean of the iid sequence π1 + π2 + β― + ππ ππ β π β’ IID mean is a transformation of the π random variables. β’ IID mean is a random variable. β’ IID mean is closely related to numerator IID sum. 7 β β’ ππ β π β’ ππ π.π . π π β’ ππ β π π2 π, π β’ CLT β’ WLLN β’ SLLN β’ SLLN β WLLN β’ CLT β WLLN 8 β’ One consequence of CLT is we can now at least approximate the distribution of an IID sum by normal distribution. β’ β’ β’ β’ β’ β’ β’ Binomial is an IID sum of Bernoullis Poisson is an IID sum of Poissons Negative Binomial is an IID sum of Geometrics Negative Binomial is also an IID sum of Negative Binomials Gamma is an IID sum of Exponentials Gamma is also an IID sum of Gammas Chisq is an IID sum of Chisqs β’ All of the above (and many others) can be approximated by appropriately parameterized Normal distributions. 9