Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Tutorial 10, STAT1301 Fall 2010, 30NOV2010, MB103@HKU
By Joseph Dong
β’ Convergence of a Point Sequence
β’ The limit is a point
lim ππ = π
πββ
β’ Local convergence of a random variable sequence at a particular outcome π0 in the
sample space requires the value sequence of these random variables evaluated at π0 to
converge.
β’ Local convergence is the convergence of a point sequence.
β’ Local convergence is yes or no. There doesnβt exist any halfway between sure local convergence and sure
local divergence.
β’ Sure Convergence of a random variable sequence
β’ The same as Everywhere Convergence of a function sequence. Using the word βsureβ because we are
addressing events rather than sets. The domain of a random variable is the state space. The whole state
space maps to the βsure event.β
β’ The limit is a random variable. Formal definition of sure convergence of an r.v. sequence:
βπ β Ξ©, lim ππ π = π π
πββ
compactly,
ππ β π
β’ Sometimes the limit can also be a single value(e.g. π π β‘ π₯0 ). In this case, the limit random variable is a
constant, non-random variable, sending each outcome of state space to the single point of sample space.
β’ Sure convergence of a random variable sequence is a slack way to say that the random
variable locally converges everywhere on the state space.
β’ This makes it to become convergence of a random variable sequence.
2
β’ Addresses convergence of a random variable sequence.
β’ is sure convergence except for a minuscule set of places on the state space.
β’ Is sure convergence except for a null event.
β’ A null event is a possible event (non-empty set) with probability zero.
β’ E.g.
β’ Ξ©, πΉ = ββ, +β , π 0,1 , the event {1, 3, 5,} is a null event because βΞ¦ 1,3,5 = 0.
β’ {Randomly draw a point from [0,1] and the outcome is the point 0.5} because the probability
implied here is βπ 0,1 and βπ 0,1 0.5 = 0.
β’ {Tossing a fair coin infinitely many times and the outcome is a sequence of heads only} because
the outcome is the singleton {0000000β¦β¦} and it has the probability zero.
β’ is convergence with probability one, but still not on the entire state space.
β’ is divergence with probability zero, but still diverges on a possible event.
β’ Definition
β’ For almost all but maybe not all πβs in Ξ©, lim ππ π = π π .
πββ
β’ Using probability notation to be precise about βalmost allβ:
β
π: lim ππ π = π π
πββ
Compactly,
ππ
π.π .
=1
π
3
β’ The random variable sequence converges on an event with arbitrarily high
probability.
β’ β1β is the highest possible probability.
β’ β1β is an βarbitrarily high probabilityβ,
but not the converse.
β’ The laymanβs language does not distinguish
between βhighestβ and βarbitrarily high.β
β’ We resort to mathematical language
for a description of the subtle difference.
β’ To account for the qualifier βarbitraryβ we need someone to arbitrarily specify a
bound so that above it every value is βarbitrarily high.β Say someone specified a
very small πΏ so that 1 β πΏ is a very high bound to satisfy her requirement.
Convergence in Probability means: there will be some large enough π to let
β π: ππ π β π π
<π
fall within the arbitrarily small πΏ-neighborhood of 1.
β’ βFalling at the single point 1β is a much stringent requirement than βFalling
within an arbitrarily small neighborhood of 1.β
β’ Youβll need a much larger π to fulfill the former than the latter.
4
β’ Very weak form of probabilistic convergence.
β’ For each π in the sample space, ππ π need not even be close to π π .
β’ The convergence need not take place in the state space.
β’ The convergence does take place in sample space on which the CDF function
is defined.
β’ E.g. Tossing a fair coin gives the sample space Ξ© = π», π . If we want to define a
random variable from this sample space to 0,1 we at least have two choices:
π1 = π», 0 , π, 1 and π2 = π», 1 , π, 0 . These are completely different random
variables but they share exactly the same sample space and the same probability
measure on it: both of them measure {0} with probability 0.5 and {1} with probability
0.5 in the sample space. Thus they are equal by distribution. If we define a sequence
of r.v.s all equal to π1 , then this sequence should converge to π2 in distribution.
β’ Definition:
βπ β β, lim πΉπ π = πΉ (π)
πββ
5
Sure
Convergence
Almost Sure
Convergence
Convergence
in Probability
Convergence
in Distribution
Strongest
Stronger
Strong
Weak
β’ Convergence to a Constant
β’ A special situation happens when the limit random variable is a constant
(not random at all), in this case, Convergence in Probability to a constant
is equivalent to Convergence in Distribution to that same constant. That is
β
π
ππ β π β ππ β π
6
β’ IID = Independent and Identically Distributed
β’ An iid sequence of r.v.s π1 , π2 , β― , ππ means
β’ ππ βs are mutually independent r.v.s.
β’ All ππ βs follow one same distribution, say, πΉ.
β’ IID mean is the arithmetic mean of the iid sequence
π1 + π2 + β― + ππ
ππ β
π
β’ IID mean is a transformation of the π random variables.
β’ IID mean is a random variable.
β’ IID mean is closely related to numerator IID sum.
7
β
β’ ππ β π
β’ ππ
π.π .
π
π
β’ ππ β π
π2
π,
π
β’ CLT
β’ WLLN
β’ SLLN
β’ SLLN β WLLN
β’ CLT β WLLN
8
β’ One consequence of CLT is we can now at least approximate
the distribution of an IID sum by normal distribution.
β’
β’
β’
β’
β’
β’
β’
Binomial is an IID sum of Bernoullis
Poisson is an IID sum of Poissons
Negative Binomial is an IID sum of Geometrics
Negative Binomial is also an IID sum of Negative Binomials
Gamma is an IID sum of Exponentials
Gamma is also an IID sum of Gammas
Chisq is an IID sum of Chisqs
β’ All of the above (and many others) can be approximated by
appropriately parameterized Normal distributions.
9