Download Theorems about the Convergence of iid mean

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
Transcript
Tutorial 10, STAT1301 Fall 2010, 30NOV2010, MB103@HKU
By Joseph Dong
β€’ Convergence of a Point Sequence
β€’ The limit is a point
lim π‘Žπ‘› = π‘Ž
π‘›β†’βˆž
β€’ Local convergence of a random variable sequence at a particular outcome πœ”0 in the
sample space requires the value sequence of these random variables evaluated at πœ”0 to
converge.
β€’ Local convergence is the convergence of a point sequence.
β€’ Local convergence is yes or no. There doesn’t exist any halfway between sure local convergence and sure
local divergence.
β€’ Sure Convergence of a random variable sequence
β€’ The same as Everywhere Convergence of a function sequence. Using the word β€œsure” because we are
addressing events rather than sets. The domain of a random variable is the state space. The whole state
space maps to the β€œsure event.”
β€’ The limit is a random variable. Formal definition of sure convergence of an r.v. sequence:
βˆ€πœ” ∈ Ξ©, lim 𝑋𝑛 πœ” = 𝑋 πœ”
π‘›β†’βˆž
compactly,
𝑋𝑛 β†’ 𝑋
β€’ Sometimes the limit can also be a single value(e.g. 𝑋 πœ” ≑ π‘₯0 ). In this case, the limit random variable is a
constant, non-random variable, sending each outcome of state space to the single point of sample space.
β€’ Sure convergence of a random variable sequence is a slack way to say that the random
variable locally converges everywhere on the state space.
β€’ This makes it to become convergence of a random variable sequence.
2
β€’ Addresses convergence of a random variable sequence.
β€’ is sure convergence except for a minuscule set of places on the state space.
β€’ Is sure convergence except for a null event.
β€’ A null event is a possible event (non-empty set) with probability zero.
β€’ E.g.
β€’ Ξ©, 𝐹 = βˆ’βˆž, +∞ , 𝑁 0,1 , the event {1, 3, 5,} is a null event because β„™Ξ¦ 1,3,5 = 0.
β€’ {Randomly draw a point from [0,1] and the outcome is the point 0.5} because the probability
implied here is β„™π‘ˆ 0,1 and β„™π‘ˆ 0,1 0.5 = 0.
β€’ {Tossing a fair coin infinitely many times and the outcome is a sequence of heads only} because
the outcome is the singleton {0000000……} and it has the probability zero.
β€’ is convergence with probability one, but still not on the entire state space.
β€’ is divergence with probability zero, but still diverges on a possible event.
β€’ Definition
β€’ For almost all but maybe not all πœ”β€™s in Ξ©, lim 𝑋𝑛 πœ” = 𝑋 πœ” .
π‘›β†’βˆž
β€’ Using probability notation to be precise about β€œalmost all”:
β„™
πœ”: lim 𝑋𝑛 πœ” = 𝑋 πœ”
π‘›β†’βˆž
Compactly,
𝑋𝑛
π‘Ž.𝑠.
=1
𝑋
3
β€’ The random variable sequence converges on an event with arbitrarily high
probability.
β€’ β€œ1” is the highest possible probability.
β€’ β€œ1” is an β€œarbitrarily high probability”,
but not the converse.
β€’ The layman’s language does not distinguish
between β€œhighest” and β€œarbitrarily high.”
β€’ We resort to mathematical language
for a description of the subtle difference.
β€’ To account for the qualifier β€œarbitrary” we need someone to arbitrarily specify a
bound so that above it every value is β€œarbitrarily high.” Say someone specified a
very small 𝛿 so that 1 βˆ’ 𝛿 is a very high bound to satisfy her requirement.
Convergence in Probability means: there will be some large enough 𝑛 to let
β„™ πœ”: 𝑋𝑛 πœ” βˆ’ 𝑋 πœ”
<πœ–
fall within the arbitrarily small 𝛿-neighborhood of 1.
β€’ β€œFalling at the single point 1” is a much stringent requirement than β€œFalling
within an arbitrarily small neighborhood of 1.”
β€’ You’ll need a much larger 𝑛 to fulfill the former than the latter.
4
β€’ Very weak form of probabilistic convergence.
β€’ For each πœ” in the sample space, 𝑋𝑛 πœ” need not even be close to 𝑋 πœ” .
β€’ The convergence need not take place in the state space.
β€’ The convergence does take place in sample space on which the CDF function
is defined.
β€’ E.g. Tossing a fair coin gives the sample space Ξ© = 𝐻, 𝑇 . If we want to define a
random variable from this sample space to 0,1 we at least have two choices:
πœ‰1 = 𝐻, 0 , 𝑇, 1 and πœ‰2 = 𝐻, 1 , 𝑇, 0 . These are completely different random
variables but they share exactly the same sample space and the same probability
measure on it: both of them measure {0} with probability 0.5 and {1} with probability
0.5 in the sample space. Thus they are equal by distribution. If we define a sequence
of r.v.s all equal to πœ‰1 , then this sequence should converge to πœ‰2 in distribution.
β€’ Definition:
βˆ€π‘Ÿ ∈ ℝ, lim 𝐹𝑛 π‘Ÿ = 𝐹 (π‘Ÿ)
π‘›β†’βˆž
5
Sure
Convergence
Almost Sure
Convergence
Convergence
in Probability
Convergence
in Distribution
Strongest
Stronger
Strong
Weak
β€’ Convergence to a Constant
β€’ A special situation happens when the limit random variable is a constant
(not random at all), in this case, Convergence in Probability to a constant
is equivalent to Convergence in Distribution to that same constant. That is
β„™
π’Ÿ
𝑋𝑛 β†’ 𝑐 ⇔ 𝑋𝑛 β†’ 𝑐
6
β€’ IID = Independent and Identically Distributed
β€’ An iid sequence of r.v.s 𝑋1 , 𝑋2 , β‹― , 𝑋𝑛 means
β€’ π‘‹π‘˜ ’s are mutually independent r.v.s.
β€’ All π‘‹π‘˜ ’s follow one same distribution, say, 𝐹.
β€’ IID mean is the arithmetic mean of the iid sequence
𝑋1 + 𝑋2 + β‹― + 𝑋𝑛
𝑋𝑛 ≔
𝑛
β€’ IID mean is a transformation of the 𝑛 random variables.
β€’ IID mean is a random variable.
β€’ IID mean is closely related to numerator IID sum.
7
β„™
β€’ 𝑋𝑛 β†’ πœ‡
β€’ 𝑋𝑛
π‘Ž.𝑠.
π’Ÿ
πœ‡
β€’ 𝑋𝑛 β†’ 𝑁
𝜎2
πœ‡,
𝑛
β€’ CLT
β€’ WLLN
β€’ SLLN
β€’ SLLN β‡’ WLLN
β€’ CLT β‡’ WLLN
8
β€’ One consequence of CLT is we can now at least approximate
the distribution of an IID sum by normal distribution.
β€’
β€’
β€’
β€’
β€’
β€’
β€’
Binomial is an IID sum of Bernoullis
Poisson is an IID sum of Poissons
Negative Binomial is an IID sum of Geometrics
Negative Binomial is also an IID sum of Negative Binomials
Gamma is an IID sum of Exponentials
Gamma is also an IID sum of Gammas
Chisq is an IID sum of Chisqs
β€’ All of the above (and many others) can be approximated by
appropriately parameterized Normal distributions.
9