Download PDF

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

History of statistics wikipedia , lookup

Degrees of freedom (statistics) wikipedia , lookup

Bootstrapping (statistics) wikipedia , lookup

Taylor's law wikipedia , lookup

Law of large numbers wikipedia , lookup

Transcript
variance∗
stevecheng†
2013-03-21 12:39:36
Definition
The variance of a real-valued random variable X is
Var X = E (X − m)2 , m = EX ,
provided that both expectations EX and E[(X − m)2 ] exist.
2
, or simply σ 2 . The expoThe variance of X is often denoted by σ 2 (X),√σX
nent on σ is put there so that the number σ = σ 2 is measured in the same
units as the random variable
X itself.
√
The quantity σ = Var X is called the standard deviation of X; because
of the compatibility of the measuring units, standard deviation is usually the
quantity that is quoted to describe an emprical probability distribution, rather
than the variance.
Usage
The variance is a measure of the dispersion or variation of a random variable
about its mean m.
It is not always the best measure of dispersion for all random variables, but
compared to other measures, such as the absolute mean deviation, E[|X − m|],
the variance is the most tractable analytically.
The variance is closely related to the L2 norm for random variables over a
probability space.
Properties
1. The variance of X is the second moment of X minus the square of the
first moment:
Var X = E[X 2 ] − E[X]2 .
This formula is often used to calculate variance analytically.
∗ hVariancei
created: h2013-03-21i by: hstevechengi version: h30510i Privacy setting:
h1i hDefinitioni h62-00i h60-00i h81-00i h83-00i h82-00i h55-00i
† This text is available under the Creative Commons Attribution/Share-Alike License 3.0.
You can reuse this document or portions thereof only if you do so under terms that are
compatible with the CC-BY-SA license.
1
2. Variance is not a linear function. It scales quadratically, and is not affected
by shifts in the mean of the distribution:
Var[aX + b] = a2 Var X ,
for any a, b ∈ R.
3. A random variable X is constant almost surely if and only if Var X = 0.
4. The variance can also be characterized as the minimum of expected squared
deviation of a random variable from any point:
Var X = inf E[(X − a)2 ] .
a∈R
5. For any two random variables X and Y whose variances exist, the variance
of the linear combination aX + bY can be expressed in terms of their
covariance:
Var[aX + bY ] = a2 Var X + b2 Var Y + 2ab Cov[X, Y ] ,
where Cov[X, Y ] = E[(X − EX)(Y − EY )], and a, b ∈ R.
6. For a random variable X, with actual observations x1 , . . . , xn , its variance
is often estimated empirically with the sample variance:
n
Var X ≈ s2 =
1 X
(xi − x̄)2 ,
n − 1 i=1
2
n
x̄ =
1X
xj .
n j=1