Download Probability

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

History of statistics wikipedia , lookup

Statistics wikipedia , lookup

Transcript
-
Expected values
Trees, tables and Venn diagrams
Factorials, combinations and permutations
Conditional probability




This standard has the highest failure rate of
all standards in statistics.
Last year approximately 75% of those who
sat the old probability standard from KBHS
did not achieve.
A standard with very simple principles and
mathematical processes.
Commonly misunderstood questions and
incorrect methods used.

1. Repeated practice (hundreds of times)
▪ By familiarising yourself with questions you can ‘feel’ the
correct method to use.

2. Understanding the question
▪ You are incredibly likely to use the wrong methods if you
do not carefully READ the question several times.

3. Common sense
▪ Like most things in statistics, the most simple way to
approach a question is to forget about the mathematics
and simply consider the problem.

The number 1 skill you will need to develop
throughout this topic is to select the correct
representation for a situation.

This will generally be one of 3 things- a tree
diagram, a Venn diagram or a table.

The situation will dictate which one makes
most sense.

Expected values are estimates of a mean
outcome from the set of all possibly
outcomes which are weighted by probability.

We take the value of each event outcome and
multiply it by the outcome probability. The
sum of these results gives us the expected
value.

Expected value (µ):
𝐸(𝑋) =
(𝑥𝑖 𝑝𝑖 )

Variance and standard deviation are two very
common measures of spread. The standard
deviation is the square root of the variance.

VAR(X) = SD(X)2 or (σ2)

And

σ =
𝑉𝐴𝑅

The variance of a random variable is the
expected squared distance of each individual
x from its mean.

These distances are weighted as with E(X).

𝑉𝐴𝑅 𝑋 = 𝐸 𝑋 2 − 𝐸(𝑋)2
x
1
2
3
p
0.5
0.25
0.25
E(X):
Each individual x is weighted:
1 × 0.5 + 2 × 0.25 + 3 × 0.25
0.5
+
0.5
+
0.75
E(X) = 1.75
E(X²):
Each x value is squared and weighted
12 × 0.5 + 22 × 0.25 + (32 × 0.25)
1 × 0.5 + 4 × 0.25 + (9 × 0.25)
0.5
+
1
+
2.25
E(X2) = 3.75
VAR(X):
The expected value of X is squared and subtracted from the expected value of X²
VAR(X) = 3.75 − 1.752
VAR(X) = 0.6875

When given a linear function ‘y’ that acts
upon a random variable ‘x’ we can also find
expected values and variance of that
function- E(Y) and VAR(Y).

From working through a simple example we
can see how the function affects E(X) and
VAR(X).
x
1
2
3
p
0.5
0.25
0.25
E(X):
Each individual x is weighted:
1 × 0.5 + 2 × 0.25 + 3 × 0.25
0.5
+
0.5
+
0.75
E(X) = 1.75
E(X²):
Each x value is squared and weighted
12 × 0.5 + 22 × 0.25 + (32 × 0.25)
1 × 0.5 + 4 × 0.25 + (9 × 0.25)
0.5
+
1
+
2.25
E(X2) = 3.75
VAR(X):
The expected value of X is squared and subtracted from the expected value of X²
VAR(X) = 3.75 − 1.752
VAR(X) = 0.6875

Now lets say a function of y=2x+3 is applied
to this events outcome.
x
1
2
3
p
0.5
0.25
0.25
y
5
7
9
p
0.5
0.25
0.25
Earlier we worked
out that:
E(X) = 1.75
VAR(X) = 0.6875
y
5
7
9
p
0.5
0.25
0.25
E(Y):
Each individual y is weighted:
(5 x 0.5) + (7 x 0.25) + (9 x 0.25)
2.5 + 1.75 +
2.25
= 6.5
VAR(Y):
E(Y²) – E(Y) ²
45 – 6.5²
45 – 42.25
= 2.75
E(Y²):
Each individual y² is weighted:
( 5² x 0.5) + (7² x 0.25) + (9² x 0.25)
(25 x 0.5) + (49 x 0.25) + (81 x 0.25)
12.5 + 12.25 + 20.25
= 45
For reference:
E(X) = 1.75
VAR(X) = 0.6875

The linear function is y=2x+3
E(X) = 1.75
E(Y) = 6.5

By putting 1.75 in for x we find that:

E(aX + b) = a x E(X) + b


▪ or

E(Y) = aE(X) + b

The linear function is y=2x+3
VAR(X) = 0.6875
VAR(Y) = 2.75

VAR(Y) = a² x VAR(X)

The relationship is difficult to see at first, but
makes sense we consider the reasons.



Remember, the variance measures the
squared distance, it is influenced by the
square of the co-efficient in the function (a²).

Also remember, variance measures spread.
By adding a constant we change all the
outcome values but their relative difference is
the same (the difference between the
numbers). Constants have no effect.


If we have two independent variables that
are combined we establish several
relationships. Most of these are based on
theories outside the scope of this course so
the theory behind these are not required
though you do need to be able to compute
them.
If variables are not independent they cannot
be combined.

If we have two random variables (in this case
labelled X and Y) that are to be added we find
that:

E(T) = E(X+ Y) = E(X) + E(Y)
▪ And

VAR(T) = VAR(X+Y) = VAR(X) + VAR(Y)

The mean weight of a can is estimated to be
30 grams with a standard deviation of 3
grams. The mean weight of beans in the can
is estimated as 470 grams with a standard
deviation of 6 grams. Find the mean total
weight of a can of beans along with its
variance.






E(X) = 30 grams
SD(X) = 3 grams
VAR(X) = 9 grams
E(T) = E(X) + E(Y)
E(T) = 30 + 470
E(T) = 500 grams
VAR(T) = VAR(X) + VAR(Y)
E(Y) = 470 grams
VAR(T) = 9 + 36
SD(Y) = 6 grams
VAR(Y) = 36 grams VAR(T) = 45 grams
SD(T) = 6.7 grams

The difference between two variables is a
little bit different. As we expect:

E(T) = E(X –Y) = E(X) – E(Y)

But…

VAR(T) = VAR(X –Y) = VAR(X) + VAR(Y)

This is because of the squared term in
functions of variables.

Consider:

T = X –Y
VAR(T) = (+1)²VAR(X) (-1)²VAR(Y)

▪ so

VAR(T) = (+1)VAR(X) (+1)VAR(Y)

The average diameter of the inside of a nut is
4.5mm with a variance of 0.005mm. The
average diameter of a bolt is 4.4mm with a
variance of 0.004mm. Find the average gap
between a nut and a bolt.

E(X) = 4.5 mm
E(Y) = 4.4 mm

VAR(X) = 0.005 mm
VAR(Y) = 0.004 mm






E(T) = E(X –Y) = E(X) – E(Y)
E(T) = 4.5 – 4.4 = 0.1 mm
VAR(T) = VAR(X –Y) = VAR(X) + VAR(Y)
VAR(T) = 0.005 + 0.004 = 0.009 mm
SD(T) = √0.009
SD(T) = 0.094 mm

Combining all of the information we have
learned so far we can see that:

Z = aX + bY

E(Z) = E(aX + bY) = aE(X) + bE(Y)

VAR(Z) = VAR(aX + bY) = a²VAR(X) + b²VAR(Y)
▪ Assuming x and y are independent.

A truck carts a mixture of two different types
of metal. When loading, the truck received 3
scoops of small metal and 4 scoops of large
metal. From experience, the driver knows the
average weight of a scoop of small metal is
500kgs and the average weight of a scoop of
large metal is 400kgs. If the variance of
weights are 200kgs for small metal and
250kgs for large metal, estimate the trucks
weight and variance.

Z = 3X + 4Y
▪ The total load is 3 small scoops (X) and 4 large scoops (Y).


E(Z) = 3(500) + 4(600)
E(Z) = 3900 kgs

VAR(Z) = 3² x 200 + 4² x 250
VAR(Z) = 5800 kgs

SD(Z) = 76.16 kgs
