Download 5-1 Random Variables and Probability Distributions

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Fisher–Yates shuffle wikipedia , lookup

Transcript
5-1 Random Variables and
Probability Distributions
The Binomial Distribution
Random Variables
Discrete – These variables take on a finite number of
values, or a countable number of values
Number of days absent
Number of students taking a course
Continuous – These variables can take on an infinite
number of values on a number line
Time it takes for students to drive home
Gallons of gas you buy each time you go to the gas
station
(Typically length, temperature, volume, time, etc)
Probability Distribution
A probability distribution is an
assignment of probabilities to the
specific values of a random variable,
or to a range of values of the random
variable.
Discrete: probability assigned to each
value of the random variable (and the
sum = 1)
What does this mean?
Lets look at a graph of a probability
distribution of the discrete model, and
see what it is:
30.00
25.00
20.00
Series1
15.00
10.00
5.00
0.00
1
2
3
4
5
6
7
8
9
10
How do we look at it?
30.00
25.00
20.00
Series1
15.00
10.00
5.00
0.00
1
2
3
4
5
6
7
8
9
10
It looks like
A histogram, where the height is the
relative probability (i.e. a RELATIVE
FREQUENCY) and the bin is the
particular number.
Does this look familiar??
So what is the probability of choosing a
7 or a 3?
P(7 or 3) =
Mean and Standard Deviation
Probability distributions have a mean and
standard deviation.
For discrete population probability
distributions, the mean and the standard
deviation are given by formulas…
Which letters will we use?  and s or μ and σ?
Population…. Anyone??
Mean and Standard Deviation
μ   xP(x)
σ
(x  μ) P(x)
2
Sometimes this is called
the expected value of a
distribution - it is an
AVERAGE value, or what
can be thought of as a
central point (cluster
point)
Where x is the value of the random variable,
P(x) is the probability of that variable and
The sum is taken for all the values of that
random variable.
Notice – we are now discussing mean and
standard deviation of something other than
= Risk – the likelihood that a random variable is
raw data…
different from the mean
Lets consider the ways you can
toss a coin four times
Assume the coin is balanced (i.e. H and
T equally likely)
Assume also there is no memory (the
coin doesn’t remember that the last
toss was heads).
There are sixteen outcomes, right? (1/2)4
Lets draw out ALL possible outcomes
To assign a discrete random
variable
Let x = heads, so
x=0
TTTT
x=1
HTTT, THTT, TTHT, TTTH
x=2
HHTT, HTHT, HTTH, THHT, THTH,
TTHH
x=3
HHHT, HHTH, HTHH, THHH
x=4
HHHH
So the probability model would
be…
# of heads
0
1
2
3
4
Probability
.0625
.25
.375
.25
.0625
Calculate the mean and SD of
the distribution
mean = 2
Standard Deviation = 1
Linear Functions of Random
Variables
Suppose I have a and b, which are
constants. A new function L = a + bx
(where x is a random variable) ALSO
has a mean, variance and standard
deviation.
Linear Functions of Random
Variables
Suppose I have a and b, which are
constants. A new function L = a + bx
(where x is a random variable) ALSO
has a mean, variance and standard
deviation.
μL  a  bμ
2
L
2
σ b σ
σL  b σ
2
Combining independent
random variables
To make a linear combination of two
independent random variables x1 and
x2 ,
W = ax1 + bx2
and
Combining independent
random variables
To make a linear combination of two
independent random variables x1 and
x2 ,
W = ax1 + bx2
and
μW  aμ1  bμ2
2
W
2
2
1
2
σ  a σ b σ
2
2
1
2
2
2
σW  a σ  b σ
2
2
What?
All this is a way to look what happens
when you transform data (for instance,
if I take all the data and multiply by 2
then add 10, for rescaling purposes).
Application
The manager of a computer company quickly
shipped 2 computers to a client on the
same day as the order. Unfortunately, the
two computers were accidentally chosen
from a stockroom with an inventory of 15
computers, 4 of which were refurbished.
If one of the computer is refurbished it will be
sent back at your expense ($100). If both
are refurbished, the client will cancel the
order this month and you will lose $1000.
What is the expected value and standard
deviation of your loss?
Sources
http://www.mathworks.com/access/helpdesk/help/toolbox/stats
/index.html?/access/helpdesk/help/toolbox/stats/f4218.htm