Download binomial random variable

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Central limit theorem wikipedia , lookup

Transcript
AP Statistics
β€’ Section 8.1: The Binomial Distribution
β€’ Objective: To be able to understand and
calculate binomial probabilities.
β€’ Criteria for a Binomial Random Variable:
1. Each observation can be classified as a success or
failure.
2. n is the number of trials and n is fixed.
3. p is the probability of success and p is fixed.
4. The observations are independent.
If data are produced in a binomial setting and X = the
number of successes, then X is called a binomial random
variable.
The binomial probability distribution is as follows:
β€’
𝑿 = π’™π’Š
0
1
2
…
n
P(𝑿 = π’™π’Š )
π’‘πŸŽ
π’‘πŸ
π’‘πŸ
…
𝒑𝒏
F(𝑿 = π’™π’Š )
π’‘πŸŽ
π’‘πŸŽ + π’‘πŸ
π’‘πŸŽ + π’‘πŸ + π’‘πŸ
…
1
Points:
β€’ A binomial random variable is always discrete.
β€’ To graph use a probability histogram.
β€’ Notation: X~B(n,p)
β€’ When the population > 10n (sample size) and the sample
is selected without replacement we have an approximate
binomial distribution. It is approximate because the
condition of independence is violated. Since the effect is
minimal, we can still use the binomial formula for
calculations.
Ex. Let p = 0.3, Population (N) = 50,000, n = 5
β€’ Binomial Probability Formula:
β€’ Let X be a binomial random variable and k be the
number of successes. If X~B(n,p), then
𝑛 π‘˜
β€’ 𝑃 𝑋=π‘˜ =
𝑝 βˆ™ (1 βˆ’ 𝑝)(π‘›βˆ’π‘˜) where
π‘˜
𝑛
𝑛!
β€’
=𝑛 πΆπ‘˜ =
which is called the binomial
π‘˜! π‘›βˆ’π‘˜ !
π‘˜
coefficient.
β€’ (1 - p) = q
q represents the probability of failure.
𝑛 π‘˜ (π‘›βˆ’π‘˜)
β€’ 𝑃 𝑋=π‘˜ =
𝑝 βˆ™π‘ž
(revised with q)
π‘˜
Ex. A large shipment of toys contains 30% that are red,
40% that are blue and 30% that are yellow. Let X = the
number of red toys. You select 5 toys at random.
a. Does this example meet the criteria for a binomial
setting?
b. Find P(X = 0)
c. Find P(X = 1)
d. Find P(X = 2)
Calculator notation:
For P(X = k) --- use binompdf(n,p,X)
For P(X ≀ k) --- use binomcdf(n,p,X)
e. Create the probability distribution of X. Include the
cumulative distribution.
𝑿 = π’™π’Š
P(𝑿 = π’™π’Š )
F(𝑿 = π’™π’Š )
β€’ F( X = x) is the cumulative density function.
𝐹 𝑋 = π‘₯𝑖 = 𝑃(𝑋 ≀ π‘₯𝑖 )
β€’ The shape of the distribution of 𝐹 𝑋 = π‘₯𝑖 is always ________
β€’ Which parameter, n or p, has the greatest influence in the
shape of the probability distribution?
β€’ If p = 0.5 then the distribution is ____________________
β€’ If p < 0.5 then the distribution is ____________________
β€’ If p > 0.5 then the distribution is ____________________
β€’ Ex. 7 couples buy new homes. It is known that in this
large community 24% of new homes are built with
electric heat. Let X = the number that choose electric
heat.
β€’ Find the probability that 3 couples choose electric heat. (show the
formula that you would use)
β€’ Find the probability that at most 1 couple chooses electric heat.
β€’ Find the probability that at least 3 couples choose electric heat.
β€’ Find the probability that more than 3 couples choose electric heat.
β€’ Binomial Means and Standard Deviations
β€’ If the random variable X has a binomial distribution with
n observations and the probability of success p, then the
mean and standard deviation of X are
β€’ πœ‡ = 𝑛𝑝
and
𝜎 = π‘›π‘π‘ž
β€’
β€’
β€’
β€’
β€’
Normal Approximation to the Binomial Distribution
Suppose 𝑋~𝐡(𝑛, 𝑝). When n is large,
𝑋~π΄π‘π‘π‘Ÿπ‘œπ‘₯𝑁(𝑛𝑝, π‘›π‘π‘ž).
Points:
Normal Approximation condition: Both 𝑛𝑝 β‰₯
10 π‘Žπ‘›π‘‘ π‘›π‘ž β‰₯ 10
β€’ If either condition is not met then the distribution is too
skewed to use a normal approximation.
β€’ This method was used prior to the creation of calculators
and computers because as n gets larger and larger the
calculations become more time consuming.
β€’ As n increases, the binomial distribution approaches a
normal distribution.
β€’ Results for the normal approximation will be optimal
when n is large and p is close to 0.5
β€’ We use an π΄π‘π‘π‘Ÿπ‘œπ‘₯𝑁 notation because it will never be
exactly normal because the binomial distribution is
discrete and the normal distribution is continuous.
Ex. Let X = number of successes; n = 100, p = 0.75.
a. Find the exact solution to P(X < 70)
b. Using the Normal Approximation, find the P(X < 70).
Ex. Suppose from previous exit polls it is known that the
probability that a voter will choose candidate A is 0.56. In
an upcoming election, you plan on asking 1000 randomly
selected voters who they will vote for. Using a Normal
Approximation, find the probability that between 501 and
550 voters inclusive will vote for candidate A.
(OPTIONAL)
Binomial Simulation:
Methods:
1. The book method: Use randbin(1, p, n). This will
choose the number 1 (for success) with probability p
and 0 (for failure) with probability q. This emphasizes
the binomial philosophy of success and failure. Then
you must sum the 1s to arrive at the total number of
successes.
2. Recommended method: Use randbin(n, p, number of
simulations). This will give you the number of
successes in n trials with a probability of success of p. It
will do this for as many simulations as desired.
3. Use the methods learned in chapter 5.
Ex. In the Hershey Kiss activity that we did in chapter 6, we
found that approximately 30% of the time a Hershey Kiss
lands point up. Suppose you flip a kiss 100 times. How
likely is it that the kiss to lands point up more than 35
times?
a. Use simulation to determine an approximate answer.
b. Use the binomial formula to find the exact answer.
c. Use the Normal Approximation to arrive at an
approximate answer.