Download Notes

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Statistics wikipedia , lookup

History of statistics wikipedia , lookup

Ars Conjectandi wikipedia , lookup

Probability wikipedia , lookup

Probability interpretations wikipedia , lookup

Transcript
Stats CH 6 Intro Activity
1
1. Purpose – can you tell the difference between bottled water and tap water?
You will drink water from 3 samples. 1 of these is bottled water.
2. You must test them in the following order:
First:
Second:
Third:
3. After testing each, circle the letter that you feel is the bottled water and show the teacher.
Do NOT discuss your results with your classmates.
4. Make a chart of the number of people who stated each sample
Sample
Frequency
A
B
C
Total
Relative
Frequency
The true bottled water sample is…
5. What percent correctly identified the bottled water?
6. Assume that no one can distinguish tap water from bottled water. If this is the case, then what is the
probability that a person would guess correctly?
7. What would the percentage that can distinguish tap water from bottled water have to be in order for you to
be convinced that the students are not guessing?
8. Perform a simulation of the 3 cups example to test how many times the correct response was discovered
among 50 students. The strings are 10 digits long. There are 5 strings. None of the strings include the digit 0.
What digits will you consider correct:
total correct:
percent:
What digits will you consider incorrect:
total incorrect:
percent:
5231676879
2781234823
9165396471
1248164528
8981692471
9. Make a histogram for the number of correct responses in class
1
Stats CH 6 Intro Activity
2
1. Purpose – can you tell the difference between bottled water and tap water?
You will drink water from 3 samples. 1 of these is bottled water.
2. You must test them in the following order:
First:
Second:
Third:
3. After testing each, circle the letter that you feel is the bottled water and show the teacher.
Do NOT discuss your results with your classmates.
4. Make a chart of the number of people who stated each sample
Sample
Frequency
A
B
C
Total
Relative
Frequency
The true bottled water sample is…
5. What percent correctly identified the bottled water?
6. Assume that no one can distinguish tap water from bottled water. If this is the case, then what is the
probability that a person would guess correctly?
7. What would the percentage that can distinguish tap water from bottled water have to be in order for you to
be convinced that the students are not guessing?
8. Perform a simulation of the 3 cups example to test how many times the correct response was discovered
among 50 students. The strings are 10 digits long. There are 5 strings. None of the strings include the digit 0.
What digits will you consider correct:
total correct:
percent:
What digits will you consider incorrect:
total incorrect:
percent:
8699344889
3373967234
7414855135
5434738399
7973697488
9. Make a histogram for the number of correct responses in class
2
AP STATS
Section 6.1: Discrete and Continuous Random Variables
Random Variable: A variable, usually represented by an X, which has a single numerical value (determined by
chance) for each outcome of an experiment.
Examples:
1) X = The number of seniors who get into college early.
2) X = The number of defective tires on a car
3) X = A random number chosen between 0 and 1
4) X = The lifetime of a light bulb.
a) Discrete Random Variable: Has a finite or countable number of values. (Examples 1 and 2)
b) Continuous Random variable: Has infinitely many values and the values can be associated with a
continuous scale so that there are no gaps or interruptions. (Examples 3 and 4)
Probability Distribution: Gives the probability of each value of a random variable.
Example 1 - Eggs-ample: Suppose the random variable X is the number of broken eggs in a randomly selected
carton of one dozen “store brand” eggs at a certain supermarket. Since the number of broken eggs is a discrete
random variable, the probability distribution is a list or a table of the possible values of X and the corresponding
probabilities
Number of Broken Eggs:
Probability:
0
.65
1
.20
2
.08
3
.04
4
.02
5
.01
Requirements for a probability distribution (legitimate):
1)
! ! =1
says all the probabilities must add to 1
2) 0 ≤ p(X) ≤ 1
for all values of x
Here is a probability Histogram that represents the above probability distribution:
Probability
Probability Histogram
P(X > 3) =
0.7
0.6
0.5
0.4
0.3
0.2
0.1
0
P(X ≥ 1) =
P(X < 2) =
0
1
2
3
Number of Broken Eggs
4
5
P(X ≥ 4) =
3
Example 2 - Tossing Coins:
What is the probability distribution of the discrete random variable X that counts the number of heads in three
tosses of a coin?
Number of heads
Probability
0
1
2
3
Let’s create a probability histogram for this distribution.
For a continuous random variable we use density curves, not histograms, to graphically represent the
distribution:
Example 3 - Random Number Example: X is a random number between 0 and 2. The distribution is a
continuous distribution and is represented by the uniform density curve below:
P(X)
0.5
0
2
X
The probability of any event is the area under the density curve
P(1 ≤ x ≤ 2) =
P(0 ≤ x ≤ 0.5) =
P(x ≥ 0.3) =
4
If the curve is not uniform, you would need to use geometry or
calculus to find the probability of an event
And don’t forget, the most familiar density curve is the standard
normal curve. The Normal Distribution is a probability
distribution.
The Expected Value of a random variable represents the average value of the outcomes. The expected value is
the mean of a probability distribution. It is found by finding the value of:
Expected Value:
E = (! • ! ! ) = !
Mean: µ =
∑ x ⋅ p(x)
Example 4 - Looking back at the Probability distribution from example 1, we can calculate, on average, how
many broken eggs there will be in a carton:
Number of Broken Eggs:
Probability:
0
.65
1
.20
2
.08
3
.04
4
.02
5
.01
This means that “on average” we can expect that _________ eggs will be broken in a randomly selected carton
of eggs.
Example 5 - Getting the Flu: The probability that 0, 1, 2, 3, or 4 people will seek treatment for the flu during
any given hour at an emergency room is show in the probability distribution below.
x
P(x)
0
.12
1
.25
2
?
3
.24
4
.06
a) What is the probability that 2 people will seek treatment for the flu during any given hour at an emergency
room?
b.) What is the probability that at least 1 person will be treated for the flu in the next hour?
c.) What is the probability that 3 or more people will be treated for the flu in the next hour?
d.) What is the average number of people that an emergency room can expect to treat for the flu during any
given hour?
HW Section 6-1 A: 1,5,7,9,13
5
More Examples of Expected Value and Games:
•
Expected value can also be used to determine whether or not a particular game is “fair” and therefore worth
playing.
Example 6 - Patrick offers Christine to play a dice game whereby he will pay her $6 if she rolls a six, but
Christine would have to pay Patrick $1 every time she rolls a number other than 6. Should Christine agree to
play this game over a long period of time?
To find the expected value of a game multiply each payoff by its probability and then add.
Based on the above definition, the expected value for Christine is:
1
This means that for every roll of the die, Christine will earn, on average, $ , which also means that Patrick will
6
1
lose, on average $ for every roll of the die.
6
If Patrick and Christine were to play this game 100 times, how much money (on average) can Christine expect
to make? How much can Patrick expect to lose?
Fair Game: We say that a game is “fair” whenever its expected value is equal to zero. For example, you win a
dollar every time you toss a coin and get heads, and lose a dollar every time you get tails.
Example 7 - Zoot Suit Example: Would you be willing to play this game? You pay me $25 to play. I take a
standard deck of cards. I Shuffle the cards well. You pick 1 card at random. The suit of that card becomes
your winning suit. You do not put the card back. You pick 2 more cards at random. If those 2 cards are both
the winning suit, I will pay you $500. If they are not, I pay you nothing.
6
Variance, and standard deviation of a probability distribution
Variance
2)
Standard Deviation
∑ µ
σ
µ
∑
σ =
∑[x ⋅ p(x)] − µ
1) σ 2 =
3)
(the square root of the variance)
2
=
( x − ) 2 ⋅ p ( x)
[ x 2 ⋅ p( x)] −
2
2
2
Note: The mean and standard deviation formulas are in the stat formula packet that you get on the AP
Example 8 - Find the mean, variance, and standard deviation of the following probability distribution:
x
0
1
2
3
4
P(x)
.3
.1
.2
.1
.3
Calculator Method:
HW Section 6-1 B: 14,18,19,23,25
7
Section 6.2: Transforming and Combining Random Variables
Review of Chapter 2: Operations on statistics:
Addition of constant a
Mean
Increase by a
Subtraction of
constant a
Decrease by a
Multiplication of
constant a
Multiply by a
Division of constant a
Median
Increase by a
Decrease by a
Multiply by a
Divides by a
Standard Deviation
No change
No change
Multiply by |a|
Divides by |a|
Quartiles
Increase by a
Decrease by a
Multiply by a
Divides by a
IQR
No change
No change
Multiply by |a|
Divides by |a|
Min and Max
Increase by a
Decrease by a
Multiply by a
Divides by a
Range
No change
No change
Multiply by |a|
Divides by |a|
Percentiles
Increase by a
Decrease by a
Multiply by a
Divides by a
Divides by a
Rules for Random Variables:
1. If the value “b” is multiplied by each value of a random variable:
Measures of Center (mean, median, quartiles, percentiles) multiply by b
Measures of Spread (range, IQR, standard deviation) multiply by |b|
Shape is NOT changed
2. If the value “b” is divided into each value of a random variable:
Measures of Center (mean, median, quartiles, percentiles) divide by b
Measures of Spread (range, IQR, standard deviation) divide by |b|
Shape is NOT changed
3. If the value “b” is added to each value of a random variable:
Adds b to measures of Center (mean, median, quartiles, percentiles)
Measures of Spread (range, IQR, standard deviation) are NOT changed
Shape is NOT changed
4. If the value “b” is subtracted from each value of a random variable:
Subtracts b from measures of Center (mean, median, quartiles, percentiles)
Measures of Spread (range, IQR, standard deviation) are NOT changed
Shape is NOT changed
5. If Y = a + bX is a linear transformation of the random variable X, then
The probability distribution of Y is the same as X
The mean for Y = a + b•(mean of X)
!! = ! + ! • !!
The st. dev. of Y = |b|•(st. dev. of X)
!! = |!| • !!
8
Example 1
Pete owns a Jeep Tour Company. There must be at least 2 passengers for the trip and no more than 6. The
discrete random variable X describes the number of passengers with the following distribution
No. of passengers xi 2
Probability pi
0.15
3
0.25
4
0.35
5
0.20
6
0.05
a) Find the expected value (mean) and the standard deviation for X. Describe it in context.
Example 2
Pete charges $150 per passenger. The discrete random variable C is equal the total amount of money that Pete
collects on a randomly selected trip. Therefore if X = 2 passengers, then C = $300.
Then C = 150X
Total Collected ci
Probability pi
300
0.15
450
0.25
0.35
0.20
0.05
b) Complete the values for the total collected in the table above
c) Find the expected value (mean) and the standard deviation for C. Describe it in context.
d) How do the answers to “c” compare to the answer to “a”?
Example 3
It costs Pete $100 to buy permits, gas, and a ferry pass for each half-day trip. The amount of profit V that Pete
makes from the trip is the total amount of money C that he collects from passengers minus $100.
Then V = C – 100
If Pete has only two passengers on the trip (X = 2), then C = 300 and V = 200.
Total Profit vi
Probability pi
200
0.15
350
0.25
0.35
0.20
0.05
e) Complete the values for the total profit in the table above
f) Find the expected value (mean) and the standard deviation for V. Describe it in context.
g) How do the answers to “F” compare to the answer to “a” and “c”?
9
Example 4
At a car dealership, let the discrete random variable X = cars sold on a given day with distribution:
Cars sold
Probability
0
.3
1
.4
2
.2
3
.1
1. Find the mean and standard deviation
2. The owner offers the manager a $500 bonus for each car sold. The manager attempts to attract
customers by offering free coffee and donuts at a cost of $75. Define a profit variable and find the
mean and standard deviation for profit.
HW Section 6-2 C: 27-30,37, 39-41,43,45
10
Combining Random Variables Activities
11
Rules for means & variances for two random variables
Here is an illustration of these important statistical formulas (which are NOT included on your AP Stat Formula
packet)
If X and Y are any two random variables, then
or if T = X + Y
µ x+ y = µ x + µ y
! ! = !! + !!
If X and Y are independent random variables, then σ 2 x + y = σ 2 x + σ 2 y
or if T = X + Y
µ x− y = µ x − µ y
σ 2 x− y = σ 2 x + σ 2 y
!!! = !!! + !!!
Note: These "nice" rules do not hold for standard deviations
Example 5: Let’s say you started your own business on weekends. The business can bring you a profit on
Saturday and Sunday based on various (independent) scenarios. The profits and their respective probabilities
for Saturday and Sunday are listed below:
X= Saturday profit
x
p(x)
$10.00
$50.00
$100.00
SUM
0.1
0.5
0.4
1
Y= Sunday profit
y
$20.00
$60.00
$80.00
SUM
p(y)
0.2
0.3
0.5
1
Based on the tables to the left please calculate:
µx =
µy =
µx + µ y =
µx − µ y =
σ x2 =
σ y2 =
σ x2 + σ y2 =
σx =
σy =
σ x +σ y =
On the AP Formula Page
12
Example 6: A company makes packages containing Oreo cookies. The mean of an Oreo cookie is 40 grams
with standard deviation = 2 grams.
The company rejects cookies weighing less than 36 gram. In a batch of 20,000 cookies, how many will be
discarded?
The wrapper for a package containing 5 cookies has a mean of 6 grams with a standard deviation of 1.3 grams.
– What is the mean weight of the package (cookies and wrapper).
– What is the standard deviation of the package?
– Define the random variable A as the average weight of the 5 cookies. Find the mean and standard
deviation of A.
The Law of Large Numbers:
Draw independent observations at random from any population with finite mean µ. Decide how accurately you
would like to estimate µ. As the number of observations drawn increases, the mean ! of the observed values
eventually approaches the mean µ of the population as closely as you specified and then stays that close.
HW Section 6-2 D: 49,51,57-59,63
13
Section 6.3: The Binomial Distribution
There are many experiments and situations that result in what are called dichotomous responses – responses for
which there exist two possible choices (True / False, Yes / No, Defective / Non-defective, Male / Female, etc.).
A simple example of such an experiment is that of tossing a coin, where there are only two possibilities, Heads
or Tails. There are many other types of experiments similar to a coin toss where you are observing the “success”
or “failure” of a certain outcome. Such experiments give us a probability distribution called a binomial
distribution.
Requirements (Use this when asked if a situation is binomial):
1. Binary – there can only be 2 ways the event can turn out
2. Independent – events must be independent for each trial
3. Fixed – there must be a set number of trials
4. Probability of success is constant for all trials and the we are counting successes
Extra Rule – the sample size must be less than 10% of the entire population size.
Example 1: Tom is about to take a five-question true/false quiz for which he is not prepared. He will be
guessing on all five questions.
What is the probability that:
1) He gets all the answers correct?
2) He gets all the answers wrong?
3) He gets exactly three answers correct?
4) He will pass the quiz?
Solution: First we need to know how many total
answer combinations are there. Using the
counting rule:
2 ⋅ 2 ⋅ 2 ⋅ 2 ⋅ 2 = 32
Use ‘c’ to denote a correct answer and ‘w’ denote
a wrong answer. All the possible combinations of
correct and incorrect are shown !
The probability distribution for
this situation is shown below.
(X = number of correct responses):
x
0
1
2
3
4
5
p(x)
1/32
5/32
10/32
10/32
5/32
1/32
This can be done using the AP Stat Program.
14
The long process used in example 1 can be avoided by using the notion that this is a binomial probability
distribution. There is a formula we can use to find the probability of any value in a binomial distribution:
n!
P( x) =
⋅ p x ⋅ q n− x for x = 0,1, 2, 3, ...., n
(n − x)! x!
n = # of trials
x = # of successes among n trials
p = probability of success in any one trial
q = probability of failure in any one trial (q = 1 – p)
! ! !!!
This formula can be written as ! ! = ! = !
! !
where the parenthesis is called n
!
choose k. This is the combination formula for choosing k items from n total items. In
combinations, order does NOT matter.
Formulas for The binomial distribution:
µ = np
σ 2 = np (1 − p)
σ = np (1 − p)
Example 2: Suppose Charlie manages to manipulate a coin in such a way that it lands on heads with a .7
probability and lands on tails with a .3 probability. Suppose John then flips the coin 20 times, what is the
probability that it will land on heads for exactly 13 of the twenty flips? On average, how many flips will result
in a heads? What is the standard deviation of this binomial distribution?
15
Example 3: Suppose that the probability that any random freshman girl will agree to go with Sam to the senior
prom is 0.1. Suppose Sam asks 20 random freshman girls to the prom, what is the probability that exactly 1 will
say “yes”? That at least 1 will say “yes”? On average, how many freshman girls will agree to go with Sam to
the prom? What is the standard deviation of this binomial distribution?
HW Section 6-3
E: 61,65,66,69, 71,73,75,77
F: 79,81,83,85,87,89
The Geometric Distribution
Let’s start with a simulation: Roll a die until the number six appears and keep a record of how many rolls it
took before the six was obtained.
While in a binomial distribution the random variable was the number of successes in a fixed number of trials, in
a geometric distribution the random variable is the number of trials it takes to achieve a success.
Examples:
1) flip a coin until you get heads
2) Roll a die until you get a 6
3) Throw darts at a dartboard until you hit the bull’s-eye
A geometric distribution must have the following properties:
1) Each trial in the experiment must have only two possible outcomes (success or failure)
2) The probability of success, p, doesn’t change from trial to trial
3) The trials in the experiment are independent
4) The variable of interest is the number of trials required to reach the first success.
P(X = n) = (1-p)n-1p
where p = probability of success
The mean, µ, of a geometric distribution (the average number of times we can expect to repeat the trials before
a success occurs) is simply 1/p where p is the probability of success.
µ = 1/p
The variance of a geometric distribution
variance = (1 - p) / p2
16
Example 4: We will take our simulation example to analyze various aspects of the geometric distribution.
Lets first find the various probabilities associated with our dice simulation and come up with a probability
distribution:
a. Find the probability that a 6 will come up on the first roll.
b. Find the probability that a 6 will come up on the second roll.
P(won’t come up on the first roll) =
P(will come up on the second roll =
c. Find the probability that a 6 will come up on the third roll.
d. If we proceed in the manner above will come up with the following probability distribution:
X
P(X)
1
2
3
4
5
1/6
5/6•1/6
5/6•5/6•1/6
5/6•5/6•5/6•1/6
5/6•5/6•5/6•5/6 •1/6
n
(5/6)n-1(1/6)
e. How many rolls can we expect, on average, to roll the die before getting the number 6?
f. The probability that it takes more than n trials to see a success is:
g. Find the variance of a geometric distribution
h. Find the probability that it would take more than 5 rolls for us to get the number 6
HW Section 6-3 G: 93,95,97, 99,101-104
Quiz Review: Complete the AP Prep Questions on pages 409-411
17