Download A small store keeps track of the number X of customers that make a

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Probability wikipedia , lookup

History of statistics wikipedia , lookup

Statistics wikipedia , lookup

Transcript
Statistics 2011 – 2012 Final Exam Review Packet; page 1
Name ______________________________________________________ Date __________________
Statistics Final Exam Review Packet: Chapters 7-12
Write your final answers in math form: P( ) =
Show your work clearly with solutions or calculator commands. ex: normalcdf (Z, e99)
Chapter 7 : Random Variables
Mean of a Discrete Random Variable:
   X i Pi  X1P1  X 2 P2  ...
Concept:
Differences between discrete and continuous random variables. Law of large numbers.
1. A small store keeps track of the number X of customers that make a purchase during the first hour that
the store is open each day. Based on the records, X has the following probability distribution.
(a) Construct a probability histogram of X using the space below.
(b) What is the probability that no customers make a purchase during the first hour that the store is open?
(c) What is the probability that the number of customers that make a purchase during the first hour that
the store is open is more than 1 but no greater than 3?
(d) Find the mean number of customers that make a purchase during the first hour that the store is open.
Show products used to arrive at your answer.
(e) Find P  2.0  X  4.0 
(f) Find P  2.0  X  4.0 
(g) Circle the correct answer: This distribution is an example of a ( discrete ) ( continuous ) random
variable.
2. I toss a fair coin a large number of times. Assuming tosses are independent, which of the following
is true?
A) Once the number of flips is large enough (usually about 10,000) the number of heads will
always be exactly half the total of the number of tosses. For example, after 10,000 tosses I
should have exactly 5000 heads.
B) The proportion of heads will be about 1/2, and this proportion will tend to get closer and closer
to 1/2 as the number of tosses increases.
C) As the number of tosses increases, any long run of heads will be balanced by a corresponding
run of tails so that the overall proportion of heads is 1/2.
D) All of the above.
Statistics 2011 – 2012 Final Exam Review Packet; page 2
3. A lottery offers one $1,000 prize, one $500 prize, and five $100 prizes. One thousand tickets are sold
at $3 each.
(a) Make a probability model of the player’s winning amount and probabilities.
(b) What is player’s expected winning amount per bet (hint: find mean)?
(c) In the long run, how much does the lottery company make from each single bet?
Chapter 8 : Binomial and Geometric Distributions
Binomial mean and standard deviation:
  np
  np 1 p 
Geometric mean and standard deviation:
1  p 
1


p
p2
geometpdf or geometcdf (p, x)
binompdf or binomcdf (n, p, x)
Concept:
Differences between binomial and geometric distributions. Normal approximation.
4. Identify whether the following setting is binomial, geometric, or neither. Explain clearly with the four
requirements.
(a) Draw a card from a standard deck of 52 playing cards, observe the card, and replace the card within
the deck. Count the number of times you draw a card in this manner until you observe a jack.
(b) UPS Company advertises that it ships 90% of its order within three working days. You select a
random sample of 100 mail orders received in the past week and examine the probability that 90 of them
were shipped on time.
5. In a large population of college students, 20% of the students have experienced feelings of math
anxiety. If you take a random sample of 10 students from this population, the probability that exactly
2 students have experienced math anxiety is
A) 0.3020
C) 0.2013
E)
1
B) 0.2634
D) 0.5
F)
None of the above
6. Refer to the previous problem # 5. Determine the standard deviation of the number of students in the
sample who have experienced math anxiety.
Statistics 2011 – 2012 Final Exam Review Packet; page 3
7. A manufacturer produces a large number of toasters. From past experience, the manufacturer knows
that approximately 2% are defective. In a quality control procedure, we randomly select 20 toasters for
testing. We want to determine the probability that the number of these toasters is defective.
(a) Is this a binomial or geometric distribution? Explain clearly with four requirements.
(b) Determine the probability that exactly one of the toasters is defective.
(c) Find the probability that at most two of the toasters are defective.
(d) Find the mean and standard deviation for the random variable X in the toaster problem.
8. What rules of thumb must be satisfied in order to use the normal approximation to the binomial
distribution?
A) np  10
B) n 1 p  10
C) np 10 and n 1 p  10
D) none of the above
9. When a computerized generator is used to generate random digits, the probability that any particular
digit in the set {0, 1, 2, . . . , 9} is generated on any individual trial is 1/10 = 0.1. Suppose that we are
generating digits one at a time and are interested in tracking the first occurrence of the digit 0.
(a) Is this a binomial or geometric distribution? Explain clearly with four requirements.
(b) Determine the probability that the first 0 occurs as the fifth random digit generated.
(c) How many random digits would you expect to have to generate in order to observe the first 0?
Statistics 2011 – 2012 Final Exam Review Packet; page 4
Chapter 9: Sampling Distributions
Sampling Distribution of p̂ :
1.   p
Sampling Distribution of x :
1.   x
p 1  p 

2.  
n
n
Concepts:
1. Differences between parameter and statistics.
2. Differences between sample distributions for sample means and sample proportions.
3. Central Limit Theorem
2.  
10. Identify each of the bolded figures in the problem below as statistics or parameters. Use the
appropriate notation to indicate each statistic and parameter.
A 1993 survey conducted by the local paper in Columbus, Ohio, one week before election day asked
voters who they would vote for in the City Attorney’s race. Thirty-seven percent said they would vote
for the Democratic candidate. On election day, 41 % actually voted for the Democratic candidate.
11. The distribution of the values taken on by a statistic in all possible samples from the same
population is called
A) the parameter.
C) the sampling distribution.
B) bias.
D) a table of random digits.
12. Below are histograms of the values taken by three sample statistics in several hundred samples from
the same population. The true value of the population parameter is marked with an arrow.
(a) Which statistic has the largest bias among these three? Explain.
(b) Which statistic has the lowest variability among these three? Explain.
(c) Which statistic has the largest sample size?
Explain.
(d) Based on the performance of the three statistics in many samples, which is preferred as an estimate of
the parameter? Why?
Statistics 2011 – 2012 Final Exam Review Packet; page 5
13. Power companies kill trees growing near their lines to avoid power failures due to falling limbs in
storms. Applying a chemical to slow the growth of the trees is cheaper than trimming, but the chemical
kills some of the trees. Suppose that one such chemical would kill 20% of sycamore trees. The power
company tests the chemical on 250 sycamores.
(a) What are the unbiased mean and standard deviation of the sampling proportion of trees that are killed?
(b) Can you use the normal approximation for this study? Explain clearly with the two rules.
(c) What is the probability that at least 60 trees (24% of the sample) are killed?
14. The weights of newborn children in the United States vary according to the normal distribution with
mean 7.5 pounds and standard deviation 1.25 pounds. The government classifies a newborn as having
low birth weight if the weight is less than 5.5 pounds.
(a) What is the probability that a baby chosen at random weighs less than 5.5 pounds at birth?
(b) What is the probability that three randomly chosen babies have average birth weight less than 5.5
pounds?
Chapter 10: Statistical Inference
Margin of Error: Z
*

n
Concepts:
1. Stating null ( H 0 ) and alternate ( H a ) hypotheses.
2. Confidence Interval
3. Z test and interpret your results with p-value and α level.
4. Type I error (α error): falsely rejecting H 0
5. Type II error (β error): falsely accepting H 0
Statistics 2011 – 2012 Final Exam Review Packet; page 6
15. Suppose you administer a certain aptitude test to a random sample of 9 students in your school, and
that the average score is 105. We want to determine the mean of the population of all students in the
school. Suppose the population standard deviation is 15.
(a) What is the critical value, z*, for a 98% confidence interval (2 answers)?
(b) Calculate a 98% confidence interval for the mean score for the whole school. Sketch a graph to
display the center, confidence interval and 3 probabilities. Circle the margin of error.
(c) Calculate a 90% confidence interval for the mean score for the whole school. Sketch a graph to
display the center, confidence interval and 3 probabilities. Circle the margin of error.
(d) Compare your answers in (b) and (c), what is the relationship between margins of error and confidence
levels?
16. It is believed that the average amount of money spent per U.S. household per week on food is about
$110, with standard deviation $25. A random sample of 200 households in a certain affluent
community yields a mean weekly food budget of $105. We want to test the hypothesis that the mean
weekly food budget for all households in this community is lower than the national average.
(a) State two hypotheses in notations and perform a significance test at the   0.05 significance level.
Interpret your results and state your conclusion clearly.
(b) Describe a Type I error in the context of this problem. What is the probability of making a Type I
error?
(c) Describe a Type II error in the context of this problem.
Statistics 2011 – 2012 Final Exam Review Packet; page 7
17. At the bakery where you work, loaves of bread are supposed to weigh 1 pound, with standard
deviation  = 0.11 pounds. You believe that new personnel are producing loaves that differ from 1
pound. As supervisor of Quality Control, you want to test your hypotheses at the 0.05 significance level.
You weigh 20 loaves and obtain the following weights, in pounds:
0.98
1.07
0.99
1.07
1.08
1.06
1.12
1.03
1.04
0.99
1.06
1.05
1.05
1.11
1.08
1.12
1.03
1.00
1.04
1.03
(a) State the hypotheses both in words and symbols.
(b) Find the sample mean.
(c) Calculate the P-Value.
(d) Is the result significant at the   0.05 level? What is your conclusion?
(e) Sketch a normal curve for your significance test. Mark the suggested population mean, sample mean,
sampling standard deviation, P-value, and   0.05 . Shade the area under the curve to show the Pvalue.
18. A machine is designed to fill 16-ounce bottles of shampoo with a standard deviation of 0.1 ounce if
the machine is working properly. Four bottles are randomly selected each hour and the number of
ounces in each bottle is measured. The sample mean amount poured into the bottles is 16.05 ounces.
What is 95% of the observations should occur in which interval?
(A) 16.05 and 16.15 ounces
(B) –.30 and +.30 ounces
(C) 15.95 and 16.15 ounces
(D) 15.90 and 16.20 ounces
(E) None of the above
Statistics 2011 – 2012 Final Exam Review Packet; page 8
Chapter 11 and 12: More about Statistical Inference
19. Data is collected from a certain medical sample. If the sample mean from a simple random sample
of 28 observations is11.4 and the sample standard deviation s = 1.7, what is the standard error?
20. Hallux abducto valgus (call it HAV) is a deformation of the big toe that is uncommon in youth and
often requires surgery. Doctors used X-rays to measure the angle (in degrees) of deformity in 38 patients
under the age of 21 who came to a medical center for surgery to correct HAV.
The angle is a measure of the seriousness of the deformity. Here are the data:
28
32
25
34
38
26
25
18
30
26
21
17
16
21
23
14
32
25
21
22
16
30
30
20
50
25
26
28
31
38
28
20
32
13
18
21
20
26
(a) We are willing to consider these patients as a random sample of young patients who require HAV
surgery. Give a 95 % confidence interval for the mean HAV angle in the population of all such
patients. Sketch a graph to display the center, confidence interval and 3 probabilities. Circle the
margin of error.
(b) Is there convincing evidence that the mean reading of the 38 angles of deformity collected in the
previous exercise is less than the expected population mean of 28?
State your hypotheses clearly in words and notations. Carry out a t-test, find the p-value, interpret
your result at 0.05 significance level and state your conclusion clearly.
(c) Based on your result in (b), sketch a normal curve for your significance test. Mark the suggested
population mean, sample mean, sampling standard deviation, P-value, and   0.05 . Shade the area
under the curve to show the P-value.
Statistics 2011 – 2012 Final Exam Review Packet; page 9
21. Which of the following is an example of a matched-pairs design?
A) A track coach compares the long jump measurements of her upperclassmen
long jumpers with those of her underclassmen long jumpers.
B) A track coach compares the long jump measurements of her long jumpers
with the long jump measurements of a near-by high school’s track team.
C) A track coach compares the long jump measurements of her long jumpers
before and after a workshop on long jumping techniques.
D) A track coach compares the long jump measurements of her long jumpers
with nationally ranked long jumpers.
22. Mutual fund performance. Many mutual funds compare their performance with that of a
benchmark, an index of the returns on all securities of the kind the fund buys. The Vanguard
International Growth Fund, for example, takes as its benchmark the Morgan Stanley EAFE (Europe,
Australasia, Far East) index of overseas stock market performance. Here are the percent returns for
the fund and for the EAFE from 1982 (the first full year of the fund’s existence) to 2000. Does the
fund significantly outperform its benchmark (EAFE)?
(a) Explain these hypotheses in words:
Ho: µ1 = µ2
H1: µ1 > µ2
(b) Complete the following table by entering the data into your graphing calculator.
# of years (
)
Sample Mean of Returns (
)
Sample Standard Deviation (
1 Funds
2 EAFE
(c) Find the combined standard error.
(d) T-test: find t score and P-value.
(e) Interpret your results at significance level 0.05. State your conclusion clearly.
)
Statistics 2011 – 2012 Final Exam Review Packet; page 10
23. Do students tend to improve their Math SAT score the second time they take the test? A random sample
of four students who took the test twice received the following scores:
Student
First score
Second score
1
450
440
2
520
600
3
720
720
4__
600
630
Assume that the change in SAT math scores (second score - first score) for the
population of all students taking the test twice is normally distributed with mean  .
You wish to test the hypotheses: H 0 : 1  2 ; H a : 1  2 . The p-value for the test is
A) 0.304
B) 0.386
C) 0.152.
D) 0.2328.
24. In a matched-pairs experiment, 50 subjects tasted two unmarked cups of coffee (brand A and B) and
said which he/she prefers. 31 of them prefer brand A coffee.
(a) Determine a 95 % confidence interval for this sample proportion. Sketch a graph to display the center,
confidence interval and 3 probabilities. Circle the margin of error.
(b) Is there evidence to conclude that more than 50% of people prefer brand A coffee? Make two
hypotheses, perform a Z-test, find P-value, interpret your result at significance level 0.05 and state
your conclusion clearly.
(c) Based on your result in (b), sketch a normal curve for your significance test. Mark the suggested
population proportion, sample proportion, sampling standard deviation, P-value, and   0.05 . Shade
the area under the curve to show the P-value.