Download Final Exam

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

History of statistics wikipedia , lookup

Confidence interval wikipedia , lookup

Foundations of statistics wikipedia , lookup

Inductive probability wikipedia , lookup

Multi-armed bandit wikipedia , lookup

Secretary problem wikipedia , lookup

Probability amplitude wikipedia , lookup

Simulated annealing wikipedia , lookup

Law of large numbers wikipedia , lookup

Misuse of statistics wikipedia , lookup

Transcript
Name:
ID#:
Final Exam
Elementary Statistics
May 15, 2014
PLEASE READ THE FOLLOWING INFORMATION.
• This is a 110-minute exam. Calculators are allowed. Books, notes, formula sheets, and
other aids are not allowed.
• You are required to show all your work and provide the necessary explanations everywhere
to get full credit.
1. The quality assurance engineer of a television manufacturer inspects TVs in lots of 100.
He selects 5 of the 100 TVs at random and inspects them thoroughly. Assuming that 6
of the 100 TVs in the current lot are defective, find the probability that exactly 2 of the
5 TVs selected by the engineer are defective.
Solution: We apply the Hypergeometric Distribution formula
b
a
·
n−x
x
f (x) =
a+b
n
with n = 5, a = 6, b = 94, and x = 2. We have
94
6
94
6
·
·
15 · 134, 044
2, 010, 660
5−2
2
3
2
= =
f (2) = =
≈ 0.027
6 + 94
100
75, 287, 520
75, 287, 520
5
5
So, there is a 2.7% chance that exactly 2 of the 5 TVs selected by the engineer will be
defective.
Info: The average in the class for this problem was 64.1%.
2. On New Year’s Eve, the probability of a person having a car accident is 0.09. The
probability of a person driving while intoxicated is 0.32 and probability of a person having
a car accident while intoxicated is 0.15. What is the probability of a person driving while
intoxicated or having a car accident?
Solution: Let I be the event that a person is driving while intoxicated and let A be the
event that a person is having a car accident. Then
P (I ∪ A) = P (I) + P (A) − P (I ∩ A) = 0.32 + 0.09 − 0.15 = 0.26
Info: The average in the class for this problem was 88.1%.
3. In the paper “Persistent Pulmonary Hypertension of the Neonate and Asymmetric Growth
Restriction” (Obstetrics & Gynecology, Vol. 91, No. 3, pp. 336-341), M. Williams et
al. reported on a study of characteristics of neonates. Infants treated for pulmonary
hypertension, called the PH group, were compared with those not so treated, called
the control group. One of the characteristics measured was head circumference. The
mean head circumference of the 10 infants in the PH group was 34.2 centimeters (cm).
Assuming that head circumferences for infants treated for pulmonary hypertension are
normally distributed with standard deviation 2.1 cm, determine the sample size required
to have a margin of error of 0.5 cm with a 95% confidence level.
z · σ 2
α/2
Solution: To find the sample size, we use the formula n =
. We know that
E
σ = 2.1 and E = 0.5. The confidence level is 0.95, which means that α = 0.05 and
zα/2 = z0.025 = 1.96. Thus
2
1.96 · 2.1
n=
≈ 67.7658
0.5
which, rounded up to the nearest whole number, is 68.
Info: The average in the class for this problem was 63.7%.
4. For a given population with σ = $15, we want to test the null hypothesis µ = $80 on the
basis of random sample of size n = 50. If the null hypothesis is rejected when x is greater
than or equal to $83 and otherwise it is accepted, find
(a) the probability of Type I error;
(b) the probability of Type II error when µ = $82;
(c) the probability of Type II error when µ = $84.
15
σ
Solution: First we note that σx = √ = √ .
n
50
(a) We have
83 − 80 √
√ = 2 ≈ 1.41
15/ 50
and the probability is 0.5 − 0.4207 = 0.0793.
z=
(b) We have
√
2
83 − 82
√ =
z=
≈ 0.47
3
15/ 50
and the probability is 0.5 + 0.1808 = 0.6808.
(c) We have
√
83 − 84
2
√ =−
z=
≈ −0.47
3
15/ 50
and the probability is 0.5 − 0.1808 = 0.3192.
Info: The average in the class for this problem was 50.6%.
5. The table below provides a sample of 19 exam grades randomly selected from a large
class.
76
76
86
92
74
78
84
82
82
72
62
74
96 66
52 68
76 78
88
8
4 6
4 6
6 6 8
8
Construct a stem-and-leaf display.
Solution:
5
6
7
8
9
2
2
2
2
2
6
4
2
6
8
Info: The average in the class for this problem was 96.7%.
6. Consider the following data for shipments of peanuts from a hypothetical U.S. exporter
to five Canadian cities.
City
Montreal
Ottawa
Toronto
Vancouver
Winnipeg
Peanuts (Thousands of Bags)
64.0
15.0
285.0
228.0
45.0
Let’s assume that shipments to the respective cities will be sold at the following profits
per thousand bags: $15.00, $13.50, $15.50, $12.00, and $14.00. Find the average profit
per thousand bags.
Solution: By the formula for the weighted mean, we get
(64)($15.0) + (15)($13.5) + (285)($15.5) + (228)($12.0) + (45)($14.0) $8946
=
≈ $14.04
64 + 15 + 285 + 228 + 45
637
Info: The average in the class for this problem was 87.3%.
7. An urn contains 6 red marbles and 4 black marbles. Two marbles are drawn without
replacement from the urn. What is the probability that both of the marbles are black?
Solution: Let B1 and B2 be the events that the first and second marbles are black,
respectively. Then we want P (B1B2 ). Note that we easily have that P (B1 ) = 4/10. Also,
P (B2 |B1 ) = 3/9 because the reduced sample space B1 consists of 3 black marbles and 6
red marbles, and we are asking for the probability of choosing a black marble. Therefore,
4 3
2 1
2
· = · =
10 9
5 3
15
10
4
.
/
Note, that we could also do it straightaway as
2
2
P (B1B2 ) = P (B1 )P (B2 |B1 ) =
Info: The average in the class for this problem was 79.4%.
8. In 1908, W. S. Gosset published the article “The Probable Error of a Mean” (Biometrika,
Vol. 6, pp. 125). In this pioneering paper, written under the pseudonym “Student,”
Gosset introduced what later became known as Students t-distribution. Gosset used the
following data set, which gives the additional sleep in hours obtained by a sample of 10
patients using laevohysocyamine hydrobromide.
1.9 0.8 1.1 0.1
4.4 5.5 1.6 4.6
-0.1
3.4
(a) Find the mean of the sample.
Solution: We have
x=
23.3
1.9 + 0.8 + 1.1 + 0.1 + (−0.1) + 4.4 + 5.5 + 1.6 + 4.6 + 3.4
=
= 2.33 hr
10
10
(b) Find the standard deviation of the sample.
Solution: We will use the formula
r
X
Sxx
, where Sxx =
x2 −
n−1
X
X
First we calculate
x and
x2 , getting
s=
X
and
X
X 2
x
n
x = 1.9 + 0.8 + 1.1 + 0.1 + (−0.1) + 4.4 + 5.5 + 1.6 + 4.6 + 3.4 = 23.3
x2 = 1.92 + 0.82 + 1.12 + 0.12 + (−0.1)2 + 4.42 + 5.52 + 1.62 + 4.62 + 3.42 = 90.37
Therefore
Sxx =
hence
X
x2 −
s=
r
X 2
x
n
Sxx
=
n−1
r
= 90.37 −
23.32
≈ 36.08
10
36.08
≈ 2.002 hr
9
(c) Obtain a 95% confidence interval for the additional sleep that would be obtained on
average for all people using laevohysocyamine hydrobromide.
Solution: We want a 95% confidence interval, so α = 1 − 0.95 = 0.05. For n = 10, we
have df= 10 − 1 = 9. From Table II,
tα/2 = t0.05/2 = t0.025 = 2.262
The confidence interval for µ is from
s
x − tα/2 · √
n
s
to x + tα/2 · √ .
n
We know that x = 2.33 and s = 2.002, therefore a 95% confidence interval for µ is from
2.002
2.33 − 2.262 · √
10
2.002
to 2.33 + 2.262 · √
10
or 0.90 to 3.76 hr.
(d) Interpret your answer from part (c).
Solution: It appears so because, based on the confidence interval, we can be 95% confident
that the mean additional sleep is somewhere between 0.90 hr and 3.76 hr and that, in
particular, the mean is positive.
Info: The average in the class for this problem was 73.1%.
9. Researchers G. Evans and E. Kantrowitz explored the health consequences for exposure
to many different environmental risks in the journal article “Socioeconomic Status and
Health: The Potential Role of Environmental Risk Exposure” (Annual Review of Public
Health, Vol. 23, No. 1, pp. 303-331). According to research, 65% of preschool children
living in poverty have been exposed to cigarette smoke at home. If 200 preschool children
living in poverty are selected at random, what is the probability that at least 125 have
been exposed to cigarette smoke at home?
Solution: We have n = 200 and p = 0.65, therefore
np = 200 · 0.65 = 130 and n(1 − p) = 200 · 0.35 = 70
Both np and n(1 − p) are greater than 5, so we can approximate the binomial distribution
by normal. We get
p
√
µ = np = 130 and σ = np(1 − p) = 200 · 0.65 · 0.35 ≈ 6.745
To make the correction for continuity, we subtract 0.5 from 125. Thus we need to find
the area under the normal curve with parameters µ = 130 and σ = 6.745 that lies to the
right of 124.5. We have
124.5 − 130
≈ −0.82
z=
6.745
therefore the probability is about 0.2939 + 0.5 = 0.7939 that least 125 have been exposed
to cigarette smoke at home.
Info: The average in the class for this problem was 42.9%.
10. A saleswoman has been offered a new job with a fixed salary of $290. Her records from
her present job show that her weekly commissions have the following probabilities:
Commission
Probability
0
$100 $200 $300 $400
0.05 0.15 0.25 0.45 0.1
Should she change her present job?
Solution: The expected value from her present job is
(0 · $0.05) + (0.15 · $100) + (0.25 · $200) + (0.45 · $300) + (0.1 · $400) = $240
Since $290 > $240, she should change her present job.
Info: The average in the class for this problem was 87.6%.
11. Data on investments in the high-tech industry by venture capitalists are compiled by
VentureOne Corporation and published in America’s Network Telecom Investor Supplement. A random sample of 18 venturecapital investments in the fiber optics business
sector yielded the following data, in millions of dollars.
5.60 6.27 5.96 10.51 2.04 5.48
5.74 5.58 4.13 8.63 5.95 6.67
4.21 7.71 9.21 4.98 8.64 6.66
Determine a 95% confidence interval for the mean amount, µ, of all venture-capital investments in the fiber optics business sector. Assume that the population standard deviation
is $2.04 million. (Note: The sum of the data is $113.97 million.)
Solution: From Table I, zα/2 = z0.05/2 = z0.025 = 1.96. The confidence interval for µ is
from
σ
σ
x − zα/2 · √
to x + zα/2 · √
n
n
We know σ = 2.04, n = 18, and zα/2 = 1.96. To compute x for the data in the Table
above, we apply the usual formula:
X
xi
113.97
=
≈ 6.332
x=
n
18
to one decimal place. Consequently, a 95% confidence interval for µ is from
2.04
2.04
6.332 − 1.96 · √
to 6.332 + 1.96 · √
18
18
or $5.389 million to $7.274 million.
Info: The average in the class for this problem was 76.7%.
12. The scores achieved by students in America make the news often, and all kinds of conclusions are drawn based on these scores. The ACT Assessment is designed to assess high
school students’ general educational development and their ability to complete collegelevel work. One of the categories tested is Science Reasoning. The mean ACT test score
for all high school graduates in 2008 in Science Reasoning was 20.8 with a standard deviation of 4.6. According to Chebyshev’s theorem, at least what percent of high school
graduates’ ACT scores in Science Reasoning were between 11.6 and 30.0?
Solution: Since
it follows that
30.0 − 20.8 = 20.8 − 11.6 = 9.2
k=
9.2
=2
4.6
therefore at least
3
1
= = 75%
2
2
4
of high school graduates’ ACT scores in Science Reasoning were between 11.6 and 30.0.
1−
Info: The average in the class for this problem was 78.7%.
13. A variable is normally distributed with mean 6 and standard deviation 2. Find the
percentage of all possible values of the variable that
(a) lie between 1 and 7.
Solution: We have
z1 =
1−6
7−6
= −2.5 and z2 =
= 0.5
2
2
therefore the probability is 0.4938 + 0.1915 = 0.6853, so the percentage is 68.53%.
(b) exceed 5.
Solution: We have
5−6
= −0.5
2
therefore the probability is 0.1915 + 0.5 = 0.6915, so the percentage is 69.15%.
z=
(c) are less than 4.
Solution: We have
4−6
= −1
2
therefore the probability is 0.5 − 0.3413 = 0.1587, so the percentage is 15.87%.
z=
Info: The average in the class for this problem was 75.6%.
14. If ice cream sundaes come in 5 flavors with 4 possible toppings, how many different
sundaes can be made with one flavor of ice cream and one topping?
Solution: By the Basic Principle of Counting, the number of different sundaes is 5·4 = 20.
Info: The average in the class for this problem was 96%.
15. Scores on the SAT test have a mean of 1518 and a standard deviation of 325. Scores on
the ACT test have a mean of 21.1 and a standard deviation of 4.8. Which is relatively
better: a score of 1840 on the SAT test or a score of 26.0 on the ACT test? Why?
Solution: We have
z1 =
1840 − 1518
26.0 − 21.1
≈ 0.99 and z2 =
≈ 1.02
325
4.8
The score of 26.0 is relatively better because it has the larger z score.
Info: The average in the class for this problem was 88.9%.
16. A math teacher gave her class two tests. 25% of the class passed both tests and 42% of
the class passed the first test. What percent of those who passed the first test also passed
the second test?
Solution: 0.25/0.42=0.6
Info: The average in the class for this problem was 79.8%.
17. In how many ways can the letters of the word “statistics” be arranged?
Solution: We have
10!
= 50400
3!3!2!
Info: The average in the class for this problem was 80.2%.