Download Practice Exam Questions

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Bootstrapping (statistics) wikipedia , lookup

History of statistics wikipedia , lookup

Regression toward the mean wikipedia , lookup

Categorical variable wikipedia , lookup

Transcript
Solutions To Course Guide for Test #1: Statistics 1024
_____________________________________________________________________________
Practice Exam Questions
_____________________________________________________________________________
A1. To display the distribution of grades on a statistics test (A, B, C, D, F), it would be correct
to use:
a) a bar graph
b) a bar graph or a pie chart
c) a histogram
d) a stemplot
Solution:
The answer is b) because both pie charts and bar graphs can be used for categorical data.
A2. A study of 5000 university graduates recorded the sex of the graduates, the total university
debt and the average annual salary of the university graduates. Which of the following is true?
a) sex, university debt and annual salary are all categorical variables
b) sex, university debt and annual salary are all quantitative variables
c) sex is a categorical variable and university debt and annual salary are quantitative variables
d) sex is a quantitative variable and university debt and annual salary are categorical variables
Solution:
The answer is c).
A3. The following marks were obtained on a mathematics quiz: 34, 56, 78, 89, 23, 66, 65, 77,
99
To make a stemplot of these data, you would use stems:
a) 10, 20, 30, 40, 50, 60, 70, 80, 90
b) 0, 1, 2, 3, 4, 5, 6, 7, 8, 9
c) 20, 30, 50, 60, 70, 80, 90
d) 2, 3, 5, 6, 7, 8, 9
Solution:
You must include all stems, even if there are no data in them
a) can not be correct because 20 on the left of the line and 3 on the right, would mean 203 and
not 23 as required.
The answer is b).
A4. The following stemplot represents the marks of some statistics students on the first test.
The stems are whole percents in tens and the leaves are percents in ones. What is the outlier?
3
4
5
6
7
8
1
246
345
136
2468
a) 3.1%
b) 31%
c) 4%
d) 40%
Solution:
The answer is b).
A5. An M&M’s chocolate fan is interested in studying the colour distribution of the sugar
coating of the chocolate candies. He opens a bag of M&M’s chocolate candies and classifies
the candies according to the colour of the coating. Which of the following are true?
I) The colour of the sugar coating is a categorical variable
II) A bar graph can be used to display the distribution of the colour variable
III) A pie graph can be used to display the distribution of the colour variable
a) I
b) II
c) III
d) all of I,II, III
e) II and III
Solution:
The answer is d).
A6. Environmental Protection Agency regulations require automakers to give the city and
highway gas mileages for their cars. Here are the highway mileages (miles per gallon, MPG)
for 15 car makes.
Make
MPG Make
MPG Make
MPG
BMW
Buick
Chevrolet
Chrysler
Dodge
23
29
27
27
27
Ford
Honda
Hyundai
Infiniti
Lexus
27
28
27
22
23
Mazda
Mercedes-Benz
Nissan
Rolls-Royce
Toyota
30
22
26
15
28
Which of the following graph(s) is (are) appropriate for displaying the distribution for
the gas mileage variable? Check the most appropriate answer.
(i) bar chart and histogram
(ii) bar chart and boxplot
(iii) stemplot and histogram
(iv) pie chart
Solution:
The answer is (iii).
A7. Harris recently installed a spam filter software, but he still saw spam emails in his inbox.
He made a daily record of the number of spam emails that were delivered to his inbox over the
past 20 days. The following is a frequency histogram for his data. The frequency refers to the
number of days.
Histogram of spam email data
6
frequency
5
4
3
2
1
0
20
40
60
80
100
120
# spam emails
Harris also plotted a stemplot for the data. Which of the following is a correct
stemplot for his data? Check only one answer.
(i) Stemplot A
(ii) Stemplot B
(iii) Stemplot C
A. 2|011355
3|01467
4|12479
5|56
6|
7|
8|0
9|
10|5
B. 2|011355
3|01467
4|12479
5|56
8|0
10|5
Solution:
The answer is (i), Stemplot A.
C. 1|05
2|011355
3|01467
4|12479
5|56
6|
7|
8|0
A8. A consumer recorded the price of a loaf of “DiBaggio” Italian bread at her local
supermarket each week in 2007. The data are presented in a histogram below:
Weekly Price of “DiBaggio”loaf, 2007
12
Frequency
10
8
6
4
2
0
1.76
1.92
2.08
2.24
Price ($)
The distribution displayed above would be described as bimodal. True or false?
Solution:
False. There is only one peak.
A9. Given the following pie graph, about what percent of the people travel by car?
Solution:
Cars
Therefore, 52% travel by car.
52%
A10. A survey was conducted in an undergraduate mathematics course. The students were
asked their:
i) age (in years)
ii) height (in metres)
iii) present major (Math, Chemistry, Physics, Sociology, Psychology, Music, Business, Art,
Other
iv) amount of student loans at the present time (in dollars)
v) present annual salary (in dollars)
vi) plans after graduation (Masters, Work, Undecided)
Which of the above are categorical variables and which are quantitative variables?
Solution:
Age, height, amount of student loans and present annual salary are quantitative variables.
Present major and plans after graduation are categorical variables.
A11. The marks in a statistics course containing 10 students are given below:
5
6
7
8
9
46
2358
14
3
2
Which of the following is true?
a) the distribution is right-skewed
b) the lowest mark is 54 %
c) the second highest mark is 83%
d) all of the above
Solution:
The answer is d).
Practice Exam Questions
B1. Calculate the mean, median and mode of the following marks:
34, 44, 67, 98, 78, 56, 66, 78, 88, 67, 50
Solution:
Mode- occurs most often...Therefore, 67 and 78 (bi-modal)
Median- Write numbers in ascending order and take the middle # which is 67.
34, 44, 50, 56, 66, 67, 67, 78, 78, 88, 98
Mean=
⋯
66
B2. Benjamin has four tests all worth 25% each. Benjamin received three marks on tests: 70,
80, and 90. If Benjamin receives an 80 on the fourth test, which of the following is true about
his average mark?
a) the median mark will change, the mean mark will stay the same
b) the mean mark will change, the median mark will stay the same
c) the mean mark and the median mark will both increase
d) the mean mark will increase, and the median mark will stay the same
e) both the median and the mean remain the same
Solution:
Originally, the numbers are 70, 80, 90
Median= 80 (middle # when written in ascending order)
Mean=
80
After adding another mark of 80... the numbers are 70, 80, 80, 90
Median=
Mean=
The answer is e).
80
80
B3. Samuel had four tests in a mathematics course. He scored 80, 75 and 90 on three of the
tests and his average on all four tests was 78.75. What was the mark on the fourth test?
Solution:
80
75
90
78.75
4
80+75+90+x = 315
x = 315 - 90 - 75 - 80
x=70
Therefore, the mark on the fourth test was 70.
B4. A consumer recorded the price of a loaf of “DiBaggio” Italian bread at her local
supermarket each week in 2007. The data are presented in a histogram below:
Weekly Price of “DiBaggio”loaf, 2007
12
Frequency
10
8
6
4
2
0
1.76
1.92
2.08
2.24
Price ($)
Which of the following is the most likely value of the mean of the prices recorded?
(i) $1.80
(ii) $1.90
(iii) $2.00
(iv) $2.10
(v) $2.20
Solution:
Let X be the price of a loaf. Then the mean of X is
  E( X ) 
 xp( x)
x
 1.68 
1
52
 1.76  521  1.84  527  1.92  10
 2.00  11
 2.08  529
52
52
 2.16  526  2.24  525  2.40  521  2.48  521 
The answer is (iii).
B5. Given the data below, what is the median and the mean?
105.28
52
 2.0246 .
5
6
7
8
9
46
2358
14
3
2
Solution:
The numbers represented by the stemplot are 54, 56, 62, 63, 65, 68, 71, 74, 83, 92
Median=
66.5
Mean=
B6.
⋯
68.8
Which of the following statements is true?
(i) When the distribution is skewed to the left, mean > median > mode.
(ii) When the distribution is skewed to the right, mean < median < mode.
(iii) When the distribution is symmetric and unimodal, mean = median = mode.
(iv) When the distribution is symmetric and bimodal, mean = median = mode.
Solution:
The answer is (iii). Think of the Normal distribution as an example.
B7. Find the mean and median given the following stemplot.
3
4
5
6
7
8
1
246
345
136
2468
Solution:
In ascending order, the numbers are 31, 52, 54, 56, 63, 64, 65, 71, 73, 76, 82, 84, 86, 88
68
Median=
Mean=
⋯
67.5
_____________________________________________________________________________
Practice Exam Questions
_____________________________________________________________________________
C1. Which of the following is false?
a) the standard deviation is resistant measure
b) the median is a resistant measure
c) the quartiles are resistant measures
d) the five number summary is a resistant measure
Solution:
a) is false.
C2. The mean of a sample is zero. Which of the following must be true?
(Choose all that apply.)
(i) The sample standard deviation is zero.
(ii) The sample variance must be nonzero.
(iii) The IQR is zero.
(iv) The sample total is zero.
(v) The sample median is zero.
Solution:
Since x 
1 n
 xi , if x  0 then
n i 1
n
x
i 1
i
 0 . The answer is (iv).
C3. If the observations in a data set are all equal, then
(i) the variance of the data set equals 0.
(ii) the mean of the data set equals 0.
(iii) the IQR of the data set equals 0.
(iv) both (i) and (iii).
(v) both (ii) and (iii).
Solution:
IQR= Q3 - Q1 = 0 since all data are the same number
Mean = whatever # all of the data are
Variance = 0
There is no dispersion in the data. The answer is (iv).
C4. Environmental Protection Agency regulations require automakers to give the city and
highway gas mileages for their cars. Here are the highway mileages (miles per gallon, MPG)
for 15 car makes.
Make
MPG Make
MPG Make
MPG
BMW
23
Ford
27
Mazda
30
Buick
29
Honda
28
Mercedes-Benz
22
Chevrolet
27
Hyundai
27
Nissan
26
Chrysler
27
Infiniti
22
Rolls-Royce
15
Dodge
27
Lexus
23
Toyota
28
Sketch a boxplot for the highway mileage data (the observations have been arranged
in ascending order for you). Use the scale provided when you draw the plot. Show
your calculations below the plot.
15, 22, 22, 23, 23, 26, 27, 27, 27, 27, 27, 28, 28, 30, 30
mileage per gallon
13
15
17
19
21
23
25
27
29
31
27
29
31
Solution:
mileage per gallon
13
15
17
19
21
min
23
Q1
25
Q2 Q3
max
min  15 , max  30 ;
The number of observations is n  15 (odd), so the median is the
.
15 1
2
 8 th observation  27
Excluding the median, the upper half of the data is: 27, 27, 27, 28, 28, 30, 30;
Q3  median of the above list  28 .
Excluding the median, the lower half of the data is: 15, 22, 22, 23, 23, 26, 27;
Q1  median of the above list  23 .
IQR  Q3  Q1  28  23  5 ;
Q3  1.5  IQR  35.5 ;
Q1  1.5  IQR  15.5 .
The observation 15 is below 15.5, so it is an outlier.
The second smallest observation is 22, and it is not flagged by the outlier rule.
C5. Harris recently installed a spam filter software, but he still saw spam emails in his inbox.
He made a daily record of the number of spam emails that were delivered to his inbox
over the past 20 days. The following is a frequency histogram for his data. The
frequency refers to the number of days.
Histogram of spam email data
6
frequency
5
4
3
2
1
0
20
40
60
80
100
120
# spam emails
(a)
Which of the following is a correct statement about the distribution of the spam
email data? Check only one answer.
(i) The distribution is roughly symmetric, and the mean is about the same as the
median.
(ii) The distribution is skewed, and the mean is larger than the median.
(iii) The distribution is skewed, and the mean is smaller than the median.
Solution:
(ii) The distribution is skewed to the right, so the mean is larger than the median.
(b) What is the percentage of days over the past 20 days that Harris received fewer than
50 spam emails? Check only one answer.
(i) 16%
(ii) 19%
(iii) 50%
(iv) 80%
Solution:
16
 100%  80% . The answer is (iv).
20
(c) What is the third quartile of the number of spam emails? Use the stemplot you have
chosen in part (a) to answer this question. Check only one answer.
(i) 23
(ii) 25
(iii) 47
(iv) 48
Solution:
The upper half of the data set that is above the median consists of the data points
37, 41, 42, 44, 47, 49, 55, 56, 80, 105.
The third quartile is the median of the above list: Q3 
47  49
 48 . The answer is (iv).
2
(d)
Identify any outliers in the data set using the stemplot you have chosen in part (a). It is
given to you that the IQR is 23.
Solution:
There are no outliers in the lower end (the distribution does not have a long left tail).
Q3  1.5 IQR  48  1.5  23  82.5 .
Since 105  82.5 , the data point 105 is an outlier, the only outlier in the data set.
(e) Which of the following pairs of summary statistics best describe the centre and the
spread of the number of spam emails received daily? Check only one answer and
explain briefly.
(i) mean and standard deviation
(ii) mean and IQR
(iii) median and IQR
(iv) median and variance
Solution:
Both the median and IQR are insensitive to outliers, so in the presence of outliers (105), we
should report these two summary statistics. The answer is (iii).
C6. Calculate the mean, median, range, IQR, and sample standard deviation of the following
data set. Find any outliers, if they exist.
24, 46, 49, 51, 64, 64, 67, 81, 88, 89, 97, 103
Solution:
If we look, we can see that the data is in order. There are 12 data points, so the median is
the average of the 6th and 7th data points. Q1 is in between the 3rd and 4th data points, and Q3
is between the 9th and 10th data points.
Q1  50
median  65.5
Q3  88.5
Range  103  24  79
IQR  88.5  50  38.5
We can use the formulas above to calculate the mean and sample standard deviation:
x 
s 
24  46  97  103
 68.58
12
(24  68.58) 2  (46  68.58) 2  (97  68.58) 2  (103  68.58) 2
 23.69
11
We now check for outliers:
Q3  1.5 IQR  88.5  1.5 (38.5)  146.25
Q1  1.5 IQR  50  1.5 (38.5)   7.75
Since no data point is above 146.25 or below -7.75, there are no outliers in the data set.
C7.
A very hardworking chemist performs a chemical reaction 36 times, but he does not
seem to be very satisfied with the outcome (so he might do more!). He weighs the yield of the
reaction every time and sends the weight data to his statistician friend. This friend uses a
stemplot to display the weights (measured in grams) from the 36 reactions.
15
16
17
18
19
20
21
22
23
24
5
014
123559
11456899
0245667899
00137
2445
(a) Describe the shape of the distribution of the weights of the chemical yields.
Solution:
Unimodal (one peak).
Asymmetric – skewed to the left.
Has a suspected outlier ( 155 g ).
(b) The chemist later realizes that he has made a mistake when entering the data. The
smallest yield of his 36 reactions should be 195 grams instead of 155 grams. Based
on the wrong data, his statistician friend calculates the mean weight to be 218 grams.
(i) Find the correct value of the mean weight.
Solution:
With the wrong data:
With the correct data:
155  190  191   244  245
 218
36
195  190  191   244  245
x 
36
x 

195 155  190  191   244  245 155


36
36
36

195
155
 218 
 219.11 grams
36
36
(ii) How will the values of the following summary statistics change after the data
correction is made?
Choose your answer from the following list:
I. The value becomes smaller after the correction is made.
II. The value becomes larger after the correction is made.
III. There is no change in the value after the correction is made.
Summary Statistics
Your answer is (I, II or III?)
median
______________________
IQR
______________________
standard deviation
______________________
75th percentile
______________________
Solution:
Summary Statistics
median
III
IQR
III
standard deviation
I
th
75 percentile
III
C8. Calculate the mean, median, mode, range, standard deviation and variance of the following
marks: 50, 65, 70, 55, 80, 95, 35, 65, 80
Solution:
Write the numbers in ascending order: n=9
35, 50, 55, 65, 65, 70, 80, 80, 95
Mode=65
Median=middle # = 5th number = 65
Mean=
⋯
66
Range= max - min= 95 - 35 = 60
Make a chart to find the standard deviation:
̅ 2
̅
35-66
961
50-66
256
55-66
121
65-66
1
65-66
1
70-66
16
80-66
196
80-66
196
95-66
841
2589
8
̅
1
17.99
C9. Find the five-number summary and draw a box-plot for the following list of marks:
55, 76, 88, 56, 77, 89, 66
Solution:
55, 56, 66, 76, 77, 88
Min= 55
Max= 88
Q1= 56
Q3=77
Median=
71
C10. If a distribution is skewed to the right, which of the following is true?
a) the mean is equal to the median
b) the mean is less than the median
c) the mean is greater than the median
d) the mean and the mode are equal
Solution:
The answer is c).
C11. What percentage of the observations in a distribution lie between the first quartile and the
third quartile?
a) 25%
b) 50%
c) 75%
d) 100%
Solution:
The answer is b).
C12. What percentage of the observations in a distribution lie before the first quartile?
a) 25%
b) 50%
c) 75%
d) 100%
Solution:
The answer is a).
C13. Which of the following represent possible values of the standard deviation for a set of
data?
a) 0,1.5,2.3
b) -1, 0,1, 2
c) -2, -1, 0, 1, 2
d) -1, 0 , 1.2
Solution:
The standard deviation is always greater or equal to 0.
The answer is a).
C14. Find the interquartile range for the following data and determine if there is a suspected
outlier.
30, 59, 66, 56, 67, 62, 78, 81, 82
Solution:
30, 56, 59, 62, 66, 67, 78, 81, 82
IQR= Q3 - Q1
First, we need the median and then the 1st and 3rd quartiles
Median=66
Q1=middle of 56 and 59=
Q3=
=57.5
79.5
IQR=Q3-Q1=79.5 - 57.5=22
To find outliers
Q3 + 1.5 IQR= 79.5 + 1.5(22) = 112.5
Q1 + 1.5IQR= 57.5 - 1.5(22)= 24.5
An outlier is any number above 112.5 or below 24.5.
Therefore, there are no outliers.
C15. Find the standard deviation and the variance for the following set of data of temperatures
during the summer in London, Ontario:
34, 32, 26, 28, 27, 40, 23, 21, 34, 37, 38, 41
Solution:
21, 23, 26, 27, 28, 32, 34, 34, 37, 38, 40, 41
̅
21
23 ⋯
12
41
31.75
Make a chart to find the standard deviation.
̅
21-31.75
23-31.75
26-31.75
27-31.75
28-31.75
32-31.75
34-31.75
34-31.75
37-31.75
38-31.75
40-31.75
41-31.75
478.16
11
The variance is
̅
115.56
76.56
33.06
22.56
14.06
0.06
5.06
5.06
27.56
39.06
68.06
85.56
6.6
43.5
_____________________________________________________________________________
Practice Exam Questions
_____________________________________________________________________________
D1. For the standard normal random variable Z, find the value of Pr[Z<1.8].
A. 0.0359
B. 0.9641
C. 0.8599
D. 0.1401
E. 0.7881
Solution:
Pr (Z<1.8)=0.9641
The answer is b).
D2. For the standard normal random variable Z, find the value of Pr[Z>-0.60].
A.0.2743
B. 0.7257
C. 0.6
D. 0.5239
E. 0.4761
Solution:
same area as
Pr(Z> - 0.60) = Pr(Z<0.60) = 0.7257
The answer is b).
D3. For the standard normal random variable Z, find the value of
Pr[-1.10<Z<1.10].
A. 0.7286
B. 1.10
C. 0.1357
Solution:
Pr (-1.10<Z<1.10)= Pr(Z<1.1) - Pr(Z<-1.1)
=0.8643 - 0.1357
=0.7286
The answer is a).
D. 0.8643
E. 0.8438
D4. Consider the standard normal random variable Z. Which of the following statements is
false?
A. The mean of Z is   0
B. The standard deviation of Z is
 1
C. For any value of k, Pr[Z>-k]=1 -Pr[Z<K]
D. For any value of k, Pr[Z<-k]=1 - Pr[Z>k]
E. Pr[Z<1.00]=0.8413
Solution:
The answer is c).
D5. For the standard normal random variable Z, what is the value of Pr[Z>1.55]?
A.0.9332
B. 0.0668
C. 0.5596
D. 0.0606
E. 0.9394
Solution:
Pr (Z> 1.55)= 1 - Pr(Z< 1.55)
= 1 - 0.9394
=0.0606
The answer is d).
D6. If Z is the standard normal random variable, find Pr[-0.9 < Z < 0.9].
A. 0.8159
B. 0.1841
C. 0.5359
Solution:
Pr (-0.9<Z<0.9)= Pr(Z<0.9) - Pr(Z<-0.9)
=0.8159 - 0.1841
=0.6318
The answer is d).
D. 0.6318
E. 0.4641
D7. Find the value of k if it is known that Pr[k<Z<2.1]= 0.0489 , where Z is the standard normal
random variable.
A. 1.5
B. 0.0489
C. 0.9511
D. 0.9821
E. -1.5
Solution:
Pr(Z<2.1)= 0.9821
o.9821 - 0.0489 = 0.9332
Look up area 0.9332 and you get k=1.5
The answer is a).
D8. Use the table for the standard normal random variable Z to find
Pr[-0.67<Z<1.81].
A. 0.2155
B. 0.7135
C. 0.2163
Solution:
Pr (-0.67<Z<1.81)=Pr(Z<1.81) - Pr(Z<-0.67)
=0.9649 - 0.2514
= 0.7135
The answer is b).
D. 0.9649
E. 0.7486
D9. Use the table for the standard normal random variable Z to find a value of k for which
Pr[Z<k]= 0.0968.
A. -1.85
B. 1.85
C. -1.86
D. -1.3
E. 1.3
Solution:
Pr(Z<k)= 0.0968
Look up on the table and k is negative....k=-1.3
The answer is d).
_____________________________________________________________________________
Practice Exam Questions
_____________________________________________________________________________
D10. X is a normal random variable with mean
 =100 and standard deviation  =10. Find
Pr[X>90].
A. 0.8413
B. 0.1587
Solution:
90
100
10
1
Pr(X>90)=Pr(Z>-1)= Pr(Z<1)=0.8413
C. 0.8289
D. 0.8531
E. 0.1469
D11. X is a normal random variable with mean
 =10 and standard deviation  =2. Find
Pr[12<X<15].
A. 1.5
B. 0.9938
C. 0.8413
D. 0.9332
E. 0.1525
Solution:
Pr(12<X<15)=Pr
=Pr(1<Z<2.5)
=Pr(Z<2.5) - Pr(Z<1)
=0.9938 - 0.8413
=0.1525
The answer is e).
D12. The amount of cola dispensed by an automated drink dispenser is normally distributed
with mean  =500 ml and standard deviation  =10ml. What is the probability that when you
buy a cola from this machine you will receive less than 480ml?
Solution:
Pr(X<480)=Pr(Z<
)=Pr(Z<-2)=0.0228
D13. X is a normal random variable with mean   25 and standard deviation
the Pr[30 < X < 40].
Solution:
 5 . Find
Pr 1
Pr(30<X<40)=Pr(
3
=Pr(Z<3) - Pr(Z<1)
=0.9987 - 0.8413
=0.1574
D14. The scores on a national achievement test are Normally distributed with mean 500 and
standard deviation 100. What percentage of those who took the test had a score greater
than 630?
Solution:
Let X be the test score. Then X ~ N(  , )
with   500 ,   100 . So
X   630  500 

Pr( X  630)  Pr  Z 



100 

 Pr( Z  1.3)  1  Pr( Z  1.3)  1  0.9032  0.0968 .
D15. The IQ’s of a large population of children are Normally distributed with mean 100.4 and
standard deviation 11.6.
(a) What percentage of the children have IQ’s greater than 125?
Solution:
Let X be the child’s IQ. Then X ~ N(  , )
with   100.4 ,   11.6 . So
X   125  100.4 

Pr( X  125)  Pr  Z 



11.6


 Pr( Z  2.12)  1  Pr( Z  2.12)  1  0.9830  0.0170 .
(b) About 90% of the children have IQ’s greater than what value?
Solution:
Pr( Z  z )  0.90 
z 
x


z   1.28 .
x    z    1.28
So x  100.4  1.28  (11.6)  85.6 .
Thus, 90% of the children have IQ’s greater than 85.6.
D16. The length of human pregnancies from conception to birth varies according to a
distribution that is approximately Normal with mean 266 days and standard deviation 16
days.
(a) What percentage of pregnancies last less than 240 days?
Solution:
Let X be the length of the pregnancy in days. Then X ~ N(  , ) with   266 ,   16 .
X   240  266 

Pr( X  240)  Pr  Z 

  Pr( Z  1.625)  0.0526 .
16



So
(b) What percentage of pregnancies last between 240 and 270 days?
Solution:
270  266 
 240  266
Pr(240  X  270)  Pr 
Z 
  Pr(1.625  Z  0.25)
16
16


 Pr( Z  0.25)  Pr( Z  1.625)  0.5987  0.0521  0.5466 .
(c) How long do the longest 20% of pregnancies last?
Solution:
Pr( Z  z )  0.20 
z 
x


z  0.84 .
x    z    0.84
 266  0.84  (16)  279.44 .
Therefore, the longest 20% of pregnancies last more than 279 days.
(d) 95% of all pregnancies last between x=a and x=b days. Find a and b.
Solution:
Pr(a<X<b)=0.95
Pr(Z< k)= 1-0.025 = 0.975
Look up on the table for Z and k=1.96
266
16
1.96
X=297
AND
1.96
266
16
X=234
Therefore, 95% of all pregnancies last between 234 and 297 days.
D17. A set of final examination grades in an introductory statistics course was found to be
Normally distributed with a mean of 73 and a standard deviation of 8.
(a) What is the probability of getting a 91 or less on the exam?
Solution:
Let X be the final grade. Then X ~ N(  , ) with   73 ,   8 . Then
X   91  73 

Pr( X  91)  Pr  Z 

  Pr( Z  2.25)  0.9878 .
8 


(b) What percentage of students scored between 65 and 89?
Solution:
89  73 
 65  73
Pr(65  X  89)  Pr 
Z 
  Pr(1  Z  2)
8 
 8
 Pr( Z  2)  Pr( Z  1)  0.9772  0.1587  0.8185 .
c) Only 5% of the students taking the test scored higher than what grade?
Solution:
Pr( Z  z )  0.05  z  1.645 .
z 
x


x    z    1.645
So x  73  1.645  (8)  86.16 .
Therefore, 5% of the students scored higher than 86%.
(d) If the professor grades on a curve (i.e. gives A’s to the top 10% of the class regardless
of the score), are you better off with a grade of 81 on this exam or a grade of 68 on a
different exam, where the mean is 62 and the standard deviation is 3?
Solution:
Let X be the score on the first exam and let Y be the score on the second exam. Then
X ~ N(  X , X ) with  X  73 ,  X  8 , and Y ~ N( Y , Y ) with Y  62 ,  Y  3 . So

X   X 81  73 

Pr( X  81)  Pr  Z 

8

X



Y  Y 68  62 

Pr(Y  68)  Pr  Z 

3

Y


 Pr( Z  1)  1  Pr( Z  1)
 Pr( Z  2)  1  Pr( Z  2)
 1  0.8413  0.1587
 1  0.9772  0.0228 .
It would be better to get the grade of 68% on the second exam.
D18. The weight of adult male rhesus monkeys is Normally distributed, with a mean of 15
pounds and a standard deviation of 3 pounds. A rhesus monkey is randomly selected.
(a) Find the probability that the monkey’s weight is less than 13 pounds.
Solution:
Let X be the rhesus monkey’s weight in pounds. Then X ~ N(  , ) with   15 ,   3 .
So
X   13  15 

Pr( X  13)  Pr  Z 

  Pr( Z  0.667)  0.2525 .
3 


(b) Find the probability that the weight is between 13 and 17 pounds.
Solution:
17  15 
 13  15
Pr(13  X  17)  Pr 
Z 
  Pr(0.667  Z  0.667)
3 
 3
 Pr(Z  0.667)  Pr(Z  0.667)  0.7475  0.2525  0.4950 .
(c)
Find the probability that the monkey’s weight is more than 17 pounds.
Solution:
17  15 

Pr( X  17)  Pr  Z 
  Pr( Z  0.667)  1  Pr( Z  0.667)
3 

 1  0.7475  0.2525 .
(d) If 50 rhesus monkeys are randomly selected, about how many would you expect to
weigh less than 12 pounds.
Solution:
12  15 

Pr( X  12)  Pr  Z 
  Pr( Z  1)  0.1587 .
3 

We would expect about 16% of the 50 monkeys to weigh less than 12 pounds, i.e. we would
expect about 8 monkeys to weigh less than 12 pounds.
(e) What is the probability of getting exactly 10 of the 50 monkeys (chosen at random)
that weigh less than 12 pounds?
Solution:
Let Y be the number of monkeys out of 50 that weigh less than 12 pounds. Then
Y ~ Bin(n  50 , p) , where p  Pr( X  12)  0.1587 .
n
 50 
So p( y )  Pr(Y  y )    p y (1  p ) n  y    (0.1587) y (0.8413) 50  y .
 y
y
Therefore, the probability of getting exactly 10 of the 50 monkeys that will weigh less than
12 pounds is
 50 
Pr(Y  10)  p(10)    (0.1587)10 (0.8413) 40  0.1036 .
 10 
D19. The time spent (in days) waiting for a heart transplant for patients with type A+ blood can
be approximated by a Normal distribution with a mean of 127 days and a standard
deviation of 23.5 days.
(a) What is the shortest time spent waiting for a heart transplant that would still place a
patient in the top 30% of waiting times?
Solution:
Let X be the waiting time (in days). Then
X ~ N(  , ) with   127 ,   23.5 .
So Pr( Z  z )  0.30 
z 
x


z  0.52 .
x    z    0.52
 127  0.52  (23.5)  139.2 .
Therefore, 30% of the patients must wait for more than 139 days for a heart transplant.
(b) What is the longest time spent waiting for a heart transplant that would still place a
patient in the bottom 10% of waiting times?
Solution:
Pr( Z  z )  0.10  z   1.28 .
z 
x


x    z    1.28
 127  1.28  (23.5)  96.9 .
Therefore, 10% of the patients have to wait for less than 97 days for a heart transplant.
D20. A statistics test has scores that follow a normal distribution with a mean score of 80 and a
standard deviation of 5.
a) What percent of students scored greater than 90?
b) What percent of students scored below 75?
c) If a students score has a z-score of 0.9772, what is the students score on the statistics test?
Solution:
a) Pr(X>90)= Pr
b) Pr(X<75)=Pr
Pr
2
)=Pr(Z<-1)=0.1587
c) Pr(Z<k)=0.9772
Look up the area 0.9772 on the table and k=2.
2
80
5
10 = X - 80
X=90
1
Pr
2
1
0.9772
0.0228
Therefore, the student scored 90.
D21. The monthly income of pharmacists at a particular pharmacy is normally distributed with
a mean of $3700 and a standard deviation of $400. What percentage of pharmacists earn less
than $3000 per month?
Solution:
3700
400
Pr(X<3000)=Pr(Z<
Pr
1.75
0.0401
D22. The scores on a university finite examination are normally distributed with a mean of 65
and a standard deviation of 10. How high must a student score to place in the top 5% of all
students taking this exam?
Solution:
65
10
Pr(Z<k)=0.95
Look up the area 0.95 on the table and k=1.645
1.645
65
10
X=81.5
Therefore, a student must score 81.5
D23. To tell the exact shape of a Normal distribution, you need:
a) the five number summary
b) the mean, median and mode
c) the mean and the median
d) the mean and the standard deviation
Solution:
The answer is d).
D24. The scores of adults on an IQ test are approximately Normal with a mean of 100 and a
standard deviation of 15. What percent of adults are considered gifted, with an IQ of 130 or
higher? What percent of adults have a score between 110 and 120?
Solution:
100
15
Pr(X>130)=Pr(Z>
Pr(110<X<120)=Pr(
=Pr(Z<1.33) - Pr(Z<0.67)
=0.9082 - 0.7486
=0.1596
Pr
2
0.9772
Pr 0.67
1.33
D25. The scores of adults on an IQ test are approximately Normal with a mean of 100 and a
standard deviation of 15. Helen scores 120 on this test. This means she scores higher than
what percent of all adults?
Solution:
100
15
120
100
15
1.33
Pr(Z<1.33)=0.9082
Therefore, she scores higher than 90.8% of all adults.
D26. Height of women aged 20 to 29 are approximately normal with mean 64 inches and
standard deviation 2.7 inches. Find the z-score for a woman who is 5 feet and 10 inches.
Solution:
Her height is 70 inches.
64
2.7
70 64
2.7
The Z-scores is 2.22
2.22