Download Duffy Template

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Chapter 03 - Descriptive Statistics: Numerical Methods
CHAPTER 3—Descriptive Statistics: Numerical Methods
3.1
3.2
a.
Population parameter: number calculated using the population measurements that
describe some aspect of the population.
Point estimate: a one-number estimate of the value of a population parameter.
b.
The sample mean is the point estimate of the population mean.
a.
Equal or roughly equal.
b.
Mean is less than the median.
c.
Mean is greater than the median.
3.3
a
b
N
10
9
MEAN
9.600
103.33
MEDIAN
10.000
100.00
MODE
10.00
90.00
a
b
N
10
7
MEAN
20
503
MEDIAN
20
501
MODE
20
501
3.4
3.5
3.6
3.7
3.8
a.
Sample mean = 42.954. Yes, because x  42 .
b.
The sample median is 43.0
The mean of 42.954 is slightly less than the median of 43.
The histogram is slightly skewed to the left so the mean should be less than the median.
a.
Sample mean = 5.46. Yes, because x  6 .
b.
Sample median = 5.25
The mean is slightly larger than the median. The histogram is somewhat skewed right
which is consistent with the mean being larger than the median.
a.
Sample mean = 50.575. Yes, because x  50.
b.
Sample median = 50.65. The mean and median are very similar which is consistent with
the histogram being close to symmetric.
Mean = 99 and the median = 40. Median is better measure of typical because of the high
outlier (350).
3-1
Chapter 03 - Descriptive Statistics: Numerical Methods
3.9 – 3.13 MINITAB output for all variables on data comparing lifestyles.
a.
Variable
voters
income tax
video reentals
PCs
Religion
b.
3.9
3.10
3.11
3.12
3.13
c.
N
9
9
8
8
6
Mean
70.70
49.44
4.03
18.94
31.65
Median
71.50
50.00
2.10
17.50
26.25
StDev
11.81
7.30
4.49
7.10
18.21
Min
49.1
40.0
0.7
11.5
10.0
Max
85.0
60.0
13.8
35.0
55.8
Q1
61.35
42.00
1.13
14.50
17.50
Somewhat skewed left; the U.S. has the lowest percentage.
Slightly skewed left, the U.S. has the lowest percentage, tied with Britain.
Skewed right; U.S. is the highest.
Skewed right; U.S. is the highest.
Skewed right; U.S. is above the mean and median.
All dot plots below show the same skewness as noted above.
DotPlot
40
50
60
70
80
90
Voters
DotPlot
30
40
50
60
70
Income Tax
DotPlot
0
5
10
Video Rentals
3-2
15
Q3
80.40
55.50
6.45
20.00
52.65
Chapter 03 - Descriptive Statistics: Numerical Methods
DotPlot
10
15
20
25
30
35
40
40
50
60
PCs
DotPlot
0
10
20
30
Religion
d.
3.14
U.S. has the lowest percentage.
U.S. has the lowest percentage, tied with Britain.
U.S. is the highest.
U.S. is the highest.
U.S. is above the mean and median.
Sample mean = 83.6 and sample median = 78.5 indicating data is skewed right.
3.15 a.
3.16
3.9
3.10
3.11
3.12
3.13
Skewed to the right. Most players earned the league minimum.
b.
Earning more than the mean: about 33%
Earning more than the median: 50%
c.
Explanations will vary. All quotes are a distortion/misrepresentation of the true earnings by
the majority of players.
The range is the difference between the largest and smallest measurements in a population or
sample. It is a simple measure of variation.
The variance is the average of the squared deviations from the average. It measures variability
in square units.
The standard deviation is the square root of the variance.
3.17
Both the variance and standard deviation measure the spread of the individual values about the
mean. The larger the spread, the more variation in the data.
3-3
Chapter 03 - Descriptive Statistics: Numerical Methods
3.18
The variance and standard deviation are calculated by using all of the data, while the range is
calculated using only the largest and smallest values.
3.19
range = 15 – 5 = 10
5
(x
i
–  )2
(5 – 10) 2  (8 – 10) 2  (10 – 10) 2  (12 – 10) 2  (15 – 10) 2
5
5
25  4  0  4  25

 11.6
5
  11.6  3.4059
2 
i 1

3.20
Housing Affordability in Texas
x
57.5
61.7
32.5
67.4
55.7
49.2
(x – 54)
3.5
7.7
–21.5
13.4
1.7
-4.8
(x – 54)**2
12.25
59.29
462.25
179.56
2.89
23.04
sum(x – 54)**2 PopVariance PopStDev
739.28
123.21
11.10
Mean = 54
Range = 67.4 – 32.5 = 34.9
Variance = 123.21
Std Deviation = 11.10
3-4
Chapter 03 - Descriptive Statistics: Numerical Methods
3.21
a
Airline Revenues
Revenue
$ Billions
(Rev - 11.43)^2
22.6
19.3
17.2
13.1
12.6
11.6
9.1
3.3
3.1
2.4
124.7689
61.9369
33.2929
2.7889
1.3689
0.0289
5.4289
66.0969
69.3889
81.5409
114.3
446.641
Mean =
11.43
Variance = 446.641 / 10 =
44.664
Sum
Std. Dev. = Sqrt (Variance)=
6.683
Range = 22.6 – 2.4 =
20.200
Airline Profits
Profits
$ millions
(Rev - 1530.7)^2
231
22,876
-6,203
343
-2,835
304
499
-53
146
-1
1689220.09
455621832.1
59810115.69
1410631.29
19059336.49
1504792.89
1064404.89
2508105.69
1917394.09
2346104.89
Sum
15307
546931938.1
Mean =
Variance =
1530.7
Std. Dev. =
Range =
54693193.810
7395.485
29079
3-5
Chapter 03 - Descriptive Statistics: Numerical Methods
b.
United Airlines is a high outlier and the only observation above the average. All other
airlines are within two standard deviations of the average.
Airline
American
Airlines
United Airlines
Delta Air Lines
Continental
Airlines
Northwest
Airlines
US Airways
Group
Southwest
Airlines
Alaska Air
Group
SkyWest
Jetblue Airways
3.22 a.
x
Profits
($
millions)
z-score
231
22,876
-6,203
-0.18
2.89
-1.05
343
-0.16
-2,835
-0.59
304
-0.17
499
-0.14
-53
146
-1
-0.21
-0.19
-0.21
157  132  109  145  125  139
 134.5
6
(157 – 134 .5) 2  (132 – 134 .5) 2  (139 – 134 .5) 2
s2 
= 276.7
6 –1
s  276.7  16.63
 x  157  132  109  145  125  139  109,925
 x   (157  132  109  145  125  139)  (807 )  651,249
2
i
Also
2
2
2
2
2
2
2
2
2
i
s2 
1 
n – 1 
x
2
i
–
1
n
 x    6 1– 1 109,925 – 16 (651,249)  276.7

2
i
b.
[ x  s]  [134.5  16.63]  [117.87,151.13]
[ x  2s]  [134.5  2(16.63)]  [101.24,167.76]
[ x  3s]  [134.5  3(16.63)]  [84.61,184.39]
c.
Yes, because $190 is not within the 99.73% interval.
d.
z157=
157  134.5
= 1.353
16.63
z132 = –0.150
z109 = –1.533
z145 = 0.631
3-6
Chapter 03 - Descriptive Statistics: Numerical Methods
z125 = –0.571
z139 = 0.271
Each expense is within 2 standard deviations of the mean.
3.23 a.
The histogram indicates that the breaking strengths are approximately normally distributed.
Therefore, the empirical rule is appropriate.
b.
68.26%: [50.575 ± 1(1.6438)] = [48.9312, 52.2188]
95.44%: [50.575 ± 2(1.6438)] = [47.2875, 53.8626]
99.73%: [50.575 ± 3(1.6438)] = [45.6436, 55.5064]
c.
We estimate that the breaking strengths of 99.73% of the trashbags are between 45.6436 lbs
and 55.5064 lbs. Thus, almost any trashbag will have a breaking strength that exceeds 45 lbs.
d.
27 out of 40 (or 67.5%) actually fall within the interval [ x  s].
38 out of 40 (or 95%) actually fall within the interval [ x  2s].
40 out of 40 (or 100%) actually fall within the interval [ x  3s].
These percentages compare favorably with those given by the Empirical Rule
(68.26%, 95.44%, and 99.73%). This suggests that our inferences are valid.
3.24 a.
It is somewhat reasonable.
b.
[ x  s]  [5.46  2.475]  [2.985,7.935]
[ x  2s]  [5.46  2(2.475)]  [.51,10.41]
[ x  3s]  [5.46  3(2.475)]  [–1.965,12.885]
c.
Yes, because the upper limit of the 68.26% interval is less than 8 minutes.
d.
66% fall into [ x  s], 96% fall into [ x  2s] , 100% fall into [ x  3s] .
Yes, they are reasonably valid.
3.25 a.
It is somewhat reasonable.
b.
[ x  s]  [42.95  2.6424 ]  [40.3076,45.5924 ]
[ x  2s]  [42.95  2(2.6424 )]  [37.6652,48.2348]
[ x  3s]  [42.95  3(2.6424 )]  [35.0228,50.8772 ]
c.
Yes, because the lower limit of the interval is greater than 35.
d.
63% fall into [ x  s], 98.46% fall into [ x  2s], 100% fall into [ x  3s] .
Yes, they are reasonably valid.
3-7
Chapter 03 - Descriptive Statistics: Numerical Methods
3.26 a.
Stem and Leaf plot for
stem unit =
leaf unit =
Time
10
1
Frequency
27
Stem
3
22
11
1
3
4
4
2
63
5
Leaf
222222222333333333334444444
555555566677777889999
9
00011122234
6
12
Histogram
35
30
20
15
10
5
Time
Distribution is skewed to the right.
3-8
54
52
50
48
46
44
42
40
38
36
34
0
32
Percent
25
Chapter 03 - Descriptive Statistics: Numerical Methods
b.
[ x  s]  [36.56  4.475]  [32.085,41.035] :73%
[ x  2s]  [36.56  2(4.475)]  [27.61,45.51] :95.23%
[ x  3s]  [36.56  3(4.475)]  [23.135,49.985] :96.83%
c.
They are inconsistent with the empirical rule, but consistent with Chebyshev’s Theorem.
d.
The transaction times are skewed to the right.
3.27 a.
RS Internet Age Fund:
Franklin Income A Fund:
Jacob Internet Fund:
[10.93 – 2*41.96, 10.93 + 2*41.96] = [–72.99, 94.85]
[13 – 2*9.36, 13 + 2*9.36] = [–5.72, 31.72]
[34.45 – 2*41.16, 34.5 + 2*41.16] = [–47.87, 116.77]
b.
RS has the lowest average and highest variability.
Franklin has the second lowest average and the smallest variability.
Jacob has the highest average return and the second highest variability.
c.
RS Internet Age Fund:
Franklin Income A Fund:
Jacob Internet Fund:
Coefficient of Variation = 41.96 / 10.93 * 100 = 383.9%
Coefficient of Variation = 9.36 / 13 * 100 = 72%
Coefficient of Variation = 41.14 / 34.45 * 100 = 119.4%
RS Internet is riskiest, Jacob is second and Franklin is least risky.
3.28 a.
The value such that a specified percentage of the measurements in a population or sample fall
at or below it.
b.
A value below which approximately 25% of the measurements lie
c.
The 75th percentile
d.
IQR  Q3  Q1 contains the middle 50% of the data.
3.29
In a box and whisker plot a point that is more than 1.5 times the IQR above Q3 or below Q1 is
considered to be an outlier.
3.30
Median = 8. Q3 = 9. Q1 = 7.5.
3-9
Chapter 03 - Descriptive Statistics: Numerical Methods
3.31 a.
192 b.
152
c.
141
d.
171
e.
132
f.
30
g.
Income
(1000's)
127
241
141.00
152.00
171.00
minimum
maximum
1st quartile
median
3rd quartile
BoxPlot
100
120
140
160
180
200
220
240
260
280
Income (1000's)
3.32 a.
Explanations will vary.
b.
High-tech sector is most variable. Retailing sector is least variable.
c.
High-tech and Insurance sectors
3.33
Overall, the thirty-year rates are higher than the 15-year rates. The variability is similar.
Average of the differences = .4444
3.34 a.
Q1 = 7.5, Q3 = 9
lower hinge = 7.5
upper hinge = 9
(7 + 8) / 2 = 7.5
(9 + 9) / 2 = 9
b.
Q1 = 138, Q3 = 177
lower hinge = 139.5 (138 + 141) / 2 = 139.5
upper hinge = 174 (171 + 177) / 2 = 174
c.
explanations will vary
3.35 a.
b.
All categories showed improvement.
Most: strategic quality planning, quality and operational results
Least: information and analysis, human resource development and management
3-10
Chapter 03 - Descriptive Statistics: Numerical Methods
c.
Increase: leadership, human resource development, customer focus and satisfaction
Decrease: strategic quality planning, management of process quality
Others stayed about the same.
d.
The skewness changed in leadership, strategic quality planning, human resource development,
and quality and operational results.
3.36
Both are measurements of linear association between two quantitative variables. If they are
positive, then the relationship is positive and if they are negative, then the relationship is
negative.
3.37
For a specific value of x you predict a value of y by plugging the value of x into the equation
of the least squares line.
3.38
sxy = -179.6475 / 7 = -25.66, sx = sqrt(1404.355 / 7) = 14.16, sy = sqrt(25.548 / 7) = 1.91,
r = -25.66 / (14.16 * 1.91) = -0.949, b1 = -25.66 / 14.162 = 0.1279, b0 = 10.2125 – (-0.1279) *
43.98 = 15.84
Predicted value at x = 40 degrees is 15.84 – 0.1279 * 40 = 10.724
3.39 a.
b.
There is a strong positive linear relationship between these two variables
Predicted service time is 11.4641 + 24.6022 * 5 = 134.4751 minutes.
3.40
Classess are weighted by credit hours. The xi values are the class grades and the weights are
the credit hours per class.
3.41
The midpoint equals the average of the measurements in the class.
3.42
The actual measurements are unknown.
3.43

100,000 (10.7)  500,000 (21.7)  500,000 (9.9)  200,000 (5.8)  50,000 (5.5)
100,000  500,000  500,000  200,000  50,000

18,305,000
 13.56%
1,350,000
a.
b.
unweighted mean = 10.72%
More money was invested in funds with larger gains.
3.44
a.

1.62(5.1)  5.09(7.1)  6.45(5.6)  3.18(5.4)  3.08(4.8)
1.62  5.09  6.45  3.18  3.08

112 .477
19.42
 5.79%
b.
unweighted mean = 5.6%
3-11
Chapter 03 - Descriptive Statistics: Numerical Methods
The weighted mean accounts for the different sizes of labor force.
3.45
3.46
2 

53(2)  118(5)  21(8)  3(11)
 4.6lbs
195
a.
x=
b.
53(2  4.6) 2  118(5  4.6) 2  21(8  4.6) 2  3(11  4.6) 2 742.8
s=

 3.829
195  1
194

2
1(24.5)  17(74.5)  5(124.5)  4(174.5)  1(224.5)  2(274.5) 3385

 112.83%
30
30
1(24 .5  112 .83) 2  17 (74 .5  112 .83) 2  5(124 .5  112 .83) 2  4(174 .5  112 .83) 2  1(224 .5  112 .83) 2  2(274 .5  112 .83) 2
20
113416 .667
 3780 .56
30
  3780 .56  61.49%
3.47
Midpoint
30
35
40
45
50
55
60
65
70
75
Freq
1
3
3
13
14
12
9
1
3
1
sample mean = 51.5
(M – mean)**2
462.25
272.25
132.25
42.25
2.25
12.25
72.25
182.25
342.25
552.25
M*f
30
105
120
585
700
660
540
65
210
75
variance = 81.61017
s = 9.033835
3-12
F*diff^2
462.25
816.75
396.75
549.25
31.5
147
650.25
182.25
1026.75
552.25
Chapter 03 - Descriptive Statistics: Numerical Methods
3.48
The geometric mean return for an investment is the constant return on the investment for the
investment period.
3.49
Yes, the ending value is I(1+Rg)n , where I is the initial investment.
3.50
a.
Rg  3 (1  .1)(1  .1)(1  .25)  1  1.0736  1  .0736  7.36%
b.
$5,000,000(1+.0736)3=$6,187,500
1,000,000(1+Rg)4=4,000,000
3.51
(1+Rg)4 = 4
(1+Rg) =
4
4
Rg = 1.4142 – 1
Rg = .4142
3.52
a.
b.
3.53
2000–01:
1283 .27  1455 .22
 11.8%
1455 .22
2001–02:
1154 .67  1283 .27
 10.0%
1283 .27
2002–03:
909 .03  1154 .67
 21.3%
1154 .67
2003–04:
1108 .48  909 .03
 21.9%
909 .03
2004–05:
1211 .92  1108 .48
 9.3%
1108 .48
c.
Rg= 5 (1  .118)(1  .1)(1  .213)(1  .219 )(1  .093)  1  .9640  1  .036
d.
$1,000,000(1 – .036)5 = $832,501
a.
1990–95:
1096  789
 .3891
7891
1995–2000:
Rg =
1534  1096
 .3996
1096
(1.3891)(1.3996)  1  .39434
3-13
Chapter 03 - Descriptive Statistics: Numerical Methods
x  1534
 .39434
1534
b.
x  $2,139
3.54
Times greater than 17 are high outliers and they should be investigated.
BoxPlot
0
5
10
15
20
25
30
Time
3.55
Explanations will vary.
3.56 a.
Fixed Annuities:
68.26%
99.73%
[ ± ] = [8.31 ± .54] = [7.77, 8.85]
[ ± 3] = [8.31 ± 3(.54)] = [6.69, 9.93]
Cash Equivalents:
68.26%
99.73%
[ ± ] = [7.73 ± .81] = [6.92, 8.54]
[ ± 3] = [7.73 ± 3(.81)] = [5.30, 10.16]
U.S. Treasury Bonds:
68.26%
99.73%
[ ± ] = [8.80 ± 5.98] = [2.82, 14.78]
[ ± 3] = [8.80 ± 3(5.98)] = [–9.14, 26.74]
U.S. Corporate Bonds:
68.26%
99.73%
[[ ± ] = [9.33 ± 7.92] = [1.41, 17.25]
[ ± 3] = [9.33 ± 3(7.92)] = [–14.43, 33.09]
Non U.S. Gov’t Bonds:
68.26%
99.73%
[ ± ] = [10.95 ± 10.47] = [.48, 21.42]
[ ± 3] = [10.95 ± 3(10.47)] = [–20.46, 42.36]
Domestic Large Cap Stocks:
68.26%
99.73%
[ ± ] = [11.71 ± 15.30] = [–3.59, 27.01]
[ ± 3] = [11.71 ± 3(15.30)] = [–34.19, 57.61]
International Equities:
68.26%
99.73%
[ ± ] = [14.02 ± 17.16] = [–3.14, 31.18]
[ ± 3] = [14.02 ± 3(17.16)] = [–37.46, 65.5]
3-14
Chapter 03 - Descriptive Statistics: Numerical Methods
Domestic MidCap Stocks
68.26%
99.73%
[ ± ] = [13.64 ± 18.19] = [–4.55, 31.83]
[ ± 3] = [13.64 ± 3(18.19)] = [–40.93, 68.21]
Domestic Small Cap Stocks
68.26%
99.73%
[ ± ] = [14.93 ± 21.82] = [–6.89, 36.75]
[ ± 3] = [14.93 ± 3(21.82)] = [–50.53, 80.39]
b. and c. are for fixed annuities; others follow similar methods. The results for b are obtained
by using Chebyshev’s Theorem for k=2 and k=3. The results for c are 3 standard deviations
above the mean and 3 standard deviations below the mean.
b.
At least 75%: [7.23, 9.39]; at least 88.89%: [6.69, 9.93]
c.
9.93, 6.69
d.
Domestic small cap stocks: 80.39, domestic midcap stocks: 68.21
e.
Fixed annuities: 6.69; cash equivalents: 5.30
f.
Least risky: fixed annuities
coefficient of variation = (.54/8.31)100 = 6.498
Riskiest: domestic small cap stocks
coefficient of variation = (21.82/14.93)100 = 146.149
3.57 a.
b.
3.58 a.
b.
3.59 a.
About 70%
410
538  3(41) = 538  123 = [415, 661]
Yes, it is beyond the interval.
Average Price with Pools = $276,056
Average Price without Pools = $226,900
Difference = $49,156
% Recouped = $49,156/32,500 = 151%
b.
The assumption that the only variable causing the difference in average price is whether
or not there is a pool is really suspect.
Internet Exercise
3.60
Analyses will vary
3-15
Related documents