Download 251y9911

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

History of statistics wikipedia , lookup

Bootstrapping (statistics) wikipedia , lookup

Taylor's law wikipedia , lookup

German tank problem wikipedia , lookup

Misuse of statistics wikipedia , lookup

Student's t-test wikipedia , lookup

Transcript
251y9911 10/7/99251 (Open this document in 'Page Layout' view!)
ECO251 QBA1
Name __________________
FIRST HOUR EXAM
SECTION MWF TR 11AM 12:30
OCTOBER 7, 1999
Note that in 2000 all exams will be open book. The exam below is a closed book exam. This means that
parts I and II will be changed considerably. Questions like d) and l) in part I and j), k) an l) in part II will
still appear.
 pN  F 
x1 p  L p  
w
 f p 
Part I. Explain or Define the Following (13 Points Maximum)
a) Field (1) See diagram at right
b) Cell (1) A location in a field
c) Stubhead (1) The top area of a stub –
see diagram.
Table Number
Title
Headnote

Stub
Master Caption
Stub Head
R
O
W
L
A
B
E
L
S
Footnotes
Source Note
Column Labels
C
E
L
L
S
 Boxhead
Field
Field


d) I have a table describing the grades received by a class (A, B, C, D, F or NG) and their year (Fr,
So, Jr, Sr). there are no individuals in the class who are not regularly enrolled undergraduates. If
we define the following categories:
C1 'Person received A or B'
C3 ' Person is Freshman or Sophomore’'
C2 'Person received C, D, or F’
C4 'Person is Junior or Senior'
Are the following classes: (3)
Mutually Exclusive?
Collectively Exhaustive?
C2 and C3
__n__
___n___
C3 and C4
__y___
___y___
C1, C3, C4
__n___
___y___
C3 and C4 are mutually exclusive because no individual in C3 can also be in C4.
C1, C3 and C4 are collectively exhaustive because everyone must be in at least one of these
classes.
e) The mean, median and mode have something in common as to what they measure, which they
do not share with , say, the variance. What is it? Make a sketch showing where they would be
relative to one another in a distribution that is skewed to the left. (4) They are all measures of
central tendency. The sketch should show the mean on the left, the mode on the right and the
median between them .
f) Frequency Polygon (What does it look like, what does it show? A labeled sketch might work
best.) (1) A line graph with x on the x-axis and frequency on the y-axis.
g) Interval Data (1) Quantitative data like temperature where the interval between two values has
a constant meaning and there is no real bottom or top. Contrast it with ordinal or ratio data.
251y9911 10/7/99
h) Third Quintile (1) A point with 3/5 or 60% of the data below it.
i) Mesokurtic (1) Not flat or sharp peaked, or, better, with a zero coefficient of excess,  2 .
j) Box Plot (2) See text page 85. A double firecracker diagram that shows the median and the
quartiles.
k) Is the mean of a population a statistic or a parameter? (1) Numbers that describe a population
are parameters.
l) My firm has a large number of stores throughout the country with annual sales between 12 and
50 million dollars. If this data is to be presented in nine classes, what intervals would you use?
highest - lowest 50  12
Explain your reasoning using the appropriate formula. (3) Use

no. of classes
9
 4.2222 . So the interval should be above 4.2222, maybe 4.5. You might try 10-14.49, 14.518.99, 19-23.49, 2.5-2.99, 28-32.49, 32.5 to 36.99, 37-41.49, 41.5 to 45.99, and 46 to 50.49.
m) If I pick an item at random from a very large population with a median of 535, what is the
approximate probability that the item I pick will be above 535? (2) Since half the numbers are
above the median, the probability is 50%.
n) If you answered question m) and the distribution in m) is skewed to the right and I pick another
item from the population, is the chance that it is above the mean greater, less than the
probability you gave in m), or is it unchanged? ( A picture may help you decide) (1)
If the distribution is skewed to the right, the mean is above the median. If half the numbers are
above the median, less than half must be above the mean.
2
251y9911 9/27/99
Part II. Give formulas for the following or compute an appropriate answer, showing your work (12 Points
maximum):
a) Population Variance (Definitional formula for
ungrouped data) (2)
b) Root-mean-square (2)
c) Sample Mean (Grouped data) (1)
d) Sample Standard Deviation (Use words!) (1)
e) Coefficient of Variation ( For a population)(1)
f) Population Skewness (1)
g) Sample Variance (Grouped Data Computational Formula (2)
h) Kurtosis (Population - Ungrouped) (2)
i) Pearson's Measure of Skewness (2)
j) Compute the Harmonic Mean of the numbers 4,
40 and 400 (2)
k) If a population of 1500 items has a mean of
13555 and a standard deviation of 100, according
to Chebyshef's Inequality, what is the minimum
number of items that could be between 13355 and
13755? (3)
l) If you answered k) and you now learn that the
population is unimodal and symmetrical, would
you raise or lower your estimate in k), or would
you leave it unchanged? (1)
Solution: a) Population Variance (Definitional formula for ungrouped data) (2) 
 x or
 fx
c) Sample Mean (Grouped data) (1) x 
b) Root-mean-square (2) x rms 2 
1
n
x rms 
2
1
n
x
2
 x   

2
N
2
n
d) Sample Standard Deviation (Use words!) (1) The square root of the sample variance.
e) Coefficient of Variation ( For a population)(1) C 
f) Population Skewness (1)  3 
1
N
 x   
3

1
N


 x
3
 3
x
g) Sample Variance (Grouped Data - Computational Formula (2) s
h) Kurtosis (Population - Ungrouped) (2)
4  
2
2
 2n 3
 fx

x   
2

 nx 2
n 1
4
N
3mean  mode 
i) Pearson's Measure of Skewness (2) SK 
std .deviation
j) Compute the Harmonic Mean of the numbers 4, 40 and 400 (2)
1
1

xh n
 x so
1
1
1 1 1
1  1  100 10
1  1  111  111
1200
  

 


 

and x h 
 10 .811



x h 3  4 40 400  3  400 400 400  3  400  1200
111
k) If a population of 1500 items has a mean of 13555 and a standard deviation of 100, according
to Chebyshef's Inequality, what is the minimum number of items that could be between 13355
1
and 13755? (3) Chebyshef’s Inequality says Px    k   2 which means that the
k
1
proportion in the tails outside the interval   k to   k is at most 2 , so that the
k
13755  13555
1
 2 , so the
proportion within the same interval is at least 1  2 . In this case k 
100
k
3
1
1 3
proportion is at least 1  2  1   , and 1500   1125 , so the number is at least 1125.
4
4 4
k
l) If you answered k) and you now learn that the population is unimodal and symmetrical, would
you raise or lower your estimate in k), or would you leave it unchanged? (1) The empirical rule
says that 95% is within 2 standard deviations, so raise the estimate.
3
251y9911 10/7/99
Part III. Do the Following Problems (25 Points)
In a period of 8 days you make the following numbers of sales(in millions):
Day :
1
2
3
4
5
6
Sales: 9.2
10.2
9.2
11.2
19.2
12.2
Compute the Following (assuming that the numbers are a sample):
a) Mean Sales (1)
b) The Median (1)
c) The Standard Deviation (3)
d) The 8th Decile (2)
Solution: Compute the Following:
Note that x is in order
n  6 ,  x  98 .6 ,
x
 x is not  x   98.6
 x  x  equal to  x  x 
2
2
2
2
Index x
x2
1
9.2
84.64
2
9.2
84.64
3 10.2 104.04
4 11.2 125.44
5 12.2 148.84
6 13.2 174.24
7 14.2 201.64
8 19.2 368.64
98.6 1292.12
 1292 .12 ,
xx
-3.125
–3.125
–2.125
–1.125
-0.125
0.875
1.875
6.875
0.000
7
13.2
8
14.2
 x  x 2
9.7656
9.7656
4.5156
1.2656
0.0156
0.7656
3.5156
47.2657
76.8750
 x  x   0.00,  x  x 2  76.875 .
2
as some of you seem to have fooled yourselves into believing. Nor is
2
 98 .6  12 .325 2 . If you had tried these in any of the homework
problems, you would have found that these tricks didn’t work.
a) x 
 x  98.6  12.325
n
8
b) position  pn  1  a.b  .59  4.5
x1 p  xa  .b( xa1  xa ) so x1.5  x.5  x 4  .5( x5  x 4 )  11 .2  .5(12 .2  11 .2) 
c) s
2
x

2
 nx 2
n 1
1292 .12  812 .325 2

 10 .9821
7
s
2
 x  x 

n 1
2

11 .2  12 .2
 11 .7
2
76 .875
 10 .9821
7
s  10.9821  3.3139
d) The 8th decile has 80% below it. position  pn  1  a.b  .89  7.2
x1 p  xa  .b( xa1  xa ) so x1.8  x.2  x 7  .2( x8  x 7 )  14.2  .2(19.2  14.2)  15.2
4
251y9911 10/7/99
2. A retailer finds that the lead time (time between receipt and fulfillment) for a sample of orders is as
below:
Days
a. Calculate the Cumulative Frequency (1)
b. Calculate The Mean (1)
c. Calculate the Median (2)
d. Calculate the Mode (1)
e. Calculate the Variance (3)
f. Calculate the Standard Deviation (2)
g. Calculate the Interquartile Range (3)
h. Calculate a Statistic showing Skewness and
Interpret it (3)
i. Make an Ogive of the Data (Neatness
Counts!)(2)
Frequency
0-4.9
5-9.9
10-14.9
15-19.9
20-24.99
7
8
12
10
13
Solution: x is the midpoint of the class.
7
8
12
10
13
50
7
15
27
37
50
n   f  50,
 f x  x 
2
x
F
f
fx 2
fx
fx3
2.5 17.5
43.75
109
7.5 60.0 450.00
3375
12.5 150.0 1875.00 23437
17.5 175.0 3062.50 53594
22.5 292.5 6581.25 148078
695.0 12012.5 228594
 fx
 695 .0,
 2352.0, and
 fx
 f x  x 
3
2
 12012 .5, and
xx
-11.4
-6.4
-1.4
3.6
8.6
 fx
3
f x  x  f x  x 2 f x  x 3
–79.8 909.72 –10370.8
–51.2 327.68 -2097.2
-16.8 23.52
-32.9
36.0 129.60
466.6
111.8 961.48
8268.7
0.0 2352.0 –3765.6
 228594 ,
 f x  x   0.00,
 3765.60. As usual half of the people who tried to compute
these last three sums got them wrong.
a. Calculate the Cumulative Frequency (1) (See above) The cumulative frequency is the whole F column.
b. Calculate the Mean (1) x 
 fx  695 .0  13.9
n
50
c. Calculate the Median (2) position  pn  1  .551  25.5 . This is above 15 and below 27, so the
 pN  F 
 .550   15 
interval is 10-14.9. x1 p  L p  
 w so x1.5  x.5  10  
 5  14 .1667
12


 f p 
d. Calculate the Mode (1) The mode is the midpoint of the largest group. Since 13 is the largest frequency,
the modal group is 20 to 24.99 and the mode is 22.5.
e. Calculate the Variance (3) s 2 
s2 
 f x  x 
n 1
2

 fx
2
 nx 2
n 1

12012 .5  50 13 .92
 48 .0000 or
49
2352 .0
 48 .0000
49
f. Calculate the Standard Deviation (2) s  48.0000  6.9282
5
251y9911 10/7/99
g. Calculate the Interquartile Range (3) First Quartile: position  pn  1  .2551  12.75 . This is above
 pN  F 
F  7 and below F  15 , so the group is 5 to 9.9. x1 p  L p  
 gives us
 f p 
 .2550   7 
Q1  x1.25  x.75  5  
 5  8.4375 .
8


Third Quartile: position  pn  1  .7551  38.25 . This is above 37 and below 50, so the group is 20 to
 .7550   37 
24.9. x1.75  x.25  20  
 5  20 .1923 . IQR  Q3  Q1  20.1923 8.4375  11.7548.
13


h. Calculate a Statistic showing Skewness and interpret it (3)
k 3
n
(n  1)( n  2)
 fx
3
 3x  fx 2  2nx 3
  495048 228594  313.912012 .5  25013.9 
3
 0.02126 3765 .35   80.0457 . The computer got –80.0502 for this calculation.

n
50
 3765 .60   80.0510 The computer got –80.0509 for this
f x  x 3 
(n  1)( n  2)
49 48 
calculation.
k
 80 .051
 0.2407
or g 1  33 
s
6.9280 3
or k 3 
3mean  mode 313 .9  22 .5

 3.7239
std .deviation
6.9282
Because of the negative sign, the measures imply skewness to the left.
i. Make an Ogive of the Data (Neatness Counts!)(2)
An ogive is a graph of the cumulative distribution. It
must begin at y  0 (not necessarily x ) and should end with a horizontal line. It cannot include negative
slopes.
or Pearson's Measure of Skewness SK 
Your points are:
Up to x  0
F 0
5
7
10
15
15
27
20
37
25
50
above 25
50
6