Download SOLITIONS ON UNIVARIATE DESCRIPTIVE ANALYSIS (FIRST

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
SOLITIONS ON UNIVARIATE DESCRIPTIVE ANALYSIS
(FIRST PART OF THE COURSE)
EXERCISE 1


The variable Evaluation of instructor is a qualitative variable.
Frequency distribution table.
Evaluation of instructor
n
E
A
A
B
P
6
5
3
4
2
Sum



20
Relative frequencies and percentages for all categories.
Evaluation of instructor
ni
fi
%
E
AA
A
B
P
Sum
6
5
3
4
2
20
0.3
0.25
0.15
0.2
0.1
1
30%
25%
15%
20%
10%
100%
What percentage of these students ranked this instructor as above average? 25%
Draw a bar graph for the percentage distribution.
35%
30%
25%
20%
15%
10%
5%
0%
E
AA
A
B
P
The two possible measures of central tendency are the mode and the median. Compute the possible
measures of central tendency.
a.
Mode = E
b.
Median.
Middle position: (20+1)/2 = 10.5
Median = AA
EXERCISE 2
The following data give the numbers of computer keyboards assembled at the Twentieth Century Electronics
Company for a sample of 25 days.
45

52
43

48
52
41
50
56
54
46
47
44
44
42
47
48
50
53
49
51
48
Numbers of computer
keyboards assembled
n
41-44
45-48
49-52
53-56
Sum
5
9
7
4
25
53
51 48 46
Calculate the percentage distribution, using the classes.
Numbers of computer
keyboards assembled
41-44
45-48
49-52
53-56
Sum
ni
fi
%
5
9
7
4
25
0.2
0.36
0.28
0.16
1
20%
36%
28%
16%
100%
What percentage of days corresponds to the numbers of computer keyboards assembled between
45 and 48? 36%

Calculate the cumulative percentage distribution.
Numbers of computer
keyboards assembled
41-44
45-48
49-52
53-56
Sum
ni
Cumulative freq
Cumulative
relative freq
Cumulative%
freq
5
9
7
4
25
5
14
21
25
-
0.2
0.56
0.84
1
-
20%
56%
84%
100%
-
What percentage of days corresponds to the numbers of computer keyboards assembled between 41 and 48?
56%
40%
35%
30%
25%
20%
15%
10%
5%
0%
41-44
45-48
49-52
53-56
EXERCISE 3

The variable Amount of telephone Bill is a continuous quantitative variable.
Amount of
Telephone Bill
Number of Families
(ni)
Cumulative
freq.
cumulative
relative freq.
% percentage
freq.
8
13
17
9
3
50
8
21
38
47
50
-
0.16
0.42
0.76
0.94
1
-
16%
42%
76%
94%
100%
-
20 -| 40
40 -| 60
60 -| 80
80 -| 100
100 -| 120
Sum
How many families in this sample had a bill of $80 or less? 38
What is the corresponding percentage? 76%

Amount of
Number of Families
Telephone Bill
(ni)
(dollars)
20 -| 40
8
40 -| 60
13
60 -| 80
17
80 -| 100
9
100 -| 120
3
Sum
50
Modal-class interval = 60 -|80
EXERCISE 4


Defines the elements and the variable in this data set.
The elements are the cities, the variable is hours spent annually in gridlock.
Construct a frequency distribution table. Take the classes as 44-50, 51-57, 58-64, 65-71, 72-78,
79-85.
Hours
44 -| 50
51 -| 57
58 -| 64
65 -| 71
72 -| 78
79 -| 85
Sum
Cities
(ni)
6
5
4
3
1
1
20
%
30
25
20
15
5
5
100

Construct a histogram for the percentage distribution.
35
30
25
20
15
10
5
0
44 -| 50

58 -| 64
65 -| 71
72 -| 78
79 -| 85
In what percentage of these cities do the motorists spend an average of 64 hours or less annually in
gridlock? 75%
Hours
Cities
(ni)
6
5
4
3
1
1
20
44 -| 50
51 -| 57
58 -| 64
65 -| 71
72 -| 78
79 -| 85
Sum

51 -| 57
%
30
25
20
15
5
5
100
Cumulative
percentage
30
55
75
90
95
100
Starting from the frequency distribution table, compute the mode.
Modal-class interval = 44 -|50
EXERCISE 5
Median.
Ranked data:
0.20 0.80
4.30
5.00
5.25
6.00
Middle position: ((6/2)+(6/2+1))/2 = 3.5
0.20 0.80
4.30
5.00
5.25
6.00
Median = (4.30 + 5.00)/2 = 4.65
Mean.
Mean = (0.20 + 0.80 + 4.30 + 5.00 + 5.25 + 6.00)/6 = 3.59
There is no mode in this data set. Each value in this data set occurs only once.
EXERCISE 6

Compute the mode, the median and the mean.
Numbers of children
Number of Families
0
1
2
3
10
15
21
4
Mode = 2
Median
Middle position: ((50/2)+ ((50/2)+1))/2 = 25.5 Median = (1 + 2)/2 = 1.5
Mean = (10*0 + 15*1 + 2*21 + 3*4)/50 = 1.38

Compute the range, variance and standard deviation.
Numbers of children
xi
Number of Families
ni
xi*ni
xi2
xi2*ni
0
1
2
3
Sum
10
15
21
4
50
0
15
42
12
69
0
1
4
9
0
15
84
36
135
Range = 3 – 0 = 3
Variance = s2= 1/50(135) – 1.90 = 0.80
Standard deviation = s = 0.89
EXERCISE 7
Compute the mode and the mean.
Computers sold
Number of Store
ni
Midpoint
mi
mi*ni
4-12
13-21
22-30
31-39
40-48
Sum
6
9
14
7
4
40
8
17
26
35
44
48
153
364
245
176
986
Modal-class interval = 22-30
Mean = (986)/40 = 24.65
EXERCISE 8

Calculate the mean and the median for these data.
Mean=(17.487+5189+3045+2616+2298+1630+1604+1410)/8= 4409.875
Median :
1. Rank the data:
1410 1604 1630 2298 2616 3045 5189 17487
2. Compute the middle position: ((8/2)+((8/2)+1))/2=4.5 Median=(2298+2616)/2=2457

Do they contain an outlier? Yes, the value 17487

Drop the outlier and recalculate the mean and the median.
Mean=(5189+3045+2616+2298+1630+1604+1410)/8=2541.714
Median Middle position=(7+1)/2=4 Median= 2298
EXERCISE 9


Range = 15 – 2 = 13
Mean = (6+3+7+11+4+3+8+7+2+6+9+15)/12 = 6.75
Variance = s2= 1/12(36+9+49+121+16+9+64+49+4+36+81+225) - 45.56 = 12.69

Standard deviation = s = 3.56
EXERCISE 10
Compute the variance and standard deviation.
No. of days
absent
0-2
3-5
6-8
9-11
12-14
Sum
Number of
employees
ni
10
14
9
4
3
40

Mean = 208/40 = 5.2
Variance = 2= 1/40(1582) – 27.04 = 12.51

Standard deviation =  = 3.54
midpoint
mi
mi*ni
mi2
mi2*ni
1
4
7
10
13
10
56
63
40
39
208
1
16
49
100
169
10
224
441
400
507
1582
Related documents