Download Average

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics
Statistics for Economist
Chap 3. The Average and the Standard Deviation
1.
2.
3.
4.
5.
6.
7.
The Center and the Spread
The Average
The mean, median and mode
The Root-Mean-Square
The Standard Deviation
The Degrees of Freedom
The Measurement Error
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics
INDEX
STATISTICS
1
The Center and the Spread
2
The Average
3
The mean, median and mode
4
The Root-Mean-Square
5
The Standard Deviation
6
The Degrees of freedom
7
The Measurement Error
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics
2/27
STATISTICS
1. The Center and the Spread
Center and Spread(I)
center
center
0
25
50
75
100
0
25
spread
50
75
100
spread
 A histogram can be used to summarize large amounts
of data giving the center of the histogram and the
spread around the center.
- center : mean, median
- spread : deviation, interquartile range
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics
3/27
1. The Center and the Spread
STATISTICS
Center and Spread(II)
 The histogram can be
summarized by the center and
the spread, but things do not
always work out so well.
 Reporting only the center
and spread of this histogram
would miss the two peaks
-8
-6
-4 -2
0
해발 (km)
2
4
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics
4/27
INDEX
STATISTICS
1
The Center and the Spread
2
The Average
3
The mean, median and mode
4
The Root-Mean-Square
5
The Standard Deviation
6
The Degrees of freedom
7
The Measurement Error
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics
5/27
STATISTICS
2. The Average
Average
The average of a list of numbers equals
their sum, divided by how many there
are.
EX) data : 9, 1, 2, 2, 0
sum : 9+1+2+2+0 = 14
number of data : 5
average : 14/5 = 2.8
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics
6/27
STATISTICS
2. The Average
Average and the variance
average
average
average
The average is important but not sufficient.
The distribution with the same average
varies if the variances are different.
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics
7/27
2. The Average
STATISTICS
Age-specific average heights and weights
남성
여성
70
남성
여성
180
평균 신장(cm)
평균몸무게(kg)
80
60
50
40
30
170
160
150
140
130
120
10
20
30
40
연
50
60
70
80
령
10
20
30
40
50
연
령
60
70
80
Does the average height of people decrease after
age 20, dropping about 10cm in 60years?
Does the average weight of people decrease after
age 20, dropping about 15kg in 60years?
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics
8/27
2. The Average
STATISTICS
Lexis diagram
100
Cross-sectional data
O Time-series data
90
80
O3
70
Longitudinal data
연령
60
50
X
40
30
O2
20
10
O1
Examine O and X along each
45 degree line.
X2
At 1978
X1
45 o
0
1900 1910 1920 1930 1940 1950 1960 1970 1980 1990 2000 2010
연도
O : age 70, 157cm
X : age 21, 162cm
You need to know the type of data first!
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics
9/27
INDEX
STATISTICS
1
The Center and the Spread
2
The Average
3
The mean, median and mode
4
The Root-Mean-Square
5
The Standard Deviation
6
The Degrees of freedom
7
The Measurement Error
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics
10/27
3. The Mean, Median and Mode
STATISTICS
Histogram for the weight
1400
1200
1000
800
600
400
200
0
25 30 35 40
45 50 55 60 65 70 75 80
85 90 95
Weight (kg)
 The left and the right of the mean are not always same.
It is the median of a histogram that has the value with half the
area to the left and half to the right.
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics
11/27
3. The Mean, Median and Mode
STATISTICS
Histogram for the list 1,2,2,3
50%
25%
0%
0
1
2
3
4
If the histogram is symmetric around a median,
it equals the average.
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics
12/27
3. The Mean, Median and Mode
STATISTICS
The average and the histogram
list 1,2,2,3
50%
0%
0
1
3
4
5
6
7
8
list 1,2,2,5
50%
0%
0
1
2
0
1
2
3
4
5
6
4
5
6
50%
7
8
list 1,2,2,7
0%
7
8
The histograms balance when supported at the average
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics
13/27
3. The Mean, Median and Mode
STATISTICS
The tails of a histogram
Long right hand tail
symmetric
(right-skewed)
Average is
bigger than
median
Long left hand tail
(left-skewed)
Average is
about the same
as median
Average is
smaller than
median
Half the area under the histogram lies to the left
of the median and half to the right.
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics
14/27
3. The Mean, Median and Mode
STATISTICS
Histogram and the mode
 mode: the most frequent value
Think of ‘fashion’. Fashion Mode!
Histogram is highest at the mode.
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics
15/27
INDEX
STATISTICS
1
The Center and the Spread
2
The Average
3
The mean, median and mode
4
The Root-Mean-Square
5
The Standard Deviation
6
The Degrees of freedom
7
The Measurement Error
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics
16/27
4. The Root-Mean-Square
STATISTICS
Root-mean-square
Root-Mean-Square
how to do the arithmetic, read it backwards:
(1) SQUARE all the entries, getting rid of the
signs.
(2) Take the MEAN of the squares.
(3) Take the square ROOT of the mean.
EX) For the list 0, 5, -8, 7, -3 find the RMS
0 2  52  (8) 2  7 2  (3) 2
 29.4  5.4
5
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics
17/27
INDEX
STATISTICS
1
The Center and the Spread
2
The Average
3
The mean, median and mode
4
The Root-Mean-Square
5
The Standard Deviation
6
The Degrees of freedom
7
The Measurement Error
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics
18/27
5. The Standard Deviation
STATISTICS
Calculating SD
 SD is a sort of average deviation
deviation from the average = each value – average
SD  RMS of deviations from the average
Ex) for the list 20, 10, 15, 15 find the SD.
average = (20+10+15+15)/4 = 15
each deviation from the average : 5, -5, 0, 0
SD:
52  (5) 2  02  02
 16.7  4.1
4 1
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics
19/27
5. The Standard Deviation
STATISTICS
Meaning of SD
 SD is often helpful to think of the way a list of
numbers spreads out around the average.
Roughly 68% of the entries on a list are within 1 SD of
the average. And roughly 95% are within 2 SDs of the
average
68%
mean
-1SD
mean
mean+1SD
95%
mean-2SD
mean
mean
+2SD
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics
20/27
5. The Standard Deviation
STATISTICS
Histogram for the heights of women
The heights of the 5,257 women in the Health and Nutrition Examination Survey
sample. The region within 1SD of the average is shaded in (i) and 2SD in (ii)
(i)
(ii)
110 120 130 140 150 160 170 180 190 110 120 130 140 150 160 170 180 190
68 – 95 rule
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics
21/27
INDEX
STATISTICS
1
The Center and the Spread
2
The Average
3
The mean, median and mode
4
The Root-Mean-Square
5
The Standard Deviation
6
The Degrees of freedom
7
The Measurement Error
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics
22/27
STATISTICS
6. The Degrees of Freedom
Definition of DF
 The degrees of freedom is as same as the number of actual
independent values among all the values.
 When calculating SD, we get the average by dividing the
sum of the squares of the deviations by “number of data – 1”
and then get the root squares of the average.
 Why do we use “number of data – 1” instead of “number of
data”? The sum of the deviation has to be 0, so they cannot
all vary freely. The constraint that the sum equals 0
eliminates one degree of freedom.
DF when calculating SD = number of data - 1
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics
23/27
INDEX
STATISTICS
1
The Center and the Spread
2
The Average
3
The mean, median and mode
4
The Root-Mean-Square
5
The Standard Deviation
6
The Degrees of freedom
7
The Measurement Error
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics
24/27
STATISTICS
7. The Measurement Error
Definition
 Measurement Error
No matter how carefully it was made, a measurement
could have come out a bit differently than it did. There
exists difference between the exact value and the
measurement. Such a difference is a measurement error.
Measurement error makes an individual measurement
deviate from the exact value.
Measurement error = individual measurement – exact value
Individual measurement = exact value + measurement error
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics
25/27
STATISTICS
7. The Measurement Error
Size of the measurement error
 The SD of a series of repeated measurements
becomes SD of the measurement error.
 The SD is an estimate of the likely size of the
measurement error in a single measurement.
Individual measurement = exact value + measurement error
SD of individual measurement = SD of measurement error
The size of the measurement error is obtained by
SD of individual measurements
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics
26/27
STATISTICS
7. The Measurement Error
Bias
 Bias is the systematic error.
Ex) A butcher weighs a steak with his thumb on the
scale.
Ex) A draper store uses a cloth tape measure shrunk.
Bias affects all measurements the same way, pushing
them in the same direction. When bias is present, the
long-run average of repeated measurements will
itself be either too high or too low. The error should
not cancel out.
Individual measurement = exact value + bias + chance error
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics
27/27
Related documents