Download Practical Stats

Document related concepts

Bootstrapping (statistics) wikipedia , lookup

Taylor's law wikipedia , lookup

Foundations of statistics wikipedia , lookup

Statistics education wikipedia , lookup

History of statistics wikipedia , lookup

Misuse of statistics wikipedia , lookup

Transcript
Practical Statistics
Descriptive Statistics
There are six statistics that will
answer 90% of all questions!
There six are:
1. Descriptive
There six are:
1. Descriptive
2. Chi-square
There six are:
1. Descriptive
2. Chi-square
3. Z-tests
There six are:
1.
2.
3.
4.
Descriptive
Chi-square
Z-tests
t-tests
There six are:
5. Correlation (association)
There six are:
5. Correlation
6. Regression
Descriptive Statistics
These are utilized to describe the characteristic
of a population (parameter), or
the characteristics of a sample (a statistic)
Many business problems can be solved by
knowing only descriptive statistics.
Geodemography
http://www.youtube.com/watch?v=7UCxzYpXhpI
Geodemography
Geodemography
2012 Map
http://www.geocurrents.info/2012/11#!gallery/6/
Descriptive Statistics
Simple counts: frequency
Descriptive Statistics
Simple counts: frequency
The amount of time (in seconds) needed for assembly-line workers
to complete a weld was recorded for 40 workers.
69 60 75 74 68 66 73 76 63 67 69 73 65 61 73 72 72 65 69 70
64 61 74 76 72 74 65 63 69 73 75 70 60 62 68 74 71 73 68 67
a. What is the frequency for each amount of time?
Descriptive Statistics
Simple counts: histograms
5
Frequency
4
3
2
1
Mean =68.975
Std. Dev. =4.76896
N =40
0
60.00
65.00
70.00
VAR00001
75.00
On SPSS: Open the file called WeldDat.sav
Do the following:
Graphs
Graphboard Template…
Select WeldTime
Select histogram with Normal…
OK
Descriptive Statistics
Simple counts: histograms
5
Frequency
4
3
2
1
Mean =68.975
Std. Dev. =4.76896
N =40
0
60.00
65.00
70.00
VAR00001
75.00
Descriptive Statistics
Simple counts: ogive
100.0%
Cumulative Percent
80.0%
60.0%
40.0%
20.0%
0.0%
60.00
65.00
70.00
VAR00001
75.00
80.00
Descriptive Statistics
Simple counts: ogive
http://mathworld.wolfram.com/Ogive.html
Symmetry
Right
Left
Descriptive Statistics
Back to the workers making welds…..
The amount of time (in seconds) needed for assembly-line workers
to complete a weld was recorded for 40 workers.
69 60 75 74 68 66 73 76 63 67 69 73 65 61 73 72 72 65 69 70
64 61 74 76 72 74 65 63 69 73 75 70 60 62 68 74 71 73 68 67
Descriptive Statistics
Central tendency:
A single number to represent a group.
Statistics
Time
N
Mean
Std. Error of Mean
Median
Mode
Std. Deviation
Skewness
Std. Error of Skewness
Kurtos is
Std. Error of Kurtosis
Sum
Valid
Missing
40
0
68.9750
.75404
69.0000
73.00
4.76896
-.379
.374
-.985
.733
2759.00
Descriptive Statistics
The average is 68.98.
The standard deviation is 4.77.
The amount of time (in seconds) needed for assembly-line workers
to complete a weld was recorded for 40 workers.
69 60 75 74 68 66 73 76 63 67 69 73 65 61 73 72 72 65 69 70
64 61 74 76 72 74 65 63 69 73 75 70 60 62 68 74 71 73 68 67
Descriptive Statistics
Central tendency:
1. Mode
The most frequent score.
Descriptive Statistics
Central tendency:
1. Mode
2. Median
Half the numbers are larger, half are smaller.
Descriptive Statistics
Central tendency:
1. Mode
2. Median
3. Mean
The center of gravity of a distribution
Descriptive Statistics
Central tendency:
1. Mode
2. Median
3. Mean
Arithmetic
Harmonic
Geometric
Descriptive Statistics
Central tendency:
1. Mode
2. Median
3. Mean
Arithmetic
n
x
x
i 1
n
i
Descriptive Statistics
Central tendency:
3. Mean
Harmonic
Clayson Power Data
180
160
140
120
100
80
60
40
20
0
1/6/00 1/2/02 1/10/03 1/6/05 1/2/07 1/10/08 1/6/10 1/2/12 1/10/13 1/6/15
Legend
Temp
KWH
Cost
Descriptive Statistics
Central tendency:
3. Mean
Harmonic
Clayson Power Data
80
70
60
50
40
30
20
10
0
1/6/00 1/2/02 1/10/03 1/6/05 1/2/07 1/10/08 1/6/10 1/2/12 1/10/13 1/6/15
Legend
Temp
Descriptive Statistics
Central tendency:
3. Mean
Harmonic
Clayson Power Data
80
70
60
Legend
Temp
Linear Trend
50
40
30
20
10
0
1/5/00 1/1/02 1/9/03 1/5/05 1/1/07 1/9/08 1/5/10 1/1/12 1/9/13 1/5/15
Descriptive Statistics
Central tendency:
1. Mode
2. Median
3. Mean
Harmonic
n
1 1
1
 
H n i 1 xi
Descriptive Statistics
Central tendency:
1. Mode
2. Median
3. Mean
Geometrical (For pure ratio numbers)
G
 x 
n
i 1 i
1
n
Geometric Mean:
Does pricing following perception or volume?
If a package holds twice as much should it be
priced as twice that of a
package that holds half
as much?
Problem: Customers do not perceive volume correctly.
Geometric Mean:
Does pricing following perception or volume?
The perception of volume follows:
Seven’s Law:
  kS
How should this be done?
p
Descriptive Statistics
Measure of Variation:
Again a single number to represent a group.
Statistics
Time
N
Std. Error of Mean
Std. Deviation
Variance
Skewness
Std. Error of Skewness
Kurtos is
Std. Error of Kurtosis
Range
Minimum
Maximum
Valid
Missing
40
0
.75404
4.76896
22.743
-.379
.374
-.985
.733
16.00
60.00
76.00
Descriptive Statistics
Measure of Variation:
1. Range
a. (Highest – Lowest) + 1
b. Subject to deviancy
Descriptive Statistics
Measure of Variation:
1. Range
2. Mean Deviation
a. Problem: Variation logically would be
the average deviation around a measure
of central tendency.
 x  x / n
i
Descriptive Statistics
Measure of Variation:
1. Range
2. Mean Deviation
a. Problem: But…
 x  x  0
i
Always!!
Descriptive Statistics
Measure of Variation:
1. Range
2. Mean Deviation
a. Problem: So negative can be removed by
looking at the absolute value.
Abs(  xi  x ) / n
Descriptive Statistics
Measure of Variation:
1. Range
2. Mean Deviation
a. Problem: Negative can be removed
by squaring the value.
 x
i
 x / n
2
Descriptive Statistics
Measure of Variation:
1. Range
2. Mean Deviation
a. Problem: This is the definition of:
3. Variance
 x
i
 x  / (n  1)
2
Descriptive Statistics
Measure of Variation:
1. Range
2. Mean Deviation
3. Variance
a. Problem: This is difficult to work with
descriptively, so the square root can be taken.
x  x
n
Sx 
i 1
i
n 1
2
Descriptive Statistics
Measure of Variation:
1. Range
2. Mean Deviation
3. Variance
a. This is called the:
4. Standard Deviation
x  x
n
Sx 
i 1
i
n 1
2
The standard deviation is:
Roughly the average difference scores are
from the average.
Or:
It is roughly the average difference between
two measures taken from the same population.
It is an intuitive measure.
The amount of time (in seconds) needed for assembly-line workers to
complete a weld was recorded for 40 workers.
69 60 75 74 68 66 73 76 63 67 69 73 65 61 73 72 72 65 69 70
64 61 74 76 72 74 65 63 69 73 75 70 60 62 68 74 71 73 68 67
a. Give the central tendency measures of this sample.
b. What is the range, variance and standard deviation?
c.
Explain what you now know about the time it takes to make welds.
d.
When should a worker be reprimanded or fired for working too slow?
a. Give the central tendency measures of this sample.
Statistics
Time
N
Mean
Std. Error of Mean
Median
Mode
Std. Deviation
Variance
Skewness
Std. Error of Skewness
Kurtos is
Std. Error of Kurtosis
Range
Minimum
Maximum
Valid
Missing
40
0
68.9750
.75404
69.0000
73.00
4.76896
22.743
-.379
.374
-.985
.733
16.00
60.00
76.00
a. Give the central tendency measures of this sample.
b. What is the range, variance and standard deviation?
Statistics
Time
N
Mean
Std. Error of Mean
Median
Mode
Std. Deviation
Variance
Skewness
Std. Error of Skewness
Kurtos is
Std. Error of Kurtosis
Range
Minimum
Maximum
Valid
Missing
40
0
68.9750
.75404
69.0000
73.00
4.76896
22.743
-.379
.374
-.985
.733
16.00
60.00
76.00
a. Give the central tendency measures of this sample.
b. What is the range, variance and standard deviation?
c. Explain what you now know about the time it takes to make welds.
Statistics
Time
N
Mean
Std. Error of Mean
Median
Mode
Std. Deviation
Variance
Skewness
Std. Error of Skewness
Kurtos is
Std. Error of Kurtosis
Range
Minimum
Maximum
Valid
Missing
40
0
68.9750
.75404
69.0000
73.00
4.76896
22.743
-.379
.374
-.985
.733
16.00
60.00
76.00
a.
b.
c.
d.
Give the central tendency measures of this sample.
What is the range, variance and standard deviation?
Explain what you now know about the time it takes to make welds.
When should a worker be reprimanded or fired for working
too slow?
Statistics
Time
N
Mean
Std. Error of Mean
Median
Mode
Std. Deviation
Variance
Skewness
Std. Error of Skewness
Kurtos is
Std. Error of Kurtosis
Range
Minimum
Maximum
Valid
Missing
40
0
68.9750
.75404
69.0000
73.00
4.76896
22.743
-.379
.374
-.985
.733
16.00
60.00
76.00