Download Cont.

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Degrees of freedom (statistics) wikipedia , lookup

History of statistics wikipedia , lookup

Bootstrapping (statistics) wikipedia , lookup

Mean field particle methods wikipedia , lookup

Taylor's law wikipedia , lookup

Regression toward the mean wikipedia , lookup

Transcript
Review
Mean—arithmetic average, sum of all
scores divided by the number of scores
Median—balance point of the data, exact
middle of the distribution, 50th percentile
Mode—highest frequency, can be more
than one
1
Review
X
5
4
3
2
1
f
2
5
3
2
2
Find the mean,
median, mode
2
Review
X
5
4
3
2
1
f fX
2
5
3
2
2
N ∑fX
Find the mean,
median, mode
Mean=sum of all
scores(∑fX) /number
of scores(N)
3
Review
X
5
4
3
2
1
f fX
2
5
3
2
2
Find the mean,
median, mode
Mean=sum of all
scores(∑fX) /number
of scores(N)
Median=middle point
(N-1/2)th position
4
Review
X
5
4
3
2
1
f fX
2
5
3
2
2
Find the mean,
median, mode
Mean=sum of all
scores(∑fX) /number
of scores(N)
Median=middle point
(N-1/2)th position
Mode=greatest f
5
Measures of
Variability
Major Points
The general problem
Range and related statistics
Deviation scores
The variance and standard deviation
Boxplots
Review questions
7
The General Problem
Central tendency only deals with the
center
Dispersion


Variability of the data around something
The spread of the points
Example: Mice and Music
8
Mice and Music
Study by David Merrell
Raised some mice in quiet environment
Raised some mice listening to Mozart
Raised other mice listening to Anthrax
Dependent variable is the time to run a
straight alley maze after 4 weeks.
Borrowed from David Howell, 2000
9
Results
Anthrax mice took much longer to run
Much greater variability in Anthrax group

See following graphs for Anthrax and Mozart
We often see greater variability with larger
mean
10
Mozart Group
12
10
8
6
4
2
Std. Dev = 36.10
Mean = 114.6
N = 24.00
0
27.8
83.3
138.9 194.4 250.0 305.6 361.1 416.7 472.2
WEEK4
Anthrax Group
10
8
6
4
2
Std. Dev = 103.14
Mean = 1825.9
N = 24.00
0
1600.0
1700.0
1650.0
WEEK4
1800.0
1750.0
1900.0
1850.0
2000.0
1950.0
2050.0
Range and Related Statistics
The range


Distance from lowest to highest score
Too heavily influenced by extremes
The interquartile range (IQR)



Delete lowest and highest 25% of scores
IQR is range of what remains
May be too little influenced by extremes
13
Trimmed Samples
Delete a fixed (usually small) percentage
of extreme scores
Trimmed statistics are statistics computed
on trimmed samples.
14
Deviation Scores
Definition


distance between a score and a measure of
central tendency
usually deviation around the mean
(X  X )
Importance
15
Variance
Definitional formula
( X  X )
s 
N 1
2
2
Example

See next slide
16
Computing the Variance
X
2
4
5
8
7
4
30
X-X
¯
(X - X)
¯2
Definitional formula
2

(
X

X
)
s2 
N 1
Find the mean
N=6
∑X=30
30/6=5
17
Computing the Variance
X
X-X
¯
2
-3
4
-1
5
0
8
3
7
2
4
-1
30
0
(X - ¯
X)2
2

(
X

X
)
s2 
N 1
Calculate the
difference between
each score and the
mean and sum
18
Computing the Variance
X
X-X
¯
(X - ¯
X)2
2
-3
9
4
-1
1
5
0
0
8
3
9
7
2
4
4
-1
1
30
0
24
( X  X ) 2 24
s 

 4.80
N 1
5
2
Calculate the square
of the difference
between each score
and the mean and
sum
Standard Deviation is
the square root
19
Standard Deviation
Definitional formula

The square root of the variance
( X  X )
s s 
N 1
2
2
Computational formula based on algebraic
manipulation

Makes it easier to calculate
20
Computational Formula
( X )
30
2
2
2
2
2
2
X 
2  4 5 8 7  4 
2
N
6
s 

N 1
5
2
2
2
 4.80
( X )
X 
N  4.8  2.19
s
N 1
2
2
21
Try one
X
5
4
3
2
1
f
2
5
3
2
2
N
(X )
X 
N
s
N 1
2
2
22
Try one
X
5
4
3
2
1
f fX
2
5
3
2
2
N ∑fX
(X )
X 
N
s
N 1
2
2
23
Try one
X
5
4
3
2
1
f
2
5
3
2
2
14
fX
10
20
9
4
2
45
(X )
X 
N
s
N 1
2
2
24
Try one
X
5
4
3
2
1
f
2
5
3
2
2
14
fX
10
20
9
4
2
45
X2
fX2
(X )
X 
N
s
N 1
2
2
∑fX2
25
Try one
X
5
4
3
2
1
f
2
5
3
2
2
14
fX
10
20
9
4
2
45
X2
25
16
9
4
1
fX2
50
80
27
8
2
167
(X )
X 
N
s
N 1
2
2
26
X
5
4
3
2
1
f
2
5
3
2
2
14
fX
10
20
9
4
2
45
X2
25
16
9
4
1
fX2
50
80
27
8
2
167
(X )
X 
N
s
N 1
2
2
(45)
167 
14
s
14  1
2
2025
167 
14  167  144.64  22.36  1.72  1.31
s
13
13
13
27
Estimators
Mean

Unbiased estimate of population mean ()
Define unbiased

Long range average of statistic is equal to the parameter
being estimated.
Variance

2

(
X

X
)
s2 
N 1
Unbiased estimate of 2
28
Cont.
Estimators--cont.

Using
2

(
X

X
)
s2 
N
gives biased estimate

Standard deviation
use square root of unbiased estimate.
29
Merrell’s Music Study SPSS
Printout
WEEK4
Treatment Mean
N
Std. Deviation
Quiet
307.2319 23
71.8267
Mozart
114.5833
24
36.1017
Anthrax 1825.8889 24
103.1392
Total
755.4601 71
777.9646
30
Boxplots
The general problem

A display that shows dispersion for center
and tails of distribution
Calculational steps (simple solution)




Find median
Find top and bottom 25% points
(quartiles)
eliminate top and bottom 2.5% (fences)
Draw boxes to quartiles and whiskers to
fences, with remaining points as outliers
31
Combined Merrell Data
3000
2000
1000
0
-1000
N =
71
WEEK4
32
Merrell Data by Group
3000
2000
1000
WEEK4
0
-1000
N =
23
24
24
Quiet
Mozart
Anthrax
Treatment Condition
33
Review Questions
What do we look for in a measure of
dispersion?
What role do outliers play?
Why do we say that the variance is a
measure of average variability around
the mean?
Why do we take the square root of the
variance to get the standard deviation?
34
Cont.
Review Questions--cont.
How does a boxplot reveal dispersion?
What do David Merrell’s data tell us about
the effect of music on mice?
35