Download 003 The Normal Curve

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Standard Scores
and
The Normal Curve
0.45
Standard Normal Curve
0.4
0.35
Frequency
0.3
0.25
0.2
0.15
0.1
0.05
0
-3
-2
-1
0
1
2
3
Z Score
5/24/2017
HK 396 - Dr. Sasho MacKenzie
1
Z-Score
• Just like percentiles have a known basis of
comparison (range 0 to 100 with 50 in the
middle), so does the z-score.
• Z-scores are centered around 0 and indicate
how many standard deviations the raw score
is from the mean.
• Z-scores are calculated by subtracting the
population mean from the raw score and
dividing by the population standard deviation.
5/24/2017
HK 396 - Dr. Sasho MacKenzie
2
Z-score Equation
z
x

• x is a raw score to be standardized
• σ is the standard deviation of the population
• μ is the mean of the population
5/24/2017
HK 396 - Dr. Sasho MacKenzie
3
Z-score for 300lb Squat
• Assume a population of weight lifters had a
mean squat of 295 ± 19.7 lbs.
z
x

300  295
z
 .25
19.7
• That means that a squat of 300 lb is .25
standard deviation above the mean. This
would be equivalent to the 60th percentile.
5/24/2017
HK 396 - Dr. Sasho MacKenzie
4
What about a 335 lb Squat
• How many standard deviation is a 335 lb
squat above the mean?
z
x

335  295
z
2
19.7
• That means that a squat of 335 lb is 2
standard deviation above the mean. This
would be equivalent to the 97.7th percentile.
5/24/2017
HK 396 - Dr. Sasho MacKenzie
5
From z-score to raw score
• A squat that is 1 standard deviation below the
mean (-1 z-score) would have a raw score of?
x  295  (1)(19.7)  275.3 lbs
• What would you know about the raw score if it
had a z-score of 0 (zero)? Right on the mean.
5/24/2017
HK 396 - Dr. Sasho MacKenzie
6
Z-score for 10.0 s 100 m
• Assume a population of sprinters had a mean
100 m time of 11.4 ± 0.5 s.
z
x

10  11.4
z
 2.8
0.5
• That means that a sprint time of 10 s is 2.8
standard deviations below the mean. This
would be equivalent to the 99.7th percentile.
5/24/2017
HK 396 - Dr. Sasho MacKenzie
7
Converting Z-scores to Percentiles
• The cumulative area under the standard
normal curve at a particular z-score is equal
to that score’s percentile.
• The total area under the standard normal
curve is 1.
5/24/2017
HK 396 - Dr. Sasho MacKenzie
8
Histogram of Male 100 m
300
Frequency
250
200
150
300
250
100
150
50
0
250
150
50
<10.0
50
10.1
to
10.5
10.6
to
11.0
11.1
to
11.5
11.6
to
12.0
12.1
to
12.6
>12.7
Time (s)
5/24/2017
HK 396 - Dr. Sasho MacKenzie
9
The Histogram
• Each bar in the histogram represents a range
of sprint times.
• The height of each bar represents the
number of sprinters in that range.
• We can add the numbers in each bar moving
from left to right to determine the number of
sprinters that have run faster than the current
point on the x-axis.
• Dividing by the total number of sprinters
yields the proportion of sprinters that have
run faster.
5/24/2017
HK 396 - Dr. Sasho MacKenzie
10
Proportion
• For example, 50 sprinters ran less than 10.0 s.
• That means that, (50/1200)*100 = 4%, of the
sprinter ran < 10.0 s.
• Notice that the area of each bar reflects the
number of scores in that range. Therefore, we
could just look at the amount of area.
• If there are a sufficient number of scores, the
bars can be replaced by a smooth line.
5/24/2017
HK 396 - Dr. Sasho MacKenzie
11
Male NCAA 100 m Sprint
300
Frequency
250
200
150
300
250
100
250
150
50
150
50
0
9.8
10.2
50
10.6
11.0
11.4
11.8
12.2
12.6
13.0
Time (s)
5/24/2017
HK 396 - Dr. Sasho MacKenzie
12
Male NCAA 100 m Sprint
300
Frequency
250
200
150
100
50
0
9.8
10.2
10.6
11.0
11.4
11.8
12.2
30
16
12.6
13.0
Time (s)
97
90
84
70
50
10
3
Percentile
5/24/2017
HK 396 - Dr. Sasho MacKenzie
13
Normal Distribution
• If the data are normally distributed, then
the raw scores can be converted into zscores.
• This yields a standard normal curve with
a mean of zero instead of 11.4 s.
5/24/2017
HK 396 - Dr. Sasho MacKenzie
14
Frequency
Male NCAA 100 m Sprint
-3
-2
-1
0
1
2
3
z-score
(standard deviations)
5/24/2017
HK 396 - Dr. Sasho MacKenzie
15
Male NCAA 100 m Sprint
50%
Frequency
15.9%
84.1%
Cumulative %
34.1%
34.1%
2.3%
97.7%
13.6%
0.14%
-3
0.1%
5/24/2017
13.6%
2.2%
-2
2.2%
-1
0
1
z-score
(standard deviations)
HK 396 - Dr. Sasho MacKenzie
2
99.9%
3
0.1%
16
Excel
• The function NORMSDIST() calculates the
cumulative area under the standard normal
curve.
• The function NORMSINV() performs the
opposite calculation and reports the z-score
for a given proportion.
• NORMDIST() and NORMINV() perform the
same calculations for scores that have not
been standardized.
5/24/2017
HK 396 - Dr. Sasho MacKenzie
17
Z-score and Percentile Agreement
• Converting a z-score to a percentage will
yield that score’s percentile.
• However, the population must be normally
distributed.
• The less normal the population the greater
discrepancy between the converted z-score
and the percentile.
5/24/2017
HK 396 - Dr. Sasho MacKenzie
18