Download Measures of Variability

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
POLS 7000X
STATISTICS IN POLITICAL SCIENCE
CLASS 3
BROOKLYN COLLEGE – CUNY
SHANG E. HA
Leon-Guerrero and Frankfort-Nachmias,
Essentials of Statistics for a Diverse Society
Chapter 3: Measures of Central Tendency







The Mode
The Median
The Mean
Finding the Mean in a Frequency Distribution
The Shape of the Distribution
Considerations for Choosing a Measure of Central
Tendency
Statistics in Practice: Representing Income
Leon-Guerrero/Frankfort-Nachmias: Essentials of Social Statistics for a Diverse Society
© 2012 SAGE Publications
What is a measure of Central
Tendency?
•
Numbers that describe what is average or typical of the distribution
•
You can think of this value as where the middle of a distribution lies.
Leon-Guerrero/Frankfort-Nachmias: Essentials of Social Statistics for a Diverse Society
© 2012 SAGE Publications
The Mode
•
The category or score with the largest frequency (or
percentage) in the distribution.
•
The mode can be calculated for variables with levels of
measurement that are: nominal, ordinal, or interval-ratio.
Leon-Guerrero/Frankfort-Nachmias: Essentials of Social Statistics for a Diverse Society
© 2012 SAGE Publications
The Mode: An Example

Example: Number of Votes for Candidates
for Mayor of Camarillo, California. The
mode, in this case, gives you the
“central” response of the voters: the most
popular candidate.
Sheriff Tupper – 11,769 votes
The Mode:
Jessica Fletcher – 39,443 votes
“Dr. Seth Hazlett”
Dr. Seth Hazlett – 78,331 votes
Leon-Guerrero/Frankfort-Nachmias: Essentials of Social Statistics for a Diverse Society
© 2012 SAGE Publications
The Median


The score that divides the distribution into
two equal parts, so that half the cases are
above it and half below it.
The median is the middle score, or
average of middle scores in a distribution.
Leon-Guerrero/Frankfort-Nachmias: Essentials of Social Statistics for a Diverse Society
© 2012 SAGE Publications
The Mean

The arithmetic average
obtained by adding up all
the scores and dividing by
the total number of scores.
Leon-Guerrero/Frankfort-Nachmias: Essentials of Social Statistics for a Diverse Society
© 2012 SAGE Publications
Formula for the Mean
åY
Y=
N
Where ΣY = sum of all scores
N = the number of scores.
Leon-Guerrero/Frankfort-Nachmias: Essentials of Social Statistics for a Diverse Society
© 2012 SAGE Publications
An Example

Annual per capita carbon dioxide emissions (metric tons) for n
= 10 largest nations in population size
 Bangladesh 0.3, Brazil 1.8, China 2.3, India 1.2,
Indonesia 1.4, Pakistan 0.7, Russia 9.9, U.S. 20.1, Japan
1.4, Nigeria 0.6




Ordered sample: 0.3, 0.6, 0.7, 1.2, 1.4, 1.4, 1.8, 2.3, 9.9,
20.1
Mode = 1.4
Median = (1.4 + 1.4)/2 = 1.4
Mean = (0.3 + 0.6 + 0.7 + …+ 20.1)/10 = 3.97
Leon-Guerrero/Frankfort-Nachmias: Essentials of Social Statistics for a Diverse Society
© 2012 SAGE Publications
Calculating the mean with
grouped scores
Y 
fY
N
where: f Y = a score multiplied by its frequency
Leon-Guerrero/Frankfort-Nachmias: Essentials of Social Statistics for a Diverse Society
© 2012 SAGE Publications
Example: Mean of Grouped Scores
Leon-Guerrero/Frankfort-Nachmias: Essentials of Social Statistics for a Diverse Society
© 2012 SAGE Publications
Grouped Data: the Mean & Median
Calculate the median and mean for the grouped frequency below.
Number of People Age 18 or older living in a U.S. Household in
1996 (GSS 1996)
Number of People
1
2
3
4
5
6
TOTAL
Frequency
190
316
54
17
2
2
581
Leon-Guerrero/Frankfort-Nachmias: Essentials of Social Statistics for a Diverse Society
© 2012 SAGE Publications
Shape of the Distribution


Symmetrical (mean is about equal to median)
Skewed
 Negatively
(example: years of education)
mean < median
 Positively (example: income)
mean > median


Bimodal (two distinct modes)
Multi-modal (more than 2 distinct modes)
Leon-Guerrero/Frankfort-Nachmias: Essentials of Social Statistics for a Diverse Society
© 2012 SAGE Publications
Distribution Shape
Leon-Guerrero/Frankfort-Nachmias: Essentials of Social Statistics for a Diverse Society
© 2012 SAGE Publications
Considerations for Choosing a Measure
of Central Tendency



For a nominal variable, the mode is the only
measure that can be used.
For ordinal variables:
-Use the mode to show what is the most common
value in the distribution.
-Use the median to show which value is located
exactly in the middle of the distribution.
For interval-ratio variables, the mode, median, and
mean may all be calculated. The mean provides
the most information about the distribution, but the
median is preferred if the distribution is skewed.
Leon-Guerrero/Frankfort-Nachmias: Essentials of Social Statistics for a Diverse Society
© 2012 SAGE Publications
Central Tendency
Leon-Guerrero/Frankfort-Nachmias: Essentials of Social Statistics for a Diverse Society
© 2012 SAGE Publications
Chapter 4: Measures of Variability






The Importance of Measuring Variability
The Range
The Inter-Quartile Range
The Variance and the Standard Deviation
Considerations for Choosing a Measure of Variation
Reading the Research Literature: Differences in
College Aspirations and Expectations Among Latino
Adolescents
Leon-Guerrero/Frankfort-Nachmias: Essentials of Social Statistics for a Diverse Society
© 2012 SAGE Publications
The Importance of Measuring Variability


Central tendency - Numbers that describe
what is typical or average (central) in a
distribution
Measures of Variability - Numbers that
describe diversity or variability in the
distribution.
These two types of measures together help us to sum up a distribution
of scores without looking at each and every score. Measures of central
tendency tell you about typical (or central) scores. Measures of
variation reveal how far from the typical or central score that the
distribution tends to vary.
Leon-Guerrero/Frankfort-Nachmias: Essentials of Social Statistics for a Diverse Society
© 2012 SAGE Publications
Notice that both distributions have the same
mean, yet they are shaped differently
The Range
Range = highest score - lowest
score

Range – A measure of variation in
interval-ratio variables. It is the difference
between the highest (maximum) and the
lowest (minimum) scores in the distribution.
Leon-Guerrero/Frankfort-Nachmias: Essentials of Social Statistics for a Diverse Society
© 2012 SAGE Publications
Percentiles

A score below which a specific percentage
of the distribution falls.
example, the 75th percentile is a score
that divides the distribution so that 75% of
the cases are below it.
 For example, the 25th percentile is a score
that divides the distribution so that 25% of
the cases are below it.
 For
Leon-Guerrero/Frankfort-Nachmias: Essentials of Social Statistics for a Diverse Society
© 2012 SAGE Publications
Inter-Quartile Range




Inter-Quartile Range (IQR) – A measure of
variation for interval-ratio data. It indicates the
width of the middle 50 percent of the
distribution and is defined as the difference
between the lower and upper quartiles (Q1
and Q3.)
IQR = Q3 – Q1
Q3 = 75th percentile
Q1 = 25th percentile
Leon-Guerrero/Frankfort-Nachmias: Essentials of Social Statistics for a Diverse Society
© 2012 SAGE Publications
The Difference Between the Range
and IQR
These
values fall
together
closely
Shows greater
variability
Importance of
the IQR
Yet the ranges
are equal!
Leon-Guerrero/Frankfort-Nachmias: Essentials of Social Statistics for a Diverse Society
© 2012 SAGE Publications
Variance
• Variance – A measure of variation for
interval-ratio variables; it is the
average of the squared deviations from
the mean
2
s 
2
Y
 (Y Y )
N 1
Leon-Guerrero/Frankfort-Nachmias: Essentials of Social Statistics for a Diverse Society
© 2012 SAGE Publications
Standard Deviation
• Standard Deviation – A measure of variation
for interval-ratio variables; it is equal to the
square root of the variance.
2
sy  s 
2
Y
 (Y Y )
N 1
Leon-Guerrero/Frankfort-Nachmias: Essentials of Social Statistics for a Diverse Society
© 2012 SAGE Publications
Finding the Mean
Leon-Guerrero/Frankfort-Nachmias: Essentials of Social Statistics for a Diverse Society
© 2012 SAGE Publications
Finding the Standard Deviation
Leon-Guerrero/Frankfort-Nachmias: Essentials of Social Statistics for a Diverse Society
© 2012 SAGE Publications
Considerations for Choosing a Measure
of Variability

For nominal variables, you can only use IQV
(Index of Qualitative Variation)



Not discussed in this class!
For ordinal variables, you can calculate the
IQV or the IQR (Inter-Quartile Range). Though,
the IQR provides more information about the
variable.
For interval-ratio variables, you can use IQV,
IQR, or variance/standard deviation. The
standard deviation (also variance) provides the
most information, since it uses all of the values
in the distribution in its calculation.
Leon-Guerrero/Frankfort-Nachmias: Essentials of Social Statistics for a Diverse Society
© 2012 SAGE Publications
Related documents