Download unit4_MeasuresOfDisp..

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
MEASURES OF DISPERSION
(VARIABILITY)
•
•
•
•
Range
Variance and Standard Deviation
Coefficient of Variation
Non-central Locations: Inter-fractile
Ranges
Range
Ungrouped data: range = max data value – minimum data value
grouped data: range = upper limit of largest class – lower limit of smallest class
Standard Deviation
Statistical measure which expresses the
average deviation (spread) about the mean.
The sample standard deviation is given by:
s
s
2
2
x

(
x
)
  /n
n 1
2
2
fx

(
fx
)

 /n
n 1
(ungrouped data)
(grouped data)
COEFFICIENT OF VARIATION:
 Relative measure of dispersion
 Ratio of the standard deviation to the arithmetic
mean, expressed as a percentage:
s
CV  (100%)
x
The higher the value of CV, the greater the variability
Example: A sample of movie theaters in a large metropolitan area
tallied the total number of movies showing in a certain week.
Class interval
1–3
4–6
7–9
10 – 12
13 – 15
Frequency (f)
1
4
3
2
1
Σ = 11
Compute each of the following quantities:
(a) range
(b) standard deviation
(c) coefficient of variation.
3- 5
(a)
Range = 15-1 = 14
(b)
s
 fx
2
 ( fx) / n
2
n 1
Movies frequency class
f
showing
midpoint
x
1–3
1
2
fx
fx2
2
4
4–6
4
5
20
100
7–9
3
8
24
192
10- 12
2
11
22
242
13 - 15
1
14
14
196
82
734
Total
11
734  82  82 / 11
s
 3.5
10
(c)
s
CV  (100%)
x
s  3.5, x  7.46
s
3.5
CV  
100%  46.9%
x 7.46
3- 9
Empirical Rule: For any symmetrical, bell-shaped
distribution:

About 68% of the observations will lie within one
standard deviation of the mean

About 95% of the observations will lie within two
standard deviations of the mean

Virtually all the observations will be within three
standard deviations of the mean
3- 10
Bell-shaped curve showing relationship
between m and σ
68%
95%
99.7%
m3s
m2s ms
m
m+s
m+2s m+ 3s
Symmetric Distribution
Zero skewness → :Mean =Median = Mode
Mean
Median
Mode
The Relative Positions of the Mean, Median, and Mode:
3- 11
Positively skewed
(right skewed)
Mean>Median>Mode
Mode
Mean
Median
The Relative Positions of the Mean, Median and Mode.
3- 12
3- 13
Negatively Skewed
(left skewed)
Mean<Median<Mode
Mean
Mode
Median
The Relative Positions of the Mean, Median and Mode
Non-Central Location Measures
(Fractiles or Quantiles)
Fractiles (Quantiles)
Divide size-ordered data sets into subsets of equal
frequency:
• Quartiles
• Sextiles
• Octiles
• Deciles
• percentiles
Most commonly used are quartiles and percentiles.
Quartiles:
Divide size-ordered data sets into four equalfrequency subsets, i.e they define boundaries
of datasets split into 4 equal-frequency
classes.
Lower (first) quartile Q1: Identifies the value below
which the lower 25% of the ordered data set lies. (Q1is
also the 25th percentile)
Middle (second) quartile Q2: Identifies the value that
separates the lower 50% of the data set from the upper
50% of the data set.
Note: Q2 is also the median or 50th percentile)
Upper (third) quartile Q3: Identifies the value
above which the top 25% of the ordered data set
lies.
(Q3 is also the 75th percentile)
Percentiles divide an ordered data set into 100
equal-frequency parts.
A Percentile is a data value below which a
specified percentage of data values in an
ordered data set fall.
CALCULATING QUARTILES FOR
GROUPED DATA
The jth quartile for grouped data is given by:
 jn


F

c
4

Qj  L + 
fQ j
n = sample size
L = lower limit of jth quartile class
F = < cumulative frequency of immediately preceding class.
fQj = frequency of jth quartile class.
Measures of Dispersion using Fractiles
Interquartile Range (IQR):
Difference between Q3 and Q1.
Includes the middle 50% of the observations.
IQR = Q3 - Q1
Middle 80% range = P90 – P10
Middle 40% range = P70 –P30
Quartile deviation QD= (Q3-Q1)/2
Quartile Deviation (QD):
 Measure of spread about the median.
 Equals half the difference between Q3 and Q1.
Q3  Q1
IQR
QD 

2
2
CALCULATING PERCENTILES
GROUPED DATA
The jth percentile for grouped data is given by:
 jn

 F c

100

Pj  L + 
fP j
n = sample size.
L = lower limit of jth percentile class.
F = < cumulative frequency of immediately preceding class.
fPj = frequency of jth percentile class.
3- 23
Example: A sample of movie theaters in a large
metropolitan area tallied the total number of movies
showing in a certain week. Find the 70th percentile
and the IQR.
Class interval
Frequency (f)
1–3
1
4–6
4
7–9
3
10 – 12
2
13 – 15
1
Σ = 11
Movies showing
frequency
f
< cum freq (F)
1–3
1
1
4–6
4
5
7–9
3
8
10- 12
2
10
13 - 15
1
11
Pos P70 = 70x11/100 = 7.7
Movies freq f
1–3
4–6
7–9
10 – 12
13 – 15
1
4
3
2
1
< cum
freq (F)
1
5
8
10
11
(7.7  5)
P70  7 +
 3  7 + 2.7  9.7
3
Pos Q1 = 1x11/4 = 2.75
Pos Q3 = 3 x 2.75 = 8.25
Movies
freq f
1–3
4–6
7–9
10 – 12
13 – 15
1
4
3
2
1
(2.75  1)
Q1  4 +
 3  5.3
4
< cum freq
(F)
1
5
8
10
11
(8.25  8)
Q3  10 +
 3  10.4
2
IQR = Q3 - Q1=10.4 - 5.3 = 5.1
Related documents