Download Ch3

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
3
Numerical Summary Measures
Measures of central tendency
Mean = 2.95 liters
75% percentile = 3.38
Three values are larger than 3.38
and nine are smaller.
25% percentile = 2.60
Rank of FEV
2.15, 2.25, 2.30, 2.60, 2.68, 2.75, 2.82, 2.85, 3.00, 3.38, 3.50, 4.02, 4.05
Chapter3 p39
0 = female
1 = male
Mean = 0.615
61.5% of the study subjects are males
Median
- Not as sensitive to the value of each
measurement
- the 50th percentile of a set of measurements
- N is odd, the median is the middle value,
(N+1)/2
- N is even, the median is the average of the
middlemost values, the N/2 th and (N/2)+1 th
observations
Chapter3 p40
Mode
The mode of a set of values is the
observation that occurs most frequently
Table 3.1 do not have a unique mode, since
each of the value occurs only once.
The mode for the data in Table 3.2 is 1.
Fig. 3.1(a) – symmetric, and unimodal
mean ~ median ~ mode
Fig. 3.1(b) – symmetric and bimodal
mean ~ median, two peaks  two
distinct group
Fig. 3.1(c) – skewed to the right
not symmetric, mean lies to the right of
the median
Fig. 3.1(d) – skewed to the left
not symmetric, not symmetric, mean lies
to the left of the median
median mean
When the data are not symmetric – median
is the best measure of central tendency
Chapter3 p42
mean = median = mode
Need to have some idea about the
variation among the measurements.
Chapter3 p39
Measures of Dispersion
Range = max. – min.
Its usefulness is limited,
because it considers
only the extreme values
of a data set.
Chapter3 p45
Interquartile range
= 75% percentile – 25% percentile
= 3.38 – 2.60 = 0.78 liters
How to find the kth percentile ?
Let n = number of observations (obs.)
Rank the data from the smallest to the largest
If nk/100 = integers
kth percentile  ½ (nk/100)th + (nk/100 + 1)th
largest obs.
If nk/100 is not an integers
kth percentile  (j+1)th largest measurement
where j is the largest integer that is < nk/100
+
In all cases, mean > median  skewed
(means – median) is smaller for post-AIDS
Example - Table 3.2
The 25% percentile
13(25)/100 = 3.25, so the 25% percentile is the 3
+ 1 = 4th largest measurement = 2.60 liters
The 75% percentile
13(75)/100 = 9.75, so the 75% percentile is the 9
+ 1 = 10th largest measurement = 3.38 liters
Rank of FEV
2.15, 2.25, 2.30, 2.60, 2.68, 2.75, 2.82, 2.85, 3.00, 3.38, 3.50, 4.02, 4.05
Chapter3 p46
Variance (s2) and standard deviation
n
1
2
s2 
(
x

x
)
 i
n  1 i 1
Table 3.1
Mean = 2.95 liters
Variance = 0.39 liters2
Standard deviation = 0.62 liters
Coefficient of variation
s
0.62
CV  100% 
100%  21.0%
x
2.95
Chapter3 p46
Group mean = ungroup mean
Table 3.3
Two/three subjects have 12/11 years of duration
Grouped mean =
[3(5)+1(6)+1(8)+3(11)+2(12)]/10=8.6 years
k
x

i 1
mi f i
k 
 f i 
 i 1 
k  number _ of _ int ervals
mi  midpo int_ of _ the _ ith _ int erval
f i  frequency _ associated _ with _ the _ ith _ int erval
Chapter3 p49
Grouped variance
k
s 
2
Midpoint
99.5
139.5
 (m  x )
i 1
i
2
fi
k 
 f i   1
 i 1 
k  number _ of _ int ervals
mi  midpo int_ of _ the _ ith _ int erval
f i  frequency _ associated _ with _ the _ ith _ int erval
Chapter3 p51
Chebychev’s inequality
For symmetric and unimodal distribution
~67% of the data lie in the interval mean ± 1s
~95% of the data lie in the interval mean ± 2s
~99% of the data lie in the interval mean ± 1s
Not symmetric and unimodal
 Chebychev’s inequality
For any number , at least [1- (1/k)2] of the measurements in the set of data
lie within k standard deviation of their mean
Given k = 2  1 – (1/2)2 = ¾, or 75% of the data lie within mean ± 2s
k (std. dev.)
Chebychev (%)
Symmetric and unimodal (%)
1
≧0
~ 67
2
≧ 75
~ 95
3
≧ 89
~ 99
Chapter3 p49
10 patients
(beats/min.)
9 patients without
#7 (beats/min.)
mean
130.8
140.9
median
143
150
mode
150
150
range
127
47
interquartile
25%percentile=
120
75%percentile=
150
=150 – 120 = 30
Std.
deviation
35.5
Chapter3 p54
outliner
Chapter3 p56
Grouped mean
= 3348.2 grams
Grouped std.dev.
= 616.1 grams
Chapter3 p57
Chapter3 p59
Chapter3 p59
Using Excel
Statistical analysis – AVERAGE, STDEV, VAR, MAX, MIN,
MODE, PERCENTILE, TDIST, TINV, TTEST ….
Graph plotting using Excel
Exercises
Chapter3 p61
Chapter3 p62
Related documents