Download View

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Session 2 Handout
Clarification of Some Common Misconceptions
•
Mean, median, variance and other descriptive statistics are NOT exclusive to the normal
distribution.
•
The normal distribution is a case of symmetrical distribution in which theoretically,
mean=median
•
“(Empirical) distribution” aka sampling distribution, means where the data points are;
theoretical distribution means where the data points “should be”.
•
In empirical distributions, mean & median rarely exactly equal, so whereas it cannot be PROVED
that a set of data points is normally distributed, it can be shown that the set of data points is not
too deviant from the (theoretical) normal distribution.
•
Descriptive statistics of a sample are conceptually and computationally different from
descriptive statistics of a population, ie the sample mean is defined and calculated differently
from population mean, the sample variance is defined and calculated differently from
population variance, and so on.
Demo – Mean & Median
http://opl.apa.org/contributions/Rice/rvls_sim/stat_sim/descriptive/index.html
Descriptive Statistics Exercise
•
Suppose a waitress recorded the amount of tips she got from a few tables at a birthday party.
Calculate & interpret the following statistics:
Sample Median, Sample Mean, Sample Variance, Sample Standard Deviation
Data Set
Index
A
Data points (ie Sample)
17, 15, 23, 7, 9, 13, 39, 5
Sample
median
14
Sample
mean ( )
16
Sample
2
variance (s )
120
Sample standard
deviation (s)
≈10.95445
B
18, 16, 24, 8, 10, 14, 40, 6
15
17
120
≈10.95445
C
19, 17, 25, 9, 11, 15, 41, 7
16
18
120
≈10.95445
D
20, 18, 26, 10, 12, 16, 42, 8
17
19
120
≈10.95445
In this exercise, all data points were systematically increased by 1 unit across data sets, hence only
sample medians and sample means changed. In general, if the sample distribution only shifts location
across samples and the relative distances between data points within the sample do not change (ie the
change of data points between samples is systematic), then only the sample median and sample mean
change; Measures of within-sample variability (sample variance and sample standard deviation) do not
change, as the within-sample relative distances between individual data points with respect to the
sample mean do not change.