Download Sampling Distributions:

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Bootstrapping (statistics) wikipedia , lookup

Regression toward the mean wikipedia , lookup

Taylor's law wikipedia , lookup

Student's t-test wikipedia , lookup

Transcript
Research Methods 1 Handouts, Graham Hole,COGS - version 1.0, September 2000: Page 1:
SAMPLING DISTRIBUTIONS:
In Psychology we generally make inferences about populations on the basis of
limited samples. We therefore need to know what relationship exists between samples and
populations.
A population of scores has a mean (µ), a standard deviation (δ) and a shape (e.g.,
normal distribution).
If we take repeated samples of scores, each sample also has a mean, a standard
deviation and a shape. Due to random fluctuations, each sample is different - from other
samples and from the parent population. Fortunately, these differences are predictable and hence we can still use samples in order to make inferences about their parent
populations.
What are the properties of the distribution of Sample Means?
(a) The mean of the sample means is the same as the mean of the population of
raw scores:
µX = µX
(b) The standard deviation of the sample means has a special name: "standard
error". It is NOT the same as the standard deviation of the population of raw scores.
σ X ≠ σX
The standard error underestimates the population s.d.:
σX =
σX
n
The bigger the sample size, the smaller the standard error: i.e., variation between
samples decreases as sample size increases. (Because producing a sample mean reduces
the influence of any extreme raw scores in that sample).
(c) The distribution of sample means tends to be normally distributed, no matter
what the shape of the original distribution of raw scores in the population.
This is called the "Central Limit Theorem".
This effect is more marked the bigger the sample size.
This means that we can use the normal distribution as a simple "model" for many
different populations - even though we don't know their true characteristics.
Research Methods 1 Handouts, Graham Hole,COGS - version 1.0, September 2000: Page 2:
The Central Limit Theorem in action:
(a) Frequency distribution of raw scores:
high
frequency
low
population mean
(b) Frequency distribution of means of samples taken from the population above:
high
frequency
low
population mean
Review of the normal distribution:
(a) We know that various proportions of scores fall within certain limits of the
mean (i.e. 68% fall within the range of the mean +/- 1 s.d.; 95% within +/- 2 s.d., etc.).
(b) Using z-scores, we can represent a given score in terms of how different it is
from the mean of the group of scores.
Relationship of a raw score to the mean of the group of scores:
Mean of all scores
A particular raw score
Research Methods 1 Handouts, Graham Hole,COGS - version 1.0, September 2000: Page 3:
We can do exactly the same thing with sample means:
(a) we obtain a particular sample mean;
(b) we can represent this in terms of how different it is from the mean of its parent
population.
Relationship of a sample mean to the population mean:
Population mean
A particular sample mean
Means of samples will vary randomly, but most of them should be reasonably
close to the population mean. (e.g., 68% of sample means are likely to fall within +/- 1 s.d.
of the population mean, etc.).
Using z-scores, we could work out more precisely what proportion of sample
means are likely to be higher than our obtained sample mean; lower than it; etc.
If we obtain a sample mean that is much higher or lower than the population mean,
there are two possible reasons:
(a) our sample mean is a rare "fluke" (a quirk of sampling variation);
(b) our sample has not come from the population we thought it did, but from some
other, different, population.
A concrete example:
The human population I.Q. is 100.
Suppose we take a random sample of people and find their mean I.Q. is 170. This
sample mean (170) is much higher than the population mean (100).
There are two possible explanations:
(a) the sample is a fluke: by chance our random sample contained a large number
of highly intelligent people.
(b) the data do not come from the population we thought they did: our sample was
actually from a different population - e.g., aliens masquerading as humans.
Research Methods 1 Handouts, Graham Hole,COGS - version 1.0, September 2000: Page 4:
population mean I.Q. (100)
high
sample mean I.Q. (170)
frequency
low
This logic can be extended to the difference between two samples from the same
population:
A common experimental design:
Suppose we compare two groups of people: an experimental group and a control
group. The experimental group get a "wolfman" drug. The control group get a harmless
placebo.
The D.V. is the number of dog-biscuits consumed.
At the start of the experiment, they are two samples from the same population
("humans"). (We make sure this is the case by allocating people randomly to one
condition or the other, so that the only difference between them initially should be which
group they belong to).
At the end of the experiment, are they:
(a) still two samples from the same population (i.e., still two samples of "humans" our experimental treatment has left them unchanged);
OR
(b) now samples from two different populations - one from the "population of
humans" and one from the "population of wolfmen"?
We can decide between these alternatives as follows:
The differences between any two sample means from the same population are
normally distributed around a mean difference of zero.
Research Methods 1 Handouts, Graham Hole,COGS - version 1.0, September 2000: Page 5:
Most differences will be relatively small, since the Central Limit Theorem tells us
that most samples will have similar means to the population mean (and hence similar
means to each other). If we obtain a very large difference between our sample means, it
could have occurred by chance, but this is very unlikely - it is more likely that the two
samples come from different populations.
How large is "large"?
In psychology, we have a convention: if a difference would occur by chance only
5% of the time, then we assume that it has not arisen by chance, but instead reflects a
"real" difference between the two samples.
Possible differences between two sample means:
big difference between mean of sample
A and mean of sample B:
high
frequency
low
mean of sample A
low
sample means
mean of sample B
high
small difference between mean of
sample A and mean of sample B:
high
frequency
low
mean of sample A
low
mean of sample B
sample means
high
Research Methods 1 Handouts, Graham Hole,COGS - version 1.0, September 2000: Page 6:
Frequency distribution of differences between sample means:
high
frequency
low
mean A smaller
than mean B
no
difference
mean A bigger
than mean B
size of difference between sample means ( "sample mean A" minus "sample mean B")
A small difference between sample
means (quite likely to occur by
chance)
high
frequency
low
mean A smaller
than mean B
no difference
mean A bigger
than mean B
size of difference between sample means
A large difference between sample
means (quite unlikely to occur by
chance)
high
frequency
low
mean A smaller
than mean B
no difference
mean A bigger
than mean B
size of difference between sample means