Download Statistical Sampling

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Bootstrapping (statistics) wikipedia , lookup

Resampling (statistics) wikipedia , lookup

Taylor's law wikipedia , lookup

Sampling (statistics) wikipedia , lookup

Student's t-test wikipedia , lookup

Transcript
Statistical Sampling
Sample
A subset of units selected from
the population to represent it.
Hopefully it is representative
Reasons to Sample - review
 Reduced Cost
 For populations having an extremely large number of individuals (like forest trees),
measuring each individual would be impractical. Neither the time nor the money is
available to do such an inventory. Additionally, the inventory costs of doing such an
inventory would exceed the value of the sale.
 Greater Speed
 Sampling reduces the magnitude of the job, allowing the task to be completed in a
shorter period of time. Not many timber sales would make it to market if the volume is
measured on all of the trees and it takes ten months to do the sale prep work.
 Greater Scope
 Sampling provides the ability to study a larger area and include diverse information
about the population. Additional information such as species present in the population,
type of defect, or other conditions in the population can be included when sampling is
used.
 Greater Accuracy
 Often overlooked is the quality of work suffers when budgets and resources are
stretched too thin. Good measurements on a sample of individuals provide more
reliable information than bad measurements on the entire population.
Sampling Types
Simple Random Sampling
 Every possible combination of sample units has an equal and
independent chance of being selected.
 However…
Systemic Sampling
 Beware coincidental bias of sample interval and natural area.
 Ridges
 River bends
 Etc.
Stratified Random Sampling
The point is to reduce variability within strata.
Example: if you were measuring average
estrogen levels in humans, you would stratify
male versus female.
Can you think of some forest examples?
Population sampling
Involves studying a portion of the population to
gain information about the entire population.
When applying sample information to the
population, it is called making an inference.
Inferences include estimation, prediction,
hypothesis testing and determining relationships.
Populations and Samples
Sample Size
The number of units selected randomly from
the population for observation.
Sample size is designated by the lower case n
Sample Total
The sum of the values for all units in the sample
Sample Mean
 The sum of the individual unit values divided by the sample size.
Variability
The differences
between individuals
or units in a
population
Taking
measurements on
these two stands
should result in
values that reflect
their differences.
Sample Variance
 Estimates of the variability of the population. This is a measure of
how closely the observations are clustered around the mean. The
notation for the variance is usually shown as lower case sigma
squared with a caret or as lower case s squared or a capital V. All
squared differences are summed and then divided by the sample
size minus one. Both equations below give the same answer.
Sample Standard Deviation
 Equals the square root of the
variance.
 Returns the variability index to the
units of the mean.
 The notation can be lower case
sigma with a caret, lower case s or
SD.
In Microsoft Excel the function is…
=STDEV(A1:An) or =STDEV.S(A1:An)
Sample Coefficient of Variation
 Because populations with large means tend to have larger
standard deviations than those with small means, the
coefficient of variation permits a comparison of relative
variability about different-sized means. The sample
coefficient of variation is an expression of the standard
deviation as a percentage of the mean. It is usually
represented by upper case CV. It is calculated as the
standard deviation divided by the mean multiplied by 100 to
convert to a percentage.
Sampling Summary
In Excel
=AVERAGE(A1:An)
mean of the
squared deviations
Square root of
variance
Exercise in Random Sampling
 Student heights equals population
 Calculate population mean, etc.
 Take a systemic 20% sample compare estimates of population.
 Take a 50% sample (systemic or random) and compare results.
 Calculate mean, variance, SD and CV of both population and samples.