Download week_6_data_collection_sampling

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Data Collection &
Sampling
Dr. Guerette
Gathering Data

Three ways a researcher collects data:
By asking questions
 By direct observation
 By using written records


The fundamental issue in data collection is its’
representativeness.
Logic of Probability Sampling


Allows researchers to generalize to unobserved
cases.
Conscious and Unconscious Sampling Bias

Bias in this case means that selections are not typical
or representative of the population from which they
are drawn. This could be result of personal beliefs or
many other reasons.
Logic of Probability Sampling

Representativeness and Probability of Selection

All members of a population have an equal chance
of being selected for a sample in probability
selections.
Probability Theory &
Sampling Distribution


Probability theory permits inferences about how
sampled data are distributed around the value
found in a larger population.
Sample Element

The unit about which information is collected and it
provides the basis of analysis. It is the grouping of
study elements.
Probability Theory &
Sampling Distribution

Population parameter


Sample statistic


The summary description of a given variable in the
population.
The summary description of a given variable in the
sample.
Sampling distribution

The range of sample statistics that would be
obtained when many samples are selected.
From Sampling Distribution to
Parameter Estimate

Sampling frame


Binomial variable


A list of elements in the population
A variable that has only two values
Estimating sampling error

When independent random samples are selected from a
population and sample statistics are calculated from those
samples they will be distributed around the population
parameter in a known way.
From Sampling Distribution to
Parameter Estimate

Standard error


Is the way of estimating how closely the sample
statistics are clustered around the true value.
Confidence levels and Confidence Intervals

Use probability theory to indicate sample estimates
that fall within one, two or three standard errors of
the parameter.
Standard Error Diagram
Source: Jeremy Kemp (2005)
SE = √[p(1-p)/n]
Or
SE = √[p x q/n]
In Class Exercise –
Probability Sampling


You have selected a probability sample and want to
determine the sampling error in order to estimate the
population parameter. In your sample you compute that
70 percent of your sample opposes establishing a
hurricane relief fund derived from Miami-Dade tax
dollars, while 30 percent favor such a fund. You had a
sample of 400 (N = 400).
Report your confidence level and confidence interval.
Probability Sampling

Simple random sampling


Systematic sampling


All elements in the population have an equal chance
of being selected for the sample.
Drawing a sample of every Nth element in the
population.
Stratified sampling

Based upon prior knowledge of a population, a
sample is drawn that will offer a greater degree of
representativeness.
Probability Sampling

Disproportionate Stratified Sampling


Specifically produce samples that are not
representative of a population on some variable.
Multistage Cluster Sampling

Used when populations cannot easily be listed for
sampling purposes. Generally involves geographic
dispersion. While this technique increases efficiency
it decreases accuracy.
Probability Sampling

Multistage Cluster Sampling with Stratification

Used to increase the homogeneity of the sample.
Non-Probability Sampling


Used when the likelihood of any given element
will be selected is not known (e.g. when random
probability sampling is not possible).
Purposive or Judgemental Sampling


Selection based upon prior knowledge of the
population.
Quota sampling

Uses a matrix or table to describe the characteristics
of the population and the sample is drawn to reflect
the cells of the matrix.
Non-Probability Sampling

Reliance on available subjects


Using people that are readily available seldom
produces data that have great value.
Snowball Sampling

Begins by identifying a single member of a
population and then having that subject identify
others like him/her.
Related documents