Download 2.3 - 2.4

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Collecting Samples
Chapter 2.3 – In Search of Good Data
Mathematics of Data Management (Nelson)
MDM 4U
Why Sampling?




sampling is done because a census is too
expensive or time consuming
the challenge is being confident that the
sample represents the population accurately
convenience sampling occurs when you
simply take data from the most convenient
place (for example collecting data by walking
around the hallways at school)
convenience sampling is not representative
Random Sampling





representative samples involve random sampling
random events are events that are considered to
occur by chance
random numbers are described as numbers that
occur without pattern
random numbers can be generated using a
calculator, computer or random number table
random choice is used as a method of selecting
members of a population without introducing bias
1) Simple Random Sampling




this sample requires that all selections be
equally likely and that all combinations of
selections be equally likely
the sample is likely to be representative of
the population
but if it isn’t, this is due to chance
example: put entire population’s names in a
hat and draw them
2) Systematic Random Sampling



you decide to sample a fixed percent of the
population using some random starting point
and you select every nth individual
n in this case is determined by calculating the
sampling interval (population size ÷ sample
size)
example: you decide to sample 10% of 800
people. n = 800 ÷ 80 = 10, so generate a
random number between 1 and 10, start at
this number and sample each 10th person
3) Stratified Random Sampling



the population is divided into groups called
strata (which could be MSIPs or grades)
a simple random sample is taken of each of
these with the size of the sample determined
by the size of the strata
example: sample CPHS students by MSIP,
with samples randomly drawn from each
MSIP (the number drawn is relative to the
size of the MSIP)
4) Cluster Random Sampling



the population is ordered in terms of groups
(like MSIPs or schools)
groups are randomly chosen for sampling
and then all members of the chosen groups
are surveyed
example: student attitudes could be
measured by randomly choosing schools
from across Ontario, and then surveying all
students in those
5) Multistage Random Sampling


groups are randomly chosen from a
population, subgroups from these groups are
randomly chosen and then individuals in
these subgroups are then randomly chosen
to be surveyed
example: to understand student attitudes a
school might randomly choose one period,
randomly choose MSIPs during that period
then randomly choose students from within
those MSIPs
6) Destructive Sampling



sometimes the act of sampling will restrict the
ability of a surveyor to return the element to
the population
example: cars used in crash tests cannot be
used again for the same purpose
example: taking a standardized test
(individuals may acquire learning during
sampling that would introduce bias if they
were tested again)
Example: do students at CPHS want a
longer lunch? (sample 200 of 800 students)

Simple Random Sampling


Create a numbered, alphabetic list of students,
have a computer generate 200 names and
interview those students
Systematic Random Sampling



sampling interval n = 800 ÷ 200 = 4
generate a random number between 1 and 4
start with that number on the list and interview
each 4th person after that
Example: do students at CPHS want a
longer lunch?

Stratified Random Sampling

group students by grade and have a computer generate
a random group of names from each grade to interview
the number of students interviewed from each grade is
probably not equal, rather it is proportional to the size of
the group

if there were 180 grade 10’s, 180 ÷ 800 = 0.225

800 × 0.225 = 45 so we would need to interview 45

grade 10s
Example: do students at CPHS want a
shorter lunch?

Cluster Random Sampling



randomly choose enough MSIPs to sample 200
students
say there are 25 per MSIP, we would need 8
MSIPs, since 8 x 25 = 200
interview every student in each of these rooms
Example: do UCDSB high school students
want a shorter lunch?

Multi Stage Random Sampling





Randomly select 4 high schools in the UCDSB
Randomly choose a period from 1-5
randomly choose 2 MSIP classes of 25
interview every student in those MSIPs
200 students total
Sample Size



the size of the sample will have an effect on
the reliability of the results
the larger the better
factors:



variability in the population (the more variation,
the larger the sample required to capture that
variation)
degree of precision required for the survey
the sampling method chosen
Techniques for Experimental Studies



Experimental studies are different from
studies where a population is sampled as it
exists
in experimental studies some treatment is
applied to some part of the population
however, the effect of the treatment can only
be known in comparison to some part of the
population that has not received the
treatment
Vocabulary

treatment group


the part of the experimental group that receives
the treatment
control group

the part of the experimental group that does not
receive the treatment
Vocabulary

placebo



a treatment that has no value given to the control
group to reduce bias in the experiment
no one knows whether they are receiving the
treatment or not (why?)
double-blind test

in this case, neither the subjects or the
researchers doing the testing know who has
received the treatment (why?)
Class Activity





How would we take a sample of the students
in this class using the following methods:
a) 40% Simple Random Sampling
b) 20% Systematic Random Sampling?
c) 40% Stratified Random Sampling?
d) 50% Cluster Random Sampling?
MSIP / Homework


p. 99 #1, 5, 6, 10, 11
For 6b, see Ex. 1 on p. 95
Creating Survey Questions
Chapter 2.4 – In Search of Good Data
Mathematics of Data Management (Nelson)
MDM 4U
Surveys





A series of carefully designed questions
Commonly used in data collection
Types: interview, questionnaire, mail-in,
telephone, WWW, focus group
Bad questions lead to bad data (why?)
Good questions may create good data (why?)
Question Styles
Open Questions
 respondents answer in their own words (written)
 gives a wide variety of answers
 may be difficult to interpret
 offer the possibility of gaining data you did not know
existed
 sometimes used in preliminary collection of
information, to gain a sense of what is going on
 can clarify the categories of data you will end up
studying
Question Styles
Closed Questions
 questions that require the respondent to select from
pre-defined responses
 responses can be easily analyzed
 the options present may bias the result
 options may not represent the population and the
researcher may miss what is going on
 sometimes used after an initial open ended survey
as the researcher has already identified data
categories
Types of Survey Questions

Information


ex: Circle your Age: 16 17 18+
Checklist

ex: Courses currently being taken (check all
that apply):
□ Data Management
□ Advanced Functions
□ Calculus and Vectors
□ Other _________________
Types of Survey Questions

Ranking Questions



ex: rank the following in order of importance (1 =
most important, 3 = least important)
__ Work __ Homework __ Sports
Rating Questions
ex: How would you rate your teacher?
(choose 1)
□ Great □ Fabulous □ Incredible □ Outstanding

Questions should…





Be simple, relevant, specific, readable
Be written without jargon/slang,
abbreviations, acronyms, etc.
Not lead the respondents
Allow for all possible responses on closed Qs
Be sensitive to the respondents
MSIP / Homework

Complete p. 105 #1, 2, 4, 5, 8, 9, 12
References

Wikipedia (2004). Online Encyclopedia.
Retrieved September 1, 2004 from
http://en.wikipedia.org/wiki/Main_Page