Download Day 1

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

History of statistics wikipedia , lookup

Foundations of statistics wikipedia , lookup

Statistics wikipedia , lookup

Transcript
Day 1
Introduction to the Course

What is Statistics?
Statistics is the art and science of decision making. That is, the art and science of using
data to answer research questions.
Explore the following research question: Is Rock-Paper-Scissors a fair game?
o How can we decide if Rock-Paper-Scissors is a fair game?
o What assumptions are made?
o Model the Statistical Process:
 Define Research Question
 Collect Data
 Summarize Data
 Make Conclusions

Discuss Basic Definitions:
Sample vs. Population, Statistic vs. Parameter, Categorical vs. Numerical
Sampling



Weighing a "Random" Sample Activity (Gelman & Nolan, 2002)
If conclusions are to be valid, data must be collected according to a well-developed plan.
How can we choose elements for a sample?
o Simple Random Sampling
o Stratified Sampling
o Cluster Sampling
o Systematic Sampling
o Convenience Sampling
Random Rectangles Activity (Scheaffer, Watkins, Witmer, & Gnanadesikan, 2004)
Experimental Design


Jumping Frogs Activity (Scheaffer et al., 2004)
Principles of Experimental Design (Connect to Jumping Frogs Activity)
o Randomization
o Replication
o Control
o Other Important Concepts: Confounding, Experimental Unit
Observational vs. Experimental Studies

Discuss differences between each type of study. Why don't we always use randomized
experiments?
Graphical Displays (Connect to Jumping Frogs Activity)





Dot Plots
Stem and Leaf Plots
Histograms - Histogram Bin Width Applet (Garcia, Lima, Chance, Holmes, & Gill, n.d.)
Examples of Bad/Misleading Graphs
Describing Numerical Data Distributions:
o Shape of the Distribution
o Center of the Distribution
o Spread of the Distribution
Measures of Center

Mean vs. Median
o Demo 2: Mean and Median (Erickson, 2008)
o When is one more appropriate to use than the other?
Other Numerical Summaries (Connect to Jumping Frogs Activity)


Five Number Summary: Min, Q1, Median, Q3, Max
Boxplots
Measures of Spread

Variance
Compare data sets with different variances. For example,
Data Set 1: 20, 20, 20, 20, 20
Data Set 2: 18, 19, 20, 21, 22
Data Set 3: 1, 5, 7, 8, 79
Use a sample data set (e.g., Data Set 2) to derive its standard deviation by discussing how
you could measure the “average” distance between each point and the mean.



Calculate sample variance/standard deviation with a graphing calculator.
What Makes the Standard Deviation Larger or Smaller? Activity (delMas, 2001)
Demo 4: Transforming the Mean and Standard Deviation (Erickson, 2008)
Day 2
Random Variables


Definition, Examples
Discrete vs. Continuous Random Variables
Probability

Basic Probability Rules & Definitions (De Veaux, Velleman, & Bock, 2006, pp. 325327)
o Any probability cannot be less than zero or greater than one.
o Something Has to Happen Rule
o Complement Rule
o Addition Rule
o Equally Likely Rule
o Conditional probability is the probability of an event A occurring, given that
event B has already occurred. If two events are independent, then knowing that
event B has already occurred does not change the probability that event A occurs.
o Multiplication Rule

Examples (De Veaux et al., 2006)
o Problem 10, p. 334
o Problems 12 & 14, p. 334
o Problem 2, p. 398

Simulation to Estimate Probability
o Birthday Paradox Activity (National Council of Teachers of Mathematics, 2008)
Normal Distribution






Properties: Shape, Center, Spread
68-95-99.7 Rule: Problem 17, p. 123 (De Veaux et al., 2006)
Z-scores
Find probabilities and percentiles with a graphing calculator.
Standardized Test Reports: Discuss reported percentiles and percents.
Dear Abby Example (Larsen & Marx, 1990, pp. 333-334)
The following letter appeared in the "Dear Abby" column in the Tennessean (Nashville)
on January 20, 1973:
Dear Abby,
You wrote in your column that a woman is pregnant for 266 days. Who said so? I carried
my baby for ten months and five days, and there is no doubt about it because I know the
exact date my baby was conceived. My husband is in the Navy and it couldn't possibly
have been conceived any other time because I saw him only once for an hour, and I didn't
see him again until the day before the baby was born.
I don't drink or run around, and there is no way this baby isn't his, so please print a
retraction about the 266-day carrying time because otherwise I am in a lot of trouble.
San Diego Reader
Assume that the standard deviation of pregnancy length is 16 days. The San Diego
Reader’s 10 month, 5 day pregnancy is about the same as 310 days. How likely is it that a
pregnancy would last at least 310 days?
Sampling Distributions

Sampling Distributions of Sample Means and Variances: Dear Abby Activity
o Create a stem-and-leaf plot of 100 pregnancy lengths simulated from a Normal
model with a mean of 266 days and a standard deviation of 16 days. Students
select multiple random samples of pregnancy lengths (three of each size: n = 4
and n = 9) from this simulated population distribution. Students add their sample
averages and variances to classroom plots for each statistic and sample size
combination.
o Discuss the shape, center, and spread of each plot. How do these characteristics
vary with the sample size? statistic?

Sampling Distributions of Sample Proportions: Coins and Dice Activity
o Create a dot plot of student observations for each of the following random
variables:
 Number of heads observed in 20 flips of a coin.
 Number of {0, 1, 2}s observed in 20 rolls of a 10-sided die.
 Number of {0}s observed in 20 rolls of a 10-sided die.
o Discuss the shape, center, and spread of each plot. How do these characteristics
vary with the true probability of success? What happens when converted to
sample proportions?
Central Limit Theorem (CLT)


Cents and the Central Limit Theorem Activity (Scheaffer et al., 2004)
Summary of Sampling Distributions and the CLT
Day 3
Introduction to Confidence Intervals (CIs)

Spinning Pennies Activity (Scheaffer et al., 2004) adapted to use 68-95-99.7 Rule for
introducing confidence intervals
Confidence Intervals for Proportions



Land or Water? Activity (Gelman & Nolan, 2002)
Find confidence intervals for proportions with a graphing calculator.
Examples (De Veaux et al., 2006)
o Problem 13, p. 444
o Problem 24, p. 515
Margin of Error



In general, confidence intervals look like: Estimate ± Margin of Error.
Use Simulating Confidence Intervals for Population Parameter - Newer Version Applet
(Garcia et al., n.d.) to explore the following questions:
o What does 95% confidence really mean?
o What happens to the confidence interval when we change the confidence level?
o What happens to the confidence interval when the sample size changes?
Example: Problem 6, p. 443 (De Veaux et al., 2006)
Confidence Intervals for Means




Use recently discussed concepts of sampling distributions of sample means and
confidence intervals to brainstorm the general formula for confidence intervals for means.
Briefly explain why using t-distribution.
Find confidence intervals for means with a graphing calculator.
Examples (De Veaux et al., 2006)
o Problem 15, p. 541
o Problem 17, p. 541
o Problem 34 (a-c), p. 543
Hypothesis Tests (HTs) for Proportions



Ducks and Green - An Introduction to the Ideas of Hypothesis Testing Activity (Seier &
Robe, 2002)
Do hypothesis tests for proportions with a graphing calculator.
Examples (De Veaux et al., 2006)
o Problem 27, p. 469
o Problem 30, p. 470
o Problem 2, p. 467
Day 4
Hypothesis Tests for Means



Do hypothesis tests for means with a graphing calculator.
Examples (De Veaux et al., 2006)
o Problem 26, p. 542
o Problem 35 (a-d), pp. 543-544
More Practice with HTs and CIs for Proportions & Means (De Veaux et al., 2006)
o Problem 32, p. 543
o Problem 3, p. 596
o Problem 4, p. 596
o Problem 20, p. 599
o Problem 30, p. 601
Hypothesis Tests and Confidence Intervals: Relationship between the Two Methods

Discuss relationship between confidence intervals and hypothesis tests
Inference for Two Means, Independent Samples




Example: Problem 4, p. 565 (De Veaux et al., 2006)
Do hypothesis test and confidence interval for a difference between two independent
means with a graphing calculator.
Discuss difference between equal variance and unequal variance tests.
Example: Problem 7, p. 566 (De Veaux et al., 2006)
Inference for Two Means, Dependent Samples




Matched Pairs Activity
o Carry out an activity to compare two dependent samples. For example, average
number of snaps on dominant vs. non-dominant hand, average heart rate before
and after exercise, etc.
o Discuss appropriateness of previously discussed methods.
Dependent vs. Independent Samples
Do hypothesis test and confidence interval for mean difference with a graphing
calculator.
Examples (De Veaux et al., 2006)
o Problem 26, pp. 591-592
o Problem 15, p. 589
o Problem 19, pp. 598-599
Lab Time

Provide students with a data set for them to explore. Each group should create and answer
their own research questions.
Day 5
Group Presentations

Each group presents research questions and conclusions from previous day's explorations.
Inference for Two Proportions, Independent Samples



Hershey Kisses Activity
o Carry out a study to answer the research question:
Is the probability a Hershey's Special Dark Kiss lands on its base when spilled
from a cup different than the probability a Hershey's Milk Chocolate with
Almonds Kiss does?
Do hypothesis test and confidence interval for difference between proportions with a
graphing calculator.
Examples (De Veaux et al., 2006)
o Problem 30, page 510
o Problem 36, p. 516
Relationships between Two Categorical Variables



Contingency Tables
Use data from Problem 26, p. 630 (De Veaux et al., 2006) to discuss:
o Row and Column Percents
o Risks and Relative Risks
o Odds and Odds Ratios
Example: Problem 25, p. 630 (De Veaux et al., 2006)
Relationships between Two Numerical Variables



Scatterplots
o Explanatory and Response Variables; Independent and Dependent Variables
o Direction/Association (Positive vs. Negative), Pattern (Linear, etc.), Outliers,
Strength of Pattern
o Examples (De Veaux et al., 2006)
 Problem 2, p. 158
 Problem 6, p. 159
Correlation Coefficient, r
o Facts about r
o Correlation ⇏ Causation
o Examples (De Veaux et al., 2006)
 Problem 11, pg. 160
 Problem 25, p. 163
Least-Squares Regression Equation
o Simple Examples of Least Squares Activity (Gelman & Nolan, 2002)
o
o
o
o

Find regression statistics and least squares regression lines with a graphing
calculator.
Interpret the slope and intercept.
Lurking Variables (Levitt & Dubner, 2005, pp. 161-176)
Coefficient of Determination
 Class Examples, 4 - Price vs. Number of Pizza Toppings (De Veaux et al.,
2006, p. IG 8-C)
Barbie Bungee Activity (Zordak, 2008)
Wrap-Up

Summarize Big Ideas in Statistics
References
De Veaux, R. D., Velleman, P. F., & Bock, D. E. (2006). Introductory statistics: Instructor's
edition (2nd Ed.). Boston, MA: Pearson.
delMas, R. C. (2001). What makes the standard deviation larger or smaller? Statistics Teaching
and Resource Library. Retrieved from http://www.oercommons.org/courses/star-librarystandard-deviation/view
Erickson, T. (2008). Fifty fathoms: Statistics demonstrations for deeper understanding.
Emeryville, CA: Key Curriculum.
Garcia, F., Lima, C., Chance, B., Holmes, E., & Gill, R. (n.d.). Histogram bin width.
Rossman/Chance Applet Collection. Retrieved June 27, 2011, from
http://www.rossmanchance.com/applets/
Gelman, A., & Nolan, D. (2002). Teaching statistics: A bag of tricks. New York: Oxford
University.
Larsen, R. J., & Marx, M. L. (1990). Statistics: Annotated instructor's edition. Englewood Cliffs,
NJ: Prentice Hall.
Levitt, S.D., & Dubner, S. J. (2005). Freakonomics: A rogue economist explores the hidden side
of everything. New York: HarperCollins.
National Council of Teachers of Mathematics. (2008). Birthday paradox: Random integers on
TI83/TI84Plus. Retrieved from
http://illuminations.nctm.org/lessons/CommonSheets/RandInts-OV-UsingCalc.pdf
Scheaffer, R. L., Watkins, A., Witmer, J., & Gnanadesikan, M. (2004). Activity-based statistics:
Student guide (2nd ed., revised by Tim Erickson). Emeryville, CA: Key College
Publishing.
Seier, E., & Robe, C. (2002). Ducks and green - An introduction to the ideas of hypothesis
testing. Teaching Statistics, 24(3), 82-86.
Zordak, S. E. (2008). Barbie bungee. Retrieved from National Council of Teachers of
Mathematics Illuminations website:
http://illuminations.nctm.org/LessonDetail.aspx?id=L646