Download AP Stats Notes

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

History of statistics wikipedia , lookup

Confidence interval wikipedia , lookup

Psychometrics wikipedia , lookup

Taylor's law wikipedia , lookup

Bootstrapping (statistics) wikipedia , lookup

Foundations of statistics wikipedia , lookup

Statistical hypothesis testing wikipedia , lookup

Omnibus test wikipedia , lookup

Resampling (statistics) wikipedia , lookup

Student's t-test wikipedia , lookup

Misuse of statistics wikipedia , lookup

Transcript
AP Statistics Notes
10.2 Day 1: Tests of Significance
Name_______________________________
Date__________________________
Ex 1: Confidence Intervals on the Graphing Calculator
 A manufacturer of high-resolution video terminals must control the tension on
the mesh of fine wires that lies behind the surface of the viewing screen.
Careful study has shown that when the process is operating properly, the
standard deviation of the tension readings is σ = 43 mV. Here are the tension
readings from an SRS of 20 screens from a single day’s production
269.5
297.0
269.6
283.3
304.8
280.4
233.5
257.4
317.5
327.4
264.7
307.7
310.0
343.3
328.1
342.6
338.8
340.1
374.6
336.1
 Construct a 90% confidence interval using your calculator.
Confidence Intervals vs. Tests of Significance
 You use confidence intervals when you are trying to estimate ______________
_________________________.
 You use ________________________________ when you are trying to assess
the evidence provided by data about some claim concerning a population.
Ex 2: I’m a great free-throw shooter
 Let’s say that I claim I can make 80% of my basketball free-throws.
 You challenge me to make 20 free throws and prove to you that I can shoot
80%.
 I only make 8 out of 20 free throws.
 The probability that I would only make 8 out of 20 free-throws if I really was
an 80% free-throw shooter is 0.0001.
Ex 3: Call the Paramedics!
 Accidents can result in serious injuries to drivers and passengers. Slow
paramedic response time can have serious effects.
 In one city, the mean response time to all accidents last year was μ = 6.7
minutes with a standard deviation of σ = 2 minutes.
 At the end of the following year, the city manger selects a SRS of 400 calls and
examines the response times.
 For this sample, the mean response time was x-bar = 6.48 minutes.
 Does this data provide good evidence that response times have decreased since
last year?
1
 If the claim that μ = 6.7 minutes is true, the sampling distribution of x-bar will
be approximately Normal (CLT) with mean
μ = 6.7 minutes and standard
deviation 0.10 minutes.
Null and Alternative Hypotheses
 The statement being tested in a significance test is called the null hypothesis.
(Ho) The significance test is designed to ______________________________
__________________________________________. Usually the null
hypothesis is a statement of “no effect,” or no change from historical values.
 The claim about the population that we are ___________________________
is the alternative hypothesis. (Ha)
What are the hypotheses for the paramedic example?
 What is the null hypothesis?
 What is the alternative hypothesis?
Population vs. Sample
 Hypotheses always refer to some population, not to a particular outcome.
 Be sure to state Ho and Ha in terms of the population, not the sample.
 It doesn’t make sense to state hypotheses about something you already know,
i.e. the sample.
 On the AP exam, you will be penalized if you state the hypothesis using x-bar.
You must state the hypotheses in terms of μ!
One-Sided vs. Two-Sided
 It is not always clear, in particular, whether Ha should be one-sided or twosided.
 If you do not have a specific direction firmly in mind in advance, use a twosided alternative.
2
Ex 4: Job Satisfaction
 Does the job satisfaction of assembly workers differ when their work is
machine-paced rather than self-paced?
 One study chose 18 subjects at random from a group of people who assembled
electronic devices.
 Half of the subjects were assigned at random to each of two groups. Both
groups did similar assembly work, but one work setup allowed workers to pace
themselves, and the other featured an assembly line that moved at fixed time
intervals so that the workers were paced by machine.
 After two weeks, all subjects took the Job Diagnosis survey (JDS), a test of job
satisfaction.
 Then they switched work setups, and took the JDS again after two more weeks.
 What type of design is illustrated by this example?
 What is the parameter of interest?
 What is the null hypothesis?
 What is the alternative hypothesis?
Don’t Cheat!
 The alternative hypothesis Ha should express the hopes or suspicions we have
before we see the data.
 It is cheating to first look at the data and then frame Ha to fit what the data
show.
Conditions for Significance Tests
1) SRS from the population of interest
2) Normality
 For means, the population distribution must be normal or the sample
size must be large (n ≥ 30)
 For proportions, the population must be at least ten times larger than
the sample
3) Independent observations
Test Statistics
 A significance test uses data in the form of a test statistic.
3
Ex 5: Call the Paramedics!
 Find the test statistic given that Ho: μ = 6.7 minutes,
the estimate of μ is x-bar = 6.48 minutes, and the standard deviation σ = 2
minutes.
 Because the sample result is over two standard deviations below the
hypothesized mean 6.7, it gives good evidence that the response time is
probably less than 6.7 minutes.
P-values
 The probability, computed assuming that Ho is true, that the observed outcome
would take a value as extreme as or more extreme than that actually observed
is called the P-value of the test.
 The smaller the P-value is, the stronger the evidence against Ho provided by
the data.
Ex 6: Call the Paramedics
Ex 7: Calculating a two-sided P-value

Suppose that the z test statistic for a two-sided test is z = 1.7. The two sided pvalue is the probability that Z ≤ -1.7 or Z ≥ 1.7.
Significance Level




Small P-values are evidence against Ho.
How small must a P-value be to persuade us?
There is no fixed rule, but the generally accepted rule is 0.05.
In general, a result with a small P-value, say less than 0.05, is called
statistically significant.
4
Statistical Significance
 If the P-value is as small as or smaller than alpha, we say that the data are
statistically significant at a level of α .
 Typically, we will choose 0.05 to represent alpha, thus making a p-value of
0.05 or less statistically significant.
 But, if we decided to make alpha 0.01, a p-value of 0.05 would not be
statistically significant.
(we can decide on a different alpha)
Significance Tests
 To test a claim about an unknown population parameter:
Step 1: Hypotheses
Step 2: Conditions
Step 3: Calculations
Step 4: Sentence
Ex 8: Can you balance your checkbook?
 In a discussion of the education level for the American workforce, someone
says, “The average young person can’t even balance a checkbook.” The
NAEP survey says that a score of 275 or higher on its quantitative test reflects
the skill needed to balance a checkbook. The NAEP random sample of 840
young Americans had a mean score of x-bar = 272, a bit below the checkbookbalancing level. Is this sample result good evidence that the mean for all
young people is less than 275? Assume that σ = 60.
5
Step 1: State the Hypotheses
Step 2: Choose the appropriate inference procedure and verify conditions.
 Since σ is known, we will use a one-sample z test for a population mean.
1) The data come from an SRS from the population of interest.
2) The sampling distribution of x-bar is approximately normal (n = 840)
by the CLT.
3) Observations are independent since the entire population of young
adults is more than 10 times larger than the sample.
Step 3: Calculations
Step 4: Interpret your results in context.
6