Download Powerpoint 5b

yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

History of statistics wikipedia , lookup

Statistics wikipedia , lookup

Chapter 9
Hypothesis Testing II
Chapter Outline
 Introduction
 Hypothesis Testing with Sample
Means (Large Samples)
 Hypothesis Testing with Sample
Means (Small Samples)
 Hypothesis Testing with Sample
Proportions (Large Samples)
Basic Logic
 We begin with a difference between
sample statistics (means or
 The question we test:
 “Is the difference between statistics
large enough to conclude that the
populations represented by the
samples are different?”
Basic Logic
 The H0 is that the populations are the
 There is no difference between the parameters
of the two populations
 If the difference between the sample
statistics is large enough, so that a
difference of this size is unlikely, assuming
that the H0 is true, we will reject the H0 and
conclude there is a difference between the
Basic Logic
 The H0 is a statement of “no difference”
 The 0.05 level will continue to be our
indicator of a significant difference
 We change the sample statistics to a Z
score, place the Z score on the sampling
distribution and use Appendix A to
determine the probability of getting a
difference that large if the H0 is true.
The Five Step Model
1. Make assumptions and meet test
2. State the H0.
3. Select the Sampling Distribution and
Determine the Critical Region.
4. Calculate the test statistic.
5. Make a Decision and Interpret
Example: Hypothesis Testing in the
Two Sample Case
 Problem 9.7b.
 Middle class families average 8.7 email
messages/wk (s = 3.1, N=125) and
working class families average
5.7(s=2.9, N=104) messages.
 The middle class families seem to use
email more but is the difference
Step 1 Make Assumptions and
Meet Test Requirements
 Model:
 Independent Random Samples
 The samples must be independent of each
 LOM is Interval Ratio
 Number of email messages has a true 0 and
equal intervals so the mean is an appropriate
 Sampling Distribution is normal in shape
 each N is 100 or more so the Central Limit
Theorem applies
Step 2 State the Null
 H0: μ1 = μ2
 The Null asserts there is no significant
difference between the populations.
Step 2 State the Alternative or
Research Hypothesis
 H1: μ1  μ2
 The research hypothesis contradicts the
H0 and asserts there is a significant
difference between the populations.
 This will be a two-tailed test. Why?
Step 3 Select the S. D. and
Establish the C. R.
 Sampling Distribution = Z distribution
because N1 + N2 = 100 or more
 Alpha (α) = 0.05
 Z (critical) = ± 1.96
 We could also use a t-distribution
 Degrees of freedom = N1+N22=125+104=229 t (critical) = 1.96
 But then we use formula 9.5 and 9.6
Step 4 Compute the Test
 Use Formula 9.4 to compute the
pooled estimate of the standard
 Use Formula 9.2 to compute the
obtained Z score.
Step 5 Make a Decision
 The obtained test statistic (Z = 7.5) falls in
the Critical Region so reject the null
 The difference between the sample means
is so large that we can conclude (at α =
0.05) that a difference exists between the
populations represented by the samples. (
 The difference between the email usage of
middle class and working class families is
Factors in Making a Decision
 The size of the difference (e.g.,
means of 8.7 and 5.7 for problem
 The value of alpha (the higher the
alpha, the more likely we are to
reject the H0
Factors in Making a Decision
 The use of one- vs. two-tailed tests
(we are more likely to reject with a
one-tailed test)
 The size of the sample (N). The
larger the sample the more likely we
are to reject the H0.
Significance Vs. Importance
 We test for significance whenever we
generalize from a sample to a
 Significance is not the same thing as
 Differences that are otherwise trivial or
uninteresting may be significant.
Significance Vs Importance
 A sample outcome could be:
 significant and important
 significant but unimportant
 not significant but important
 not significant and unimportant
Test of hypothesis between two
sample proportions
Appropriate only with large samples (N1
+N2 = 100 or more)
The logic is basically the same, but the
formulae get more and more complicated
See formulas 9.7, 9.8, 9.9,. 9.10 on p. 226.
Again, the basic idea to establish the
standard error (the standard deviation of
the sampling distribution) and then do the
five-step testing procedure.