Download Overview Section 11-1 & 11-2 Overview and Multinomial Experiments: Goodness of Fit

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Statistics wikipedia , lookup

History of statistics wikipedia , lookup

Transcript
Section 11-1 & 11-2
Overview and Multinomial
Experiments: Goodness of Fit
Key Concept
Overview
We focus on analysis of categorical (qualitative
or attribute) data that can be separated into
different categories (often called cells).
Use the χ2 (chi-square) test statistic (Table A- 4).
The goodness-of-fit test uses a one-way
frequency table (single row or column).
Given data separated into different
categories, we will test the hypothesis that
the distribution of the data agrees with or
“fits” some claimed distribution.
The hypothesis test will use the chi-square
distribution with the observed frequency
counts and the frequency counts that we
would expect with the claimed distribution.
The contingency table uses a two-way frequency
table (two or more rows and columns).
Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.
Slide
1
Definition
Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.
Slide
2
Slide
4
Example: Last Digits of Weights
Multinomial Experiment
This is an experiment that meets the following
conditions:
When asked, people often provide
weights that are somewhat lower
than their actual weights. So how
can researchers verify that weights
were obtained through actual
measurements instead of asking
subjects?
1. The number of trials is fixed.
2. The trials are independent.
3. All outcomes of each trial must be classified
into exactly one of several different
categories.
4. The probabilities for the different categories
remain constant for each trial.
Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.
Slide
3
Example: Last Digits of Weights
Example: Last Digits of Weights
Test the claim that the digits in Table 11-2 do not
occur with the same frequency.
Table 11-2
summarizes the last
digit of weights of 80
randomly selected
students.
Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.
Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.
Verify that the four conditions of a
multinomial experiment are satisfied.
1. The number of trials (last digits) is the fixed number 80.
2. The trials are independent, because the last digit of
any individual’s weight does not affect the last digit of any
other weight.
3. Each outcome (last digit) is classified into exactly 1 of
10 different categories. The categories are 0, 1, … , 9.
4. Finally, in testing the claim that the 10 digits are
equally likely, each possible digit has a probability of 1/10,
and by assumption, that probability remains constant for
each subject.
Slide
5
Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.
Slide
6
Definition
Goodness-of-Fit Test
Notation
Goodness-of-fit Test
A goodness-of-fit test is used to
test the hypothesis that an
observed frequency distribution
fits (or conforms to) some claimed
distribution.
Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.
Slide
7
Expected Frequencies
represents the expected frequency of an outcome.
k
represents the number of different categories or
outcomes.
n
represents the total number of trials.
Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.
Slide
8
E=np
the sum of all observed frequencies
divided by the number of categories
Slide
Each expected frequency is found by
multiplying the sum of all observed
frequencies by the probability for the
category.
9
Goodness-of-Fit Test in
Multinomial Experiments
Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.
Slide
10
Goodness-of-Fit Test in
Multinomial Experiments
Requirements
Test Statistics
1. The data have been randomly selected.
2. The sample data consist of frequency counts
for each of the different categories.
3. For each category, the expected frequency is at
least 5. (The expected frequency for a category
is the frequency that would occur if the data
actually have the distribution that is being
claimed. There is no requirement that the
observed frequency for each category must be
at least 5.)
Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.
E
If expected frequencies are not all
equal:
n
k
Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.
represents the observed frequency of an outcome.
Expected Frequencies
If all expected frequencies are equal:
E=
O
Slide
χ2 = Σ
(O – E)2
E
Critical Values
1. Found in Table A-4 using k – 1 degrees of
freedom, where k = number of categories.
2. Goodness-of-fit hypothesis tests are always
right-tailed.
11
Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.
Slide
12
Relationships Among the χ2 Test
Statistic, P-Value, and Goodness-of-Fit
Goodness-of-Fit Test in
Multinomial Experiments
Figure 11-3
A close agreement between observed and
expected values will lead to a small value of χ2
and a large P-value.
A large disagreement between observed and
expected values will lead to a large value of χ2
and a small P-value.
A significantly large value of χ2 will cause a
rejection of the null hypothesis of no difference
between the observed and the expected.
Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.
Slide
13
Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.
Example: Last Digit Analysis
Example: Last Digit Analysis
Test the claim that the digits in Table 11-2
do not occur with the same frequency.
Test the claim that the digits in Table 11-2
do not occur with the same frequency.
H0: p0 = p1 = … = p9
Slide
14
Slide
16
Because the 80 digits
would be uniformly
distributed through
the 10 categories, each
expected frequency
should be 8.
H1: At least one of the
probabilities is different
from the others.
α = 0.05
k–1=9
χ2.05, 9 = 16.919
Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.
Slide
15
Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.
Example: Last Digit Analysis
Example: Last Digit Analysis
Test the claim that the digits in Table 11-2
do not occur with the same frequency.
Test the claim that the digits in Table 11-2
do not occur with the same frequency.
From Table 11-3, the test statistic is χ2 = 156.500.
Since the critical value is 16.919, we reject the
null hypothesis of equal probabilities.
There is sufficient evidence to support the claim
that the last digits do not occur with the same
relative frequency.
Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.
Slide
17
Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.
Slide
18
Example: Detecting Fraud
Unequal Expected Frequencies
Example: Detecting Fraud
In the Chapter Problem, it was noted that statistics can
be used to detect fraud. Table 11-1 lists the percentages
for leading digits from Benford’s Law.
Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.
Slide
19
Example: Detecting Fraud
Test the claim that there is a significant discrepancy
between the leading digits expected from Benford’s
Law and the leading digits from the 784 checks.
Observed Frequencies and Frequencies Expected with
Benford’s Law
Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.
Slide
20
Example: Detecting Fraud
Test the claim that there is a significant discrepancy
between the leading digits expected from Benford’s
Law and the leading digits from the 784 checks.
Test the claim that there is a significant discrepancy
between the leading digits expected from Benford’s
Law and the leading digits from the 784 checks.
H0: p1 = 0.301, p2 = 0.176, p3 = 0.125, p4 = 0.097, p5 = 0.079,
p6 = 0.067, p7 = 0.058, p8 = 0.051 and p9 = 0.046
The test statistic is χ2 = 3650.251.
H1: At least one of the proportions is different from the
claimed values.
Since the critical value is 20.090, we reject the null
hypothesis.
α = 0.01
There is sufficient evidence to reject the null
hypothesis. At least one of the proportions is
different than expected.
k–1=8
χ2.01,8 = 20.090
Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.
Slide
21
Example: Detecting Fraud
Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.
Slide
22
Example: Detecting Fraud
Test the claim that there is a significant discrepancy
between the leading digits expected from Benford’s
Law and the leading digits from the 784 checks.
Figure 11-5
Figure 11-6 Comparison of Observed Frequencies and
Frequencies Expected with Benford’s Law
Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.
Slide
23
Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.
Slide
24