Download H 0 - iuma.ulpgc

Document related concepts

Student's t-test wikipedia , lookup

Transcript
Chapter
Thirteen
Hypothesis
Testing
Hypotheses Testing
• Oversimplified or incorrect assumptions must
be subjected to more formal hypothesis
testing
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 2
Interesting Hypotheses
• Bankers assumed high-income earners are more
profitable than low-income earners
• Clients who carefully balance their checkbooks every
month and minimize fees due to overdrafts are
unprofitable checking account customers
• Old clients were more likely to diminish CD balances
by large amounts compared to younger clients
– This was nonintutive because conventional wisdom
suggested that older clients have a larger portfolio of
assets and seek less risky investments
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 3
Data Analysis
• Descriptive
– Computing measures of central tendency and
dispersion,as well as constructing one-way
tables
• Inferential
– Data analysis aimed at testing specific
hypotheses is usually called inferential
analysis
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 4
Null and Alternative Hypotheses
H0 -> Null Hypotheses
Ha -> Alternative Hypotheses
• Hypotheses always pertain to population
parameters or characteristics rather than to
sample characteristics. It is the population,
not the sample, that we want to make an
infernece about from limited data
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 5
Steps in Conducting a Hypothesis Test
• Step 1. Set up H0 and Ha
• Step 2. Identify the nature of the sampling
distribution curve and specify the appropriate
test statistic
• Step 3. Determine whether the hypothesis
test is one-tailed or two-tailed
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 6
Steps in Conducting a Hypothesis Test
(Cont’d)
• Step 4. Taking into account the specified significance
level, determine the critical value (two critical values
for a two-tailed test) for the test statistic from the
appropriate statistical table
• Step 5. State the decision rule for rejecting H0
• Step 6. Compute the value for the test statistic from
the sample data
• Step 7. Using the decision rule specified in step 5,
either reject H0 or reject Ha
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 7
Launching a Product Line Into a
New Market Area
• Karen, product manager for a line of apparel, to
introduce the product line into a new market area
• Survey of a random sample of 400 households in that
market showed a mean income per household of
$30,000. Karen strongly believes the product line will
be adequately profitable only in markets where the
mean household income is greater than $29,000.
Should Karen introduce the product line into the new
market?
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 8
Karen’s Criterion for Decision Making
• To reach a final decision, Karen has to make
a general inference (about the population)
from the sample data
• Criterion: mean income across across all
households in the market area under
consideration
• If the mean population household income is
greater than $29,000, then Karen should
introduce the product line into the new market
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 9
Karen’s Hypothesis
• Karen’s decision making is equivalent to
either accepting or rejecting the hypothesis:
– The population mean household income in the
new market area is greater than $29,000
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 10
One-Tailed Hypothesis Test
• The term one-tailed signifies that all - or zvalues that would cause Karen to reject H0,
are in just one tail of the sampling distribution
 -> Population Mean
H0:   $29,000
Ha:   $29,000
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 11
Type I and Type II Errors
• Type I error occurs if the null hypothesis is
rejected when it is true
• Type II error occurs if the null hypothesis is
not rejected when it is false
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 12
Significance Level
• -> Significance level—the upper-bound
probability of a Type I error
• 1 -  ->confidence level—the complement of
significance level
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 13
Summary of Errors Involved in
Hypothesis Testing
Inference
Based on
Sample Data
H0 is True
H0 is False
Real State of Affairs
H0 is True
Correct decision
Confidence level
= 1- 
H0 is False
Type II error
P (Type II error) = 
Correct decision
Type I error
Significance level Power = 1-
= *
*Term  represents the maximum probability of
committing a Type I error
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 14
Level of Risk
• Two firms considering introducing a new product that
radically differs from their current product line
– Firm ABC
• Well-established customer base, distinct reputation for its
existing product line
– Firm XYZ
• No loyal clientele, no distinct image for its present
products
• Which of these two firms should be more cautious in
making a decision to introduce the new product?
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 15
Scenario - Firms ABC & XYZ
• Firm ABC
– ABC should be more cautious
• Firm XYZ
– XYZ should be less cautious
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 16
Exhibit 13.1 Identifying the Critical Sample
Mean Value – Sampling Distribution
Sample mean (x) values greater than $29,000--that is x-values on the right-hand side
of the sampling distribution centered on µ = $29,000--suggest that H0 may be false.
More important the farther to the right x is , the stronger is the evidence against H0
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 17
Karen’s Decision Rule for Rejecting
the Null Hypothesis
• Reject H0 if the sample mean exceeds xc
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 18
Criterion Value
Every mean x has a corresponding equivalent
standard Normal Deviate:
The expression for z
x-
Z = --------sx
x =  + zsx
Substituting xc for x and zc for z
xc =  + zcsx where zc is standard normal deviate
corresponding to the critical sample mean, xc.
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 19
Computing the Criterion Value
Standard deviation for the sample of 400 households is
$8,000. The standard error of the mean (sx ) is given by
S
s x = ---- = $400
n
Critical mean household income xc through the
following two steps:
1. Determine the critical z-value, zc. For  =.05, From
Appendix 1, zc = 1.645.
2. Substitute the values of zc, s, and  (under the assumption
that H0 is "just" true ), xc =  + zc s = $29,658.
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 20
Karen’s Decision Rule
• If the sample mean household income is
greater than $29,658, reject the null
hypothesis and introduce the product line into
the new market area.
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 21
Test Statistic
The value of the test statistic is simply the zvalue corresponding to = $30,000.
x-
Z = ------ = 2.5
s
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 22
Exhibit 13.2 Critical Value for Rejecting
the Null Hypothesis
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 23
P - Value – Actual Significance Level
• The probability of obtaining an x-value as high as
$30,000 or more when  is only $29,000 = .0062
• This value is sometimes called the actual significance
level, or the p-value
• The actual significance level of .0062 in this case
means the odds are less than 62 out of 10,000 that
the sample mean income of $30,000 would have
occurred entirely due to chance (when the population
mean income is $29,000 or less)
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 24
T-test
Conduct T-Test when sample is small
Let the sample size, n = 25
X = $30,000 , s = $8,000
From the t-table in Appendix 3, tc = 1.71 for  = .05
and d.f. = 24.
Decision rule: “Reject H0 if t  1.7l.”
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 25
T-test (Cont’d)
The value of t from the sample data:
S = 8000/25 = $1,600
x-
t = ------ = 0.625
sx
The computed value of t is less than 1.71, H0 cannot
be rejected.
Karen should not introduce the product line into the
new market area.
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 26
Two-Tailed Hypothesis Test
• Two-tailed test is one in which values of the
test statistic leading to rejectioin of the null
hypothesis fall in both tails of the sampling
distribution curve
H0 :  = $29,000
Ha :   $29,000
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 27
Test of Two Means
• A health service agency has designed a public
service campaign to promote physical fitness and the
importance of regular exercise. Since the campaign
is a major one, the agency wants to make sure of its
potential effectiveness before running it on a national
scale
– To conduct a controlled test of the campaign’s
effectiveness, the agency needs two similar cities
– The agency identified two similar cities
• city 1 will serve as the test city
• city 2 will serve as a control city
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 28
Test of Two Means (Cont’d)
• Random survey of was conducted to measure the
average time per day a typical adult in each city
spent on some form of exercise
– 300 adults in city 1,
– 200 adults in city 2
• Results of the survey :
– average was 30 minutes per day (with a standard
deviation of 22 minutes) in city 1
– Average was 35 minutes per day (with a standard
deviation of 25 minutes) in city 2
• Question
– From these results, can the agency conclude
confidently that the two cities are well matched for the
controlled test?
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 29
Basic Statistics and Hypotheses
City 1: n1 = 300
x1 = 30 s1 = 22
City 2: n2 = 200
x2 = 35 s2 = 25
The hypotheses are
H0: 1 =2
or
1 -2 = 0
Ha: 1  2
or
1 -2  0
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 30
Test Statistic
Test statistic is the z-statistic, given by
z
=
(x1 - x 2) - (1 - 2 )
------------------------------ s12/n1 + s22/n2
n1 and n2 are greater than 30.
The z-statistic can therefore be used as the
test statistic.
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 31
Decision – Two-Tailed Test
• For Two-Tailed tests
– Identify two critical values of z, one for each tail of the
sampling distribution
– The probability corresponding to each tail is .025, since
 = .05
– From the Normal Table, the z-value, for /2 =.025 is
1.96
• Decision rule : “Reject H0 if z  -1.96 or if z  1.96.”
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 32
Computing Z-value – Two-Tailed Test
Computing the value of z from the survey results
and under the customary assumption that the null
hypothesis is true (i.e., 1 - 2 = 0):
z=
(30 - 35) - (0)
--------------------------------- = -2.29
(22)2/300 + (25)2/200
Since z  -1.96, we should reject H0.
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 33
Exhibit 13.5 Hypothesis Test Related to
Mean Exercising in Two Cities
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 34
T- Test for Independent Samples
Test statistic
(x1 - x2) - (1 - 2 )
t = ------------------------s* (  1/n1 + 1/n2 )
with d.f. = n1 + n2 - 2. In this expression, s* is the pooled
standard deviation, given by
(n1 – 1)s12 + (n2 – 1)s22
s* = --------------------------------n1 + n2 - 2
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 35
T- Test for Independent Samples Two Cities
n1 = 20
n2 = 10
x1 = 30
x2 = 35
s1 = 22
s2 = 25
The degrees of freedom for the t-statistic are
d.f. = 28
Critical value of t with 28 d.f for a tail probability
of .025 is 2.05.
Decision rule : “Reject H0 if t  -2.05 or if t 
2.05." The pooled standard deviation is
s* =  529 (approximately) = 23
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 36
T- Test for Independent Samples
The test statistic is
t = -.56
Since t is neither less than -2.05 nor greater than 2.05,
we cannot reject H0
The sample evidence is not strong enough to conclude
that the two cities differ in terms of levels of
exercising activity of their residents.
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 37
National Insurance Company Study – Perceived Service
Quality Differences Between Males and Females
• Test of Two Means Using the SPSS T-TEST
Program
– On the 10-point scale,
• males gave a mean rating of approximately
7.87
• females gave a mean rating of approximately
7.83.
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 38
National Insurance Company Study – Perceived Service
Quality Differences Between Males and Females
• In SPSS,
1. Select ANALYZE from the menu,
2. Click COMPARE MEANS
3. Select INDEPENDENT-SAMPLES T -TEST
4. Move “OQ – Over all Service Quality” to the “TEST
VARIABLES(S)” box
5. Move “gender” to “GROUPING VARIABLE” box
6. DEFINE GROUPS (SEX = 1 for male and 2 for female)
7. Click OK.
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 39
National Insurance Company Study – Perceived Service
Quality Differences Between Males and Females
OQ – Overall Perceived Service Quality
Copyright © Houghton Mifflin Company. All rights reserved.
Gender – Sex = 1 for male
Sex = 2 for female
13 | 40
National Insurance Company Study – Perceived Service
Quality Differences Between Males and Females
Group Statistics
OQ
gender
male
female
N
137
126
Copyright © Houghton Mifflin Company. All rights reserved.
Mean
7.87
7.83
Std. Deviation
2.26
2.31
Std. Error
Mean
.19
.21
13 | 41
National Insurance Company Study – Perceived Service
Quality Differences Between Males and Females
P-value >  = 0.05 -- Do
not Reject, Equal variance
assumed is correct
Use this
row
when the
null
hypothesi
s of
equality
of
variance
is
rejected
F-Test--to see if the variance of the
2 groups are assumed to be equal
p-value = .210 --> null hypothesis
cannot be rejected at  = 0.05
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 42
National Insurance Company Study – Perceived Service
Quality Differences Between Males and Females
P-value=.88 is
greater than the  =
of 0.05.
Do
not reject Ho.
The p-value implies that the odds are 88 to 100
that a difference of magnitude .04 (i.e., 7.87 7.83) could have occurred from chance. The null
hypothesis cannot be rejected at the customary
significance level of .05.
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 43
Test of Two Means When
Samples Are Dependent
• The need to check for significant differences
between two mean values when the samples
are not independent
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 44
Test of Two Means When
Samples Are Dependent (Cont’d)
• A retail chain ran a special promotion in a
representative sample of 10 of its stores to
boost sales
• Weekly sales per store before and after the
introduction of the special promotion are
shown
• Did the special promotion lead to a significant
increase in sales?
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 45
Sales Per Store Before and After a
Promotional Campaign
Sales per Store (In Thousands)
Store
Before
Number (i) Promotion
(xbi )
After
Promotion
(xai )
Change in
Sales (In
Thousands)
xdi = xai - xbi
1
250
260
10
2
3
4
5
235
150
145
120
240
151
140
124
5
1
-5
4
6
98
100
2
75
85
180
212
70
95
200
220
-5
10
20
8
50
7
8
9
10
Total
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 46
Test of Two Means When
Samples Are Dependent (Cont’d)
One-Tailed Hypothesis Test:
H0: d  0; Ha: d  0.
The sample estimate of d is xd, given
by
n
Xdi
i=1
xd = ----n
where n is the sample size.
xd = 50/10 = 5
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 47
Test of Two Means When
Samples Are Dependent (Cont’d)
Test statistic is
t =
xd - 
----------- = 2.10
s/n
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 48
Test of Two Means When
Samples Are Dependent (Cont’d)
Standard deviation (s) = 7.53,  = 0.05,
tc for 9 d.f = 1.83 from the Appendix 3
Decision rule: “Reject H0 if t  1.83.”
Test Statistic, t  1.83, we reject H0 and
conclude that the mean change in sales per
store was significantly greater than zero.
The special promotion was indeed effective.
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 49
Exhibit 13.6 Hypothesis Test Related to
Change in Weekly Sales Per Store
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 50
Test for a Single Proportion
• Ms.Jones wants to substantially increase her firm's
advertising budget.
• The firm sells a variety of personal computer
accessories
• Random sample : 20/100 (20%) know the brand
name
• True awareness rate for the brand name across all
personal computer owners is less than .3
• Should Ms. Jones increase the advertising budget on
the basis of survey results?
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 51
Test for a Single Proportion (Cont’d)
• Need to test the population proportion of
personal computer owners who are aware of
the brand:
H0:   .3
Ha:   .3
( is the symbol for population proportion)
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 52
Test for a Single Proportion (Cont’d)
The test statistic:
p-
Z = -------------------- (1- )/n
where p is the sample proportion.
From the Normal Table, zc, = -1.645 for  = .05.
Decision rule here is: “Reject Ho if z  - 1.645.”
p = .2,  = .3, and n = 100, z = -2.174
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 53
Test for a Single Proportion
Since -2.174  -1.645, we reject H0;
The sample awareness rate of .2 is too low to support the
hypothesis that the population awareness rate is .3 or more.
The actual significance level (p-value) corresponding to
z = -2.174 is approximately .015 (from Appendix 1).
Level of significance implies that the odds are lower than
15 in 1,000 that the sample awareness rate of .2 would have
occurred entirely by chance (that is, when the population
awareness rate is .3 or higher).
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 54
Exhibit 13.4 Hypothesis Test Related to
Proportion of Personal Computer Owners
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 55
Test of Two Proportions: Choosing Between
Commercial X & Commercial Y For a New Product
• Tom, advertising manager for a frozen-foods,
company, is in the process of deciding between two
TV commercials (X and Y) for a new frozen food to
be introduced
– Commercial X
• Runs for 20 seconds
• Random sample: 20 % awareness out of 200
respondents
– Commercial Y
• Runs for 30 seconds
• Random sample: 25 % awareness out of 200
respondents
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 56
Test of Two Proportions (Cont’d)
• Question
– Can Tom conclude that commercial Y will be
more effective in the total market for the new
product?
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 57
Criterion for Decision Making
• To reach a final decision, Tom has to make a
general inference (about the population) from
the sample data
• Criterion: relative degrees of awareness likely
to be created by the 2 commercials in the
population of all adult consumers
• Tom should conclude that commercial Y is
more effective than commercial X only if the
anticipated population awareness rate for
commercial Y is greater than that for X
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 58
Hypothesis
• Tom’s decision-making is equvalent to either
accepting or rejecting the hypothesis
– The potential awareness rate that commercial
Y can generate among the population of
consumers is greater than that which
commercial X can generate
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 59
Null and Alternative Hypotheses
Sample sizes:
Sample proportions:
Commercial
X
n1 = 200
p1 = .25
Commercial
Y
n2 = 200
p2 = .20
The hypotheses are:
H0:
1  2
or
1 - 2
 0
Ha:
1  2
or
1 - 2
 0
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 60
Test of Two Proportions –
Sample Standard Error
(p1 – p2) - (1 - 2)
z = -----------------------p1 - p2 -- is estimated by the sample
standard error formula
Sample Standard Error
sp1 - p2 = PQ ( 1/n1 + 1/n2)
n1p1 + n2p2
P = ------------------n1 + n2
Q = 1-P
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 61
Test of Two Proportions
For  = .05, the critical value of z (from Appendix 1)
is 1.645.
Decision rule: “Reject H0 if z  1.645.”
First compute P and Q, then sp1
P=
- p2
and z:
200(.25) + 200(.2)
----------------------- = .225
200 + 200
Q = 1 - .225
= .775
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 62
Test of Two Proportions
sp1
- p2
= (.225)(.775) (1/200 + 1/200)
=0.042
z=
(.25 - .20) - (0)
---------------------- = 1.19
.042
Since z  1.645, we cannot reject H0.
The sample evidence is not strong enough to suggest that
commercial Y will be more effective than commercial X.
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 63
Hypothesis Test Related to Awareness
Generated by Two Commercials
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 64
Cross-Tabulations:
Chi-square Contingency Test
• Technique used for determining whether
there is a statistically significant relationship
between two categorical (nominal or ordinal)
variables
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 65
Telecommunications Company
• Marketing manager of a telecommunications
company is reviewing the results of a study of
potential users of a new cell phone
– Random sample of 200 respondents
• A cross-tabulation of data on whether target consumers
would buy the phone (Yes or No) and whether the cell
phone had Bluetooth wireless technology (Yes or No)
• Question
– Can the marketing manager infer that an association
exists between Bluetooth technology and buying the
cell phone?
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 66
Table 13.3 Two-Way Tabulation of Bluetooth Technology and
Whether Customers Would Buy Cell Phone
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 67
Cross Tabulations - Hypotheses
H0: There is no association between wireless
technology and buying the cell phone (the two
variables are independent of each other).
Ha: There is some association between the Bluetooth
feature and buying the cell phone (the two
variables are not independent of each other).
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 68
Conducting the Test
• Test involves comparing the actual, or
observed, cell frequencies in the crosstabulation with a corresponding set of
expected cell frequencies (Eij)
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 69
Expected Values
Eij =
n inj
----n
Where ni and nj are the marginal frequencies,
that is, the total number of sample units in
category i of the row variable and category j
of the column variable, respectively
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 70
Computing Expected Values
The expected frequency for the first-row, first-column
cell is given by
100  100
E11 =
------------ = 50
200
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 71
Table 13.4 Observed and
Expected Cell Frequencies
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 72
Chi-square Test Statistic
c
(Oij - Eij)2

-----------------
i=1 j=1
Eij
r
2 = 
= 72.00
Where r and c are the number of rows and columns,
respectively, in the contingency table. The number of
degrees of freedom associated with this chi-square
statistic are given by the product (r - 1)(c - 1).
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 73
Chi-square Test Statistic
in a Contingency Test
For d.f. = 1, Assuming  =.05, from Appendix 2, the
critical chi-square value (2c) = 3.84.
Decision rule is: “Reject H0 if 2  3.84.”
Computed 2 = 72.00
Since the computed Chi-square value is greater than
the critical value of 3.84, reject H0.
The apparent relationship between “Bluetooth
technology"and "would buy the cellular phone"
revealed by the sample data is unlikely to have
occurred because of chance
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 74
Interpretation
• The actual significance level associated with
a chi-square value of 72 is less than .001
(from Appendix 2). Thus, the chances of
getting a chi-square value as high as 72
when there is no relationship between
Bluetooth technology and purchase of cell
phones are less than 1 in 1,000.
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 75
Cross-Tabulation Using SPSS
for National Insurance Company
• One crucial issue in the customer survey of
National Insurance Company was how a
customer's education was associated with
whether or not she or he would recommend
National to a friend.
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 76
Need to Conduct Chi-square
Test to Reach a Conclusion
• The hypotheses are
– H0:There is no association between
educational level and willingness to
recommend National to a friend (the two
variables are independent of each other)
– Ha:There is some association between
educational level and willingness to
recommend National to a friend (the two
variables are not independent of each other)
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 77
Association Between Education and Customer’s
Willingness to recommend National to a Friend
For two-way tabulation:
1. Select ANALYZE on the SPSS menu,
2. Click on DESCRIPTIVE STATISTICS,
3. Select CROSS-TABS.
4. Move the “highest level of schooling” to ROW(S) box,
5. Move “rec” variable to “COLUMN(S) box.
6. Click on CELLS,
7. Select OBSERVED, and ROW PERCENTAGES.
8. Click CONTINUE and
9. Click OK.
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 78
Association Between Education and Customer’s Willingness
to recommend National to a Friend
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 79
Association Between Education and Customer’s
Willingness to recommend National to a Friend
COUNT
represents the
actual number
of customers
in each cell.
The
percentages
are based on
the
corresponding
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 80
Association Between Education and Customer’s
Willingness to recommend National to a Friend
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 81
National Insurance Company Study Chi-Square Test
For Chi-Square Assessment:
1.
2.
3.
4.
5.
6.
7.
Select ANALYZE
Click on DESCRIPTIVE STATISTICS
Select CROSS-TABS
Move the variable “highest level of schooling” to ROW(s) box
Move “rec” to COLUMN(s) box;
Click on “STATISTICS”
Select CHI-SQUARE, CONTINGENCY COEFFICIENT, and
CRAMER’S V
8. Click on CELLS,
9. Select OBSERVED and EXPECTED FREQUENCIES
10.Click CONTINUE
11.Click OK.
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 82
National Insurance Company Study Chi-Square Test (Cont’d)
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 83
National Insurance Company Study –
Expected Frequency Table
Interpret
the Table
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 84
National Insurance Company Study
Computed
Chi-square
value
P-value
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 85
National Insurance Company Study –
P-Value Significance
• The actual significance level (p-value) =
0.019
• The chances of getting a chi-square value as
high as 10.007 when there is no relationship
between education and recommendation are
less than 19 in 1000
• The apparent relationship between education
and recommendation revealed by the sample
data is unlikely to have occurred because of
chance
• Jill and Tom can safely reject null hypothesis
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 86
Precautions in Interpreting
Cross Tabulation Results
• Two-way tables cannot show conclusive
evidence of a causal relationship
• Watch out for small cell sizes
• Increases the risk of drawing erroneous
inferences when more than two variables are
involved
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 87
Two-way Table Based on a Survey of 200
Hospital Patients:
Patients who Patients who
jog
do not jog
Patients with
heart disease
20
40
Patients
without heart
disease
80
60
100
100
Is there a causal relationship between patients
who jog and patients with hearth disease?
Copyright © Houghton Mifflin Company. All rights reserved.
13 | 88