Download 63 - KFUPM Faculty List

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Bootstrapping (statistics) wikipedia , lookup

Psychometrics wikipedia , lookup

Taylor's law wikipedia , lookup

Degrees of freedom (statistics) wikipedia , lookup

Omnibus test wikipedia , lookup

Misuse of statistics wikipedia , lookup

Resampling (statistics) wikipedia , lookup

Student's t-test wikipedia , lookup

Transcript
63. The Department of Weights and Measures in a southern state has the responsibility for making sure
that all commercial weighing and measuring devices are working properly. For example, when a gasoline
pump indicates that 1 gallon has been pumped, it is expected that 1 gallon of gasoline will actually have
been pumped. The problem is that there is variation in the filling process. The state’s standards call for the
mean amount of gasoline to be 1.0 gallon with a standard deviation not to exceed 0.010 gallons. Recently,
the department came to a gasoline station and filled 10 cans until the pump read 1.0 gallon. They then
measured precisely the amount of gasoline in each can. The following data were recorded:
Based on these data what should the Department of Weights and Measures conclude if they wish to test
using a 0.05 level of significance?
ANSWER:
This situation calls for two different hypothesis tests. The first one is to test whether the mean fill
is meeting the standard. The second test deals with the variation in the fill volume. With respect to the
mean fill of 1.0 gallon, a two-tailed hypothesis should be formed with the following null and alternative
hypotheses:
H o :   1.0 gallon
H a :   1.0 gallon
.
Since we do not know the population standard deviation, the test statistic for this hypothesis test is a t
statistic computed as follows:
t
x
s
n
We need to compute the sample mean and sample standard deviation from the data. The mean is:
x
x
n

10.013
 1.001 and the standard deviation is s 
 (x  x)
10
test statistic is: t 
1.001  1.0
0.0388
n 1
2
 .0388 Given this, the
 0.0815
10
From the t-distribution with a two-tail area equal to 0.05 and 10-1 = 9 degrees of freedom, the critical t =
2.2622. Since 0.0815 < 2.2622, we do not reject the null hypothesis. Thus, based on the data, there is no
reason to say that the mean fill amount is not correct.
With respect to the variation requirement that specifies that the standard deviation can not exceed
0.01 gallon, we need to use a test for a single population variance. The appropriate null and alternative
hypotheses are:
H o :  2  .0001
H a :  2  .0001
.
Note that we have converted from standard deviation to variances since there is no test for standard
deviation directly. In this situation, the test statistic is a chi-square value computed as follows:
 
2
(n  1) s 2
2
(10  1).0388 2
=
=135.49.
.012
This test statistic can be compared to the chi-square value from the table with 10-1 = 9 degrees of freedom
and a one-tail area of 0.05. That value is 16.9190. Since  2  135.49  16.9190 , the null hypothesis is
rejected. This means that the variation standard is being exceeded based on these data. The gasoline
station will need to get the pump checked out and the variation of fill around the mean reduced.
66. There are two major companies that provide SAT test tutoring for high school students. At issue is
whether Company 1 that has been in business for the longer time provides better results than Company 2,
the newer company. Specifically of interest is whether the mean increase in SAT scores for students who
have already taken the SAT-test one time is higher for Company 1 than for Company 2. A test of this is to
be conducted using a 0.10 level of significance. Two random samples of students are selected. The first
group uses the tutoring services of Company 1 and the second uses Company 2’s services. The following
data reflect the number of points higher (or lower) that the students scored on the SAT-test after taking the
tutoring.
Based on these data, can you conclude that the students who use Company 1 score higher on average than
students who use Company 2?
ANSWER:
This test calls for us to test whether the population means are equal. Since the sample sizes are
small and the population variances are unknown, the t-distribution should be employed if we assume that
the populations are normally distributed and the populations have equal variances. We can test for the
equal variance assumptions as follows.
The following null and alternative hypotheses would be appropriate in this situation:
H o :  12   22
H o :  12   22
.
This will be a two-tailed test since we are testing to see whether a difference exists and are not predicting
which company will have more or less variability.
We first need to compute the sample variances for each sample using:
s
2
 (x  x)

n 1
2
.
The following variances are computed from the sample data:
s1  1145 .43 and s 2  345.70 .
2
2
In a two-tailed test of variances, the test statistic is an F-value that is formed by the ratio of the two sample
variances and by placing the larger sample variance in the numerator:
2
F 
s1
2

1145 .43
s2
 3.3134 .
345 .7
We then compare this value to a critical value from the F-distribution table. Since this is a two tail test
with alpha = .10, we use the table with 0.05 in the upper-tail of the F-distribution.
Two sets of degrees of freedom are used with the F-distribution. Across the top of the F-table we look for
degrees of freedom corresponding to n-1 where n is the sample size associated with the sample variance
that was placed in the numerator of the F-test statistic. In our case, n = 8, so the degrees of freedom is 8-1
= 7 for the numerator. The degrees of freedom down the side in the F table is n-1 where n corresponds to
the sample size for the sample variance in the denominator of the F-test statistic. In our case that would be
n = 8. Thus, the denominator degrees of freedom are 8-1 = 7. Then the critical F from the .05 table with 7
and 7 degrees of freedom is 3.787. The decision rule is:
If F calculated > 3.787 reject Ho, otherwise do not reject.
Since our calculated F-test statistic is F = 3.3134 < 3.787, we do not reject the null hypothesis. Thus, based
on the sample information, we have no basis for believing that there is a difference in the two companies
with respect to population variance.
Given this conclusion, we can now test the following null and alternative hypotheses:
H o : 1   2
H a : 1   2
.
The critical value for a t-test for a one-tailed test is found from the t-distribution with degrees of freedom
equal to n1 + n2 – 2 = 8 + 8 – 2 = 14. Given a 0.10 significance level, the critical value from the tdistribution with degrees of freedom equal to 14 is equal to 1.3450.
The test statistic is computed using:
t
( x1  x 2 )  0
sp
sp 
1
1

n1 n2
where sp is the pooled standard deviation and is computed as:
2
2
( n1  1) s1  ( n 2  1) s 2
n1  n 2  2
( 7 )1145 .43  ( 7 )345 .7

 27.31 .
14
Then the test statistic is:
t 
( x1  x 2 )  0
1
sp
n1


(53.5  34.375 )  0
1
1
27.31
n2
8

 1.401 .
1
8
Since 1.401 > 1.3450 from the t-distribution, we reject the null hypothesis. Based on the sample data, there
is sufficient evidence to conclude that Company 1 does produce higher average gain in SAT scores.