Download docx Hypothesis testing-0

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Foundations of statistics wikipedia , lookup

Student's t-test wikipedia , lookup

Time series wikipedia , lookup

Misuse of statistics wikipedia , lookup

Transcript
Running head: HYPOTHESIS TESTING
1
Hypothesis Testing
Name
Institution
HYPOTHESIS TESTING
2
Introduction
A confidence interval is basically the range of the values which are derived from the
sample statistics which is likely to encompass the values of unknown parameters of the
population. Due to the random nature of the population, it is likely that they will yield same
confidence interval. However, when a statistician repeats a given sample of data for several
times, there is high chance that a certain percentage of the outcome in the confidence interval
would have indefinite parameters of the population. Thus, the confidence intervals which contain
the factors are the interval of the confidence interval. Mostly, the confidence intervals are used to
bind the average or the mean as well as the standard deviation. On the other hand, it can be used
in obtaining the regression coefficients, the proportions of the occurrences and to note the
differences of a given data. A hypothesis is mainly s proposed for any two data sets. In this case,
it gives both the null hypothesis and the alternative hypothesis which gives the relationship
between two data sets which will aid in determining the level of significance. To do so, I
decided to sample data of a given production firm. The aim of doing so was to determine if the
renewal of the machine in the production process improves the performance. I sampled data
about amount of time the old machine takes to pack ten cartons. I collected the data randomly on
different days chosen randomly within the firm. It was important to collect the data as it will aid
in determining whether the firm needs to buy the new machines to improve its production
process.
Process of Collecting Data
To collect the data it was necessary to follow some of the procedures I set down. I
decided that it was good to use observation and interviewing method to collect the data. After,
HYPOTHESIS TESTING
3
seeking permission with the authorities of the firm, I observed and recorded the time taken by the
old machine to pack ten cartons with the use of stop watch. After, the purchase of then new
machine I went and observed as I determined the time taken by the new machine to pack ten
cartons of the jars. I also collected several data for different days which I chose randomly. I
collected the data below as displayed in figure 1 and then plotted a bar graph to represent the two
data as shown in figure 2.
Figure 1
New
42.1
41.3
42.4
43.2
41.8
41.8
41.0
42.8
41.3
42.7
42.7
43.8
42.5
43.1
44.0
43.3
43.6
43.5
41.7
44.1
Machine
Old
Machine
The bar of the above data is as shown below in the figure 2.
HYPOTHESIS TESTING
4
Figure 2
44.5
44
43.5
43
42.5
42
New
Old
41.5
41
40.5
40
39.5
39
Focus
Comparison of the Packing Machines
In the firm, the machine packs the cartons jars in the cartons. It was stipulated by the
manager that the new machine will pack faster than the old machine. To taste the hypothesis, I
used the table of the data as shown in figure 1. I formulated hypothesis concerning the data I
collected.
I wanted to determine if the data I collected provided enough evidence to conclude that
on an average basis a new machine packs faster than the old machine. I performed the hypothesis
test at a 5% significance level. Therefore the data is given as follows:
HYPOTHESIS TESTING
5
Ӯ1= 42.14, s1=0.683
Ӯ2= 43.23,
s2=0.750
First Assumption
It is always good to ensure that the samples are independent. The data given above are
independent since they are from two machines.
Second Assumption
Are the samples provided above large or mainly normal population? In this data sample
we have n1<30 and the n2<30.Thus, we do not have enough samples to work with, which means
that it is necessary to check the normality assumption of the two data (Milton, 2009). Thus, it is
necessary to plot a normality plots for the data. They are as shown in Figure 3 and figure 4.
Figure 3
HYPOTHESIS TESTING
Figure 4
6
HYPOTHESIS TESTING
7
As the two plots display a normal distribution, it is right for to conclude that the data
come from normal distributions.
Third Assumption
It is important to determine if the two populations have equal variance. According to the
calculations, s1 and s2 are not so much different. I concluded this using the rule of the thumbs
where by the ratio of two sample data standard deviation is from 0.5 to 2. They are not that
different. It is because s1/s2 = 0.683/0.750= 0.91. The result is close to 1. Let’s continue the
pooled t- test. Let’s use μ1 to denote the mean of the new machine while μ2 to represent the
mean of the old one.
Step One
H0: μ1−μ2 =0H
Ha: μ1−μ2< 0
Step Two
Significance level
α=0.05
Step Three
Computation of the t-statistic
sp = √{9 × (0.683)2 + 9 × ( 0.750)2 } ÷ {10 + 10 − 2} = 0.717
HYPOTHESIS TESTING
8
t= 42.14 - 43.23/ {0.717*√ (1/10 +1/10)} = -3.40
Step Four
Left-tailed test
Critical value = −tα = −t0.05
Degrees of freedom = 10+10−2 =18
−t 0.05= −1.734 Thus; Rejection region t∗<−1.734
Step Five
It is also important to check if the test statistic falls inside the rejection region or not so that we
can decide whether to reject Ho or not.
t∗=−3.40<−1.734
Reject Ho at α=0.05
HYPOTHESIS TESTING
9
Step Six
At this point we are able to conclude the above hypothesis testing. At a significance of
5% the data has provided us with evidence that the new machine packs at a faster rate than the
old machine averagely.
HYPOTHESIS TESTING
10
References
Milton, M. (2009). Head first data analysis. Beijing: O'Reilly.