Download Chapter 1 Hypothesis Testing

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

History of statistics wikipedia , lookup

Statistics wikipedia , lookup

Transcript
Chapter 1
Hypothesis Testing
Principles
of Hypothesis Testing
tests for one sample case
Econometrics
Prof. Monica Roman
1
Statistical Hypotheses
They are defined as assertion or conjecture about the parameter
or parameters of a population, for example the mean or the
variance of a normal population.
They may also concern the type, nature or probability distribution
of the population.
Statistical hypotheses are based on the concept of proof by
contradiction.
For example, say, we test the mean (m) of a population to see if
an experiment has caused an increase or decrease in m. We do
this by proof of contradiction by formulating a null hypothesis.
Econometrics
Prof. Monica Roman
2
Elements of statistical
hypothesis
There are five ingredients to any statistical
test :
(a) Null Hypothesis
(b) Alternate Hypothesis
(c) Test Statistic
(d) Rejection/Critical Region
(e) Conclusion
Econometrics
Prof. Monica Roman
3
Null Hypothesis
It is a hypothesis which states that there is no
difference between the procedures and is denoted
by H0.
For the above example the corresponding H0 would
be that there has been no increase or decrease in
the mean.
Always the null hypothesis is tested, i.e., we want to
either accept or reject the null hypothesis because
we have information only for the null hypothesis.
Econometrics
Prof. Monica Roman
4
Alternative Hypothesis
It is a hypothesis which states that there is a
difference between the procedures and is
denoted by H1.
Econometrics
Prof. Monica Roman
5
Various types of H0 and H1
Null Hypothesis
Case
H0
Alternate Hypothesis
H1
1
µ1 = µ2
µ1 ≠ µ2
2
µ1 < µ2
µ1 > µ2
3
µ1 > µ2
µ1 < µ2
Econometrics
Prof. Monica Roman
6
Reason for Rejecting H0
Sampling Distribution
It is unlikely
that we would
get a sample
mean of this
value ...
... Therefore, we
reject the null
hypothesis that
µ = 50.
... if in fact this were
the population mean.
20
µ = 50
H0
Econometrics
Prof. Monica Roman
Sample Mean
7
Test Statistic
It is the random variable X whose value is tested to arrive at a
decision.
The Central Limit Theorem states that for large sample sizes (n >
30) drawn randomly from a population, the distribution of the
means of those samples will approximate normality, even when the
data in the parent population are not distributed normally.
A z statistic is usually used for large sample sizes (n > 30)
but often large samples are not easy to obtain, in which case the tdistribution can be used. The population standard deviation s is
estimated by the sample standard deviation, s. The t curves are
bell shaped and distributed around t=0.
t
=
X
S
−
µ
n
Econometrics
Prof. Monica Roman
8
Rejection Region
It is the part of the sample space (critical region) where the null
hypothesis H0 is rejected.
The size of this region, is determined by the probability (α) of the
sample point falling in the critical region when H0 is true. α is also
known as the level of significance, the probability of the value
of the random variable falling in the critical region.
Also it should be noted that the term "Statistical significance"
refers only to the rejection of a null hypothesis at some level α. It
implies only that the observed difference between the sample
statistic and the mean of the sampling distribution did not occur
by chance alone.
Econometrics
Prof. Monica Roman
9
Conclusion
Conclusion : If the test statistic falls in the
rejection/critical region, H0 is rejected, else
H0 is accepted.
Econometrics
Prof. Monica Roman
10
Errors
Type I Error
Reject True Null Hypothesis (“False Positive”)
Has Serious Consequences
Probability of Type I Error Is α
Called Level of Significance
Set by researcher
Type II Error
Do Not Reject False Null Hypothesis (“False Negative”)
Probability of Type II Error Is β (Beta)
Econometrics
Prof. Monica Roman
11
Errors
Test Result –
True State
H0 True
H0 False
H0 True
H0 False
Correct
Decision
Type I Error
Type II Error
Correct
Decision
α = P(Type I Error ) β = P (Type II Error )
Goal: Keep α, β reasonably small
Econometrics
Prof. Monica Roman
12
Level of Significance, α and
the Rejection Region
α
H0: µ ≥ 3
H1: µ < 3
H0: µ ≤ 3
H1: µ > 3
Rejection
Regions
0
0
H0: µ = 3
H1: µ ≠ 3
Critical
Value
α
α/2
0
Econometrics
Prof. Monica Roman
13
Steps
1.
2.
State the null and alternative hypotheses
Choose α. The value should be small,
usually less than 10%. It is important to
consider the consequences of both types of
errors.
Econometrics
Prof. Monica Roman
14
3.
Select the test statistic.
Determine its value from the sample data.
This value is called the observed value of the
test statistic.
! Remember that a t statistic is usually
appropriate for a small number of
samples; for larger number of samples, a
z statistic can work well if data are
normally distributed.
Econometrics
Prof. Monica Roman
15
4
Compare the observed value of the statistic
to the critical value obtained for the chosen α.
Econometrics
Prof. Monica Roman
16
5.
Make a decision:
If statistic falls in the rejection region
Reject H0 in favour of H1.
If the test statistic does not fall in the
critical region:
Conclude that there is not enough evidence
to reject H0.
Econometrics
Prof. Monica Roman
17
Hypothesis Testing on mean
t-Test: s Unknown
Assumptions
Population is normally distributed
If not normal, only slightly skewed & a large
sample taken (Central limit theorem applies)
Parametric test procedure
t test statistic, with n-1 degrees of freedom
X −µ
t =
S
n
Econometrics
Prof. Monica Roman
18
Hypothesis Testing: Steps
Test the Assumption that the true mean of
monthly cinema attendance of students is
at least 3.
1. State H0
H0 : µ ≥ 3.0
2. State H1
H1 : µ < 3.0
3. Choose α
α = .05
4. Choose n
n = 25
5. Choose Test:
t Test (or p Value)
Econometrics
Prof. Monica Roman
19
Hypothesis Testing: Steps
(continued)
6. Set Up Critical Value(s)
7. Collect Data
8. Compute Test Statistic
t = -1.7
25 students sampled, mean=2.7, s=0.75
Computed Test Stat.= -2
(computed P value=.04, two-tailed test)
9. Make Statistical Decision
10. Express Decision
Reject Null Hypothesis
The true mean is less than 3.0
Econometrics
Prof. Monica Roman
20
Hypothesis Testing on σ2
Two-tailed test
Ho: σ2 = σ2o
Ha: σ2 ≠ σo2
(n - 1)s2
Test statistic: χ2 =
σo2
One tail test
2
Ho: σ2 >= σ
o
Ho: σ2 <= σ2o
Ha: σ2 > σo2
Ha: σ2 < σo2
reject Ho if χ2* > χα2 , n-1
reject Ho if χ2* < χ2α, n-1
Econometrics
Prof. Monica Roman
21
Homework & References
1. Voineagu, V. si colectiv- Teorie si practica econometrica,
Ed. Meteor Press, 2007, pages 85-98
Read the text and solve the exercises.
2. David Ray Anderson,Dennis J. Sweeney,Thomas Arthur
Williams,Thomas A. Williams - Statistics for business and
economics, Chapter 9.
http://books.google.ro/books?id=_sOSAXTuy8cC&pg=PA388&l
pg=PA388&dq=hypothesis+testing+textbook+sweeney&sou
rce=bl&ots=24jbOAksnM&sig=xfPsCcF1GCwaLIZYA7bDJsP
z80k&hl=ro&ei=1s29TK2oMoOhOsSGyGY&sa=X&oi=book_r
esult&ct=result&resnum=1&ved=0CBQQ6AEwAA#v=onepag
e&q&f=false
Econometrics
Prof. Monica Roman
22