Download Pearson Goodness of Fit Test for a Poisson Distribution

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

History of statistics wikipedia , lookup

Statistics wikipedia , lookup

Transcript
Pearson Goodness of Fit Test for a Poisson Distribution1
The table below presents count data on the number of Larrea divaricata plants found in each of
48 sampling quadrants, as reported in the paper, “Some sampling characteristics of plants and
arthropods of the Arizona desert,” (Ecology, 1962, 567-571).
Observed Counts
Cell
1
2
3
4
5
Number of Plants
0
1
2
3
4
Frequency
9
9
10
14
6
For cells 1, 2, 3, 4, and 5, the respective observed cell counts are 9, 9, 10, 14, and 6.
Let Y = number of plants in a quadrant. Assume that Y has a Poisson distribution. We will
assume for the moment that the six counts in cell 5 were actually 4, 4, 5, 5, 6, 6. Based on the
observed data, the M.L.E. for the Poisson parameter is the mean of the sample data,
09  19  210  314  42  52  62  2.10 . We want to test whether a
ˆ 
9  9  10  14  2  2  2
Poisson distribution is a good fit to the data, using Pearson’s chi-squared test and the estimated
value of the Poisson parameter. We have n = 48, and the expected cell counts are  j  n j ˆ ,
for j = 1, 2, 3, 4, 5. From the Poisson distribution, we have
0
1
e 2.10 2.10
e 2.10 2.10
 1 ˆ  P(Y  0) 
 e  2.10 ,  2 ˆ  P(Y  1) 
 2.10e  2.10 ,
0!
1!
2
3
e 2.10 2.10
e 2.10 2.10
 2.10
ˆ
ˆ
 5   P(Y  2) 
 2.205e
 1.5435e  2.10 ,
,  4   P(Y  3) 
2!
3!
2.10
ˆ
and  5   P(Y  4)  1  PY  3  1  e
1  2.10  2.205  1.5435.
Then the expected cell frequencies are
1  n 1 ˆ  5.8779 ,  2  n 2 ˆ  12.3436 , 3  n 3 ˆ  12.9608 ,  4  n 4 ˆ  9.0726 ,
  n ˆ  7.7451.








5



5
We want to test the null hypothesis that the cell probabilities are those found from a
Poisson(2.10) distribution v. the alternative hypothesis that they are not; we will use  = 0.05.
ˆ 2
5 n  n 
j
j
2
The test statistic is Pearson’s chi-squared statistic: X  
, which under the
n j ˆ
j 1

 

null hypothesis has a  2 3 distribution. The critical value for the test is  02.05 3  7.815 . The
value of the test statistic calculated from the data is
ˆ 2 9  5.87792 9  12.34362 10  12.96082 14  9.07262
5 n  n 
j
j
2
X 




5.8779
12.3436
12.9608
9.0726
n j ˆ
j 1


 

6  7.74512
 6.3097 . Since this value is less than the critical value of the test, we fail
7.7451
to reject the null hypothesis at the 0.05 level of significance. We do not have sufficient evidence
to conclude that the data do not fit a Poisson(2.10) distribution.
1
Probability and Statistics for Engineering and the Sciences, 7th Edition, by Jay L. Devore.