When the first proof of 392 pages of a book of 1200 pages were read, the distribution of printing mistakes were found to be as follows:
No. of mistakes in a page (x):No. of pages (f):027517223037455261
Fit a Poisson distribution to the above data and test the goodness of fit at 5 percent Level of significance.
The null hypothesis is H0:X∼Poisson
The alternative hypothesis is H1:X does not follow a Poisson distribution.
The mean of the (assumed) Poisson distribution is unknown so must be estimated from the data by the sample mean:
μ^=3920(275)+1(72)+2(30)+3(7)+4(5)+5(2)+6(1)≈0.482Using the Poisson distribution with μ=0.482 we can compute pi, the hypothesised probabilities associated with each class. From these we can calculate the expected frequencies (under the null hypothesis):
p0=P(X=0)=0!e−0.482⋅(0.482)0≈0.6145470.614547⋅392≈240.90
p1=P(X=1)=1!e−0.482⋅(0.482)1≈0.2976580.297658⋅392≈116.68
p2=P(X=2)=2!e−0.482⋅(0.482)2≈0.0717360.071736⋅392≈28.12
p3=P(X=3)=3!e−0.482⋅(0.482)3≈0.0115260.011526⋅392≈4.52
p4=P(X=4)=4!e−0.482⋅(0.482)4≈0.0013890.001389⋅392≈0.54
p5=P(X=5)=5!e−0.482⋅(0.482)5≈0.0001340.000134⋅392≈0.05
p6=P(X=0)=6!e−0.482⋅(0.482)6≈0.0000110.000011⋅392≈0.004
x0123456Sum=Observed,fO27572307521392Expected,fE240.90116.6828.124.520.540.050.004392fE(fO−fE)24.82717.1090.1261.36136.83676.05248.004384.313 Null and Alternative Hypotheses
The following null and alternative hypotheses need to be tested:
H0:p0=0.614547,p1=0.297658,p2=0.071736,
p3=0.011526,p4=0.001389,p5=0.000134,p6=0.000011
H1: Some of the population proportions differ from the values stated in the null hypothesis.
This corresponds to a Chi-Square test for Goodness of Fit.
Based on the information provided, the significance level is α=0.05, the number of degrees of freedom is equal to the number of levels (k) of the categorical variable minus 1: df=k−1=7−1=6, so then the rejection region for this test is
R={χ2:χ2>12.592}
The Chi-Squared statistic is computed as follows:
χ2=i=1∑nEi(Oi−Ei)2=4.827+17.109+0.126++1.361+36.836+76.05+248.004=384.313 Since it is observed that χ2=384.313>χc2=12.592, it is then concluded that the null hypothesis is rejected. Therefore, there is enough evidence to claim that some of the population proportions differ from those stated in the null hypothesis, at the α=0.05 significance level.
We conclude that there is enough evidence to claim that the data do not follow a Poisson distribution.
Comments