The following table refers to two properties: age and traffic violations of residents in Gauteng over 12 months.
(i) If a Gauteng resident is selected at random, what is the probability that
(a) he is under 21 and has one or more violations during the past 12 months?
(b) he is under 21?
(c) he has one or more violations during the past 12 months?
(ii) Are events “under 21” and “one or more violations during the past 12 months” independent?
(iii) Test at a 5% level of significance if age and traffic violations are independent of each other:
a.
The sample size is
There are 3 rows and 2 columns.
The row totals are given as,
and the column totals are given as,
i)
a.
The probability that he is under 21 and has one or more violations during the past 12 months is given as, 25/100=1/4
Thus the probability that a resident is under 21 and has one or more violations during the past 12 months is 1/4.
b.
The probability that he is under 21 years is, (15+25)/100=40/100=2/5
c.
The probability that he has one or more violations during the past 12 months is (25+12+10)/100=47/100=0.47
Therefore, the probability that a resident has one or more violations is 0.47
ii)
Let A denote the event that the resident is under 21 and B denote the event that he has one or more violations.
If A and B are independent then we expect the condition
We have,
, and
Now,
Therefore the events “under 21” and “one or more violations during the past 12 months” are not independent.
iii)
The hypotheses tested are,
age and traffic violations are independent.
age and traffic violations are not independent.
To conduct this test, we use the Chi-square test for independence as follows,
We first determine the Expected counts for each cell using the formula,
where row and column
They are given as,
The test statistic is given as,
is compared with the table value at with degrees of freedom.
The table value is,
The null hypothesis is rejected if
Since , we reject the null hypothesis and conclude that there is no enough evidence to show that age and traffic violations are independent at 5% level of significance.
Comments