Answer on Question #46755 – Math – Statistics and Probability
Associated with a job are two random variables: CPU time required (Y) and number of disk I/O Operations (X).
i Time(sec) yi Number xi
1 40 398
2 38 390
3 42 410
4 50 502
5 60 590
6 30 305
7 20 210
8 25 252
9 40 398
10 39 392
(i) Find the regression equation.
(ii) What is the estimate increase in CPU time for one additional disk operation.
(iii) Comment on the goodness of the model.
Solution
(i) Find the regression equation.
y=ax+b.b=n(∑ixi2)−(∑ixi)2(∑iyi)(∑ixi2)−(∑ixi)(∑ixiyi);a=n(∑ixi2)−(∑ixi)2n(∑ixiyi)−(∑ixi)(∑iyi).n=10;(i∑xi)=3847;(i∑yi)=384;(i∑xiyi)=159318;(i∑xi2)=1591405.a=10(1591405)−(3847)2(384)(1591405)−(3847)(159318)=1114641−1796826=−1.61.b=10(1591405)−(3847)210(159318)−(3847)(384)=1114641115932=0.10.
So
y=0.10⋅x−1.61.
(ii) The estimate increase in CPU time for one additional disk operation is
Δy=y(x+1)−y(x)=(0.10⋅(x+1)−1.61)−(0.10⋅x−1.61)=0.10 s.
(iii) Comment on the goodness of the model.
The value r2 is a fraction between 0.0 and 1.0, and has no units. An r2 value of 0.0 means that knowing X does not help you predict Y. There is no linear relationship between X and Y, and the best-fit line is a horizontal line going through the mean of all Y values. When r2 equals 1.0, all points lie exactly on a straight line with no scatter. Knowing X lets you predict Y perfectly.
r2=(∑iyi2)−n(∑iyi)2(∑i(fi−n(∑iyi))2),
where fi=y(xi).
(i∑yi2)=15954;n(∑iyi)=10384=38.4;(i∑(fi−n(∑iyi))2)=1138.357.r2=15954−10(384)21138.357=0.94.
Our value r2 is near 1.0. Therefore the model is in good agreement with the experimental data.
www.AssignmentExpert.com
Comments