Question #46755

Associated with a job are two random variables: CPU time required (Y) and number of disk I/O
Operations (X).
i Time
(sec)
yi Number
xi
1 40 398
2 38 390
3 42 410
4 50 502
5 60 590
6 30 305
7 20 210
8 25 252
9 40 398
10 39 392

(i) Find the regression equation.
(ii) What is the estimate increase in CPU time for one additional disk operation.
(iii) Comment on the goodness of the model.
1

Expert's answer

2014-09-26T10:04:09-0400

Answer on Question #46755 – Math – Statistics and Probability

Associated with a job are two random variables: CPU time required (Y) and number of disk I/O Operations (X).

i Time(sec) yiy_i Number xix_i

1 40 398

2 38 390

3 42 410

4 50 502

5 60 590

6 30 305

7 20 210

8 25 252

9 40 398

10 39 392

(i) Find the regression equation.

(ii) What is the estimate increase in CPU time for one additional disk operation.

(iii) Comment on the goodness of the model.

Solution

(i) Find the regression equation.


y=ax+b.y = a x + b.b=(iyi)(ixi2)(ixi)(ixiyi)n(ixi2)(ixi)2;a=n(ixiyi)(ixi)(iyi)n(ixi2)(ixi)2.b = \frac{(\sum_i y_i)(\sum_i x_i^2) - (\sum_i x_i)(\sum_i x_i y_i)}{n(\sum_i x_i^2) - (\sum_i x_i)^2}; \quad a = \frac{n(\sum_i x_i y_i) - (\sum_i x_i)(\sum_i y_i)}{n(\sum_i x_i^2) - (\sum_i x_i)^2}.n=10;(ixi)=3847;(iyi)=384;(ixiyi)=159318;(ixi2)=1591405.n = 10; \quad \left(\sum_i x_i\right) = 3847; \quad \left(\sum_i y_i\right) = 384; \quad \left(\sum_i x_i y_i\right) = 159318; \quad \left(\sum_i x_i^2\right) = 1591405.a=(384)(1591405)(3847)(159318)10(1591405)(3847)2=17968261114641=1.61.a = \frac{(384)(1591405) - (3847)(159318)}{10(1591405) - (3847)^2} = \frac{-1796826}{1114641} = -1.61.b=10(159318)(3847)(384)10(1591405)(3847)2=1159321114641=0.10.b = \frac{10(159318) - (3847)(384)}{10(1591405) - (3847)^2} = \frac{115932}{1114641} = 0.10.


So


y=0.10x1.61.y = 0.10 \cdot x - 1.61.


(ii) The estimate increase in CPU time for one additional disk operation is


Δy=y(x+1)y(x)=(0.10(x+1)1.61)(0.10x1.61)=0.10 s.\Delta y = y (x + 1) - y (x) = (0.10 \cdot (x + 1) - 1.61) - (0.10 \cdot x - 1.61) = 0.10 \text{ s}.


(iii) Comment on the goodness of the model.

The value r2r^2 is a fraction between 0.0 and 1.0, and has no units. An r2r^2 value of 0.0 means that knowing X does not help you predict Y. There is no linear relationship between X and Y, and the best-fit line is a horizontal line going through the mean of all Y values. When r2r^2 equals 1.0, all points lie exactly on a straight line with no scatter. Knowing X lets you predict Y perfectly.


r2=(i(fi(iyi)n)2)(iyi2)(iyi)2n,r^2 = \frac{\left(\sum_{i} \left(f_{i} - \frac{\left(\sum_{i} y_{i}\right)}{n}\right)^{2}\right)}{\left(\sum_{i} y_{i}^{2}\right) - \frac{\left(\sum_{i} y_{i}\right)^{2}}{n}},


where fi=y(xi)f_{i} = y(x_{i}).


(iyi2)=15954;(iyi)n=38410=38.4;(i(fi(iyi)n)2)=1138.357.\left(\sum_{i} y_{i}^{2}\right) = 15954; \quad \frac{\left(\sum_{i} y_{i}\right)}{n} = \frac{384}{10} = 38.4; \quad \left(\sum_{i} \left(f_{i} - \frac{\left(\sum_{i} y_{i}\right)}{n}\right)^{2}\right) = 1138.357.r2=1138.35715954(384)210=0.94.r^2 = \frac{1138.357}{15954 - \frac{(384)^{2}}{10}} = 0.94.


Our value r2r^2 is near 1.0. Therefore the model is in good agreement with the experimental data.

www.AssignmentExpert.com


Need a fast expert's response?

Submit order

and get a quick answer at the best price

for any assignment or question with DETAILED EXPLANATIONS!

Comments

No comments. Be the first!
LATEST TUTORIALS
APPROVED BY CLIENTS