Answer to Question #121078 in Statistics and Probability for Tay Gideon

Question #121078
A computer manager needs to know how efficiency of her new computer program depends
on the size of incoming data. Efficiency will be measured by the number of processed
requests per hour. Applying the program to data sets of different sizes, she obtains the
following results,
Data size (gigabytes) 6 7 7 8 10 10 15
Processed requests 40 55 50 41 17 26 16
i. Draw the scatterplot for the data. Be sure to label your axes.
ii. Is there any correlation between the processing request and the size of incoming data?
What is the correlation coefficient?
iii. By what percentage is the processing time dependent on the size of incoming data?

iv. Compute a least square regression line for regressing processing request on the size of
incoming data.
v. Use your regression equation to predict the processing request for an incoming data
of size 17.0 gigabytes
vi. Is the slope statistically significant at α = 5% ?
1
Expert's answer
2020-06-10T17:55:41-0400
"\\def\\arraystretch{1.5}\n \\begin{array}{c:c:c:c:c:c:c}\n & X & Y & XY & X^2 & Y^2 \\\\ \\hline\n 1 & 6 & 40 & 240 & 36 & 1600 \\\\\n \\hdashline\n 2 & 7 & 55 & 385 & 49 & 3025 \\\\\n \\hdashline\n 3 & 7 & 50 & 350 & 49 & 2500\\\\\n \\hdashline\n 4 & 8 & 41 & 328 & 64 & 1681\\\\\n \\hdashline\n 5 & 10 & 17 & 170 & 100 & 289 \\\\\n \\hdashline\n 6 & 10 & 26 & 260 & 100 & 676\\\\\n \\hdashline\n 7 & 15 & 16 & 240 & 225 & 256\\\\\n \\hdashline\n Sum= & 63 & 245 & 1973 & 623 & 10027\n\\end{array}"

"\\bar{x}={1\\over n}\\displaystyle\\sum_{i=1}^nx_i=9, \\bar{y}={1\\over n}\\displaystyle\\sum_{i=1}^ny_i=35""S_{xx}=\\displaystyle\\sum_{i=1}^nx_i^2 -{1\\over n}(\\displaystyle\\sum_{i=1}^nx_i)^2=56""S_{yy}=\\displaystyle\\sum_{i=1}^ny_i^2 -{1\\over n}(\\displaystyle\\sum_{i=1}^ny_i)^2=1452""S_{xy}=\\displaystyle\\sum_{i=1}^nx_iy_i -{1\\over n}(\\displaystyle\\sum_{i=1}^nx_i)(\\displaystyle\\sum_{i=1}^ny_i)=-232""m={S_{xy}\\over S_{xx}}={-232\\over 56}=-4.142857""n=\\bar{y}-m\\cdot\\bar{x}=72.285714"

"y=72.285714-4.142857x"



ii.


"r=\\dfrac{S_{xy}}{\\sqrt{S_{xx}}\\sqrt{S_{yy}}}=-0.8136"

Thus, there is strong negative correlation between the processing request and the size of the incoming data.

iii.

The coefficient of determination


"r^2=(-0.81360.66)^2=0.6619"

"66.19 \\ \\%"

The proportion of y variance explained by the linear relationship between x and y is "66.19 \\ \\%."


iv. The regression equation is:


"y=72.285714-4.142857x"

v. "x=17"


"y=72.285714-4.142857(17)\\approx2"

vi. If there is a significant linear relationship between the independent variable x and the dependent variable x, the slope will not equal zero.

"H_0:m=0"

"H_1:m\\not=0"


"s(m)=\\sqrt{\\dfrac{\\sum_i(y_i-\\bar{y})^2}{(n-2)\\sum_i(x_i-\\bar{x})^2}}="

"=\\sqrt{\\dfrac{1452}{(7-2)(56)}}\\approx2.2772"

This corresponds to a two-tailed test, for which a t-test for one mean, with unknown population standard deviation will be used.

Based on the information provided, the significance level is "\\alpha=0.05," and the critical value for a two-tailed test "df=n-2=5" is "t_c=2.570543."  

The t-statistic is computed as follows:


"t={m-0\\over s}={-4.142857\\over 2.2772}=\u22121.8193"

Using the P-value approach: The p-value is "p=0.128575," and since "p=0.128575>0.05," it is concluded that the null hypothesis is not rejected. Therefore, there is not enough evidence to claim that the slope "m" is different than 0, at the 0.05 significance level.



Need a fast expert's response?

Submit order

and get a quick answer at the best price

for any assignment or question with DETAILED EXPLANATIONS!

Comments

No comments. Be the first!

Leave a comment

LATEST TUTORIALS
New on Blog