"i) \\; bar{x}= \\frac{\\sum^n_{i=1} x_i}{n} \\\\\n\n= \\frac{408}{7}= 58.285 \\\\\n\n\\bar{y} = \\frac{\\sum^n_{i=1} y_i}{n} \\\\\n\n= \\frac{497}{7}=71 \\\\\n\nSS_{xx}= \\sum^n_{i=1} x^2_i - \\frac{( \\sum^n_{i=1} x_i )^2}{n} \\\\\n\n= 24200 - \\frac{408^2}{7} \\\\\n\n= 419.429 \\\\\n\nSS_{yy}= \\sum^n_{i=1} y^2_i - \\frac{( \\sum^n_{i=1} y_i )^2}{n} \\\\\n\n= 36617 - \\frac{497^2}{7} \\\\\n\n= 1330 \\\\\n\nSS_{xy}= \\sum^n_{i=1} x_iy_i - \\frac{( \\sum^n_{i=1} x_i )( \\sum^n_{i=1} y_i )}{n} \\\\\n\n= 29206 - \\frac{408 \\times 497}{7} \\\\\n\n= 238"
The correction coefficient is:
"r = \\frac{SS_{xy}}{\\sqrt{ SS_{xx} \\times SS_{yy} }} \\\\\n\n= \\frac{238}{\\sqrt{419.429 \\times 1330}} \\\\\n\n= 0.319"
r=0.319 so retirement age and death age are low positive correlated.
ii) In b part one is mistakes they say that where y is the retirement age, X is the age at which one dies and, is a logical mistake.
As per data we consider Retirement age is(x) and Death age is (y)"
The regression coefficient (the slope m, and the y-intercept n) are:
"m = \\frac{SS_{xy}}{SS_{xx}} \\\\\n\n= \\frac{238}{419.429} \\\\\n\n= 0.5674 \\\\\n\nn = \\bar{y} - \\bar{x} \\times m \\\\\n\n= 71 -58.285 \\times 0.5674 \\\\\n\n= 37.9264"
The regression equation is:
"y = 37.9264 +0.5674x \\\\\n\niii) \\; y(78) = 37.9264 + 0.5674 \\times 78 \\\\\n\n= 82.18 = 82"
iv) The coefficient of determination
"r^2 = 0.319^2 = 0.101"
So, 0.1 R-square means that your model explains 10% of the variation within the data.
Comments
Leave a comment