1. Let us first determine ∑ x i , ∑ x i 2 , ∑ y i , ∑ y i 2 , ∑ x i y i \sum x_i , \sum x^2_i , \sum y_i , \sum y^2_i , \sum x_iy_i ∑ x i , ∑ x i 2 , ∑ y i , ∑ y i 2 , ∑ x i y i
∑ x i = 213 ∑ x i 2 = 4625 ∑ y i = 3422 ∑ y i 2 = 1196690 ∑ x i y i = 73169 \sum x_i = 213 \\
\sum x^2_i = 4625 \\
\sum y_i = 3422 \\
\sum y^2_i = 1196690 \\
\sum x_iy_i = 73169 ∑ x i = 213 ∑ x i 2 = 4625 ∑ y i = 3422 ∑ y i 2 = 1196690 ∑ x i y i = 73169
n represents the sample size and thus n is equal to the number of ordered pairs.
n = 10
We can then determine the covariance using the formula
s x y = ∑ x i y i − ( ∑ x i ) ( ∑ y i ) n n − 1 = 73169 − ( 213 ) ( 3422 ) 10 10 − 1 = 31.15 s_{xy} = \frac{\sum x_iy_i - \frac{(\sum x_i)(\sum y_i)}{n}}{n -1} \\
= \frac{ 73169 - \frac{(213)(3422)}{10}}{10 -1} \\
= 31.15 s x y = n − 1 ∑ x i y i − n ( ∑ x i ) ( ∑ y i ) = 10 − 1 73169 − 10 ( 213 ) ( 3422 ) = 31.15
Let us next determine the sample variance s2 using the formula:
s 2 = ∑ x i 2 − ( ∑ x i ) 2 n n − 1 s x 2 = 4625 − ( 213 ) 2 10 10 − 1 = 9.78 s y 2 = 1196690 − ( 3422 ) 2 10 10 − 1 = 2568.16 s^2 = \frac{\sum x^2_i - \frac{(\sum x_i)^2}{n}}{n-1} \\
s^2_x = \frac{ 4625 - \frac{(213)^2}{10}}{10-1} = 9.78 \\
s^2_y = \frac{ 1196690 - \frac{(3422)^2}{10}}{10-1} = 2568.16 s 2 = n − 1 ∑ x i 2 − n ( ∑ x i ) 2 s x 2 = 10 − 1 4625 − 10 ( 213 ) 2 = 9.78 s y 2 = 10 − 1 1196690 − 10 ( 3422 ) 2 = 2568.16
The sample standard deviation is the square root of the population sample:
s x = s x 2 = 9.78 = 3.12 s y = s y 2 = 2568.16 = 50.67 s_x = \sqrt{s^2_x} = \sqrt{9.78} = 3.12 \\
s_y = \sqrt{s^2_y} = \sqrt{2568.16} = 50.67 s x = s x 2 = 9.78 = 3.12 s y = s y 2 = 2568.16 = 50.67
We can then determine the correlation coefficient r using the formula:
r = s x y s x s y r = 31.15 3.12 × 50.67 = 0.197 r = \frac{s_{xy}}{s_x s_y} \\
r = \frac{31.15}{3.12 \times 50.67} = 0.197 r = s x s y s x y r = 3.12 × 50.67 31.15 = 0.197
2. The relationship between the variables is weak, positive, linear relationship.
3. Next, we can determine the slope b using the formula:
b = r s y s x b = 0.197 × 50.67 3.12 = 3.199 b = r\frac{s_y}{s_x} \\
b = 0.197 \times \frac{50.67}{3.12} = 3.199 b = r s x s y b = 0.197 × 3.12 50.67 = 3.199
Next, we can determine the y-intercept a using the formula a = y ˉ − b x ˉ a = \bar{y} -b \bar{x} a = y ˉ − b x ˉ , where the sample mean is the sum of all values divided by the number of values.
a = y ˉ − b x ˉ = ∑ y i n − b ∑ x i n a = 3422 10 − 3.199 213 10 = 342.2 − 68.138 = 274.062 a = \bar{y} -b \bar{x} = \frac{\sum y_i}{n} -b \frac{\sum x_i}{n} \\
a = \frac{3422}{10} -3.199 \frac{213}{10} \\
= 342.2 -68.138 = 274.062 a = y ˉ − b x ˉ = n ∑ y i − b n ∑ x i a = 10 3422 − 3.199 10 213 = 342.2 − 68.138 = 274.062
Finally, we then obtain the regression line:
y = a + b x = 274.062 + 3.199 x y ( 22 ) = 274.062 + 3.199 × 22 = 344.44 y ( 23 ) = 274.062 + 3.199 × 23 = 347.64 y = a + bx = 274.062 + 3.199x \\
y(22) = 274.062 + 3.199 \times 22 = 344.44 \\
y(23) = 274.062 + 3.199 \times 23 = 347.64 y = a + b x = 274.062 + 3.199 x y ( 22 ) = 274.062 + 3.199 × 22 = 344.44 y ( 23 ) = 274.062 + 3.199 × 23 = 347.64
Comments