i x y x y x 2 y 2 1 42 5 210 1764 25 2 27 10 270 729 100 3 36 8 288 1296 64 4 25 12 300 625 144 5 22 13 286 484 169 6 39 7 273 1521 49 ∑ x i = 191 ∑ y i = 55 ∑ x i y i = 1627 ∑ x i 2 = 6419 ∑ y i 2 = 551 \def\arraystretch{1.5}
\begin{array}{c:c:c:c:c:c}
i & x & y & xy & x^2 & y^2 \\ \hline
1 & 42 & 5 & 210 & 1764 & 25 \\
\hdashline
2 & 27 & 10 & 270 & 729 & 100 \\
\hdashline
3 & 36 & 8 & 288 & 1296 & 64 \\
\hdashline
4 & 25 & 12 & 300 & 625 & 144 \\
\hdashline
5 & 22 & 13 & 286 & 484 & 169 \\
\hdashline
6 & 39 & 7 & 273 & 1521 & 49 \\
\hdashline
& \sum x_i=191 & \sum y_i=55 & \sum x_iy_i=1627 & \sum x_i^2= 6419& \sum y_i^2=551
\end{array} i 1 2 3 4 5 6 x 42 27 36 25 22 39 ∑ x i = 191 y 5 10 8 12 13 7 ∑ y i = 55 x y 210 270 288 300 286 273 ∑ x i y i = 1627 x 2 1764 729 1296 625 484 1521 ∑ x i 2 = 6419 y 2 25 100 64 144 169 49 ∑ y i 2 = 551 a) Compute the line of regression
Calculating the mean ( x ˉ , y ˉ ) (\bar{x}, \bar{y}) ( x ˉ , y ˉ )
x ‾ = ∑ x i n = 191 6 , y ‾ = ∑ x i n = 55 6 \overline{x}={\sum x_i \over n}={191 \over 6}, \overline{y}={\sum x_i \over n}={55 \over 6} x = n ∑ x i = 6 191 , y = n ∑ x i = 6 55 The equation of a simple linear regression line (the line of best fit) is y = mx + b ,
m = s l o p e = n ∑ x i y i − ∑ x i ∑ y i n ∑ x i 2 − ( ∑ x i ) 2 m=slope={n\sum x_iy_i-\sum x_i \sum y_i \over n\sum x_i^2-(\sum x_i)^2} m = s l o p e = n ∑ x i 2 − ( ∑ x i ) 2 n ∑ x i y i − ∑ x i ∑ y i
m = 6 ⋅ 1627 − 191 ⋅ 55 6 ⋅ 6419 − ( 191 ) 2 ≈ − 0.365470 m={6\cdot 1627-191\cdot 55 \over 6\cdot 6419-(191)^2}\approx -0.365470 m = 6 ⋅ 6419 − ( 191 ) 2 6 ⋅ 1627 − 191 ⋅ 55 ≈ − 0.365470
b = y ‾ − m x ‾ b=\overline{y}-m\overline{x} b = y − m x
b ≈ 55 6 − ( − 0.365470 ) ⋅ 191 6 ≈ 20.800787 b\approx {55 \over 6}-(-0.365470)\cdot {191 \over 6}\approx 20.800787 b ≈ 6 55 − ( − 0.365470 ) ⋅ 6 191 ≈ 20.800787 The line of regression
y = − 0.365470 x + 20.800787 y= -0.365470x+20.800787 y = − 0.365470 x + 20.800787
b) Calculate coefficient of correlation (r)
r = n ∑ x i y i − ∑ x i ∑ y i n ∑ x i 2 − ( ∑ x i ) 2 ⋅ n ∑ y i 2 − ( ∑ y i ) 2 r={n\sum x_iy_i-\sum x_i \sum y_i \over \sqrt{n\sum x_i^2-(\sum x_i)^2}\cdot \sqrt{n\sum y_i^2-(\sum y_i)^2}} r = n ∑ x i 2 − ( ∑ x i ) 2 ⋅ n ∑ y i 2 − ( ∑ y i ) 2 n ∑ x i y i − ∑ x i ∑ y i
r = 6 ⋅ 1627 − 191 ⋅ 55 6 ⋅ 6419 − ( 191 ) 2 6 ⋅ 551 − ( 55 ) 2 ≈ − 0.983030 r={6\cdot 1627-191\cdot 55 \over \sqrt{6\cdot 6419-(191)^2} \sqrt{6\cdot 551-(55)^2}}\approx -0.983030 r = 6 ⋅ 6419 − ( 191 ) 2 6 ⋅ 551 − ( 55 ) 2 6 ⋅ 1627 − 191 ⋅ 55 ≈ − 0.983030
r = − 0.983030 r=-0.983030 r = − 0.983030
Comments