Lines of Regression 📉

In Linear Regression, we fit straight lines to the data. But did you know there are generally two regression lines, not one?

Why Two Lines? ✌️

You might think one "best fit" line is enough. However, we minimize errors differently depending on what we want to predict.

[!IMPORTANT] We always pick the line that corresponds to the variable we want to predict (Dependent Variable).

Objective: Minimize errors in Y (vertical distances).
Dependent Variable: Y
Independent Variable: X
Equation:
```
Y = a + bX
```
Key Statistic: byx (Regression Coefficient of Y on X). It tells us how much Y changes for a unit change in X.

Objective: Minimize errors in X (horizontal distances).
Dependent Variable: X
Independent Variable: Y
Equation:
```
X = a + bY
```
Key Statistic: bxy (Regression Coefficient of X on Y). It tells us how much X changes for a unit change in Y.

If you plot both regression lines on the same graph:

Intersection: They intersect at the point of means (X̄, Ȳ).
Coincidence: If r = ± 1 (Perfect Correlation), both lines overlap and become one single line.
Perpendicular: If r = 0 (No Correlation), the lines cut each other at 90 degrees (perpendicular). Y on X becomes horizontal, X on Y becomes vertical.

Two Regression Lines (Imagine a graph where two lines cross at the average values of X and Y)

Mathematically, the "Line of Best Fit" minimizes the sum of squared deviations (Least Squares Method).

If we minimize vertical deviations, we get Y on X.
If we minimize horizontal deviations, we get X on Y. Unless the correlation is perfect (r=1), these two minimization processes yield slightly different lines.

Question 1 of 5

1. How many regression lines are there usually in bivariate analysis?

One

Two

Three

Infinite

Loading quiz…