Karl Pearson’s Coefficient of Correlation (r)
Karl Pearson’s method is the most widely used numerical measure of correlation. It tells us both the direction and strength of the linear relationship between two variables.
It is also called the Product Moment Correlation Coefficient.
Meaning
Karl Pearson’s r measures the degree of linear relationship between two variables (X and Y). It indicates:
- Positive correlation → r > 0
- Negative correlation → r < 0
- Zero correlation → r = 0
The value of r always lies between –1 and +1.
Formula (Ungrouped Data)
Σ(X − X̄)(Y − Ȳ)
r = --------------------------------
√[ Σ(X − X̄)² × Σ(Y − Ȳ)² ]
Where:
- X̄ = Mean of X
- Ȳ = Mean of Y
Shortcut (Coding) Formula
nΣXY − (ΣX)(ΣY)
r = ---------------------------------------------
√[ nΣX² − (ΣX)² ] × √[ nΣY² − (ΣY)² ]
This method is preferred because it saves time in exams.
Properties of Pearson’s r
- r lies between –1 and +1.
- r = +1 → Perfect positive linear correlation.
- r = –1 → Perfect negative linear correlation.
- r = 0 → No linear correlation.
- r is unit-free (no measurement units).
- Sensitive to extreme values (outliers).
Steps to Calculate Pearson’s r (Shortcut Method)
- Prepare a table with X, Y, X², Y², and XY.
- Compute ΣX, ΣY, ΣX², ΣY², and ΣXY.
- Substitute values into the shortcut formula.
- Compute r and interpret.
Solved Example
Calculate Pearson’s r using the following data:
| X | 10 | 20 | 30 | 40 | 50 |
|---|---|---|---|---|---|
| Y | 15 | 25 | 35 | 45 | 55 |
Step 1: Prepare Table
| X | Y | X² | Y² | XY |
|---|---|---|---|---|
| 10 | 15 | 100 | 225 | 150 |
| 20 | 25 | 400 | 625 | 500 |
| 30 | 35 | 900 | 1225 | 1050 |
| 40 | 45 | 1600 | 2025 | 1800 |
| 50 | 55 | 2500 | 3025 | 2750 |
Step 2: Totals
- ΣX = 150
- ΣY = 175
- ΣX² = 5500
- ΣY² = 7125
- ΣXY = 6250
- n = 5
Step 3: Apply Formula
nΣXY − (ΣX)(ΣY)
r = ---------------------------------------------
√[ nΣX² − (ΣX)² ] × √[ nΣY² − (ΣY)² ]
Substitute:
r = [5(6250) − 150×175] ÷ √[(5×5500 − 150²)(5×7125 − 175²)]
r = [31250 − 26250] ÷ √[(27500 − 22500)(35625 − 30625)]
r = 5000 ÷ √[(5000)(5000)]
r = 5000 ÷ 5000
r = +1
Interpretation
Perfect positive correlation.
Interpretation Guidelines
| Value of r | Interpretation |
|---|---|
| +0.70 to +1.00 | Strong positive |
| +0.30 to +0.69 | Moderate positive |
| 0 to +0.29 | Weak positive |
| –0.29 to 0 | Weak negative |
| –0.69 to –0.30 | Moderate negative |
| –1.00 to –0.70 | Strong negative |
Key InsightA high correlation does not imply causation. It only shows linear association.
Advantages of Pearson’s Method
- Simple and widely used.
- Measures both direction and strength.
- Based on all observations.
- Mathematically sound.
Limitations
- Only for linear relationships.
- Affected by outliers.
- Requires quantitative data.
- Cannot establish cause-effect.
Summary
- Pearson’s r measures linear correlation between X and Y.
- Value lies between –1 and +1.
- Shortcut formula is used in exams.
- Provides direction (positive/negative) and strength.
- Does not imply causation.
Quiz Time! 🎯
Test Your Knowledge
Question 1 of 5
1. Pearson’s correlation coefficient is also called:
