Understanding Correlation: Types, Examples & How to Measure It (2025 Guide)
📌 1. What is Correlation?
Correlation is a statistical measure that describes the strength and direction of a relationship between two variables.
Simply put, it helps us understand whether—and how—two variables move together. For instance:
Example: As a child’s height increases, their weight tends to increase too. This shows a positive correlation.
The value of correlation, known as the correlation coefficient, always lies between -1 and +1:
-
+1 means a perfect positive relationship.
-
0 means no relationship.
-
-1 means a perfect negative relationship.
📊 2. Types of Correlation
Correlation can be classified in several ways:
✅ A. Positive vs. Negative Correlation
-
Positive Correlation: Both variables move in the same direction.
Example: The more time you run on a treadmill, the more calories you burn.
-
Negative Correlation: One variable increases while the other decreases.
Example: As a student’s number of absences increases, their grades tend to decrease.
✅ B. Linear vs. Non-Linear Correlation
-
Linear Correlation: The change in one variable leads to a constant change in another.
Example: Doubling the number of workers doubles the factory’s output.
-
Non-Linear (Curvilinear) Correlation: The relationship between variables changes at different rates.
Example: Increasing the radius of a sphere doesn’t result in a proportionate increase in volume.
✅ C. Simple, Multiple, and Partial Correlation
-
Simple Correlation: Involves two variables only.
Example: Correlation between study time and exam scores.
-
Multiple Correlation: Examines the relationship between one dependent variable and two or more independent variables.
Example: Exam scores related to study time, sleep, and class attendance.
-
Partial Correlation: Measures the relationship between two variables while controlling for the effect of other variables.
Example: Study time vs. grades, while controlling for
sleep.
📐 3. Pearson’s Correlation Coefficient (r)
The Pearson correlation coefficient (denoted as r) measures the linear relationship between two continuous variables. It evaluates both the strength and direction of this relationship.
🔢 Formula:
-
= individual sample values
-
= means of x and y
📍 When to Use:
-
When both variables are continuous.
-
When the relationship is linear.
-
When data is normally distributed (or nearly so).
🧮 4. Spearman’s Rank Correlation Coefficient (ρ or rho)
Spearman's rank correlation is a non-parametric test that measures the strength and direction of the monotonic relationship between two ranked variables.
🔢 Formula:
-
= difference between the ranks of each pair
-
= number of observations
📍 When to Use:
-
When your data is ordinal (ranked).
-
When the relationship is not linear but still monotonic.
-
When you have outliers or non-normal data.
✅ Summary Table: Pearson vs. Spearman
Feature Pearson’s r Spearman’s ρ Type of Data Continuous Ordinal or Ranked Measures Linear Relationship Monotonic Relationship Assumes Normality Yes No Sensitive to Outliers? Yes No Use Case Example Height vs. Weight Student rank in Math vs. S