Pearson Correlation
Calculate linear dependence ($r$)
[Image of pearson correlation graph]Pearson Correlation Calculator ($r$): Formula, P-Value & Interpretation
The Pearson Correlation Calculator is the gold standard statistical tool for measuring the strength and direction of the linear relationship between two continuous variables. The result, known as Pearson's $r$, ranges from -1 to +1.
Whether you are analyzing the link between study hours and exam scores, or marketing spend and revenue, this tool provides the three critical metrics you need: the Correlation Coefficient ($r$), the P-Value (Significance), and the Coefficient of Determination ($R^2$).
[Image of correlation coefficient plots]1. The Pearson Correlation Formula ($r$)
Think of Pearson's $r$ as "Covariance divided by the product of Standard Deviations." Covariance tells you the direction; dividing by standard deviations standardizes it so the result is always between -1 and 1.
The mathematical formula for the Pearson Product-Moment Correlation Coefficient is:
2. Interpretation Guide: How Strong is Your $r$?
You calculated $r = 0.65$. Is that strong? Weak? Here is the standard rubric used in social sciences and business analytics.
Strong Negative
Moderate Negative
Weak / None
Moderate Positive
Strong Positive
| r Value | Strength | Real-World Example |
|---|---|---|
| +1.0 | Perfect Positive | Temperature in Celsius vs Fahrenheit. |
| +0.8 | Strong Positive | Height vs. Shoe Size. |
| 0.0 | No Correlation | IQ Scores vs. Zip Code. |
| -0.6 | Moderate Negative | Hours of TV watched vs. GPA. |
| -1.0 | Perfect Negative | Speed vs. Time taken to travel distance. |
3. How to Calculate Pearson's r (Step-by-Step)
Let's verify the "Study Hours vs. Exam Score" relationship manually.
X (Hours): $\{1, 2, 3, 4, 5\}$
Y (Score): $\{50, 60, 70, 80, 90\}$
$\bar{x} = 3$
$\bar{y} = 70$
Ex: $1-3=-2$, $50-70=-20$.
Result: 200.
4. The Big Debate: Pearson vs. Spearman
This is the #1 question in my advanced classes. "Professor, which correlation should I use?"
- Type: Parametric Test.
- Relationship: Measures Linear (Straight Line) relationships only.
- Requirements: Data must be normally distributed. Sensitive to outliers.
- Best For: Physical measurements (Height vs Weight).
- Type: Non-Parametric (Rank) Test.
- Relationship: Measures Monotonic relationships (consistently increasing/decreasing, even if curved).
- Requirements: Can handle outliers and non-normal data.
- Best For: Survey data (Likert scales), Rankings.
5. Beyond r: P-Value & R-Squared
Getting an $r$ value is just the start. You need context.
The P-Value (Significance)
The P-Value answers: "Could this correlation have happened by random luck?"
• If $p < 0.05$: The correlation is statistically significant.
• If $p > 0.05$: The correlation might be random noise. (A high $r$ with a tiny sample size often has a high P-value).
The Coefficient of Determination ($R^2$)
If you square $r$, you get $R^2$. This tells you the percentage of variance explained.
Example: If Correlation $r = 0.9$ between Advertising and Sales:
$R^2 = 0.81$. This means 81% of the fluctuation in Sales can be explained by Advertising. The other 19% is other factors.
6. Professor's FAQ Corner
=CORREL(array1, array2) or =PEARSON(array1, array2). Both return the same result. To get $R^2$, use =RSQ(array1, array2).
References
- Cohen, J. (1988). Statistical Power Analysis for the Behavioral Sciences. (Standard for interpreting r effect size).
- Pearson, K. (1895). "Note on Regression and Inheritance in the Case of Two Parents."
- NIST/SEMATECH e-Handbook of Statistical Methods. "Correlation Coefficient."
Find Your Relationship
Enter your X and Y datasets to detect correlations instantly.
Calculate Pearson Correlation