9.4: Significance of Correlation
- Page ID
- 65715
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\( \newcommand{\dsum}{\displaystyle\sum\limits} \)
\( \newcommand{\dint}{\displaystyle\int\limits} \)
\( \newcommand{\dlim}{\displaystyle\lim\limits} \)
\( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)
( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\id}{\mathrm{id}}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\kernel}{\mathrm{null}\,}\)
\( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\)
\( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\)
\( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)
\( \newcommand{\vectorA}[1]{\vec{#1}} % arrow\)
\( \newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow\)
\( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vectorC}[1]{\textbf{#1}} \)
\( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)
\( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)
\( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\(\newcommand{\longvect}{\overrightarrow}\)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)We now know how to calculate \( r \) and how to interpret its sign and magnitude. But before we use a correlation to draw conclusions — or build a regression model — we need to ask a more fundamental question:
Could this correlation have occurred by random chance, even if the true relationship in the population is zero?
This is a question about statistical significance, and it calls for a hypothesis test.
Why This Matters
Suppose we sample 8 students and calculate a correlation of \( r = 0.61 \) between study hours and exam scores. That sounds meaningful — but with only 8 data points, could a value that large have appeared just by chance, even in a population where study hours and exam scores are completely unrelated?
The answer is: possibly yes. Small samples are noisy. A moderate \( r \) from a small sample may not be strong evidence of a real relationship in the population. We need a formal test to decide.
This connects directly to the five-step hypothesis testing process from Chapter 8. The only thing that changes is the test statistic we use.
Setting Up the Hypotheses
We use \( \rho \) (the Greek letter rho) to denote the true population correlation coefficient — the correlation we would find if we could measure every individual in the population. Our sample value \( r \) is an estimate of \( \rho \).
- \( H_0: \rho = 0 \) (no linear relationship exists in the population)
- \( H_A: \rho \neq 0 \) (a linear relationship does exist)
This is almost always a two-tailed test — we are asking whether the relationship is real, not specifically whether it is positive or negative. (One-tailed versions exist but are less common.)
The Test Statistic
When \( H_0: \rho = 0 \) is true, the sample correlation \( r \) follows a predictable pattern. We can convert it into a t-test statistic using the formula below — and that t-statistic follows the Student t-distribution with \( n - 2 \) degrees of freedom.
Test statistic for the significance of a correlation:
\[ t = r \sqrt{\frac{n - 2}{1 - r^2}} \]
- \( r \) = the sample correlation coefficient
- \( n \) = the number of data pairs
- Degrees of freedom: \( df = n - 2 \)
Notice that \( r^2 \) appears in the denominator — the coefficient of determination from Section 9.3.1 is built right into this formula. The larger \( |r| \) is, and the larger \( n \) is, the larger \( |t| \) will be, and the stronger the evidence against \( H_0 \).
Technical note: We lose two degrees of freedom (rather than one) because estimating a linear relationship requires fixing both the slope and the intercept.
Two Ways to Perform a Hypothesis Test for Correlation
Using the test statistic \( t \) above, we can reach a conclusion using either of two approaches in the next table. Since Desmos does not perform hypothesis testing calculations for correlation in its inference tool, we can choose one of these two methods:
| p-value method | Critical value method |
|---|---|
Best when you have access to a t-distribution calculator that gives exact probabilities. |
Best when technology is limited. The \(r \) critical value table on the following page can also be used directly with \( r \) — see below. |
Shortcut: Critical Values for \( r \) Directly
Because the test statistic formula always uses the same inputs (\( r \) and \( n \)), statisticians often use pre-computed minimum values of \( |r| \) needed to reject \( H_0 \) at common significance levels. This gives us a table of critical values for \( r \) — a convenient shortcut that skips the test statistic calculation entirely.
How to use it: Find the row for your sample size \( n \) and the column for your chosen \( \alpha \). If \( |r| \) exceeds the table value, the correlation is statistically significant.
| Sample size \( n \) | Critical Value for \(r\) at \( \alpha = 0.05 \) significance level |
|---|---|
| 4 | 0.950 |
| 5 | 0.878 |
| 6 | 0.811 |
| 7 | 0.754 |
| 8 | 0.707 |
| 9 | 0.666 |
| 10 | 0.632 |
| 12 | 0.576 |
| 15 | 0.514 |
| 20 | 0.444 |
| 25 | 0.396 |
| 30 | 0.361 |
| 40 | 0.312 |
| 50 | 0.279 |
| 60 | 0.254 |
| 80 | 0.220 |
| 100 | 0.197 |
Reading the table: To be statistically significant at \( \alpha = 0.05 \) with \( n = 10 \) pairs, you need \( |r| > 0.632 \). Our dataset from Section 9.3 had \( n = 10 \) and \( r \approx 0.99 \) — well above this threshold.
Notice how the required \( |r| \) decreases as \( n \) increases. With a large enough sample, even a small correlation can be statistically significant. This is why we always consider both statistical significance and practical significance.
Worked Example: Study Hours and Exam Scores
Let's apply the full five-step process to our familiar dataset from Section 9.3: \( n = 10 \), \( r = 0.99 \).
Step 1: State the Hypotheses
- \( H_0: \rho = 0 \) (no linear relationship between study hours and exam scores)
- \( H_A: \rho \neq 0 \) (a linear relationship exists)
Step 2: Calculate the Test Statistic
Plug in \( r = 0.99 \) and \( n = 10 \):
\[ t = r\sqrt{\frac{n-2}{1-r^2}} = 0.99 \times \sqrt{\frac{10-2}{1-(0.99)^2}} = 0.99 \times \sqrt{\frac{8}{1-0.9801}} = 0.99 \times \sqrt{\frac{8}{0.0199}} \]
\[ = 0.99 \times \sqrt{401.99} \approx 0.99 \times 20.05 \approx 19.85 \]
Degrees of freedom: \( df = 10 - 2 = 8 \).
Step 3: Choose a Significance Level
Use \( \alpha = 0.05 \).
Step 4: Make the Decision
Using the critical value method: From a t-table at \( df = 8 \) and \( \alpha = 0.05 \) (two-tailed), \( t^* = 2.306 \). Since \( |t_{obs}| = 19.85 \gg 2.306 \), we reject \( H_0 \).
Using the \( r \) critical value table: At \( n = 10 \), \( \alpha = 0.05 \), the critical value is \( 0.632 \). Since \( |r| = 0.99 > 0.632 \), we reject \( H_0 \).
Both methods agree — as they always will.
Step 5: State the Conclusion
There is strong statistical evidence of a linear relationship between study hours and exam scores (\( r = 0.99, t(8) = 19.85, p < 0.05 \)). We reject the hypothesis that no linear relationship exists in the population.
Reflect: A test statistic of 19.85 is enormous — the correlation is so strong that there is virtually no chance it arose by random sampling variation. This gives us confidence that the linear model we build in Section 9.6 is describing something real.
A Caution: Significance Is Not the Whole Story
Statistical significance tells us the correlation is unlikely to be zero — but it does not tell us the correlation is large or meaningful. With a big enough sample, even a tiny correlation like \( r = 0.05 \) can be statistically significant. Always report \( r \) itself (and \( r^2 \)) alongside the significance test so readers can judge both the reality and the size of the relationship.
| It tells us… | It does NOT tell us… |
|---|---|
| The linear relationship is unlikely to be zero in the population | The relationship is strong or practically important |
| We have enough evidence to proceed with a linear model | One variable causes the other |
| The direction of the relationship (positive or negative) is real | The relationship is linear rather than curved |
- In our study hours example, \( n = 10 \) and \( r = 0.99 \). What if we had the same \( r = 0.99 \) but only \( n = 4 \) data points? Use the critical value table to check whether the correlation would still be significant at \( \alpha = 0.05 \).
- A study of 500 people finds \( r = 0.09 \) between daily steps and resting heart rate. The p-value is 0.044. Is this significant? Is it important? What would you tell a friend who read the headline "Step count linked to heart rate"?
- Why do we use \( df = n - 2 \) rather than \( df = n - 1 \) for this test?
Now that we have established that the linear relationship in our data is real and not simply a product of chance, we are ready to quantify how much of the variation in \( y \) that relationship explains. That is the focus of the next section: the coefficient of determination, \( r^2 \).

