Skip to main content

Registration is now open for this year's LibreFest! Join us virtually the week of July 13.

Register here
Statistics LibreTexts

9.4: Significance of Correlation

  • Page ID
    65715
  • \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \( \newcommand{\dsum}{\displaystyle\sum\limits} \)

    \( \newcommand{\dint}{\displaystyle\int\limits} \)

    \( \newcommand{\dlim}{\displaystyle\lim\limits} \)

    \( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)

    ( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\id}{\mathrm{id}}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\kernel}{\mathrm{null}\,}\)

    \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\)

    \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\)

    \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)

    \( \newcommand{\vectorA}[1]{\vec{#1}}      % arrow\)

    \( \newcommand{\vectorAt}[1]{\vec{\text{#1}}}      % arrow\)

    \( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vectorC}[1]{\textbf{#1}} \)

    \( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)

    \( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)

    \( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)

    \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \(\newcommand{\longvect}{\overrightarrow}\)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)

    We now know how to calculate \( r \) and how to interpret its sign and magnitude. But before we use a correlation to draw conclusions — or build a regression model — we need to ask a more fundamental question:

    Could this correlation have occurred by random chance, even if the true relationship in the population is zero?

    This is a question about statistical significance, and it calls for a hypothesis test.


    Why This Matters

    Suppose we sample 8 students and calculate a correlation of \( r = 0.61 \) between study hours and exam scores. That sounds meaningful — but with only 8 data points, could a value that large have appeared just by chance, even in a population where study hours and exam scores are completely unrelated?

    The answer is: possibly yes. Small samples are noisy. A moderate \( r \) from a small sample may not be strong evidence of a real relationship in the population. We need a formal test to decide.

    This connects directly to the five-step hypothesis testing process from Chapter 8. The only thing that changes is the test statistic we use.


    Setting Up the Hypotheses

    We use \( \rho \) (the Greek letter rho) to denote the true population correlation coefficient — the correlation we would find if we could measure every individual in the population. Our sample value \( r \) is an estimate of \( \rho \).

    • \( H_0: \rho = 0 \)   (no linear relationship exists in the population)
    • \( H_A: \rho \neq 0 \)   (a linear relationship does exist)

    This is almost always a two-tailed test — we are asking whether the relationship is real, not specifically whether it is positive or negative. (One-tailed versions exist but are less common.)


    The Test Statistic

    When \( H_0: \rho = 0 \) is true, the sample correlation \( r \) follows a predictable pattern. We can convert it into a t-test statistic using the formula below — and that t-statistic follows the Student t-distribution with \( n - 2 \) degrees of freedom.

    Test statistic for the significance of a correlation:

    \[ t = r \sqrt{\frac{n - 2}{1 - r^2}} \]

    • \( r \) = the sample correlation coefficient
    • \( n \) = the number of data pairs
    • Degrees of freedom: \( df = n - 2 \)

    Notice that \( r^2 \) appears in the denominator — the coefficient of determination from Section 9.3.1 is built right into this formula. The larger \( |r| \) is, and the larger \( n \) is, the larger \( |t| \) will be, and the stronger the evidence against \( H_0 \).

    Technical note: We lose two degrees of freedom (rather than one) because estimating a linear relationship requires fixing both the slope and the intercept.


    Two Ways to Perform a Hypothesis Test for Correlation

    Using the test statistic \( t \) above, we can reach a conclusion using either of two approaches in the next table. Since Desmos does not perform hypothesis testing calculations for correlation in its inference tool, we can choose one of these two methods:

    Two approaches to testing the significance of a correlation.
    p-value method Critical value method
    1. Calculate \( t \) using the formula above.
    2. Look up or compute \( P(|t_{df}| > |t_{obs}|) \) — the two-tailed p-value from the t-distribution with \( df = n-2 \).  You can use Excel's =T.DIST.2T function or tdist( ) in Desmos in addition to other calculators.
    3. If \( p < \alpha \), reject \( H_0 \).

    Best when you have access to a t-distribution calculator that gives exact probabilities.

    1. Calculate \( t \) using the formula above.
    2. Find the critical value \( t^* \) from a t-table at your chosen \( \alpha \) with \( df = n-2 \).
    3. If \( |t_{obs}| > t^* \), reject \( H_0 \).

    Best when technology is limited. The \(r \) critical value table on the following page can also be used directly with \( r \) — see below.


    Shortcut: Critical Values for \( r \) Directly

    Because the test statistic formula always uses the same inputs (\( r \) and \( n \)), statisticians often use pre-computed minimum values of \( |r| \) needed to reject \( H_0 \) at common significance levels. This gives us a table of critical values for \( r \) — a convenient shortcut that skips the test statistic calculation entirely.

    How to use it: Find the row for your sample size \( n \) and the column for your chosen \( \alpha \). If \( |r| \) exceeds the table value, the correlation is statistically significant.

    Critical Values of \( r \) for a Two-Tailed Test (\( H_A: \rho \neq 0 \))
    Sample size \( n \) Critical Value for \(r\) at \( \alpha = 0.05 \) significance level
    4 0.950
    5 0.878
    6 0.811
    7 0.754
    8 0.707
    9 0.666
    10 0.632
    12 0.576
    15 0.514
    20 0.444
    25 0.396
    30 0.361
    40 0.312
    50 0.279
    60 0.254
    80 0.220
    100 0.197

    Reading the table: To be statistically significant at \( \alpha = 0.05 \) with \( n = 10 \) pairs, you need \( |r| > 0.632 \). Our dataset from Section 9.3 had \( n = 10 \) and \( r \approx 0.99 \) — well above this threshold.

    Notice how the required \( |r| \) decreases as \( n \) increases. With a large enough sample, even a small correlation can be statistically significant. This is why we always consider both statistical significance and practical significance.


    Worked Example: Study Hours and Exam Scores

    Let's apply the full five-step process to our familiar dataset from Section 9.3: \( n = 10 \), \( r = 0.99 \).

    Step 1: State the Hypotheses

    • \( H_0: \rho = 0 \)   (no linear relationship between study hours and exam scores)
    • \( H_A: \rho \neq 0 \)   (a linear relationship exists)

    Step 2: Calculate the Test Statistic

    Plug in \( r = 0.99 \) and \( n = 10 \):

    \[ t = r\sqrt{\frac{n-2}{1-r^2}} = 0.99 \times \sqrt{\frac{10-2}{1-(0.99)^2}} = 0.99 \times \sqrt{\frac{8}{1-0.9801}} = 0.99 \times \sqrt{\frac{8}{0.0199}} \]

    \[ = 0.99 \times \sqrt{401.99} \approx 0.99 \times 20.05 \approx 19.85 \]

    Degrees of freedom: \( df = 10 - 2 = 8 \).

    Step 3: Choose a Significance Level

    Use \( \alpha = 0.05 \).

    Step 4: Make the Decision

    Using the critical value method: From a t-table at \( df = 8 \) and \( \alpha = 0.05 \) (two-tailed), \( t^* = 2.306 \). Since \( |t_{obs}| = 19.85 \gg 2.306 \), we reject \( H_0 \).

    Using the \( r \) critical value table: At \( n = 10 \), \( \alpha = 0.05 \), the critical value is \( 0.632 \). Since \( |r| = 0.99 > 0.632 \), we reject \( H_0 \).

    Both methods agree — as they always will.

    Step 5: State the Conclusion

    There is strong statistical evidence of a linear relationship between study hours and exam scores (\( r = 0.99, t(8) = 19.85, p < 0.05 \)). We reject the hypothesis that no linear relationship exists in the population.

    Reflect: A test statistic of 19.85 is enormous — the correlation is so strong that there is virtually no chance it arose by random sampling variation. This gives us confidence that the linear model we build in Section 9.6 is describing something real.


    A Caution: Significance Is Not the Whole Story

    Statistical significance tells us the correlation is unlikely to be zero — but it does not tell us the correlation is large or meaningful. With a big enough sample, even a tiny correlation like \( r = 0.05 \) can be statistically significant. Always report \( r \) itself (and \( r^2 \)) alongside the significance test so readers can judge both the reality and the size of the relationship.

    What significance of \( r \) does and does not tell us.
    It tells us… It does NOT tell us…
    The linear relationship is unlikely to be zero in the population The relationship is strong or practically important
    We have enough evidence to proceed with a linear model One variable causes the other
    The direction of the relationship (positive or negative) is real The relationship is linear rather than curved

    Think About It:
    • In our study hours example, \( n = 10 \) and \( r = 0.99 \). What if we had the same \( r = 0.99 \) but only \( n = 4 \) data points? Use the critical value table to check whether the correlation would still be significant at \( \alpha = 0.05 \).
    • A study of 500 people finds \( r = 0.09 \) between daily steps and resting heart rate. The p-value is 0.044. Is this significant? Is it important? What would you tell a friend who read the headline "Step count linked to heart rate"?
    • Why do we use \( df = n - 2 \) rather than \( df = n - 1 \) for this test?

    Now that we have established that the linear relationship in our data is real and not simply a product of chance, we are ready to quantify how much of the variation in \( y \) that relationship explains. That is the focus of the next section: the coefficient of determination, \( r^2 \).


    9.4: Significance of Correlation is shared under a not declared license and was authored, remixed, and/or curated by LibreTexts.

    • Was this article helpful?