Skip to main content
Statistics LibreTexts

4.5: Variance

  • Page ID
    50011
  • \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)

    ( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\id}{\mathrm{id}}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\kernel}{\mathrm{null}\,}\)

    \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\)

    \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\)

    \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)

    \( \newcommand{\vectorA}[1]{\vec{#1}}      % arrow\)

    \( \newcommand{\vectorAt}[1]{\vec{\text{#1}}}      % arrow\)

    \( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vectorC}[1]{\textbf{#1}} \)

    \( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)

    \( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)

    \( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)

    \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)

    Another important measure of variability is the variance. Variance is a descriptive statistic that summarizes how far raw scores tended to fall from the mean in squared units. It can also be thought of as the squared estimate of the error that occurs when the mean is used to estimate the value of a raw score. Thus, the variance is just the standard deviation left in squared units.

    However, we generally best understand and, thus, report results in standard units. For this reason, variances are rarely reported as final results. Instead, variances are often used as important parts of other formulas (such as in an independent samples t-test which we will cover in Chapter 8).

    Decoding the Symbols

    The symbols within the variance formulas are the same as those within the respective standard deviation formulas: \(x\) refers to an individual raw score, \(\mu \) refers to the population mean, \(\bar{x}\) refers to the sample mean, \(N\) refers to the population size, and \(n\) refers to the sample size. The variance is just the standard deviation left in squared units. For the sake of simplicity and to keep this clear, the symbols for variance are just the standard deviation symbols adjusted to reflect this. Specifically, σ² is used to refer to variances of populations and SD² or are used to refer to the variances of samples.

    Calculating the Variance

    Just as was true for standard deviation, there are two formulas for variance: one for populations and one for samples. The formulas use the same symbols and many of the same steps as the standard deviation.

    Population Variance

    Here is the formula and required steps for finding the variance of a population:

    Variance for a Population
    Formula Calculation Steps
    \(\sigma^2=\dfrac{\sum(x-\mu)^2}{N}\)
    1. Find the mean.
    2. Subtract the mean from each raw score to find each deviation.
    3. Square each deviation.
    4. Sum the squared deviations to find SS.
    5. Divide SS by \(N\).

    Notice that these are the same as the first five steps to the population standard deviation formula. This leaves the result for variance in squared units, as is indicated by the symbol for variance which is \(\sigma^2\).

    Sample Variance Formula

    You can see the sample variance formula below. This formula is very similar to the population formula with two exceptions:

    1. It uses sample symbols such as \(\bar{x}\) in place of \(\mu \) and
    2. It uses an adjusted sample size in its denominator.

    These are the same exceptions we saw when we moved from the formula for population standard deviations to the formula for sample standard deviations and they serve the same purpose here.

    Variance for a Sample
    Formula Calculation Steps
    \(s^2=\dfrac{\sum(\mathrm{x}-\overline{\mathrm{x}})^2}{n-1}\)
    1. Find the mean.
    2. Subtract the mean from each raw score to find each deviation.
    3. Square each deviation.
    4. Sum the squared deviations to find SS.
    5. Find the adjusted sample size by subtracting 1 from \(n\).
    6. Divide SS by the adjusted sample size.

    Note that because the variance is calculated using the first six steps to finding a standard deviation, we have already calculated the variance for Data Set 4.1 when we found its standard deviation. Take a look at Table 2 which shows calculations for variation. Everything is the same as in Table 1 which showed the calculations for standard deviation for Data Set 4.1 until the last step. Specifically, instead of using the symbol s for standard deviation the symbol for variance appears and the square root sign is missing. Therefore, the result, when rounded to the hundredths place, is simply 88.10. This is the same value we had under the square root sign in the penultimate step for calculating standard deviation in Table 4.1. By comparing the two tables we can see how the variance was found in Table 4.1 in the process of ultimately calculating the standard deviation.

    Table 2 Standard Deviation Calculations for Data Set 4.1 (n = 22)
    Raw Score Mean Deviation Squared Deviation
    36 29 7 49
    34 29 5 25
    33 29 4 16
    33 29 4 16
    32 29 3 9
    29 29 0 0
    29 29 0 0
    29 29 0 0
    28 29 -1 1
    27 29 -2 4
    25 29 -4 16
    23 29 -6 36
    20 29 -9 81
    19 29 -10 100
    19 29 -10 100
    18 29 -11 121
    16 29 -13 169
    14 29 -15 225
        Sum of Squared
        Sum of Deviations = 0 Deviations (SS) = 1,850
    \[s^2=\dfrac{1,850}{22-1}=\dfrac{1,850}{21}=88.0952381 \ldots \approx 88.10 \nonumber \]

    This page titled 4.5: Variance is shared under a CC BY-NC-SA 4.0 license and was authored, remixed, and/or curated by Christina R. Peter.

    • Was this article helpful?