Skip to main content
Statistics LibreTexts

1.7: Mathematical Notation

  • Page ID
  • \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \) \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)\(\newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\) \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\) \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\) \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\) \( \newcommand{\Span}{\mathrm{span}}\) \(\newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\) \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\) \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\) \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\) \( \newcommand{\Span}{\mathrm{span}}\)

    As noted above, statistics is not math. It does, however, use math as a tool. Many statistical formulas involve summing numbers. Fortunately there is a convenient notation for expressing summation. This section covers the basics of this summation notation.

    Let's say we have a variable \(\mathrm{X}\) that represents the weights (in grams) of 4 grapes:

    Table \(\PageIndex{1}\)
    Grape \(\mathrm{X}\)
    1 4.6
    2 5.1
    3 4.9
    4 4.4

    We label Grape 1's weight \(\mathrm{X}_{1}\), Grape 2's weight \(\mathrm{X}_{2}\), etc. The following formula means to sum up the weights of the four grapes:

    \[\sum_{i=1}^{4} X_{i} \]

    The Greek letter \(\Sigma\) indicates summation. The “i = 1” at the bottom indicates that the summation is to start with \(\mathrm{X}_{1}\) and the 4 at the top indicates that the summation will end with \(\mathrm{X}_{4}\). The “\(\mathrm{X}_{i}\)” indicates that \(\mathrm{X}\) is the variable to be summed as i goes from 1 to 4. Therefore,

    \[\sum_{i=1}^{4} X_{i}=X_{1}+X_{2}+X_{3}+X_{4}=4.6+5.1+4.9+4.4=19 \nonumber \]

    The symbol

    \[\sum_{i=1}^{3} X_{i} \nonumber \]

    indicates that only the first 3 scores are to be summed. The index variable i goes from 1 to 3.

    When all the scores of a variable (such as \(\mathrm{X}\)) are to be summed, it is often convenient to use the following abbreviated notation:

    \[\sum \mathrm{X} \nonumber \]

    Thus, when no values of i are shown, it means to sum all the values of \(\mathrm{X}\).

    Many formulas involve squaring numbers before they are summed. This is indicated as

    \[\begin{array}{l}{\sum X^{2}= 4.6^{2}+5.1^{2}+4.9^{2}+4.4^{2}} \\ {\quad \quad=21.16+26.01+24.01+19.36=90.54}\end{array} \nonumber \]

    Notice that:

    \[\left(\sum \mathrm{X} \right)^{2} \neq \sum \mathrm{X}^{2} \]

    because the expression on the left means to sum up all the values of \(\mathrm{X}\) and then square the sum (19² = 361), whereas the expression on the right means to square the numbers and then sum the squares (90.54, as shown).

    Some formulas involve the sum of cross products. Below are the data for variables \(\mathrm{X}\) and \(\mathrm{Y}\). The cross products (\(\mathrm{XY}\)) are shown in the third column. The sum of the cross products is 3 + 4 + 21 = 28.

    Table \(\PageIndex{2}\)
    \(\mathrm{X}\) \(\mathrm{Y}\) \(\mathrm{XY}\)
    1 3 3
    2 2 4
    3 7 21

    In summation notation, this is written as:

    \[\sum \mathrm{XY} = 28 \nonumber \]

    1.7: Mathematical Notation is shared under a CC BY-NC-SA 4.0 license and was authored, remixed, and/or curated by Foster et al. (University of Missouri’s Affordable and Open Access Educational Resources Initiative) via source content that was edited to conform to the style and standards of the LibreTexts platform; a detailed edit history is available upon request.