0: Notation and Symbols Used in Statistics
- Page ID
- 44401
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)
( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\id}{\mathrm{id}}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\kernel}{\mathrm{null}\,}\)
\( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\)
\( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\)
\( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)
\( \newcommand{\vectorA}[1]{\vec{#1}} % arrow\)
\( \newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow\)
\( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vectorC}[1]{\textbf{#1}} \)
\( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)
\( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)
\( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)Statistics (and mathematics) uses symbols to make things easier and clearer. Symbols help express complex ideas quickly and efficiently, allowing us to communicate universally without language barriers. They ensure precision and avoid ambiguity, making understanding and working with mathematical concepts easier. Using symbols, we can manage complexity, represent abstract ideas, and recognize patterns more effectively. This way, math becomes a powerful tool for exploring and understanding the world around us. By the end of this section, readers will be equipped with the notation skills necessary to navigate through statistical literature and confidently perform their own analyses.
Formal definitions for population and sample will be provided in 1.2: Definitions of Statistics, Probability, and Key Terms.
In statistics, a population refers to the entire group of individuals or items that we are interested in studying. It includes all possible subjects that fit the criteria of the research.
For example, we are studying the heights of adult women in Fresno. The 2024 population of Fresno, California was 641,528 people of which 190,584 are adult women living in Fresno. Therefore the population for our study would be 190,584 adult women living in Fresno.
A sample, on the other hand, is a subset of the population selected for the actual study. It represents a smaller group chosen from the population, ideally in a way that accurately reflects the larger group.
Continuing with the same example, a sample might include a smaller group of 400 adult women from different parts of Fresno.
Statistics uses Greek and Latin letters for clarity, tradition, and effectively distinguishing different types of data. Typically, Greek letters describe population information, while Latin (or Latin-looking) letters represent sample information.
Latin Alphabet
Aa, Bb, Cc, Dd, Ee, Ff, Gg, Hh, Ii, Jj, Kk, Ll, Mm, Nn, Oo, Pp, Qq, Rr, Ss, Tt, Uu, Vv, Ww, Xx, Yy, Zz
Greek Alphabet
The highlighted letters are letters we use in this book.
Uppercase, Lowercase | Name | Uppercase, Lowercase | Name | Uppercase, Lowercase | Name |
---|---|---|---|---|---|
\(A\), \(\alpha\) | alpha | \(I\), \(\iota\) | iota | \(P\), \(\rho\) | rho |
\(B\), \(\beta\) | beta | \(K\), \(\kappa\) | kappa | \(\Sigma\) , \(\sigma\) | sigma |
\(\Gamma\), \(\gamma\) | gamma | \(\Lambda\), \(\lambda\) | lambda | \(T\), \(\tau\) | tau |
\(\Delta\), \(\delta\) | delta | \(M\), \(\mu\) | mu | \(\Upsilon\), \(\upsilon\) | upsilon |
\(E\), \(\epsilon\) | epsilon | \(N\), \(\nu\) | nu | \(\Phi\), \(\phi\) | phi |
\(Z\), \(\zeta\) | zeta | \(\Xi\), \(\xi\) | xi | \(X\), \(\chi\) | chi |
\(H\), \(\eta\) | eta | \(O\), \(o\) | omicron | \(\Psi, \psi\) | psi |
\(\Theta\), \(\theta\) | theta | \(\Pi\), \(\pi\) | pi | \(\Omega\), \(\omega\) | omega |
Lower and uppercase letters are used to distinguish between different types of quantities and concepts. This notation helps provide clarity and consistency in statistical communication.
- Lowercase Letters:
- Typically used for sample statistics, individual data points, and variables.
- Examples: \(x\) - a single data point, \(n\) - sample size, \(\bar x\) - sample standard deviation, \(\mu\) - population mean, \(\sigma\) - population standard deviation
- Uppercase Letters:
- Typically used for population parameters, random variables, and distributions.
- Examples: \(X\) - population random variable, \(N\) - Normal Distribution, \(S\) - might represent Sample Space (the set of all possible outcomes of an experiment).
Distribution notation in mathematics and statistics is used to describe how values of a random variable are spread or distributed. This notation conveys information about the probability distribution that a random variable follows, allowing us to understand its behavior and make predictions based on it.
Common Distribution Notations
- Discrete Distributions:
- Binomial Distribution: \(X∼B(n,p)\)
- Meaning: Random variable \(X\) follows a binomial distribution with \(n\) trials and probability of success \(p\) in each trial.
- Poisson Distribution: \(X∼P(\mu)\)
- Meaning: Random variable \(X\) follows a Poisson distribution with mean\(\mu\).
- Binomial Distribution: \(X∼B(n,p)\)
- Continuous Distributions:
- Normal Distribution: \(X~N(\mu, \sigma)\)
- Meaning: Random variable \(X\) follows a normal distribution with mean, \(\mu\), and standard deviation, \(\sigma\).
- Exponential Distribution: \(X∼Exp(\lambda)\)
- Meaning: Random variable \(X\) follows an exponential distribution with rate parameter \(\lambda\).
- Normal Distribution: \(X~N(\mu, \sigma)\)
Index or Subscript: An index or subscript on variables is a notation used to distinguish between multiple related variables, typically in contexts involving sequences, arrays, or matrices. The notation of the index (subscript) is a small number, letter, or symbol written slightly below and to the right of a variable.
Example: \(x_1, x_2, x_3,…, x_n\) where \(x_i\) represents the \(i^th\) element in a sequence.
Example: If we want to add up \(x_1, x_2, x_3, x_4, x_5, x_6, x_7, x_8, x_9, x_{10}\), we can use summation notation where we start our indexing at 1 and go up to n = 10.
\[\sum_{n=1}^{10} x_{n}=x_{1}+x_{2}+x_{3}+x_{4}+x_{5}+x_{6}+x_{7}+x_{8}+x_{9}+x_{10}\nonumber\]
Given a sequence \(\left\{ a_{n} \right\}_{n=k}^{\infty}\) and numbers \(m\) and \(p\) satisfying \(k \leq m \leq p\), the summation from \(m\) to \(p\) of the sequence \(\left\{a_{n}\right\}\) is written
\[\sum_{n=m}^{p} a_{n}=a_{m}+a_{m+1}+\ldots+a_{p}\nonumber\]
The variable \(n\) is called the index of summation. The number \(m\) is called the lower limit of summation while the number \(p\) is called the upper limit of summation.
In English, Definition
is simply defining a short-hand notation for adding up the terms of the sequence \(\left\{ a_{n} \right\}_{n=k}^{\infty}\) from \(a_{m}\) through \(a_{p}\). The symbol \(\Sigma\) is the capital Greek letter sigma and is shorthand for ‘sum’. The lower and upper limits of the summation tells us which term to start with and which term to end with, respectively. For example, using the sequence \(a_{n} = 2n-1\) for \(n \geq 1\), we can write the sum \(a_{3} +a_{4} + a_{5} + a_{6}\) as\[\begin{array}{rcl} \displaystyle{\sum_{n=3}^{6}(2n-1) } & = & (2(3)-1) + (2(4)-1) + (2(5)-1) + (2(6)-1) \\ & = & 5 + 7 + 9 + 11 \\ & = & 32 \\ \end{array}\nonumber\]
The index variable is considered a ‘dummy variable’ in the sense that it may be changed to any letter without affecting the value of the summation. For instance,
\[\displaystyle{\sum_{n=3}^{6}(2n-1)} = \displaystyle{\sum_{k=3}^{6}(2k-1)} = \displaystyle{\sum_{j=3}^{6}(2j-1)}\nonumber\]
One place you may encounter summation notation is in mathematical definitions. For example, summation notation allows us to define polynomials as functions of the form
\[f(x) = \displaystyle{\sum_{k=0}^{n} a_{k} x^{k}}\nonumber\]
for real numbers \(a_{k}\), \(k = 0, 1, \ldots n\). The reader is invited to compare this with what is given in Definition 3.1. Summation notation is particularly useful when talking about matrix operations. For example, we can write the product of the \(i\)th row \(R_{i}\) of a matrix \(A = [a_{ij}]_{m \times n}\) and the \(j^{\text {th }}\) column \(C_{j}\) of a matrix \(B = [b_{ij}]_{n \times r}\) as
\[Ri \cdot Cj = \displaystyle{\sum_{k=1}^{n} a_{ik}b_{kj}}\nonumber\]