4.1: Normal Distribution
- Page ID
- 56921
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\( \newcommand{\dsum}{\displaystyle\sum\limits} \)
\( \newcommand{\dint}{\displaystyle\int\limits} \)
\( \newcommand{\dlim}{\displaystyle\lim\limits} \)
\( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)
( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\id}{\mathrm{id}}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\kernel}{\mathrm{null}\,}\)
\( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\)
\( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\)
\( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)
\( \newcommand{\vectorA}[1]{\vec{#1}} % arrow\)
\( \newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow\)
\( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vectorC}[1]{\textbf{#1}} \)
\( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)
\( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)
\( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\(\newcommand{\longvect}{\overrightarrow}\)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)Among all the distributions we see in practice, one is overwhelmingly the most common. The symmetric, unimodal, bell curve is ubiquitous throughout statistics. Indeed it is so common, that people often know it as the normal curve or normal distribution1 shown in Figure 4.1. Variables such as SAT scores and heights of US adult males closely follow the normal distribution.
Many variables are nearly normal, but none are exactly normal. Thus the normal distribution, while not perfect for any single problem, is very useful for a variety of problems. We will use it in data exploration and to solve important problems in statistics.
Normal Distribution Model
The normal distribution always describes a symmetric, unimodal, bell-shaped curve. However, these curves can look different depending on the details of the model. Specifically, the normal distribution model can be adjusted using two parameters: mean and standard deviation. As you can probably guess, changing the mean shifts the bell curve to the left or right, while changing the standard deviation stretches or constricts the curve. Figure 4.2 shows the normal distribution with mean 0 and standard deviation 1 in the left panel and the normal distributions with mean 19 and standard deviation 4 in the right panel. Figure 4.3 shows these distributions on the same axis.
If a normal distribution has mean \(\mu\) and standard deviation \(\sigma\), we may write the distribution as \(N(\mu, \sigma)\). The two distributions in Figure 4.3 may be written as
\[\begin{aligned} N(\mu=0,\sigma=1) \quad \text{and} \quad N(\mu=19,\sigma=4)\end{aligned}\]
Because the mean and standard deviation describe a normal distribution exactly, they are called the distribution’s parameters. The normal distribution with mean \(\mu = 0\) and standard deviation \(\sigma = 1\) is called the standard normal distribution.
Write down the short-hand for a normal distribution with
- mean 5 and standard deviation 3,
- mean -100 and standard deviation 10, and
- mean 2 and standard deviation 9.
- Answer
-
- \(N(\mu=5, \sigma=3)\)
- \(N(\mu=-100, \sigma=10)\)
- \(N(\mu=2, \sigma=9)\)
Standardizing with Z-scores
We often want to put data onto a standardized scale, which can make comparisons more reasonable.
Table 4.4 shows the mean and standard deviation for total scores on the SAT and ACT. The distribution of SAT and ACT scores are both nearly normal. Suppose Ann scored 1300 on her SAT and Tom scored 24 on his ACT. Who performed better?
Solution
We use the standard deviation as a guide. Ann is 1 standard deviation above average on the SAT: 1100 + 200 = 1300. Tom is 0.5 standard deviations above the mean on the ACT: 21 + 0.5 × 6 = 24. In Figure 4.5, we can see that Ann tends to do better with respect to everyone else than Tom did, so her score was better.
| SAT | ACT | |
|---|---|---|
| Mean | 1100 | 21 |
| SD | 200 | 6 |
Example 4.2 used a standardization technique called a Z-score, a method most commonly employed for nearly normal observations but that may be used with any distribution. The of an observation is defined as the number of standard deviations it falls above or below the mean. If the observation is one standard deviation above the mean, its Z-score is 1. If it is 1.5 standard deviations below the mean, then its Z-score is -1.5. If \(x\) is an observation from a distribution \(N(\mu, \sigma)\), we define the Z-score mathematically as
\[\begin{aligned} Z = \frac{x - \mu}{\sigma}\end{aligned}\]
Using \(\mu_{SAT} = 1100{}\), \(\sigma_{SAT} = 200{}\), and \(x_{_{\text{Ann}}} = 1300{}\), we find Ann’s Z-score:
\[\begin{aligned} Z_{_{\text{Ann}}} = \frac{x_{_{\text{Ann}}} - \mu_{_{\text{SAT}}}} {\sigma_{_{\text{SAT}}}} = \frac{1300{} - 1100{}}{200{}} = 1{}\end{aligned}\]
The Z-score of an observation is the number of standard deviations it falls above or below the mean. We compute the Z-score for an observation \(x\) that follows a distribution with mean \(\mu\) and standard deviation \(\sigma\) using
\[\begin{aligned} Z = \frac{x - \mu}{\sigma} \end{aligned}\]
Use Tom’s ACT score, 24, along with the ACT mean and standard deviation to find his Z-score.
- Answer
-
Add texts here. Do not delete this text first.
Observations above the mean always have positive Z-scores, while those below the mean always have negative Z-scores. If an observation is equal to the mean, such as an SAT score of 1100, then the Z-score is \(0\).
Let \(X\) represent a random variable from \(N(\mu=3, \sigma=2)\), and suppose we observe \(x=5.19\).
- Find the Z-score of \(x\).
- Use the Z-score to determine how many standard deviations above or below the mean \(x\) falls.
- Answer
-
Add texts here. Do not delete this text first.
Head lengths of brushtail possums follow a normal distribution with mean 92.6 mm and standard deviation 3.6 mm. Compute the Z-scores for possums with head lengths of 95.4 mm and 85.8 mm.
- Answer
-
Add texts here. Do not delete this text first.
We can use Z-scores to roughly identify which observations are more unusual than others. An observation \(x_1\) is said to be more unusual than another observation \(x_2\) if the absolute value of its Z-score is larger than the absolute value of the other observation’s Z-score: \(|Z_1| > |Z_2|\). This technique is especially insightful when a distribution is symmetric.
Add exercises text here.
- Answer
-
Add texts here. Do not delete this text first.
Which of the observations in Guided Practice [headLZScore] is more unusual?
Finding tail areas
It’s very useful in statistics to be able to identify tail areas of distributions. For instance, what fraction of people have an SAT score below Ann’s score of 1300? This is the same as the percentile Ann is at, which is the percentage of cases that have lower scores than Ann. We can visualize such a tail area like the curve and shading shown in Figure 4.6.
There are many techniques for doing this, and we’ll discuss three of the options.
-
The most common approach in practice is to use statistical software. For example, in the program , we could find the area shown in Figure [satBelow1300] using the following command, which takes in the Z-score and returns the lower tail area:
.....> pnorm(1)
.....[1] 0.8413447
According to this calculation, the region shaded that is below 1300 represents the proportion 0.841 (84.1%) of SAT test takers who had Z-scores below \(Z = 1\). More generally, we can also specify the cutoff explicitly if we also note the mean and standard deviation:
.....> pnorm(1300, mean = 1100, sd = 200)
.....[1] 0.8413447There are many other software options, such as Python or SAS; even spreadsheet programs such as Excel and Google Sheets support these calculations.
- A common strategy in classrooms is to use a graphing calculator, such as a TI or Casio calculator. These calculators require a series of button presses that are less concisely described. You can find instructions on using these calculators for finding tail areas of a normal distribution in the OpenIntro video library:
- The last option for finding tail areas is to use what’s called a ; these are occasionally used in classrooms but rarely in practice. Appendix [normalProbabilityTable] contains such a table and a guide for how to use it.
We will solve normal distribution problems in this section by always first finding the Z-score. The reason is that we will encounter close parallels called beginning in Chapter [ch_foundations_for_inf]; these are, in many instances, an equivalent of a Z-score.
Normal probability examples
Cumulative SAT scores are approximated well by a normal model, \(N(\mu = 1100{}, \sigma = 200{})\).
Shannon is a randomly selected SAT taker, and nothing is known about Shannon’s SAT aptitude. What is the probability Shannon scores at least 1190 on her SATs?
Solution
First, always draw and label a picture of the normal distribution. (Drawings need not be exact to be useful.) We are interested in the chance she scores above 1190, so we shade this upper tail:

The picture shows the mean and the values at 2 standard deviations above and below the mean. The simplest way to find the shaded area under the curve makes use of the Z-score of the cutoff value. With \(\mu = 1100{}\), \(\sigma = 200{}\), and the cutoff value \(x = 1190{}\), the Z-score is computed as
\[\begin{aligned} Z = \frac{x - \mu}{\sigma} = \frac{1190{} - 1100{}}{200{}} = \frac{90}{200{}} = 0.45{} \end{aligned}\]
Using statistical software (or another preferred method), we can find the area left of \(Z = 0.45{}\) as 0.6736. To find the area above \(Z = 0.45{}\), we compute one minus the area of the lower tail:

The probability Shannon scores at least 1190 on the SAT is 0.3264.
For any normal probability situation, always always always draw and label the normal curve and shade the area of interest first. The picture will provide an estimate of the probability. After drawing a figure to represent the situation, identify the Z-score for the value of interest.
If the probability of Shannon scoring at least 1190 is 0.3264, then what is the probability she scores less than 1190? Draw the normal curve representing this exercise, shading the lower region instead of the upper one.
Solution
Add example text here.
Edward earned a 1030 on his SAT. What is his percentile?
Solution
First, a picture is needed. Edward’s is the proportion of people who do not get as high as a 1030. These are the scores to the left of 1030.
Identifying the mean \(\mu=1100{}\), the standard deviation \(\sigma=200{}\), and the cutoff for the tail area \(x=1030{}\) makes it easy to compute the Z-score:
\[\begin{aligned} Z = \frac{x - \mu}{\sigma} = \frac{1030{} - 1100{}}{200{}} = -0.35{}\end{aligned}\]
Using statistical software, we get a tail area of 0.3632. Edward is at the \(36^{th}\) percentile.
Use the results of Example 4.9 to compute the proportion of SAT takers who did better than Edward. Also draw a new picture.
- Answer
-
Add texts here. Do not delete this text first.
Many software programs return the area to the left when given a Z-score. If you would like the area to the right, first find the area to the left and then subtract this amount from one.
Stuart earned an SAT score of 1500. Draw a picture for each part.
- What is his percentile?
- What percent of SAT takers did better than Stuart?
- Answer
-
Add texts here. Do not delete this text first.
Based on a sample of 100 men, the heights of male adults in the US is nearly normal with mean 70.0” and standard deviation 3.3”.
Mike is 5’7” and Jose is 6’4”, and they both live in the US.
- What is Mike’s height percentile?
- What is Jose’s height percentile?
Also draw one picture for each part.
- Answer
-
Add texts here. Do not delete this text first.
The last several problems have focused on finding the percentile (lower tail) or the upper tail for a particular observation. What if you would like to know the observation corresponding to a particular percentile?
Erik’s height is at the \(40^{th}\) percentile. How tall is he?
Solution
As always, first draw the picture.
In this case, the lower tail probability is known (0.40), which can be shaded on the diagram. We want to find the observation that corresponds to this value. As a first step in this direction, we determine the Z-score associated with the \(40^{th}\) percentile. Using software, we can obtain the corresponding Z-score of about -0.25.
Knowing \(Z_{Erik} = -0.25\) and the population parameters \(\mu = 70\) and \(\sigma = 3.3\) inches, the Z-score formula can be set up to determine Erik’s unknown height, labeled \(x_{_{\text{Erik}}}\):
\[\begin{aligned} -0.25 = Z_{_{\text{Erik}}} = \frac{x_{_{\text{Erik}}} - \mu}{\sigma} = \frac{x_{_{\text{Erik}}} - 70}{3.3} \end{aligned}\]
Solving for \(x_{_{\text{Erik}}}\) yields a height of 69.18 inches. That is, Erik is about 5’9”.
What is the adult male height at the \(82^{nd}\) percentile?
Solution
Again, we draw the figure first.
Next, we want to find the Z-score at the \(82^{nd}\) percentile, which will be a positive value and can be found using software as \(Z = 0.92\). Finally, the height \(x\) is found using the Z-score formula with the known mean \(\mu\), standard deviation \(\sigma\), and Z-score \(Z = 0.92\):
\[\begin{aligned} 0.92 = Z = \frac{x-\mu}{\sigma} = \frac{x - 70}{3.3} \end{aligned}\]
This yields 73.04 inches or about 6’1” as the height at the \(82^{nd}\) percentile.
The SAT scores follow \(N(1100{}, 200{})\).
- What is the \(95^{th}\) percentile for SAT scores?
- What is the \(97.5^{th}\) percentile for SAT scores?
- Answer
-
Short answers:
- \(Z_{95} = 1.6449 → 1429\) SAT score.
- \(Z_{97.5} = 1.96 → 1492\) SAT score.
Adult male heights follow \(N(70.0\)”\(, 3.3\)”\()\).
- What is the probability that a randomly selected male adult is at least 6’2” (74 inches)?
- What is the probability that a male adult is shorter than 5’9” (69 inches)?
- Answer
-
Short answers:
- Z = 1.21 → 0.8869, then subtract this value from 1 to get 0.1131.
- Z = −0.30 → 0.3821.
What is the probability that a random adult male is between 5’9” and 6’2”?
Solution
These heights correspond to 69 inches and 74 inches. First, draw the figure. The area of interest is no longer an upper or lower tail.
The total area under the curve is 1. If we find the area of the two tails that are not shaded (from Guided Practice [more74Less69], these areas are \(0.3821\) and \(0.1131\)), then we can find the middle area:
That is, the probability of being between 5’9” and 6’2” is 0.5048.
SAT scores follow \(N(1100{}, 200{})\). What percent of SAT takers get between 1100 and 1400?
- Answer
-
This is an abbreviated solution. (Be sure to draw a figure!) First find the percent who get below 1100 and the percent that get above 1400: \(Z_{1100} = 0.00 → 0.5000\) (area below), \(Z_{1400} = 1.5 → 0.0668\) (area above). Final answer: 1.0000 − 0.5000 − 0.0668 = 0.4332.
Adult male heights follow \(N(70.0\)”\(, 3.3\)”\()\). What percent of adult males are between 5’5” and 5’7”?
- Answer
-
5’5” is 65 inches (Z = −1.52). 5’7” is 67 inches (Z = −0.91). Numerical solution: 1.000−0.0643−0.8186 = 0.1171, i.e. 11.71%.
68-95-99.7 Rule
Here, we present a useful rule of thumb for the probability of falling within 1, 2, and 3 standard deviations of the mean in the normal distribution. This will be useful in a wide range of practical settings, especially when trying to make a quick estimate without a calculator or Z-table.
Use software, a calculator, or a probability table to confirm that about 68%, 95%, and 99.7% of observations fall within 1, 2, and 3, standard deviations of the mean in the normal distribution, respectively. For instance, first find the area that falls between \(Z=-1\) and \(Z=1\), which should have an area of about 0.68. Similarly there should be an area of about 0.95 between \(Z=-2\) and \(Z=2\).
- Answer
-
First draw the pictures. Using software, we get 0.6827 within 1 standard deviation, 0.9545 within 2 standard deviations, and 0.9973 within 3 standard deviations.
It is possible for a normal random variable to fall 4, 5, or even more standard deviations from the mean. However, these occurrences are very rare if the data are nearly normal. The probability of being further than 4 standard deviations from the mean is about 1-in-15,000. For 5 and 6 standard deviations, it is about 1-in-2 million and 1-in-500 million, respectively.
SAT scores closely follow the normal model with mean \(\mu = 1100{}\) and standard deviation \(\sigma = 200{}\).
- About what percent of test takers score 700 to 1500?
- What percent score between 1100 and 1500?
- Answer
-
- 700 and 1500 represent two standard deviations below and above the mean, which means about 95% of test takers will score between 700 and 1500.
- We found that 700 to 1500 represents about 95% of test takers. These test takers would be evenly split by the center of the distribution, 1100, so \(\frac{95%}{2} = 47.5%\) of all test takers score between 1100 and 1500.


