Skip to main content
Statistics LibreTexts

6.1: Introduction

  • Page ID
    4575
  • This photo shows many different pairs of shoes in various colors. The shoes appear to be hanging from a wall by cords.

    Figure 6.1 If you ask enough people about their shoe size, you will find that your graphed data is shaped like a bell curve and can be described as normally distributed. (credit: Ömer Ünlϋ)

    The normal probability density function, a continuous distribution, is the most important of all the distributions. It is widely used and even more widely abused. Its graph is bell-shaped. You see the bell curve in almost all disciplines. Some of these include psychology, business, economics, the sciences, nursing, and, of course, mathematics. Some of your instructors may use the normal distribution to help determine your grade. Most IQ scores are normally distributed. Often real-estate prices fit a normal distribution.

    The normal distribution is extremely important, but it cannot be applied to everything in the real world. Remember here that we are still talking about the distribution of population data. This is a discussion of probability and thus it is the population data that may be normally distributed, and if it is, then this is how we can find probabilities of specific events just as we did for population data that may be binomially distributed or Poisson distributed. This caution is here because in the next chapter we will see that the normal distribution describes something very different from raw data and forms the foundation of inferential statistics.

    The normal distribution has two parameters (two numerical descriptive measures): the mean (\(\mu\)) and the standard deviation (\(\sigma\)). If X is a quantity to be measured that has a normal distribution with mean (\(\mu\)) and standard deviation (\(\sigma\)), we designate this by writing the following formula of the normal probability density function:

    This is a frequency curve for a normal distribution. It shows a single peak in the center with the curve tapering down to the horizontal axis on each side. The distribution is symmetrical; it represents the random variable X having a normal distribution with a mean, m, and standard deviation, s.

    Figure 6.2

    The probability density function is a rather complicated function. Do not memorize it. It is not necessary.

    \[f(x)=\frac{1}{\sigma \cdot \sqrt{2 \cdot \pi}} \cdot \mathrm{e}^{-\frac{1}{2} \cdot\left(\frac{x-\mu}{\sigma}\right)^{2}}\nonumber\]

    The curve is symmetric about a vertical line drawn through the mean, \(\mu\). The mean is the same as the median, which is the same as the mode, because the graph is symmetric about \(\mu\). As the notation indicates, the normal distribution depends only on the mean and the standard deviation. Note that this is unlike several probability density functions we have already studied, such as the Poisson, where the mean is equal to \(\mu\)\(\mu\) and the standard deviation simply the square root of the mean, or the binomial, where p is used to determine both the mean and standard deviation. Since the area under the curve must equal one, a change in the standard deviation, \(\sigma\), causes a change in the shape of the normal curve; the curve becomes fatter and wider or skinnier and taller depending on \(\sigma\). A change in \(\mu\)causes the graph to shift to the left or right. This means there are an infinite number of normal probability distributions. One of special interest is called the standard normal distribution.