Skip to main content
Statistics LibreTexts

4.3: Binomial Distribution

  • Page ID
    56924
  • \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \( \newcommand{\dsum}{\displaystyle\sum\limits} \)

    \( \newcommand{\dint}{\displaystyle\int\limits} \)

    \( \newcommand{\dlim}{\displaystyle\lim\limits} \)

    \( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)

    ( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\id}{\mathrm{id}}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\kernel}{\mathrm{null}\,}\)

    \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\)

    \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\)

    \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)

    \( \newcommand{\vectorA}[1]{\vec{#1}}      % arrow\)

    \( \newcommand{\vectorAt}[1]{\vec{\text{#1}}}      % arrow\)

    \( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vectorC}[1]{\textbf{#1}} \)

    \( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)

    \( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)

    \( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)

    \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \(\newcommand{\longvect}{\overrightarrow}\)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)

    The is used to describe the number of successes in a fixed number of trials. This is different from the geometric distribution, which described the number of trials we must wait before we observe a success.

    The binomial distribution

    Let’s again imagine ourselves back at the insurance agency where 70% of individuals do not exceed their deductible.

    Example \(\PageIndex{1}\)

    Suppose the insurance agency is considering a random sample of four individuals they insure. What is the chance exactly one of them will exceed the deductible and the other three will not? Let’s call the four people Ariana (\(A\)), Brittany (\(B\)), Carlton (\(C\)), and Damian (\(D\)) for convenience.

    Solution

    Let’s consider a scenario where one person exceeds the deductible:

    \[\begin{aligned} &P(A=\text{\resp{exceed}{}}, \text{ }B=\text{\resp{not}{}}, \text{ }C=\text{\resp{not}{}}, \text{ }D=\text{\resp{not}{}}) \\ &\quad = P(A=\text{\resp{exceed}{}})\ P(B=\text{\resp{not}{}})\ P(C=\text{\resp{not}{}})\ P(D=\text{\resp{not}{}}) \\ &\quad = (0.3{}) (0.7{}) (0.7{}) (0.7{}) \\ &\quad = (0.7{})^3 (0.3{})^1 \\ &\quad = 0.103{} \end{aligned}\]

    But there are three other scenarios: Brittany, Carlton, or Damian could have been the one to exceed the deductible. In each of these cases, the probability is again \((0.7{})^3 (0.3{})^1\). These four scenarios exhaust all the possible ways that exactly one of these four people could have exceeded the deductible, so the total probability is \(4 \times (0.7{})^3 (0.3{})^1 = 0.412{}\).

    Exercise \(\PageIndex{1}\)

    Verify that the scenario where Brittany is the only one to exceed the deductible has probability \((0.7{})^3 (0.3{})^1\).

    Answer

    Add texts here. Do not delete this text first.

    The scenario outlined in Example [insureOneOfFourExceedsDeductible] is an example of a binomial distribution scenario. The describes the probability of having exactly \(k\) successes in \(n\) independent Bernoulli trials with probability of a success \(p\) (in Example [insureOneOfFourExceedsDeductible], \(n=4\), \(k=3\), \(p=0.7{}\)). We would like to determine the probabilities associated with the binomial distribution more generally, i.e. we want a formula where we can use \(n\), \(k\), and \(p\) to obtain the probability. To do this, we reexamine each part of Example [insureOneOfFourExceedsDeductible].

    There were four individuals who could have been the one to exceed the deductible, and each of these four scenarios had the same probability. Thus, we could identify the final probability as

    \[\begin{aligned} \times P(\text{single scenario})\end{aligned}\]

    The first component of this equation is the number of ways to arrange the \(k=3\) successes among the \(n=4\) trials. The second component is the probability of any of the four (equally probable) scenarios.

    Consider \(P(\)single scenario\()\) under the general case of \(k\) successes and \(n-k\) failures in the \(n\) trials. In any such scenario, we apply the Multiplication Rule for independent events:

    \[\begin{aligned} p^k (1 - p)^{n - k}\end{aligned}\]

    This is our general formula for \(P(\)single scenario\()\).

    Secondly, we introduce a general formula for the number of ways to choose \(k\) successes in \(n\) trials, i.e. arrange \(k\) successes and \(n - k\) failures:

    \[\begin{aligned} {n\choose k} = \frac{n!}{k! (n - k)!}\end{aligned}\]

    The quantity \({n\choose k}\) is read .2 The exclamation point notation (e.g. \(k!\)) denotes a expression.[factorial_defined]

    \[\begin{aligned} & 0! = 1 \\ & 1! = 1 \\ & 2! = 2\times1 = 2 \\ & 3! = 3\times2\times1 = 6 \\ & 4! = 4\times3\times2\times1 = 24 \\ & \vdots \\ & n! = n\times(n-1)\times...\times3\times2\times1\end{aligned}\]

    Using the formula, we can compute the number of ways to choose \(k = 3\) successes in \(n = 4\) trials:

    \[\begin{aligned} {4 \choose 3} = \frac{4!}{3!(4-3)!} = \frac{4!}{3!1!} = \frac{4\times3\times2\times1}{(3\times2\times1) (1)} = 4\end{aligned}\]

    This result is exactly what we found by carefully thinking of each possible scenario in Example [insureOneOfFourExceedsDeductible].

    Substituting \(n\) choose \(k\) for the number of scenarios and \(p^k(1-p)^{n-k}\) for the single scenario probability yields the general binomial formula.

    Binomial Distribution

    Suppose the probability of a single trial being a success is \(p\). Then the probability of observing exactly \(k\) successes in \(n\) independent trials is given by

    \[\begin{aligned} {n\choose k}p^k(1-p)^{n-k} = \frac{n!}{k!(n-k)!}p^k(1-p)^{n-k} \end{aligned}\]

    The mean, variance, and standard deviation of the number of observed successes are

    \[\begin{aligned} \mu &= np &\sigma^2 &= np(1-p) &\sigma&= \sqrt{np(1-p)} \end{aligned}\]

    Is it binomial? Four conditions to check. 
    1. The trials are independent.
    2. The number of trials, \(n\), is fixed.
    3. Each trial outcome can be classified as a success or failure.
    4. The probability of a success, \(p\), is the same for each trial.
    Example \(\PageIndex{1}\)

    What is the probability that 3 of 8 randomly selected individuals will have exceeded the insurance deductible, i.e. that 5 of 8 will not exceed the deductible? Recall that 70% of individuals will not exceed the deductible.

    Solution

    We would like to apply the binomial model, so we check the conditions. The number of trials is fixed (\(n = 8\)) (condition 2) and each trial outcome can be classified as a success or failure (condition 3). Because the sample is random, the trials are independent (condition 1) and the probability of a success is the same for each trial (condition 4).

    In the outcome of interest, there are \(k = 5\) successes in \(n = 8\) trials (recall that a success is an individual who does not exceed the deductible), and the probability of a success is \(p = 0.7{}\). So the probability that 5 of 8 will not exceed the deductible and 3 will exceed the deductible is given by

    \[\begin{aligned} { 8 \choose 5}(0.7{})^5 (1-0.7{})^{8-5} &= \frac{8!}{5!(8-5)!} (0.7{})^5(1-0.7{})^{8-5} \\ &= \frac{8!}{5!3!} (0.7{})^5(0.3{})^3 \end{aligned}\]

    Dealing with the factorial part:

    \[\begin{aligned} \frac{8!}{5!3!} = \frac{8\times7\times6\times5\times4\times3\times2\times1} {(5\times4\times3\times2\times1)(3\times2\times1)} = \frac{8\times7\times6}{3\times2\times1} = 56 \end{aligned}\]

    Using \((0.7{})^5(0.3{})^3 \approx 0.00454{}\), the final probability is about \(56 \times 0.00454{} \approx 0.254{}\).

    Computing binomial probabilities

    The first step in using the binomial model is to check that the model is appropriate. The second step is to identify \(n\), \(p\), and \(k\). As the last stage use software or the formulas to determine the probability, then interpret the results.

    If you must do calculations by hand, it’s often useful to cancel out as many terms as possible in the top and bottom of the binomial coefficient.

    Exercise \(\PageIndex{1}\)

    If we randomly sampled 40 case files from the insurance agency discussed earlier, how many of the cases would you expect to not have exceeded the deductible in a given year? What is the standard deviation of the number that would not have exceeded the deductible?

    Answer

    Add texts here. Do not delete this text first.

    Exercise \(\PageIndex{1}\)

    The probability that a random smoker will develop a severe lung condition in his or her lifetime is about \(0.3\). If you have 4 friends who smoke, are the conditions for the binomial model satisfied?

    Answer

    Add texts here. Do not delete this text first.

    Exercise \(\PageIndex{1}\)

    Suppose these four friends do not know each other and we can treat them as if they were a random sample from the population. Is the binomial model appropriate? What is the probability that

    1. None of them will develop a severe lung condition?
    2. One will develop a severe lung condition?
    3. That no more than one will develop a severe lung condition?
    Answer

    Add texts here. Do not delete this text first.

    Exercise \(\PageIndex{1}\)

    What is the probability that at least 2 of your 4 smoking friends will develop a severe lung condition in their lifetimes?

    Answer

    Add texts here. Do not delete this text first.

    Exercise \(\PageIndex{1}\)

    Suppose you have 7 friends who are smokers and they can be treated as a random sample of smokers.

    1. How many would you expect to develop a severe lung condition, i.e. what is the mean?
    2. What is the probability that at most 2 of your 7 friends will develop a severe lung condition.
    Answer

    Add texts here. Do not delete this text first.

    Next we consider the first term in the binomial probability, \(n\) choose \(k\) under some special scenarios.

    Exercise \(\PageIndex{1}\)

    Why is it true that \({n \choose 0}=1\) and \({n \choose n}=1\) for any number \(n\)?

    Answer

    Add texts here. Do not delete this text first.

    Exercise \(\PageIndex{1}\)

    How many ways can you arrange one success and \(n-1\) failures in \(n\) trials? How many ways can you arrange \(n-1\) successes and one failure in \(n\) trials?

    Answer

    Add texts here. Do not delete this text first.

    Normal approximation to the binomial distribution

    The binomial formula is cumbersome when the sample size (\(n\)) is large, particularly when we consider a range of observations. In some cases we may use the normal distribution as an easier and faster way to estimate binomial probabilities.

    Example \(\PageIndex{1}\)

    Approximately 15% of the US population smokes cigarettes. A local government believed their community had a lower smoker rate and commissioned a survey of 400 randomly selected individuals. The survey found that only 42 of the 400 participants smoke cigarettes. If the true proportion of smokers in the community was really 15%, what is the probability of observing 42 or fewer smokers in a sample of 400 people?

    Solution

    We leave the usual verification that the four conditions for the binomial model are valid as an exercise.

    The question posed is equivalent to asking, what is the probability of observing \(k=0\), 1, 2, ..., or 42 smokers in a sample of \(n = 400{}\) when \(p=0.15{}\)? We can compute these 43 different probabilities and add them together to find the answer:

    \[\begin{aligned} &P(k=0\text{ or }k=1\text{ or }\cdots\text{ or } k=42{}) \\ &\qquad = P(k=0) + P(k=1) + \cdots + P(k=42{}) \\ &\qquad = 0.0054{} \end{aligned}\]

    If the true proportion of smokers in the community is \(p=0.15{}\), then the probability of observing 42 or fewer smokers in a sample of \(n=400{}\) is 0.0054.

    The computations in Example [exactBinomSmokerExSetup] are tedious and long. In general, we should avoid such work if an alternative method exists that is faster, easier, and still accurate. Recall that calculating probabilities of a range of values is much easier in the normal model. We might wonder, is it reasonable to use the normal model in place of the binomial distribution? Surprisingly, yes, if certain conditions are met.

    Exercise \(\PageIndex{1}\)

    Here we consider the binomial model when the probability of a success is \(p = 0.10\). Figure [fourBinomialModelsShowingApproxToNormal] shows four hollow histograms for simulated samples from the binomial distribution using four different sample sizes: \(n = 10, 30, 100, 300\). What happens to the shape of the distributions as the sample size increases? What distribution does the last hollow histogram resemble?

    Answer

    Add texts here. Do not delete this text first.

    Normal approximation of the binomial Distribution

    The binomial distribution with probability of success \(p\) is nearly normal when the sample size \(n\) is sufficiently large that \(np\) and \(n(1-p)\) are both at least 10. The approximate normal distribution has parameters corresponding to the mean and standard deviation of the binomial distribution:

    \[\begin{aligned} \mu &= np &\sigma& = \sqrt{np(1 - p)} \end{aligned}\]

    The normal approximation may be used when computing the range of many possible successes. For instance, we may apply the normal distribution to the setting of Example [exactBinomSmokerExSetup].

    clipboard_ef6eaaba99d79e2512124bdeb38b5c92c.png
    Figure 4.9: Hollow histograms of samples from the binomial model when p = 0.10. The sample sizes for the four plots are n = 10, 30, 100, and 300, respectively.
    Example \(\PageIndex{1}\)

    How can we use the normal approximation to estimate the probability of observing 42 or fewer smokers in a sample of 400, if the true proportion of smokers is \(p = 0.15{}\)?

    Solution

    Showing that the binomial model is reasonable was a suggested exercise in Example [exactBinomSmokerExSetup]. We also verify that both \(np\) and \(n(1-p)\) are at least 10:

    \[\begin{aligned} np &= 400{} \times 0.15{} = 60{} &n (1 - p) &= 400{} \times 0.85{} = 340{} \end{aligned}\]

    With these conditions checked, we may use the normal approximation in place of the binomial distribution using the mean and standard deviation from the binomial model:

    \[\begin{aligned} \mu &= np = 60{} &\sigma &= \sqrt{np(1 - p)} = 7.14{} \end{aligned}\]

    We want to find the probability of observing 42 or fewer smokers using this model.

    Exercise \(\PageIndex{1}\)

    Use the normal model \(N(\mu = 60{}, \sigma = 7.14{})\) to estimate the probability of observing 42 or fewer smokers. Your answer should be approximately equal to the solution of Example [exactBinomSmokerExSetup]: 0.0054.

    Answer

    Add texts here. Do not delete this text first.

    The normal approximation breaks down on small intervals

    The normal approximation to the binomial distribution tends to perform poorly when estimating the probability of a small range of counts, even when the conditions are met.

    Suppose we wanted to compute the probability of observing 49, 50, or 51 smokers in 400 when \(p = 0.15{}\). With such a large sample, we might be tempted to apply the normal approximation and use the range 49 to 51. However, we would find that the binomial solution and the normal approximation notably differ:

    \[\begin{aligned} \text{Binomial:}&\ 0.0649{} &\text{Normal:}&\ 0.0421{}\end{aligned}\]

    We can identify the cause of this discrepancy using Figure [normApproxToBinomFail], which shows the areas representing the binomial probability (outlined) and normal approximation (shaded). Notice that the width of the area under the normal distribution is 0.5 units too slim on both sides of the interval.

    clipboard_e4cb553afdbd3f8d84b0cea030f606f7e.png
    Figure 4.10: A normal curve with the area between 49 and 51 shaded. The outlined area represents the exact binomial probability.
    Improving the normal approximation for the binomial distribution

    The normal approximation to the binomial distribution for intervals of values is usually improved if cutoff values are modified slightly. The cutoff values for the lower end of a shaded region should be reduced by 0.5, and the cutoff value for the upper end should be increased by 0.5.

    The tip to add extra area when applying the normal approximation is most often useful when examining a range of observations. In the example above, the revised normal distribution estimate is 0.0633, much closer to the exact value of 0.0649. While it is possible to also apply this correction when computing a tail area, the benefit of the modification usually disappears since the total interval is typically quite wide.


    This page titled 4.3: Binomial Distribution is shared under a CC BY-SA 3.0 license and was authored, remixed, and/or curated by David Diez, Christopher Barr, & Mine Çetinkaya-Rundel via source content that was edited to the style and standards of the LibreTexts platform.