Skip to main content
Statistics LibreTexts

5.5: Hypergeometric Distribution

  • Page ID
    26587
  • \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \) \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)\(\newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\) \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\) \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\) \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\) \( \newcommand{\Span}{\mathrm{span}}\) \(\newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\) \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\) \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\) \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\) \( \newcommand{\Span}{\mathrm{span}}\)\(\newcommand{\AA}{\unicode[.8,0]{x212B}}\)

    The hypergeometric distribution is a discrete probability distribution used to find the probability of success when there are two outcomes to each trial, and there are a set number of dependent trials. A hypergeometric probability distribution results from a random experiment that meets all of the following requirements. 12

    1. The procedure has a fixed sample size n taken from a population N.
    2. The trials are taken without replacement (dependent).
    3. Each trial has exactly two outcomes, the number of successes = a and the number of failures = b. Note a + b = N.

    If a random experiment satisfies all of the above, the distribution of the random variable X, where X counts the number of successes, is called a hypergeometric distribution, we write X ~ H(n, a, N).

    The hypergeometric distribution is P(X = x) = \(\frac{ {}_{a} C_{x} \cdot {}_{b} C_{n-x}}{ {}_{N} C_{n}}\), x = 0, 1, 2, … , n or a, whichever is smaller. Where n is the sample size taken without replacement from the population size N, x is the number of successes out of n that you are trying to find the probability for, a is the number of a successes and b = Na is the number of failures out of the population N.

    A committee of 5 people is to be formed from 15 volunteers and 6 appointed positions. Compute the probability that the committee will consist of 4 randomly selected volunteers.

    Solution

    There are a total of N = 15 + 6 = 21 people. A sample of n = 5 people are selected without replacement. The success is choosing a volunteer, so a = 15. Of the 15 volunteers, we are asked to find the probability of selecting x = 4. Substitute in each value: P(X = 4) = \(\frac{ {}_{15} C_{4} \cdot { }_{6} C_{1}}{ {}_{21} C_{5}} = 0.4025\).

    There is a pattern to the numbers in the formula. The top two numbers on the left of the C’s add up to the bottom number on the left of C, and the top two numbers on the right of the C’s add up to the bottom number on the right of C.

    The waiting room for jury selection has 40 people with a college degree and 30 people without. A jury of 12 is selected from this waiting room. What is the probability that exactly 5 of the jury members will have a college degree?

    Solution

    There are a total of N = 40 + 30 = 70 people. A sample of n = 12 people are selected without replacement. The success is choosing someone with a college degree, so a = 40. Of the 12 jurors we are asked to find the probability of selecting x = 5.

    Substitute in each value: P(X = 5) = \(\frac{ {}_{40} C_{5} \cdot { }_{30} C_{7}}{ {}_{70} C_{12}} = 0.1259\).

    A wallet contains three $100 bills and five $1 bills. You randomly choose four bills. What is the probability that you will choose exactly two $100 bills?

    Solution

    There are a total of N = 3 + 5 = 8 bills. A sample of n = 4 bills are selected without replacement. The success is choosing a $100 bill, so a = 3. Of the 3 $100 bills we are asked to find the probability of selecting x = 2. Substitute in each value: P(X = 2) = \(\frac{ {}_{3} C_{2} \cdot { }_{5} C_{2}}{ {}_{8} C_{4}} = 0.4286\).

    Note: the calculator does not have the hypergeometric distribution shortcut. The following online calculator will calculate the probability: https://homepage.divms.uiowa.edu/~mbognar/applets/hg.html. The M = a is the number of successes a from our example, or use Excel.

    Figure 5-7 may help you decide when to use the “True” for cumulative in Excel.

    clipboard_ec0d30c812e7dd111fc76242a3d84c93b.png

    Figure 5-7


    This page titled 5.5: Hypergeometric Distribution is shared under a CC BY-SA 4.0 license and was authored, remixed, and/or curated by Rachel Webb via source content that was edited to the style and standards of the LibreTexts platform; a detailed edit history is available upon request.

    • Was this article helpful?