Skip to main content
Statistics LibreTexts

3.1: Defining Probability

  • Page ID
    56915
  • \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \( \newcommand{\dsum}{\displaystyle\sum\limits} \)

    \( \newcommand{\dint}{\displaystyle\int\limits} \)

    \( \newcommand{\dlim}{\displaystyle\lim\limits} \)

    \( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)

    ( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\id}{\mathrm{id}}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\kernel}{\mathrm{null}\,}\)

    \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\)

    \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\)

    \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)

    \( \newcommand{\vectorA}[1]{\vec{#1}}      % arrow\)

    \( \newcommand{\vectorAt}[1]{\vec{\text{#1}}}      % arrow\)

    \( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vectorC}[1]{\textbf{#1}} \)

    \( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)

    \( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)

    \( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)

    \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \(\newcommand{\longvect}{\overrightarrow}\)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)

    Statistics is based on probability, and while probability is not required for the applied techniques in this book, it may help you gain a deeper understanding of the methods and set a better foundation for future courses.

    Introductory Examples

    Before we get into technical ideas, let’s walk through some basic examples that may feel more familiar.

    Example \(\PageIndex{1}\)

    A “die”, the singular of dice, is a cube with six faces numbered 1, 2, 3, 4, 5, and 6. What is the chance of getting 1 when rolling a die?

    Solution

    If the die is fair, then the chance of a is as good as the chance of any other number. Since there are six outcomes, the chance must be 1-in-6 or, equivalently, \(1/6\).

    Example \(\PageIndex{2}\)

    What is the chance of getting a 1 or 2 in the next roll?

    Solution

    1 and 2 constitute two of the six equally likely possible outcomes, so the chance of getting one of these two outcomes must be \(2/6 = 1/3\).

    Example \(\PageIndex{3}\)

    What is the chance of getting either 1, 2, 3, 4, 5, or 6 on the next roll?

    Solution

    100%. The outcome must be one of these numbers.

    Example \(\PageIndex{4}\)

    What is the chance of not rolling a 2?

    Solution

    Since the chance of rolling a is \(1/6\) or \(16.\bar{6}\%\), the chance of not rolling a must be \(100\% - 16.\bar{6}\%=83.\bar{3}\%\) or \(5/6\). Alternatively, we could have noticed that not rolling a 2 is the same as getting a 1, 3, 4, 5, or 6, which makes up five of the six equally likely outcomes and has probability \(5/6\).

    Example \(\PageIndex{5}\)

    Consider rolling two dice. If 1/6 of the time the first die is a 1 and 1/6 of those times the second die is a 1, what is the chance of getting two 1s?

    Solution

    If \(16.\bar{6}\)% of the time the first die is a and \(1/6\) of those times the second die is also a , then the chance that both dice are is \((1/6)\times (1/6)\) or \(1/36\).

    Probability

    We use probability to build tools to describe and understand apparent randomness. We often frame probability in terms of a random process giving rise to an outcome.

    \[\begin{array}{ll}\text { Roll a die } & \rightarrow 1,2,3,4,5, \text { or } 6 \\ \text { Flip a coin } & \rightarrow \mathrm{H} \text { or } \mathrm{T}\end{array} \nonumber\]

    Rolling a die or flipping a coin is a seemingly random process and each gives rise to an outcome.

    Probability

    The Probability ​​​​​​ of an outcome is the proportion of times the outcome would occur if we observed the random process an infinite number of times.

    Probability is defined as a proportion, and it always takes values between 0 and 1 (inclusively). It may also be displayed as a percentage between 0% and 100%.

    Probability can be illustrated by rolling a die many times. Let \(\hat{p}_n\) be the proportion of outcomes that are after the first \(n\) rolls. As the number of rolls increases, \(\hat{p}_n\) will converge to the probability of rolling a , \(p = 1/6\). Figure 3.1 shows this convergence for 100,000 die rolls. The tendency of \(\hat{p}_n\) to stabilize around \(p\) is described by the Law of Large Numbers.

    clipboard_ee2e008d29511ac192fa803947288f0bd.png
    Figure 3.1: The fraction of die rolls that are 1 at each stage in a simulation. The proportion tends to get closer to the probability 1/6 ≈ 0.167 as the number of rolls increases.
    Law of Large Numbers

    As more observations are collected, the proportion \(\hat{p}_n\) of occurrences with a particular outcome converges to the probability \(p\) of that outcome.

    Occasionally the proportion will veer off from the probability and appear to defy the Law of Large Numbers, as \(\hat{p}_n\) does many times in Figure 3.1. However, these deviations become smaller as the number of rolls increases.

    Above we write \(p\) as the probability of rolling a 1. We can also write this probability as

    \[\begin{aligned} P(\text{rolling a {1}})\end{aligned}\]

    As we become more comfortable with this notation, we will abbreviate it further. For instance, if it is clear that the process is “rolling a die”, we could abbreviate \(P(\text{rolling a 1})\) as \(P(1)\).

    Guided Practice \(\PageIndex{3.6}\)

    Random processes include rolling a die and flipping a coin.

    1. Think of another random process.
    2. Describe all the possible outcomes of that process. For instance, rolling a die is a random process with possible outcomes 1, 2, ..., 6.
    Answer

    Here are four examples. (i) Whether someone gets sick in the next month or not is an apparently random process with outcomes sick and not. (ii) We can generate a random process by randomly picking a person and measuring that person’s height. The outcome of this process will be a positive number. (iii) Whether the stock market goes up or down next week is a seemingly random process with possible outcomes up, down, and no change. Alternatively, we could have used the percent change in the stock market as a numerical outcome. (iv) Whether your roommate cleans her dishes tonight probably seems like a random process with possible outcomes cleans_dishes and leaves_dishes.

    What we think of as random processes are not necessarily random, but they may just be too difficult to understand exactly. The fourth example in the footnote solution to Guided Practice \(\PageIndex{3.6}\) suggests a roommate’s behavior is a random process. However, even if a roommate’s behavior is not truly random, modeling her behavior as a random process can still be useful.

    Disjoint or mutually exclusive outcomes

    Two outcomes are called disjoint or mutually exclusive if they cannot both happen. For instance, if we roll a die, the outcomes 1 and 2 are disjoint since they cannot both occur. On the other hand, the outcomes 1 and “rolling an odd number” are not disjoint since both occur if the outcome of the roll is a 1. The terms disjoint and mutually exclusive are equivalent and interchangeable.

    Calculating the probability of disjoint outcomes is easy. When rolling a die, the outcomes and are disjoint, and we compute the probability that one of these outcomes will occur by adding their separate probabilities:

    \[\begin{aligned} P(\text{ {1} or {2}}) &= P(\text{ {1}})+P(\text{ {2}}) \\[4pt] &= 1/6 + 1/6 = 1/3\end{aligned}\]

    What about the probability of rolling a 1, 2, 3, 4, 5, or 6? Here again, all of the outcomes are disjoint so we add the probabilities

    \[\begin{aligned} P(\text{ {1} or {2} or {3} or {4} or {5} or {6}}) & = P(\text{ {1}})+P(\text{ {2}}) + P(\text{ {3}})+P(\text{ {4}}) + P(\text{ {5}})+P(\text{ {6}}) \\[4pt] & = 1/6 + 1/6 + 1/6 + 1/6 + 1/6 + 1/6 = 1\end{aligned}\]

    The Addition Rule guarantees the accuracy of this approach when the outcomes are disjoint.guarantees the accuracy of this approach when the outcomes are disjoint.

    Addition Rule of Disjoint Outcomes

    If \(A_1\) and \(A_2\) represent two disjoint outcomes, then the probability that one of them occurs is given by

    \[\begin{aligned} P(A_1\text{ or } A_2) = P(A_1) + P(A_2) \end{aligned}\]

    If there are many disjoint outcomes \(A_1\), ..., \(A_k\), then the probability that one of these outcomes will occur is

    \[\begin{aligned} P(A_1) + P(A_2) + \cdots + P(A_k) \end{aligned}\]

    Exercise \(\PageIndex{1}\)

    We are interested in the probability of rolling a 1, 4, or 5.

    1. Explain why the outcomes 1, 4, and 5 are disjoint.
    2. Apply the Addition Rule for disjoint outcomes to determine P(1 or 4 or 5).
    Answer

    The random process is a die roll, and at most one of these outcomes can come up. This means they are disjoint outcomes.

    \[P(1 \text { or } 4 \text { or } 5)=P(1)+P(4)+P(5)=\frac{1}{6}+\frac{1}{6}+\frac{1}{6}=\frac{3}{6}=\frac{1}{2} \nonumber\]

    Exercise \(\PageIndex{1}\)

    In the loans data set in Chapter 2, the homeownership variable described whether the borrower rents, has a mortgage, or owns her property. Of the 10,000 borrowers, 3858 rented, 4789 had a mortgage, and 1353 owned their home.

    1. Are the outcomes rent, mortgage, and own disjoint?
    2. Determine the proportion of loans with value mortgage and own separately.
    3. Use the Addition Rule for disjoint outcomes to compute the probability a randomly selected loan from the data set is for someone who has a mortgage or owns her home.
    Answer
    1. Yes. Each loan is categorized in only one level of homeownership.
    2. Mortgage: \(\frac{4789}{10000}=0.479\)Own: \(\frac{1353}{10000}=0.135 .\)
    3. \(P(\text { mortgage or own })=P(\text { mortgage })+P(\text { own })=0.479+0.135=0.614 .\)

    Data scientists rarely work with individual outcomes and instead consider sets or collections of outcomes. Let A represent the event where a die roll results in 1 or 2 and B represent the event that the die roll is a 4 or a 6. We write A as the set of outcomes {1, 2} and B = {4, 6}. These sets are commonly called events. Because A and B have no elements in common, they are disjoint events. A and B are represented in Figure 3.2.

    clipboard_efcb912a8736279dab687bb4c08167a6c.png
    Figure 3.2: Three events, A, B, and D, consist of outcomes from rolling a die. A and B are disjoint since they do not have any outcomes in common.

    The Addition Rule applies to both disjoint outcomes and disjoint events. The probability that one of the disjoint events \(A\) or \(B\) occurs is the sum of the separate probabilities:

    \[\begin{aligned} P(A\text{ or }B) = P(A) + P(B) = 1/3 + 1/3 = 2/3\end{aligned}\]

    Exercise \(\PageIndex{9}\)
    1. Verify the probability of event A, P(A), is 1/3 using the Addition Rule.
    2. Do the same for event B.
    Answer

    \[P(A)=P(1 \text { or } 2)=P(1)+P(2)=\frac{1}{6}+\frac{1}{6}=\frac{2}{6}=\frac{1}{3} .\] Similarly, \(P(B)=1 / 3 .\)

    Exercise \(\PageIndex{10}\)
    1. Using Figure 3.2 as a reference, what outcomes are represented by event D?
    2. Are events B and D disjoint?
    3. Are events A and D disjoint?
    Answer
    1. Outcomes 2 and 3.
    2. Yes, events B and D are disjoint because they share no outcomes.
    3. The events A and D share an outcome in common, 2, and so are not disjoint.
    Exercise \(\PageIndex{11}\)

    In Guided Practice 3.10, you confirmed B and D from Figure 3.2 are disjoint. Compute the probability that event B or event D occurs.

    Answer

    Since B and D are disjoint events, use the Addition Rule:

    \[P(B \text { or } D)=P(B)+P(D)=\frac{1}{3}+\frac{1}{3}=\frac{2}{3} . \noumber\]

    Probabilities when events are not disjoint

    Let’s consider calculations for two events that are not disjoint in the context of a , represented in Figure [deckOfCards]. If you are unfamiliar with the cards in a regular deck: The 52 cards are split into four suits: ♣ (club), ♢ (diamond), ♡ (heart), ♠ (spade). Each suit has its 13 cards labeled: 2, 3, ..., 10, J (jack), Q (queen), K (king), and A (ace). Thus, each card is a unique combination of a suit and a label, e.g. 4♡ and J♣. The 12 cards represented by the jacks, queens, and kings are called face cards. The cards that are ♢ or ♡ are typically colored red while the other two suits are typically colored black.

    clipboard_ea6832a7e48badc29badca098bc2310e8.png
    Figure 3.3: Representations of the 52 unique cards in a deck.
    Exercise \(\PageIndex{1}\)
    1. What is the probability that a randomly selected card is a diamond?
    2. What is the probability that a randomly selected card is a face card?
    Answer
    1. There are 52 cards and 13 diamonds. If the cards are thoroughly shuffled, each card has an equal chance of being drawn, so the probability that a randomly selected card is a diamond is \(P(♢) = \frac{13}{52} = 0.250.\)
    2. Likewise, there are 12 face cards, so \(P(\text{face card}) = \frac{12}{52} = \frac{3}{13} = 0.231.\)

    Venn diagrams are useful when outcomes can be categorized as “in” or “out” for two or three variables, attributes, or random processes. The Venn diagram in Figure 3.4 uses a circle to represent diamonds and another to represent face cards. If a card is both a diamond and a face card, it falls into the intersection of the circles. If it is a diamond but not a face card, it will be in part of the left circle that is not in the right circle (and so on). The total number of cards that are diamonds is given by the total number of cards in the diamonds circle: \(10+3=13\). The probabilities are also shown (e.g. \(10/52 = 0.1923\)).

    clipboard_ed460bf281768b84a75fc128340ea2340.png
    Figure 3.4: A Venn diagram for diamonds and face cards.

    Let \(A\) represent the event that a randomly selected card is a diamond and \(B\) represent the event that it is a face card. How do we compute \(P(A\) or \(B)\)? Events \(A\) and \(B\) are not disjoint – the cards \(J\diamondsuit\), \(Q\diamondsuit\), and \(K\diamondsuit\) fall into both categories – so we cannot use the Addition Rule for disjoint events. Instead we use the Venn diagram. We start by adding the probabilities of the two events:

    \[\begin{aligned} P(A) + P(B) = P({\color{redcards}\diamondsuit}) + P(\text{face card}) = 13/52 + 12/52\end{aligned}\]

    However, the three cards that are in both events were counted twice, once in each probability. We must correct this double counting:

    \[\begin{aligned} P(A\text{ or } B) &= P({\color{redcards}\diamondsuit}\text{ or face card}) \\ &= P({\color{redcards}\diamondsuit}) + P(\text{face card}) - P({\color{redcards}\diamondsuit}\text{ and face card}) \\ &= 13/52 + 12/52 - 3/52 \\ &= 22/52 = 11/26\end{aligned}\]

    This equation is an example of the General Addition Rule.

    General Addition Rule

    If \(A\) and \(B\) are any two events, disjoint or not, then the probability that at least one of them will occur is

    \[\begin{aligned} P(A\text{ or }B) = P(A) + P(B) - P(A\text{ and }B) \end{aligned}\]

    where \(P(A\) and \(B)\) is the probability that both events occur.

    TIP: “or” is inclusive

    When we write “or” in statistics, we mean “and/or” unless we explicitly state otherwise. Thus, \(A\) or \(B\) occurs means \(A\), \(B\), or both \(A\) and \(B\) occur.

    Exercise \(\PageIndex{1}\)
    1. If \(A\) and \(B\) are disjoint, describe why this implies \(P(A\) and \(B) = 0\).
    2. Using part (a), verify that the General Addition Rule simplifies to the simpler Addition Rule for disjoint events if \(A\) and \(B\) are disjoint.
    Answer
    1. If A and B are disjoint, A and B can never occur simultaneously.
    2. If A and B are disjoint, then the last \(P(A \text{ and } B)\) term of in the General Addition Rule formula is 0 (see part (a)) and we are left with the Addition Rule for disjoint events.
    Exercise \(\PageIndex{1}\)

    In the loans data set describing 10,000 loans, 1495 loans were from joint applications (e.g. a couple applied together), 4789 applicants had a mortgage, and 950 had both of these characteristics. Create a Venn diagram for this setup.

    Answer

    Both the counts and corresponding probabilities (e.g. 3839/10000 = 0.384) are shown. Notice that the number of loans represented in the left circle
    corresponds to 3839 + 950 = 4789, and the number represented in the right circle is 950 + 545 = 1495.

    Exercise \(\PageIndex{1}\)
    1. Use your Venn diagram from Guided Practice 3.14 to determine the probability a randomly drawn loan from the loans data set is from a joint application where the couple had a mortgage.
    2. What is the probability that the loan had either of these attributes?
    Answer

    The solution is represented by the intersection of the two circles: 0.095.

    This is the sum of the three disjoint probabilities shown in the circles: 0.384 + 0.095 + 0.055 = 0.534 (off by 0.001 due to a rounding error).

    chem_full.png

    Probability distributions

    A probability distribution is a table of all disjoint outcomes and their associated probabilities. Figure 3.5 shows the probability distribution for the sum of two dice.

    Figure 3.5: Probability distribution for the sum of two dice.
    Dice sum 2 3 4 5 6 7 8 9 10 11 12
    Probability \(\frac{1}{36}\) \(\frac{2}{36}\) \(\frac{3}{36}\) \(\frac{4}{36}\) \(\frac{5}{36}\) \(\frac{6}{36}\) \(\frac{5}{36}\) \(\frac{4}{36}\) \(\frac{3}{36}\) \(\frac{2}{36}\) \(\frac{1}{36}\)
    Rules for probability distributions

    A probability distribution is a list of the possible outcomes with corresponding probabilities that satisfies three rules:

    1. The outcomes listed must be disjoint.
    2. Each probability must be between 0 and 1.
    3. The probabilities must total 1.
    Exercise \(\PageIndex{1}\)

    Figure 3.6 suggests three distributions for household income in the United States.

    1. Only one is correct. Which one must it be?
    2. What is wrong with the other two?
    Figure 3.6: Proposed distributions of US household incomes (Guided Practice 3.16).
    Income Range $0-25k $25k-50k $50k-100k $100k+
    (a) 0.18 0.39 0.33 0.16
    (b) 0.38 -0.27 0.52 0.37
    (c) 0.28 0.27 0.29 0.16
    Answer
    1. The probabilities of (a) do not sum to 1. The second probability in (b) is negative. This leaves (c), which sure enough satisfies the requirements of a distribution.
    2. One of the three was said to be the actual distribution of US household incomes, so it must be (c).

    Chapter 1 emphasized the importance of plotting data to provide quick summaries. Probability distributions can also be summarized in a bar plot. For instance, the distribution of US household incomes is shown in Figure 3.7 as a bar plot. The probability distribution for the sum of two dice is shown in Figure 3.5 and plotted in Figure 3.8.

    clipboard_eb75f13ec73386e81967d6bce61d8d160.png
    Figure 3.7: The probability distribution of US household income.

    In these bar plots, the bar heights represent the probabilities of outcomes. If the outcomes are numerical and discrete, it is usually (visually) convenient to make a bar plot that resembles a histogram, as in the case of the sum of two dice. Another example of plotting the bars at their respective locations is shown in Figure 3.18.

    clipboard_eaad0d83084e531c6292cd5d94ba10b7f.png
    Figure 3.8: The probability distribution of the sum of two dice.

    Complement of an event

    Rolling a die produces a value in the set \(\{1, 2, 3, 4, 5, 6\}\). This set of all possible outcomes is called the sample space (\(S\)) for rolling a die. We often use the sample space to examine the scenario where an event does not occur.

    Let \(D = \{2, 3\}\) represent the event that the outcome of a die roll is 2 or 3. Then the complement of \(D\) represents all outcomes in our sample space that are not in \(D\), which is denoted by Dc = {1, 4, 5, 6}. That is, \(D^c\) is the set of all possible outcomes not already included in \(D\). Figure 3.9 shows the relationship between \(D\), \(D^c\), and the sample space \(S\).

    clipboard_ed159a966f06055471d7828a40469f4f3.png
    Figure 3.9: Event \(D = {2, 3}\) and its complement, \(D^c = {1, 4, 5, 6}\). \(S\) represents the sample space, which is the set of all possible outcomes.
    Exercise \(\PageIndex{1}\)
    1. Compute \(P(D^c) = P(\text{rolling a 1, 4, 5, or 6})\).
    2. What is \(P(D) + P(D^c)\)?
    Answer

    The outcomes are disjoint and each has probability 1/6, so the total probability is 4/6 = 2/3.

    We can also see that \(P(D) = \frac{1}{6} + \frac{1}{6} = \frac{1}{3}\). Since \(D\) and \(D^c\) are disjoint, \(P(D) + P(Dc) = 1\).

    Exercise \(\PageIndex{1}\)

    Events \(A = {1, 2}\) and \(B = {4, 6}\) are shown in Figure 3.2.

    1. Write out what \(A^c\) and \(B^c\) represent.
    2. Compute \(P(Ac)\) and \(P(Bc)\).
    3. Compute \(P(A) + P(Ac)\) and \(P(B) + P(Bc)\).
    Answer

    Brief solutions:

    1. \(A^c = {3, 4, 5, 6}\) and \(B^c = {1, 2, 3, 5}\).
    2. Noting that each outcome is disjoint, add the individual outcome probabilities to get \(P(Ac) = 2/3\) and \(P(Bc) = 2/3\).
    3. \(A\) and \(A^c\) are disjoint, and the same is true of \(B\) and \(B^c\). Therefore, \(P(A) + P(Ac) = 1\) and \(P(B) + P(Bc) = 1\).

    A complement of an event \(A\) is constructed to have two very important properties:

    1. every possible outcome not in \(A\) is in \(A^c\), and
    2. \(A\) and \(A^c\) are disjoint.

    Property (i) implies

    \[\begin{aligned} P(A\text{ or }A^c) = 1\end{aligned}\]

    That is, if the outcome is not in \(A\), it must be represented in \(A^c\). We use the Addition Rule for disjoint events to apply Property (ii):

    \[\begin{aligned} P(A\text{ or }A^c) = P(A) + P(A^c)\end{aligned}\]

    Combining the last two equations yields a very useful relationship between the probability of an event and its complement.

    Complement

    The complement of event \(A\) is denoted \(A^c\), and \(A^c\) represents all outcomes not in \(A\). \(A\) and \(A^c\) are mathematically related:

    \[\begin{aligned} P(A) + P(A^c) = 1, \quad\text{i.e.}\quad P(A) = 1-P(A^c) \end{aligned}\]

    In simple examples, computing \(A\) or \(A^c\) is feasible in a few steps. However, using the complement can save a lot of time as problems grow in complexity.

    Exercise \(\PageIndex{1}\)

    Let \(A\) represent the event where we roll two dice and their total is less than 12.

    1. What does the event \(A^c\) represent?
    2. Determine \(P(A^c)\) from Figure 3.5 on page .
    3. Determine \(P(A)\).
    Answer

    The complement of \(A\): when the total is equal to 12.

    \(P(Ac) = 1/36\).

    Use the probability of the complement from part (b), \(P(Ac) = 1/36\), and the equation for the complement: \[P(\text{less than 12}) = 1 − P(12) =
    1 − 1/36 = 35/36. \nonumber\]

    Exercise \(\PageIndex{1}\)

    Find the following probabilities for rolling two dice:

    1. The sum of the dice is not 6.
    2. The sum is at least 4. That is, determine the probability of the event B = {4, 5, ..., 12}.
    3. The sum is no more than 10. That is, determine the probability of the event D = {2, 3, ..., 10}.
    Answer
    1. First find P(6) = 5/36, then use the complement: P(not 6) = 1 − P(6) = 31/36.
    2. First find the complement, which requires much less effort: P(2 or 3) = 1/36 + 2/36 = 1/12. Then calculate \[P(B) = 1 − P(Bc) = 1 − 1/12 = 11/12. \nonumber\]
    3. As before, finding the complement is the clever way to determine P(D). First find \(P(Dc) = P(11 or 12) = 2/36 + 1/36 = 1/12.\) Then calculate \[P(D) = 1 − P(Dc) = 11/12. \nonumber\]

    Independence

    Just as variables and observations can be independent, random processes can be independent, too. Two processes are independent if knowing the outcome of one provides no useful information about the outcome of the other. For instance, flipping a coin and rolling a die are two independent processes – knowing the coin was heads does not help determine the outcome of a die roll. On the other hand, stock prices usually move up or down together, so they are not independent.

    Example 3.5 provides a basic example of two independent processes: rolling two dice. We want to determine the probability that both will be 1. Suppose one of the dice is red and the other white. If the outcome of the red die is a 1, it provides no information about the outcome of the white die. We first encountered this same question in Example 3.5 (page 81), where we calculated the probability using the following reasoning: 1/6 of the time the red die is a 1, and 1/6 of those times the white die will also be 1. This is illustrated in Figure 3.10. Because the rolls are independent, the probabilities of the corresponding outcomes can be multiplied to get the final answer: (1/6)×(1/6) = 1/36. This can be generalized to many independent processes.

    clipboard_ec0852c8db004e7f2dc5742d596dc1a51.png
    Figure 3.10: 1/6 of the time, the first roll is a 1. Then 1/6 of those times, the second roll will also be a 1.
    Example \(\PageIndex{1}\)

    What if there was also a blue die independent of the other two? What is the probability of rolling the three dice and getting all 1s?

    Solution

    The same logic applies from Example 3.5. If \(1/36\) of the time the white and red dice are both , then \(1/6\) of those times the blue die will also be , so multiply:

    \[\begin{aligned} P(white=\text{\small {1} and } red=\text{\small {1} and } blue=\text{\small {1}}) &= P(white=\text{\small {1}})\times P(red=\text{\small {1}})\times P(blue=\text{\small {1}}) \\ &= (1/6)\times (1/6)\times (1/6) = 1/216\end{aligned}\]

    Example 3.21 illustrates what is called the Multiplication Rule for independent processes.

    Multiplication Rule for independent processes

    If \(A\) and \(B\) represent events from two different and independent processes, then the probability that both \(A\) and \(B\) occur can be calculated as the product of their separate probabilities:

    \[ P(A \text{ and }B) = P(A) \times P(B) \nonumber \]

    Similarly, if there are \(k\) events \(A_1\), ..., \(A_k\) from \(k\) independent processes, then the probability they all occur is

    \[ P(A_1) \times P(A_2)\times \cdots \times P(A_k) \nonumber \]

    Exercise \(\PageIndex{1}\)

    About 9% of people are left-handed. Suppose 2 people are selected at random from the U.S. population. Because the sample size of 2 is very small relative to the population, it is reasonable to assume these two people are independent.

    1. What is the probability that both are left-handed?
    2. What is the probability that both are right-handed?
    Answer

    It is reasonable to assume the proportion of people who are ambidextrous (both right- and left-handed) is nearly 0, which results in P(right-handed) = 1 − 0.09 = 0.91. Using the same reasoning as in part (a), the probability that both will be right-handed is 0.91 × 0.91 = 0.8281.

    Exercise \(\PageIndex{1}\)

    Suppose 5 people are selected at random.

    1. What is the probability that all are right-handed?
    2. What is the probability that all are left-handed?
    3. What is the probability that not all of the people are right-handed?
    Answer
    1. The abbreviations RH and LH are used for right-handed and left-handed, respectively. Since each are independent, we apply the Multiplication Rule for independent processes:  \[\begin{aligned} P(\text { all five are } \mathrm{RH}) & =P(\text { first }=\mathrm{RH}, \text { second }=\mathrm{RH}, \ldots, \text { fifth }=\mathrm{RH}) \\ & =P(\text { first }=\mathrm{RH}) \times P(\text { second }=\mathrm{RH}) \times \cdots \times P(\text { fifth }=\mathrm{RH}) \\ & =0.91 \times 0.91 \times 0.91 \times 0.91 \times 0.91=0.624\end{aligned}\]
    2. Using the same reasoning as in (a): \(0.09 × 0.09 × 0.09 × 0.09 × 0.09 = 0.0000059\)
    3. Use the complement, P(all five are RH), to answer this question: \[P(\text { not all } \mathrm{RH})=1-P(\text { all } \mathrm{RH})=1-0.624=0.376 \nonumber\]

    Suppose the variables handedness and sex are independent, i.e. knowing someone’s sex provides no useful information about their handedness and vice-versa. Then we can compute whether a randomly selected person is right-handed and female using the Multiplication Rule:

    \[\begin{aligned} P(\text{right-handed and female}) &= P(\text{right-handed}) \times P(\text{female}) \\ &= 0.91 \times 0.50 = 0.455\end{aligned}\]

    Note: The actual proportion of the U.S. population that is female is about 50%, and so we use 0.5 for the probability of sampling a woman. However, this probability does differ in other countries.

    Exercise \(\PageIndex{1}\)

    Three people are selected at random.

    1. What is the probability that the first person is male and right-handed?
    2. What is the probability that the first two people are male and right-handed?.
    3. What is the probability that the third person is female and left-handed?
    4. What is the probability that the first two people are male and right-handed and the third person is female and left-handed?
    Answer

    Brief answers are provided.

    1. This can be written in probability notation as \(P(\text{a randomly selected person is male and right-handed}) = 0.455\).
    2. 0.207.
    3. 0.045.
    4. 0.0093.

    Sometimes we wonder if one outcome provides useful information about another outcome. The question we are asking is, are the occurrences of the two events independent? We say that two events \(A\) and \(B\) are independent if they satisfy \(P(A \text{ and }B) = P(A) \times P(B)\).

    Example \(\PageIndex{1}\)

    If we shuffle up a deck of cards and draw one, is the event that the card is a heart independent of the event that the card is an ace?

    Solution

    The probability the card is a heart is \(1/4\) and the probability that it is an ace is \(1/13\). The probability the card is the ace of hearts is \(1/52\). We check whether \(P(A \text{ and }B) = P(A) \times P(B)\) is satisfied:

    \[\begin{aligned} P({\color{redcards}\heartsuit})\times P(\text{ace}) = \frac{1}{4}\times \frac{1}{13} = \frac{1}{52} = P({\color{redcards}\heartsuit}\text{ and ace})\end{aligned}\]

    Because the equation holds, the event that the card is a heart and the event that the card is an ace are independent events.


    This page titled 3.1: Defining Probability is shared under a CC BY-SA 3.0 license and was authored, remixed, and/or curated by David Diez, Christopher Barr, & Mine Çetinkaya-Rundel via source content that was edited to the style and standards of the LibreTexts platform.