3.4: Probability and Compound Events
- Page ID
- 41780
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)
( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\id}{\mathrm{id}}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\kernel}{\mathrm{null}\,}\)
\( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\)
\( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\)
\( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)
\( \newcommand{\vectorA}[1]{\vec{#1}} % arrow\)
\( \newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow\)
\( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vectorC}[1]{\textbf{#1}} \)
\( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)
\( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)
\( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)- Compute probability in compound events involving "and"
- Compute probability in compound events involving "or"
- Compute probability in compound events involving "and" and "or"
- Use complements for computing the probability of compound events
- Determine probability from two-way tables
- Extend the use of probability rules to other compound events
Review and Preview Probability and Compound Events
In Chapter \(3,\) we have discussed basic probability and counting concepts to prepare us for our future work with inferential statistics. To use the classical approach for determining the probability of an event \(A,\) we need to know the size of the sampe space, \(n,\) and we need to know that each of those outcomes in the sample space is equally likely to occur. Then, if event \(A \) can occur in \(x\) ways in that sample space, \(P\left( A \right)\) \(= \frac{x}{n}.\) We also discussed the empirical/experimental approach where we collect data (preferably a large data set to satisfy our Law of Large Numbers more completely) related to our situation of interest. Then, in that data set of size \(n,\) if \(x\) of the data satisfy the description of event \(A,\) we state \(P\left( A \right)\) \(= \frac{x}{n}.\) However, we must remember that the empirically based probability value only estimates the actual probability when the data is from a sample set. For example, in Section \(2.1,\) we had a relative frequency table on M&M colors built on data from a package of \(55\) M&M's, as shown below.
Color | Relative Frequency |
---|---|
Brown | \(\frac{17}{55}\approx0.309\) |
Red | \(\frac{18}{55}\approx0.327\) |
Yellow | \(\frac{7}{55}\approx0.127\) |
Green | \(\frac{7}{55}\approx0.127\) |
Blue | \(\frac{2}{55}\approx0.036\) |
Orange | \(\frac{4}{55}\approx0.073\) |
From this empirical evidence, we have probability estimates for all M&M colors. If we picked a single M&M from the population of all M&M's ever made, we would estimate, empirically, \(P(\small\text{RED }\)\(\small\text{M&M})\) \(\approx0.327\) and \(P(\small\text{BLUE }\)\(\small\text{M&M})\) \(\approx0.073.\)
We have also discussed several counting rules to help us determine the size of an entire sample space and the number of outcomes matching an event description. We established various multiplication rules for counting and an addition rule for counting. Each of these rules has conditions for their use. For example, to use the general permutation rule, we had \(n\) distinct objects in which we were interested in how many ways we could select \(r\) of those objects where the order of selection mattered. Knowing the conditions allowed us to quickly determine if a given event could be counted by a specific counting method. We now extend the multiplication and addition rules of counting to similar multiplication and addition rules for probabilities of compound events.
Probability with Compound Events Involving "and"
As discussed, some event descriptions can be compound: descriptions involving multiple events. We have discussed two phrases used to combine events: "and" as well as "or." For example, in rolling two dice, the event of \(\small{\text{ROLLING }}\)\(\small{\text{A }}\)\(\small{\text{TWO }}\)\(\small{\text{ON }}\)\(\small{\text{THE }}\)\(\small{\text{FIRST }}\)\(\small{\text{DIE}}\)\(\normalsize{\text{ and }}\)\(\small{\text{ROLLING }}\)\(\small{\text{A }}\)\(\small{\text{FIVE }}\)\(\small{\text{ON }}\)\(\small{\text{THE }}\)\(\small{\text{SECOND }}\)\(\small{\text{DIE}}\) is a compound "and" description. As another example, the event of \(\small{\text{PERSON }}\)\(\small{\text{IS }}\)\(\small{\text{OVER }}\)\(\small{\text{SIXTY }}\)\(\small{\text{YEARS }}\)\(\small{\text{OLD}}\)\(\normalsize{\text{ or }}\)\( \small{\text{IS }}\)\(\small{\text{UNDER }}\)\(\small{\text{TWELVE }}\)\(\small{\text{YEARS }}\)\(\small{\text{OLD}}\) is a compound "or" description. The event of \(\small{\text{THREE }}\)\(\small{\text{MANAGERS}}\)\( \normalsize{\text{ and }}\)\( \small{\text{SEVEN }}\)\(\small{\text{HOURLY }}\)\(\small{\text{EMPLOYEES}}\)\( \normalsize{\text{ or }}\)\( \small{\text{FOUR }}\)\(\small{\text{MANAGERS}}\)\(\normalsize{\text{ and }}\)\( \small{\text{SIX }}\)\(\small{\text{HOURLY }}\)\(\small{\text{EMPLOYEES}}\) is a compound description involving both "and" as well as "or" descriptions. Although compound event descriptions may get complicated, we aim to simplify determining the probability of such compound events by developing computation methods similar to our counting methods.
We will deal first with the "and" type of event descriptions. Our previous work with probability established that with fair dice, \(P(\small{\text{TWO }}\)\(\small{\text{ON }}\)\(\small{\text{FIRST }}\)\(\small{\text{DIE}})\) \(=\frac{1}{6}\) and \(P(\small{\text{FIVE }}\)\(\small{\text{ON }}\)\(\small{\text{SECOND }}\)\(\small{\text{DIE}})\) \(=\frac{1}{6}.\) Our work with the sample space of all outcomes of two fair dice showed \(P(\small{\text{TWO }}\)\(\small{\text{ON }}\)\(\small{\text{FIRST }}\)\(\small{\text{DIE}}\)\( \normalsize{\text{ and }}\)\( \small{\text{FIVE }}\)\(\small{\text{ON }}\)\(\small{\text{SECOND }}\)\(\small{\text{DIE}})\) \(=\frac{1}{36}.\) We notice that the product of our two event probabilities, \(\frac{1}{6} \cdot \frac{1}{6}\) \( =\frac{1}{36},\) is the same value as the compound "and" event probability. A multiplication operation happens within probability, just as in counting. Also shown in Section \(3.3,\) we must carefully check if the events separated by "and" depend on each other. Remember the example of the deck of cards \(\small{\text{SPADE }}\)\(\small{\text{FIRST}}\)\(\normalsize{\text{ and }}\)\(\small{\text{BLACK }}\)\(\small{\text{CARD }}\)\(\small{\text{SECOND}}.\) When the events are dependent, the count changes for the second event causing the probability to change. \(P( \small{\text{SPADE }}\)\(\small{\text{FIRST}}\)\(\normalsize{\text{ and }}\)\(\small{\text{BLACK }}\)\(\small{\text{CARD }}\)\(\small{\text{SECOND}})\) \( = \frac{13}{52} \cdot \frac{25}{51}.\) We must notice if the probability of the second event changes when the first event has occurred, as in the card example. With this added condition, we still see multiplication between the two event probabilities. We have a useful general Probability Multiplication Rule. Given events \(A\) and \(B,\) \[\nonumber \begin{align*} P\left( A \text{ and }B\right) &= P \left( A \right) \cdot P \left( B | A \right) \\ &=P \left(B\right) \cdot P\left( A|B\right)\end{align*} \]Recall from Section \(3.3\) that \(B|A\) is read as "event \(B\) given event \(A\) has occurred." Such a probability involving the "given" condition, \( P\left(B|A\right)\) is called a Conditional Probability.
There are events in which \( P\left( B|A\right)\) \(=P\left(B\right)\) or equivalently in which \( P\left( A|B\right)\) \(=P\left(A\right)\); the occurrence of one event does not affect the occurrence of the other event. For example, the results of the first die roll do not impact the second die roll. In events where \( P\left( B|A\right)\) \(=P\left(B\right),\) we say that the two events \(A\) and \(B\) are independent of each other. But, in our example of selecting a spade card first and then a black card second, the occurrence of the first event did impact the occurrence of the second event. In events where \( P\left( B|A\right) \ne P\left(B\right),\) we say that the two events \(A\) and \(B\) are dependent of each other. The conditional probability analysis is not needed if the independence of the events is already known, we need only multiply the two simple event probabilities together, a Probability Multiplication Rule for Known Independent Events: \[\nonumber \begin{align*} P\left( A \text{ and }B\right) &= P \left( A \right) \cdot P \left( B \right) \\ &=P \left(B\right) \cdot P\left( A \right).\end{align*}\]We reiterate, use of this rule requires us to know the independence of the events involved before computing our compound probability value. We see in later content that the independence of events can be required to apply specific statistical analyses. Generally, it is best to assume events are dependent and be pleasantly surprised when we can show the events are independent.
Answer the following compound probability questions. Recognize that the given event description is a compound event that can be broken down into multiple events.
- We throw a fair die and randomly select a card from a standard deck. What is the probability of getting a \(6\) on the die and an ace on the card draw?
- Answer
-
This is a compound "and" event description. We also notice that the two simple events are independent of each other \[ \nonumber \begin{align*} P\left( A \text{ and } B \right) &= P\left( \small{\text{SIX ON DIE}}\normalsize\text{ and }\small{\text{ACE ON CARD DRAW | SIX ON DIE}}\right) \\ &= P\left( \small{\text{SIX ON DIE}}\normalsize\text{ and }\small{\text{ACE ON CARD DRAW}}\right) \\ &= P\left( \small{\text{SIX ON DIE}}\right) \cdot P\left(\small{\text{ACE ON CARD DRAW}}\right) \\ &=\frac{1}{6} \cdot \frac{4}{52} \\ &= \frac{1}{78} \approx 1.2821\% \end{align*} \]
- We shuffle a standard deck of playing cards so the cards are randomly placed through the deck.
- We draw two cards. What is the probability of getting two face cards if the first card is replaced randomly in the deck before drawing the second?
- We draw two cards. What is the probability of getting two face cards if the first card is not replaced in the deck before drawing the second?
- Answer
-
- We do not want to build the sample space in this situation: there are \(52 \cdot 52\) \(= 2,704\) different outcomes. We notice that if the first card is replaced randomly in the deck before drawing the second, the two events are independent. We also notice that the deck has \(12\) face cards. By our multiplication rule: \[ \nonumber \begin{align*}P\left( A \text{ and } B \right) &= P \left( \small{\text{FACE CARD ON FIRST DRAW}}\normalsize\text{ and }\small{\text{FACE CARD ON SECOND DRAW}}\right) \\ &= P \left( \small{\text{FACE ON FIRST FIRST}} \right) \cdot P \left( \small{\text{FACE ON SECOND | FACE CARD ON FIRST}} \right) \\ &=P \left( \small{\text{FACE ON FIRST FIRST}} \right) \cdot P \left( \small{\text{FACE ON SECOND}} \right) \\ &\frac{12}{52} \cdot \frac{12}{52} \\ &= \frac{144}{2,704} = \frac{9}{169} \\ &\approx 5.3254\% \end{align*} \]We notice this is not an unusual event.
- If the first card is not replaced in the deck before drawing the second, the second event's probability depends on the first event occurring because the number of face cards still in the deck will be down to \(11.\) The number of cards in the entire deck will be \(51\) By our multiplication rule:\[\nonumber\begin{align*}P\left( A \text{ and } B \right) &= P\left(\small{\text{FACE CARD ON FIRST DRAW}}\normalsize\text{ and }\small{\text{FACE CARD ON SECOND DRAW}}\right) \\ &=P\left(\small{\text{FACE ON FIRST FIRST}}\right)\cdot P\left(\small{\text{FACE ON SECOND | FACE ON FIRST}}\right)\\&=\frac{12}{52}\cdot \frac{11}{51}\\&=\frac{132}{2,652}=\frac{11}{221}\\ &\approx 4.9774\%\end{align*}\]Although the probability measure does not differ from the first situation drastically, it is different. The event is a little less likely to occur if the first card is not replaced; this is something we might have expected.
- Suppose we have our bag of \(55\) M&M candies with the color distribution as given previously:
Table \(\PageIndex{2}\): Frequencies and Relative Frequencies of Sampled M&M's Color Relative Frequency Brown \(\frac{17}{55}\) Red \(\frac{18}{55}\) Yellow \(\frac{7}{55}\) Green \(\frac{7}{55}\) Blue \(\frac{2}{55}\) Orange \(\frac{4}{55}\) We randomly grab four candies, one at a time, without replacement. What is the probability that we have a red candy first, a green candy second, a brown candy third, and a brown candy fourth?
- Answer
-
These events are not independent. For example, the probability of getting a green candy second will vary depending on whether we had a red candy first. By our multiplication rule:\[\nonumber\begin{align*} P\left(\small{\text{RED }}\normalsize1^{st}\text{ and }\small{\text{GREEN }} \normalsize2^{nd}\text{ and }\small{\text{BROWN }}\normalsize3^{rd} \text{ and } \small{\text{BROWN }}\normalsize 4^{th}\right) &=P\left(\small{\text{RED}}\right)\cdot P\left(\small{\text{GREEN | RED }}\normalsize1^{st} \right)\cdot P \left(\small{\text{BROWN | RED}}\normalsize \text{ and } \small{\text{ GREEN}}\normalsize{\text{ previously}}\right)\\ &~~~\cdot P\left(\small{\text{BROWN | RED}}\normalsize \text{ and } \small{\text{ GREEN}}\normalsize \text{ and } \small{\text{BROWN}}\normalsize{\text{ previously}}\right) \\[5pt]&=\frac{18}{55}\cdot \frac{7}{54} \cdot \frac{17}{53} \cdot \frac{16}{52} \\&=\frac{34,272}{8,185,320}=\frac{476}{113,685} \\ &\approx 0.4187\%\end{align*}\]
- Senior citizens make up about \(18\%\) of the U.S. adult population, according to the Pew Research Center website.
- What is the probability of randomly selecting two U.S. adults who are both senior citizens?
- What is the probability of randomly selecting two U.S. adults, the first not a senior citizen and the second is a senior citizen?
- What is the probability of randomly selecting three U.S. adults such that the first two are not senior citizens and the third is a senior citizen
- Answer
-
- As the U.S. population of senior citizens is very large, we will assume independence in the events, realizing our probability measures are approximations. By our multiplication rule: \[ \nonumber \begin{align*}P\left( A \text{ and } B \right) &= P \left( \small{\text{SENIOR CITIZEN}}\normalsize\text{ and }\small{\text{SENIOR CITIZEN}}\right) \\ &= P \left( \small{\text{SENIOR CITIZEN}} \right) \cdot P \left( \small{\text{SENIOR CITIZEN}} \right) \\ &= 18\% \cdot 18\% \\ &\approx 3.24\% \end{align*} \]We do emphasize that assumption of independence is not always reasonable, but is used at times when dealing with large population/sample sizes as probability measures do not get impacted significantly in value. We can see this in an example of comparison on, say, a dependence measure of \(\frac{2456783}{34595200} \cdot \frac{2456782}{34595199}\) \(\approx 0.504314831996\% \) versus assumed independence measure of \(\frac{2456783}{34595200} \cdot \frac{2456783}{34595200}\) \(\approx 0.50431502269\%.\) Although the probability values are technically different, the practical interpretation for real-life application would not usually have practical meaning in the difference. We do note that we reflect carefully before assuming independence as it can lead to drastic consequences.
- Again, as the U.S. population is very large, we will assume independence in the events. By our complement rule, we note that about \(82\%\) of the U.S. adult population is less than \(65\) years old. By our multiplication rule: \[ \nonumber \begin{align*} P \left( \small{\text{NOT A SENIOR CITIZEN}}\normalsize\text{ and }\small{\text{SENIOR CITIZEN}}\right) &= P \left( \small{\text{NOT A SENIOR CITIZEN}} \right) \cdot P \left( \small{\text{SENIOR CITIZEN}} \right) \\ &= 82\% \cdot 18\% \\ &\approx 14.76\% \end{align*} \]
- Again, as the U.S. population is very large, we will assume independence in the events. By our complement rule, we note that about \(82\%\) of the U.S. adult population is less than \(65\) years old. By our multiplication rule: \[ \nonumber \begin{align*} P \left( \small{\text{NOT A SENIOR CITIZEN}}\normalsize\text{ and }\small{\text{NOT A SENIOR CITIZEN}}\normalsize\text{ and }\small{\text{SENIOR CITIZEN}}\right) &= 82\% \cdot 82\% \cdot 18\% \\ &\approx 12.10\% \end{align*} \]
Probability with Compound Events Involving "or"
Now that we have handled probability measures on events separated by "and," we turn to probability and events separated by "or." For example, we might ask, "In rolling a single fair die, what is the probability of rolling a two or a three?" In the case of our M&M's, we might ask, "What is the probability of the next M&M we randomly remove from a bag being a blue or an orange M&M?" As discussed in Section \(3.3,\) the "or" compound description is tied to addition. As a reminder, our developed Addition Rule for Counting was given as\[\nonumber \text{# of outcomes in }(A \text{ or }B)=\text{# of outcomes in }A+\text{# of outcomes in }B-\text{# of outcomes in }(A \text{ and }B)\] Our developed rule reminds us to check for the outcomes that matched the descriptions of both events and to subtract the set of the twice-counted outcomes.
Similar to how the Multiplication Rule for Counting extends to the Multiplication Rule for Probability, the Addition Rule for Counting will also extend to an Addition Rule for Probability.\[\nonumber P \left( A \text{ or }B \right)=P \left( A \right) + P \left( B \right) - P \left( A \text{ and }B \right) \]In answering the question, "In rolling a single fair die, what is the probability of rolling a two or a three?", we can apply this addition rule to produce\[ \nonumber \begin{align*} &\quad \, P \left( \small{\text{ROLLING A TWO}} \normalsize\text{ or } \small{\text{ROLLING A THREE}} \right) \\ &= P \left( \small{\text{ROLLING A TWO}} \right) + P \left( \small{\text{ROLLING A THREE}} \right) - P \left( \small{\text{ROLLING A TWO}} \normalsize\text{ and } \small{\text{ROLLING A THREE}} \right) \\ &= \frac{1}{6} + \frac{1}{6} - \frac{0}{6} \\ &= \frac{2}{6} = \frac{1}{3} \approx 33.3333\% \end{align*} \]Since the sample space of rolling a single fair die is small in size, one could likely answer this question much faster by knowing that only two of the six outcomes in the sample space meet the description of being "\(\small\text{EITHER }\)\(\small\text{A }\)\(\small\text{TWO }\)\(\normalsize\text{or }\)\(\small\text{A }\)\(\small\text{THREE}.\) But as we see in the text exercises, this addition rule can be very useful when the sample spaces are large.
Answer the following compound probability questions. Recognize that the given event description is a compound event that can be broken down into multiple events.
- A standard deck of playing cards is well-shuffled for randomness. Determine the following probabilities about a single card draw.
- Find the probability that a randomly drawn card is a king or a queen.
- Find the probability that a randomly drawn card is black or an ace.
- Find the probability that a randomly drawn card is a spade or a face card.
- Answer
-
- This is a compound "or" event description, as the event of interest is a king or queen card.\[ \nonumber \begin{align*} &\quad \, P \left( \small{\text{CARD IS A KING}} \normalsize\text{ or } \small{\text{CARD IS A QUEEN}} \right) \\ &= P \left( \small{\text{KING}} \right) + P \left( \small{\text{QUEEN}} \right) - P \left( \small{\text{KING}} \normalsize\text{ and } \small{\text{QUEEN}} \right) \\ &= \frac{4}{52} + \frac{4}{52} - \frac{0}{52} \\ &= \frac{8}{52} = \frac{2}{13} \approx 15.3846\% \end{align*} \]We notice that, because \( P \left( \small{\text{KING}} \normalsize \text{ and } \small{\text{QUEEN}} \right)\) \(= 0\%,\) these two events are mutually exclusive. There is a little over \(15\%\) probability that a random card draw will produce a king or a queen. Although not highly probable, we would not consider this outcome unusual.
- This is a compound "or" event description, as the event of interest is a black card or an ace.\[ \nonumber \begin{align*} &\quad \, P \left( \small{\text{CARD IS BLACK}} \normalsize \text{ or } \small{\text{CARD IS AN ACE}} \right) \\ &= P \left( \small{\text{BLACK}} \right) + P \left( \small{\text{ACE}} \right) - P \left( \small{\text{BLACK}} \normalsize\text{ and } \small{\text{ACE}} \right) \\ &= \frac{26}{52} + \frac{4}{52} - \frac{2}{52} \\ &= \frac{28}{52} = \frac{7}{13} \approx 53.8462\% \end{align*} \]We notice that, because \( P ( \small{\text{BLACK}}\)\( \normalsize \text{ and }\)\( \small{\text{ACE}})\) \( \ne 0\%\), these two events are not mutually exclusive. There is almost a \(54\%\) chance that a random card draw will produce a black or an ace card; such an outcome is reasonably probable.
- This is a compound "or" event description, as the event of interest is a card that is a spade or a face card. We note "\(\small{\text{FACE CARD}}\)" is also an "or" event since "\(\small{\text{FACE CARD}}\)" means "\(\small{\text{KING}} \normalsize \text{ or } \small{\text{QUEEN}} \normalsize \text{ or } \small{\text{JACK}}\)". Sometimes, event descriptions can be rephrased to make a compound event more evident as an "and" or an "or" type event\[ \nonumber \begin{align*} &\quad \, P \left( \small{\text{CARD IS SPADE}} \normalsize\text{ or } \small{\text{FACE CARD}} \right) \\ &= P \left( \small{\text{SPADE}} \right) + \left( P \left( \small{\text{KING}} \right) + P \left( \small{\text{QUEEN}} \right) + P \left( \small{\text{JACK}} \right) \right)- P \left( \small{\text{SPADE}} \normalsize \text{ and } \small{\text{FACE CARD}} \right) \\ &= \frac{13}{52} + \left( \frac{4}{52} + \frac{4}{52} + \frac{4}{52} \right)- \frac{3}{52} \\ &= \frac{22}{52}\\ &= \frac{11}{26} \approx 42.3077\% \end{align*} \]We again notice, because \( P( \small{\text{SPADE}}\)\( \normalsize \text{ and }\)\( \small{\text{FACE }}\)\(\small{\text{CARD}} )\) \( \ne 0\%\), these two events are not mutually exclusive. There is a little over \(42\%\) probability that a random card draw will produce a spade or a face card.
- An online clothing store has a liberal return policy since customers cannot try on the items before purchasing. From survey data gathered in the return process from their customers, \(21\%\) of all purchased items are returned due to the item being too small/tight, while \(4\%\) are returned due to the item being too big/loose. If a customer-purchased item is randomly selected from all purchases, what is the probability that the item will be returned due to being too big/loose or too small/tight?
- Answer
-
This is a compound "or" event description as the purchased item must be returned due to being too big/loose or too small/tight. We notice that these two events are mutually exclusive; we assume that a clothing item cannot simultaneously be too big and too small\[ \nonumber \begin{align*} &\quad \, P \left( \small{\text{TOO BIG/LOOSE}} \normalsize\text{ or } \small{\text{TOO SMALL/TIGHT}} \right) \\ &= P \left( \small{\text{TOO BIG/LOOSE}} \right) + P \left( \small{\text{TOO SMALL/TIGHT}} \right) - P \left( \small{\text{TOO BIG/LOOSE}} \normalsize\text{ and } \small{\text{TOO SMALL/TIGHT}} \right) \\ &= 21\% + 4\% - 0\% = 25\% \end{align*} \]Twenty-five percent of all customer-purchased items are returned due to being either too big/loose or too small/tight. Such a significant return rate for these two issues will likely require the store to find ways to help customers find a better fit for their orders.
- We return to our bag of \(55\) M&M candies.
Table \(\PageIndex{3}\): Frequencies and Relative Frequencies of Sampled M&M's Color Relative Frequency Brown \(\frac{17}{55}\) Red \(\frac{18}{55}\) Yellow \(\frac{7}{55}\) Green \(\frac{7}{55}\) Blue \(\frac{2}{55}\) Orange \(\frac{4}{55}\)
- What is the probability of randomly drawing one candy of a primary color, red, yellow, or blue?
- What is the probability of randomly drawing one candy that is not of one of those primary colors?
- Answer
-
- By our addition rule and noticing the events are mutually exclusive, we compute\[ \nonumber \begin{align*} P \left( \small{\text{PRIMARY COLOR M&M}} \right) &=P \left( \small{\text{RED M&M}}\normalsize\text{ or }\small{\text{YELLOW M&M}}\normalsize\text{ or }\small{\text{BLUE M&M}}\right) \\ &= P \left( \small{\text{RED}} \right) + P \left( \small{\text{YELLOW}} \right) + P \left( \small{\text{YELLOW}} \right) \\ &= \frac{18}{55} + \frac{7}{55} + \frac{2}{55} \\ &= \frac{27}{55} \approx 49.0909\%. \end{align*} \]
- We could apply our addition rule again relative to the colors of brown, green, and orange. But we notice this is also a complement to the previous event.\[ \nonumber \begin{align*}P \left( \small{\text{NOT PRIMARY COLOR M&M}} \right) &= 1 - P \left( \small{\text{PRIMARY COLOR M&M}} \right) \\ &\approx 1 - 0.490909 \\ &= .509091 = 50.9091\% \end{align*} \]We notice that the probability of randomly selecting a primary color M&M is practically the same as getting a non-primary color. Stated equivalently, the proportion of M&M's is approximately \(50\%\) for each color grouping.
Extended Concepts on Compound Events
As mentioned, we sometimes have event descriptions that include multiple compounding actions. Thankfully, we do not need any extra computation rules for these. However, we do have to read very carefully to properly understand the compound event descriptions and ask for clarification if needed.
We must clarify a small detail about our "AND" rule. In our previous examples using compound "AND" event descriptions, we worked with descriptions involving "sequential trials" of two or more trials: tossing two dice, drawing four M&M candies, and randomly selecting two people. What if we are describing an event with the word "and" but in a single trial: Selecting a single M&M that is both red and brown, drawing a single card that is red and a face card? We use a compound "and," but does our multiplication rule apply in a single trial event? We can often reflect on our sample space and naturally handle the "and" within a single trial. Our general multiplication rule still works, but the two events separated by "and" will often be dependent in a single trial situation. We must carefully consider the conditional probability on the second described event as we apply the multiplication rule.
Let us first examine the probability of selecting a single M&M that is both red and brown from our bag of \(55\) M&M's. Based on the given information (and our experience eating M&M's), we can reason that there are no M&M's that meet that description, so the probability is \(\frac{0}{55}\) \(=0\%.\) By reflecting on the sample space, we determine the probability value; no special probability rule is necessary, but let us see that the multiplication rule still works.\[ \nonumber \begin{align*} P\left(\small{\text{RED }}\normalsize\text{ and }\small{\text{BROWN}} \right) &=P\left(\small{\text{RED}}\right)\cdot P\left(\small{\text{BROWN | RED }} \right) \\&=\frac{18}{55}\cdot \frac{0}{18} \\&=\frac{\cancel{18}}{55}\cdot \frac{0}{\cancel{18}} \\&=\frac{0}{55}=0\%\end{align*} \]We can see that in handling the conditional probability of selecting a single M&M that is \(\small{\text{BROWN}}\) given that we are restricted to the \(\small{\text{RED}}\) M&M's produces the probability value \(\frac{0}{18};\) none of the red M&M's are brown. Our multiplication rule for "AND" still works for even single trial cases. It is acceptable and encouraged to reflect on the sample space and not use our multiplication rule to determine such probability values. We should use sound reasoning, not blind use of formulas, to determine measures.
To see another example, find the probability of drawing a single card that is both red and a face card. By thinking about the sample space of a deck of cards, one might quickly reason that there are six cards in the deck of fifty-two that are red face cards. The probability is \(\frac{6}{52}\) \(=\frac{3}{26}\) \(\approx 11.5385\%.\) Let us check that our multiplication rule produces this value as well.\[ \nonumber \begin{align*} P\left(\small{\text{RED }}\normalsize\text{ and }\small{\text{FACE CARD}} \right) &=P\left(\small{\text{RED}}\right)\cdot P\left(\small{\text{FACE CARD | RED }} \right) \\&=\frac{26}{52}\cdot \frac{6}{26} \\&=\frac{\cancel{26}}{52}\cdot \frac{6}{\cancel{26}} \\&=\frac{6}{52}\approx 11.5385\%\end{align*} \]We can see that in handling the conditional probability of selecting a single \(\small{\text{FACE }}\) \(\small{\text{CARD}}\) given that we are restricted to the \(\small{\text{RED}}\) cards produces the probability value \(\frac{6}{52}\) since six of the red cards are face cards. Using conditional probability with the multiplication rule can be handy in more challenging event descriptions.
Let's try a few single trial "and" compound event exercises.
Answer the following single-trial compound probability questions. Recognize that the given event description is a compound event that can be broken down into multiple events.
- We draw a single card from a well-shuffled standard deck of playing cards. What is the probability of getting a black ace?
- Answer
-
This is a compound "and" event description as the card must be both black and an ace.\[ \nonumber \begin{align*} P\left( \small{\text{BLACK CARD}}\normalsize\text{ and }\small{\text{ACE CARD}}\right) &= P\left( \small{\text{BLACK CARD}}\right) \cdot P\left(\small{\text{ACE CARD | BLACK CARD}}\right) \\ &=\frac{26}{52} \cdot \frac{2}{26} \\ &= \frac{2}{52} =\frac{1}{26} \\ &\approx 3.8462\% \end{align*} \]We could have answered this by reflecting on the sample space of the \(52\) outcomes, realizing there are \(2\) black aces, but our multiplication rule does work.
- Based on \(2023-2024\) information from the U.S. Center for Disease Control website, about \(70\%\) of senior citizens (65 years and older) in the U.S. get the flu vaccine, whereas about \(40%\) of those adults under \(65\) years old get vaccinated. Senior citizens make up about \(18\%\) of the U.S. adult population, according to the Pew Research Center website.
- What is the probability of randomly selecting one U.S. adult who is a senior citizen and has had the flu shot?
- What is the probability of randomly selecting one U.S. adult who is not a senior citizen and has not had the flu shot?
- Answer
-
- We can use our multiplication rule. \[ \nonumber \begin{align*} P \left( \small{\text{SENIOR CITIZEN}}\normalsize\text{ and }\small{\text{FLU SHOT}}\right) &= P \left( \small{\text{SENIOR CITIZEN}} \right) \cdot P \left( \small{\text{FLU SHOT | SENIOR CITIZEN}} \right) \\ &= 18\% \cdot 70\% \\ &= 0.18 \cdot 0.70 \\ &= 0.126 \\ &= 12.6\% \end{align*} \]In \(2023-2024,\) about \(12.6\%\) of the U.S. adult population consisted of senior citizens who had taken the flu shot.
- We can use our multiplication rule. \[ \nonumber \begin{align*}P \left( \small{\text{NOT A SENIOR CITIZEN}}\normalsize\text{ and }\small{\text{NO FLU SHOT}}\right) &= P \left( \small{\text{NOT A SENIOR CITIZEN}} \right) \cdot P \left( \small{\text{NO FLU SHOT | NOT A SENIOR CITIZEN}} \right) \\ &= 82\% \cdot 60\%\\ &= 49.2\% \end{align*} \]In \(2023-2024,\) about \(49.2\%\) of the U.S. adult population consisted of non-senior citizens who had not taken the flu shot. Notice that this tells health officials about the demographics of those who did not have the flu shot.
Now, we focus on investigating our event descriptions with multiple compound events. No new probability calculation rules are necessary; we must apply our current understanding to slightly new contexts. For example, suppose we wish to find the probability of first throwing a fair die with a result of an even number and then drawing a card from a standard deck of playing cards that is either a red or a face card. This compound event description involves both "and" and "or." Reading carefully, we break down the description: a first event involving throwing a fair die, then a second event involving drawing a card. We notice these two events are independent. In the card draw event, we note that drawing a red card and a face card are not mutually exclusive. We have\[ \nonumber \begin{align*} &\quad \, P \left( \small{\text{DIE TOSS OF EVEN}} \normalsize\text{ and } \left[ \small{\text{CARD DRAW OF RED}}\normalsize\text{ or }\small{\text{FACE}} \right] \right)\\ &= P \left( \small{\text{DIE TOSS OF EVEN}} \right) \cdot P \left( \small{\text{CARD DRAW RED}} \normalsize\text{ or } \small{\text{FACE}} \right) \\ &= P \left( \small{\text{DIE TOSS OF EVEN}} \right) \cdot \left( P \left( \small{\text{CARD DRAW RED}} \right) + P \left( \small{\text{CARD DRAW FACE}} \right) - P \left( \small{\text{CARD DRAW RED}}\normalsize\text{ and } \small{\text{FACE}} \right) \right) \\ &= \frac{3}{6} \cdot \left( \frac{26}{52} + \frac{12}{52} - \frac{6}{52} \right) \\ &= \frac{1}{2} \cdot \left( \frac{32}{52} \right) \\ &= \frac{1}{2} \cdot \frac{8}{13} = \frac{4}{13} \approx 30.7692\%. \end{align*} \]
Determine the probabilities of each of the following.
- We draw five cards from a well-shuffled standard deck of playing cards. What is the probability of getting a flush (all cards share the same suit)?
- Answer
-
This is a compound "and" event description, as the five cards must all be of one suit. We note that the suit does not matter, so the first card drawn can be of any suit; the remaining cards must match the first. The events are not independent.\[ \nonumber \begin{align*} &\quad \, P\left( \small{\text{FLUSH}} \right) \\ &= P \left(\small{\text{ANY SUIT CARD }}\normalsize1^{st}\text{ and }\small{\text{SAME SUIT CARD }}\normalsize2^{nd} \text{ and }\small{\text{SAME SUIT CARD }}\normalsize3^{rd} \text{ and }\small{\text{SAME SUIT CARD }}\normalsize 4^{th} \text{ and }\small{\text{SAME SUIT CARD }}\normalsize 5^{th}\right) \\ &= P \left(\small{\text{ANY SUIT CARD }}\normalsize1^{st}\right) \cdot P\left(\small{\text{SAME SUIT CARD }}\normalsize2^{nd} | 1^{st}\small{\text{ CARD SUIT}}\right) \cdot P\left(\small{\text{SAME SUIT CARD }}\normalsize3^{rd} | 1^{st},2^{nd}\small{\text{ CARD SUIT}}\right) \\ &\quad \quad \quad \cdot P\left(\small{\text{SAME SUIT CARD }}\normalsize 4^{th} | 1^{st},2^{nd},3^{rd}\small{\text{ CARD SUIT}}\right) \cdot P\left(\small{\text{SAME SUIT CARD }}\normalsize 5^{th} | 1^{st},2^{nd},3^{rd},4^{th}\small{\text{ CARD SUIT}}\right) \\ &=\frac{52}{52} \cdot \frac{12}{51} \cdot \frac{11}{50} \cdot \frac{10}{49} \cdot \frac{9}{48} \\ &= \frac{33}{16660} \approx 0.1981\% \end{align*} \]It should not be surprising that flush of five cards from a well-shuffled deck is a very unlikely event.
- If we toss a fair die and then flip a fair coin, what is the probability that we get either a \(6\) on the die or a head on the coin (or both)?
- Answer
-
We approach this in two different ways. The first may initially sound easier or more natural, but the second brings us different, valuable insights. Accurate rephrasing of a given situation can make the probability calculation different yet produce the same results.
- The basic events are a toss of a fair die followed by a flip of a fair coin. Our event description is of the "or" type. Since we can toss a \(6\) and get a head on the coin in one execution of the situation, we note that our events are not mutually exclusive. By our addition rule, we obtain the following.\[ \nonumber \begin{align*} & \quad \, P \left( 6\small{\text{ ON DIE}}\normalsize\text{ or }\small{\text{HEAD ON COIN}}\right) \\ &=\normalsize P \left( 6\small{\text{ ON DIE}} \right) + P \left( \small{\text{HEAD ON COIN}} \right) - \normalsize P \left( 6\small{ \text{ ON DIE}}\normalsize\text{ and }\small{\text{HEAD ON COIN}}\right) \\ &= \frac{1}{6} + \frac{1}{2} - \left( \frac{1}{6} \cdot \frac{1}{2} \right) \\ &= \frac{7}{12} \approx 58.3333\% \end{align*} \]
- By examining complement descriptions first, the complement of "either a \(6\) on the die or a head on the coin (or both)" is "not getting a \(6\) on the die and also not a head on the coin". Now, we can use our complement and multiplication rule.\[ \nonumber \begin{align*} & \quad \, P \left( 6\small{ \text{ ON DIE}}\normalsize\text{ or }\small{\text{HEAD ON COIN}}\right) \\ &= \normalsize1 - \left( P \left( \small{\text{NOT A }\normalsize6\small\text{ ON DIE}}\normalsize \text{ and } \small{\text{NOT A HEAD ON COIN}}\right) \right) \\ &= \normalsize 1 - \left( P \left( \small{\text{NOT A }\normalsize6\small\text{ ON DIE}} \right) \cdot P \left( \small{\text{NOT A HEAD ON COIN}}\right) \right) \\ &=1 - \left( \frac{5}{6} \cdot \frac{1}{2} \right) \\ &= 1 - \frac{5}{12} = \frac{7}{12} \approx 58.3333\% \end{align*} \]We do notice we get the same answer although computed through a very different approach.
- If we roll a fair die three times, what is the probability that one or more throws will come up with a \(1?\)
- Answer
-
On the surface, this sounds easy, but that changes once we think about the possible ways to have one or more of the three approaches produce a \(1.\) We could have the first toss produce a \(1\) while the other two do not, or we can have the second produce a \(1\) and the other two not, or we can have the first two tosses both produce a \(1\) and the third not, or many other possibilities. We must determine many probabilities to use our rule with "or." However, there is an easier way. Notice the complement of "at least one of the throws will be a \(1\)" is given by "none of the three throws produce a \(1.\)"
We obtain the following by our complement and multiplication rule, utilizing the independence of the die throws.\[ \nonumber \begin{align*} &\quad \, P\left( \small{ \text{AT LEAST ONE - }}\normalsize 1 \small{\text{ AMONG THREE TOSSES}} \right) \\ &=\normalsize 1 - P \left( \small{ \text{NOT }\normalsize 1 \small \text{ ON FIRST DIE}}\normalsize\text{ and }\small{ \text{NOT }\normalsize 1\small \text{ ON SECOND DIE}}\normalsize\text{ and }\small{\text{NOT } \normalsize 1 \small \text{ ON THIRD DIE}} \right) \\ &= 1 - \left[ P \left( \small{ \text{NOT }\normalsize 1 \small \text{ ON FIRST DIE}} \right) \cdot P \left( \small{ \text{NOT }\normalsize 1 \small \text{ ON SECOND DIE}} \right) \cdot P \left( \small{ \text{NOT }\normalsize 1 \small \text{ ON THIRD DIE}} \right) \right] \\ &=1 - \left( \frac{5}{6} \cdot \frac{5}{6} \cdot \frac{5}{6} \right) \\ &= 1 - \frac{125}{216} = \frac{91}{216} \approx 42.1296\% \end{align*} \]
- Based on \(2023-2024\) information from the U.S. Center for Disease Control website, about \(70\%\) of senior citizens (65 years and older) in the U.S. get the flu vaccine, whereas about \(40%\) of those adults under \(65\) years old get vaccinated. Senior citizens make up about \(18\%\) of the U.S. adult population, according to the Pew Research Center website. What is the probability of randomly selecting two U.S. adults who are both senior citizens and recipients of the flu vaccine?
- Answer
-
Due to the large population size, we assume the selection of individuals is independent. We believe that if the first selected adult is a senior citizen, the second selected adult still has a \(18\%\) chance of being a senior citizen; likewise, we assume that the probability that the second person chosen is vaccinated is the same as the first person. We now use our multiplication rule.\[ \nonumber \begin{align*} &\quad \, P \left( \left[\small{\text{FIRST IS A SENIOR CITIZEN}}\normalsize\text{ and }\small{\text{FIRST HAS FLU SHOT}} \right] \normalsize\text{ and } \left[\ slight {\text{SECOND IS A SENIOR CITIZEN}}\normalsize\text{ and }\small{\text{SECOND HAS FLU SHOT}} \right] \right) \\ &= \left[ P \left( \small{\text{FIRST IS A SENIOR CITIZEN}} \right) \cdot P \left( \small{\text{FIRST HAS FLU SHOT | FIRST IS A SENIOR CITIZEN}} \right)\right] \cdot \left[ P \left( \small{\text{SECOND IS A SENIOR CITIZEN}} \right) \cdot P \left( \small{\text{SECOND HAS FLU SHOT | SECOND IS A SENIOR CITIZEN}} \right)\right] \\ &= \left(18\% \cdot 70\% \right) \cdot \left(18\% \cdot 70\%\right)\\ &\approx 1.5876\% \end{align*} \]So in regard to this \(2023-2024\) data, it would be unusual for us to randomly select two U.S adults both of whom were flu vaccinated senior citizens.
Two-Way Tables and Probability
Sometimes, data frequencies for a collected single variable are disaggregated, separated into mutually exclusive subgroups. For example, quiz data frequencies for a class may be separated by the frequency of those who passed the quiz versus those who did not. Alternatively, perhaps the data is separated into those students who are in extra-curricular school activities and those who are not. If done well, this allows a comparison between subgroups.
Let us go one step further in this discussion: disaggregating data in two ways. These results are displayed in a Two-Way Table or a Contingency Table. We take the quiz data of a class of \(20\) students first given in Section \(2.4\) and separate by passing, \(6\) or above, and non-passing scores, below \(6,\) then by student involvement in extra-curricular school activities resulting in the following two-way table.
Disaggregation | Passed Quiz | Failed Quiz |
Not Involved in Extra-Curricular School Activities |
\(7\) | \(1\) |
Involved in Extra-Curricular School Activities |
\(9\) | \(3\) |
When reading the table, we must pay attention to the row and column headings. The value \(3\) in the table implies that \(3\) of the \(20\) students were involved in extra-curricular activities and failed the quiz. We often include row and column totals for each disaggregation when working with two-way tables.
Disaggregation | Passed Quiz | Failed Quiz | Row Totals |
Not Involved in Extra-Curricular School Activities |
\(7\) | \(1\) | \(8\) |
Involved in Extra-Curricular School Activities |
\(9\) | \(3\) | \(12\) |
Column Totals | \(16\) | \(4\) | \(20\) |
We can see a total of \(20\) student data points. Of those, there were \(16\) that passed the quiz, while there were \(8\) not involved in extra-curricular activities.
We can now answer several probability/proportion questions concerning these results. What is the probability that a randomly selected student from the class was involved in extra-curricular school activities? By our classical probability approach, \(P( \small{\text{IN }}\)\(\small{\text{EXTRA }}\)\(\small{\text{CURRICULAR}})\) \( = \frac{12}{20}\) \( = \frac{3}{5}\) \(=60\%.\)
What is the probability that a randomly selected student from the class failed the quiz and was not involved in extra-curricular activities? Since randomly selecting a student from the twenty, \(P( \small{\text{FAILED}}\)\(\small{\text{ QUIZ}} \normalsize \text{ and } \small{\text{NOT }}\)\(\small{\text{IN }}\)\(\small{\text{EXTRA }}\)\(\small{\text{CURRICULAR}} ) \) \(= \frac{1}{20}\) \(= 5\%.\) This is because the table shows only \(1\) student in both the \(\small{\text{FAILED }}\)\(\small{\text{ QUIZ}}\) column as well as the \( \small{\text{NOT }}\)\(\small{\text{ IN }}\)\(\small{\text{ EXTRA }}\)\(\small{\text{ CURRICULAR}} \) row. We could also apply our multiplication rule, noting the events are not independent.\[ \nonumber \begin{align*} &\quad \, P \left( \small{\text{FAILED QUIZ}} \normalsize\text{ and } \small{\text{NOT IN EXTRA CURRICULAR}} \right) \\ &= P \left( \small{\text{FAILED QUIZ}} \right) \cdot P \left( \small{\text{NOT IN EXTRA CURRICULAR | FAILED QUIZ}} \right) \\ &=\frac{4}{20} \cdot \frac{1}{4} = \frac{1}{20} = 5\% \end{align*} \]In this situation, the first approach is much easier due to the table information.
Answer the following probability questions about a given two-way table.
- A polygraph device is being studied for its accuracy in lie detection. A group of \(200\) randomly drawn participants were divided into two groups: those instructed not to lie and those who were supposed to lie. The technician running the device did not know each participant's group. Each participant was tested with the device, and a positive or negative detection of a lie was returned. The results are given in the following contingency table.
Disaggregation Positive Lie
Detection ReadingNegative Lie
Detection ReadingParticipant Did Not Lie \(9\) \(71\) Participant Did Lie \(85\) \(35\)
- What is the probability that a randomly selected participant had a positive lie detection reading?
- What proportion of the participants had a false positive reading (the device detected a lie, but the participant did not lie)?
- What is the probability of randomly selecting two participants in sequence so that both had a positive lie detection reading?
- What is the probability of randomly selecting a participant who lied but had a negative detection reading or who did not lie but had a positive detection reading? In other words, for what proportion of the trials was the device inaccurate?
- What is the probability of randomly selecting two participants in sequence so that the first lied with a positive detection reading and the second did not lie with a negative detection reading?
- Answer
-
We first add column and row totals to our table.
Disaggregation Positive Lie
Detection ReadingNegative Lie
Detection ReadingRow Totals Participant Did Not Lie \(9\) \(71\) \(80\) Participant Did Lie \(85\) \(35\) \(120\) Column Totals \(94\) \(106\) \(200\) Now, we answer the probability/proportion questions using appropriate probability rules.
- From the two-way table, we notice there were \(94\) participants that had positive lie detection readings. So, \( P( \small{\text{POSITIVE }}\)\(\small{\text{ DETECTION}})\) \( = \frac{94}{200}\) \( = \frac{47}{100}\) \( = 47\%.\)
- From the table, we can notice there were \(9\) participants had a false positive reading. So, we can quickly produce results of \( P( \small{\text{FALSE }}\)\(\small{\text{POSITIVE }}\)\(\small{\text{DETECTION}})\) \( = \frac{9}{200} \) \(= 4.5\%.\)
We instead might have noticed this was an "and" compound event: \( \small{\text{DID }}\)\(\small{\text{NOT }}\)\(\small{\text{LIE}}\normalsize\text{ and } \small{\text{POSITIVE }}\)\(\small{\text{LIE }}\)\(\small{\text{DETECTION}}\). Therefore, it would also have been appropriate (though more cumbersome) to measure the proportion using our general multiplication rule.\[ \nonumber \begin{align*} &\quad \, P \left( \small{\text{DID NOT LIE}} \normalsize\text{ and } \small{\text{POSITIVE LIE DETECTION}} \right) \\ &= P \left( \small{\text{DID NOT LIE}} \right) \cdot P \left( \small{\text{POSITIVE LIE DETECTION | DID NOT LIE}} \right) \\ &=\frac{80}{200} \cdot \frac{9}{80} \\ &= \frac{9}{200} = 4.5\% \end{align*} \]
We mention two items in reflection of this exercise. First, we notice how the table is used for a conditional probability such as \( P( \small{\text{POSITIVE }}\)\(\small{\text{LIE }}\)\(\small{\text{DETECTION | DID }}\)\(\small{\text{NOT }}\)\(\small{\text{LIE}})\) \(=\frac{9}{80}.\) The given condition of \(\small{\text{DID }}\)\(\small{\text{NOT }}\)\(\small{\text{LIE}}\) requires us to use only the information in the table row of "Participants Did Not Lie."
This exercise demonstrates that multiple approaches might be taken to a probability/proportion question, but there is only one correct answer. What is important is to have sound reasoning in our approach. After that, experience is the best teacher in finding the easiest approaches without making mistakes in our reasoning.
- We notice the event description can be re-worded as \( \small{\text{FIRST }}\)\(\small{\text{PARTICIPANT }}\)\(\small{\text{HAD }}\)\(\small{\text{POSITIVE }}\)\(\small{\text{LIE }}\)\(\small{\text{DETECTION}} \normalsize \text{ and } \small{\text{SECOND }}\)\(\small{\text{PARTICIPANT }}\)\(\small{\text{HAD }}\)\(\small{\text{POSITIVE }}\)\(\small{\text{LIE }}\)\(\small{\text{DETECTION}}\), so we use the general multiplication rule.\[ \nonumber \begin{align*} &\quad \, P \left( \small{\text{FIRST PARTICIPANT HAD POSITIVE LIE DETECTION}} \normalsize \text{ and } \small{\text{SECOND PARTICIPANT HAD POSITIVE LIE DETECTION}} \right) \\ &= P \left( \small{\text{FIRST HAD POSITIVE DETECTION}} \right) \cdot P \left( \small{\text{SECOND HAD POSITIVE DETECTION | FIRST HAD POSTIVE DETECTION}} \right) \\ &=\frac{94}{200} \cdot \frac{93}{199} \\ &= \frac{8,742}{39,800} = \frac{4371}{19,900} \approx 21.9648\% \end{align*} \]We notice how these sequential events are not independent since we cannot pick the same participant twice.
- We notice the event description appears to primarily be an "or" compound event. That is, our event of interest requires selection of one participant that \( \small{\text{LIED }}\)\(\small{\text{WITH }}\)\(\small{\text{NEGATIVE }}\)\(\small{\text{DETECTION}} \normalsize \text{ or } \small{\text{DID }}\)\(\small{\text{NOT }}\)\(\small{\text{LIE }}\)\(\small{\text{WITH }}\)\(\small{\text{POSITIVE }}\)\(\small{\text{LIE }}\)\(\small{\text{DETECTION}},\) so we use the general addition rule noting these two events are mutually exclusive.\[ \nonumber \begin{align*} &\quad \, P \left( \small{\text{LIED WITH NEGATIVE DETECTION}} \normalsize \text{ or } \small{\text{DID NOT LIE WITH POSITIVE LIE DETECTION}} \right) \\ &= P \left( \small{\text{LIED WITH NEGATIVE DETECTION}} \right) + P \left( \small{\text{DID NOT LIE WITH POSITIVE LIE DETECTION}} \right) \\ &=\frac{35}{200} + \frac{9}{200} \\ &= \frac{44}{200} = \frac{11}{50} = 22\% \end{align*} \]We observe that a test with a \(22\%\) inaccuracy rate is not very dependable. Also, we excluded the subtraction in the general addition rule for possible double counting of \( \small{\text{LIED }}\)\(\small{\text{WITH }}\)\(\small{\text{NEGATIVE }}\)\(\small{\text{DETECTION}} \normalsize \text{ and } \small{\text{DID }}\)\(\small{\text{NOT }}\)\(\small{\text{LIE }}\)\(\small{\text{WITH }}\)\(\small{\text{POSITIVE }}\)\(\small{\text{LIE }}\)\(\small{\text{DETECTION}} \) since we recognized the events were mutually exclusive.
- The event description is an "and" compound type as we want sequential selection of two participants with \( \small{\text{FIRST }}\)\(\small{\text{PARTICIPANT }}\)\(\small{\text{LIED }}\)\(\small{\text{WITH }}\)\(\small{\text{POSITIVE }}\)\(\small{\text{LIE }}\)\(\small{\text{DETECTION}} \normalsize \text{ and } \small{\text{SECOND }}\)\(\small{\text{PARTICIPANT }}\)\(\small{\text{DID }}\)\(\small{\text{NOT }}\)\(\small{\text{LIE }}\)\(\small{\text{WITH }}\)\(\small{\text{NEGATIVE }}\)\(\small{\text{LIE }}\)\(\small{\text{DETECTION}}\). We use the general multiplication rule.\[ \nonumber \begin{align*} &\quad \, P \left( 1^{st}\small{\text{ PARTICIPANT LIED WITH POSITIVE LIE DETECTION}} \normalsize \text{ and }2^{nd} \small{\text{ PARTICIPANT DID NOT LIE WITH NEGATIVE LIE DETECTION}} \right) \\ &= P \left(1^{st} \small{\text{ PARTICIPANT LIED WITH POSITIVE DETECTION}} \right) \cdot P \left( 2^{nd}\small{\text{ PARTICIPANT DID NOT LIE WITH NEGATIVE DETECTION |}}\normalsize 1^{st}\small{\text{ PARTICIPANT LIED WITH POSTIVE DETECTION}} \right) \\ &=\frac{85}{200} \cdot \frac{71}{199} \\ &= \frac{6,035}{39,800} = \frac{1,207}{7,960} = 15.1633\% \end{align*} \]It would not be unusual, since the probability value is above \(5\%.\)