5.12: Base Rates

Last updated
Save as PDF

Page ID: 2368

\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

\( \newcommand{\dsum}{\displaystyle\sum\limits} \)

\( \newcommand{\dint}{\displaystyle\int\limits} \)

\( \newcommand{\dlim}{\displaystyle\lim\limits} \)

\( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)

( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)

\( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

\( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)

\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

\( \newcommand{\Span}{\mathrm{span}}\)

\( \newcommand{\id}{\mathrm{id}}\)

\( \newcommand{\Span}{\mathrm{span}}\)

\( \newcommand{\kernel}{\mathrm{null}\,}\)

\( \newcommand{\range}{\mathrm{range}\,}\)

\( \newcommand{\RealPart}{\mathrm{Re}}\)

\( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

\( \newcommand{\Argument}{\mathrm{Arg}}\)

\( \newcommand{\norm}[1]{\| #1 \|}\)

\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

\( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)

\( \newcommand{\vectorA}[1]{\vec{#1}} % arrow\)

\( \newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow\)

\( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

\( \newcommand{\vectorC}[1]{\textbf{#1}} \)

\( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)

\( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)

\( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)

\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

\(\newcommand{\longvect}{\overrightarrow}\)

\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

\(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)

Learning Objectives

Compute the probability of a condition from hits, false alarms, and base rates using a tree diagram
Compute the probability of a condition from hits, false alarms, and base rates using Bayes' Theorem

Suppose that at your regular physical exam you test positive for Disease \(X\). Although Disease \(X\) has only mild symptoms, you are concerned and ask your doctor about the accuracy of the test. It turns out that the test is \(95\%\) accurate. It would appear that the probability that you have Disease \(X\) is therefore \(0.95\). However, the situation is not that simple.

For one thing, more information about the accuracy of the test is needed because there are two kinds of errors the test can make: misses and false positives. If you actually have Disease \(X\) and the test failed to detect it, that would be a miss. If you did not have Disease \(X\) and the test indicated you did, that would be a false positive. The miss and false positive rates are not necessarily the same. For example, suppose that the test accurately indicates the disease in \(99\%\) of the people who have it and accurately indicates no disease in \(91\%\) of the people who do not have it. In other words, the test has a miss rate of \(0.01\) and a false positive rate of \(0.09\). This might lead you to revise your judgment and conclude that your chance of having the disease is \(0.91\). This would not be correct since the probability depends on the proportion of people having the disease. This proportion is called the base rate.

Assume that Disease \(X\) is a rare disease, and only \(2\%\) of people in your situation have it. How does that affect the probability that you have it? Or, more generally, what is the probability that someone who tests positive actually has the disease? Let's consider what would happen if one million people were tested. Out of these one million people, \(2\%\) or \(20,000\) people would have the disease. Of these \(20,000\) with the disease, the test would accurately detect it in \(99\%\) of them. This means that \(19,800\) cases would be accurately identified. Now let's consider the \(98\%\) of the one million people (\(980,000\)) who do not have the disease. Since the false positive rate is \(0.09\), \(9\%\) of these \(980,000\) people will test positive for the disease. This is a total of \(88,200\) people incorrectly diagnosed.

To sum up, \(19,800\) people who tested positive would actually have the disease and \(88,200\) people who tested positive would not have the disease. This means that of all those who tested positive, only

\[\dfrac{19,800}{19,800 + 88,200} = 0.1833\]

of them would actually have the disease. So the probability that you have the disease is not \(0.95\), or \(0.91\), but only \(0.1833\).

These results are summarized in Table \(\PageIndex{1}\). The numbers of people diagnosed with the disease are shown in red. Of the one million people tested, the test was correct for \(891,800\) of those without the disease and for \(19,800\) with the disease; the test was correct \(91\%\) of the time. However, if you look only at the people testing positive (shown in red), only \(19,800 (0.1833)\) of the \(88,200 + 19,800 = 108,000\) testing positive actually have the disease.

Table \(\PageIndex{1}\): Diagnosing Disease \(X\)
True Condition
No Disease 980,000		Disease 20,000
Test Result		Test Result
Positive 88,200	Negative 891,800	Positive 19,800	Negative 200

Bayes' Theorem

This same result can be obtained using Bayes' theorem. Bayes' theorem considers both the prior probability of an event and the diagnostic value of a test to determine the posterior probability of the event. For the current example, the event is that you have Disease \(X\). Let's call this Event \(D\). Since only \(2\%\) of people in your situation have Disease \(X\), the prior probability of Event \(D\) is \(0.02\). Or, more formally, \(P(D) = 0.02\). If \(P(D')\) represents the probability that Event \(D\) is false, then \(P(D') = 1 - P(D) = 0.98\).

To define the diagnostic value of the test, we need to define another event: that you test positive for Disease \(X\). Let's call this Event \(T\). The diagnostic value of the test depends on the probability you will test positive given that you actually have the disease, written as \(P(T|D)\), and the probability you test positive given that you do not have the disease, written as \(P(T|D')\). Bayes' theorem shown below allows you to calculate \(P(D|T)\), the probability that you have the disease given that you test positive for it.

\[P(D|T)=\frac{P(T|D)P(D)}{P(T|D)P(D)+P(T|D')P(D')}\]

The various terms are:

\(P(T|D) = 0.99\)
\(P(T|D') = 0.09\)
\(P(D) = 0.02\)
\(P(D') = 0.98\)

Therefore,

\[P(D|T)=\frac{(0.99)(0.02)}{(0.99)(0.02)+(0.09)(0.98)}=0.1833\]

which is the same value computed previously.

Search

Text Color

Text Size

Margin Size

Font Type