Skip to main content
Statistics LibreTexts

8.2: Distributions of Differences

  • Page ID
    49044
  • \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)

    ( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\id}{\mathrm{id}}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\kernel}{\mathrm{null}\,}\)

    \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\)

    \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\)

    \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)

    \( \newcommand{\vectorA}[1]{\vec{#1}}      % arrow\)

    \( \newcommand{\vectorAt}[1]{\vec{\text{#1}}}      % arrow\)

    \( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vectorC}[1]{\textbf{#1}} \)

    \( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)

    \( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)

    \( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)

    \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)

    In this lesson, we will begin to consider statistical methods for comparing independent samples from two populations or experimental treatments. These methods will allow us to compare population means or proportions from two independent groups. To make inferences about differences, we need to use the sampling distribution of differences.

    In a previous lesson, we considered the weights of black bears. Black bears weights tend to differ by sex. Female black bears weigh 175 pounds on average, with a standard deviation of 50 pounds, whereas, male black bears weigh 400 pounds on average, with a standard deviation of 40 pounds. Both of the populations are approximately normally distributed.

    Let’s say we are concerned with differences in male and female black bears in a small region in Colorado where the population has dwindled. The population contains only 4 male black bears and 5 female black bears.

    Below is a histogram of the 4 male black bear weights. It is very crudely bell-shaped. The mean weight is \(\mu_1=418.25\ \mathrm{lbs}\) and the standard deviation is \(\sigma_1=15.943\ \mathrm{lbs}\). Recall, the variance is the square of the standard deviation. In this case, the variance is \(\sigma_1^2=254.1875\).

    AD_4nXdXWx9vTfPRDtAODcCCkmMawxHm2NztWs4DcHx1Sme3ngmg-2PZ8W2TsUiVON2qmafpNFUk3FuCuKxRqSlbQPYoQ9fImo1vQ9-KFaPzJYQkWddA-QdP5_6zXT3h8PcgUIJsgIQwyYxhjroGTW0viBusIgokeyi1XJeTDlU718V25snr3PRQ

    Images are created with the graphing calculator, used with permission from Desmos Studio PBC.

    Next is a histogram of the 5 female black bear weights. It is also very crudely bell-shaped. The mean weight is \(\mu_2=167.6\ \mathrm{lbs}\) and the standard deviation is \(\sigma_2=36.258\ \mathrm{lbs}\). Recall, the variance is the square of the standard deviation. In this case, the variance is \(\sigma_2^2=1314.64\).

    AD_4nXcFRSz9l9QUWSS-mSkgWKdqNDNE9PLRji2xA6Q72M4XG9WrY-zkwdabWkixCNnSuiGZ8FzF6biwKPQBVXc5JJ2RRUUC3PI9i1zLNuWrlGDOZ0WGBBNM4q8rrfzjJkgVxzteelg8pQVykH1NTeCQCgXXZRc5keyi1XJeTDlU718V25snr3PRQ

    Images are created with the graphing calculator, used with permission from Desmos Studio PBC.

     

     

     

     

     

     

    We are interested in the differences in male and female black bear weights for this small population. Let m represent the male black bear weights, and f represent the female black bear weights, then \(m-f\) represents the difference in male and female black bear weights. Weight differences are listed for every combination of male and female bear in the table below.

    \(f\) values (female black bear weights)

    \(m\) values (male black bear weights)

     

    110

    164

    175

    165

    224

    418

    308

    254

    243

    253

    194

    420

    310

    256

    245

    255

    196

    395

    285

    231

    220

    230

    171

    440

    330

    276

    265

    275

    216

    The frequency distribution of weight differences in male and female black bears in the region is given below. It has mean 250.65 lbs, standard deviation 39.608 lbs, and variance 1568.8275.

    AD_4nXcTgxKq0ijFIjRNTvwBqz0YKutZsZwxxnEhi38-b2W7Nfs1Oe5bFZo3aKF-HnznB9wcggEdTWZk6hV3ABrgSdTpyzCkWLKtyvlAiWqzkuDulDrjTOefnDOUSAa3OW5HLRwIxi7pCg1AuOX5NcUV5ndKTSCzkeyi1XJeTDlU718V25snr3PRQ

    Images are created with the graphing calculator, used with permission from Desmos Studio PBC.

    1. Describe the shape of the distribution of differences.

       

       

       

       

    2. Describe the relationship between the mean difference and the means of male and female black bear weights \(\left(\mu_1 \text { and } \mu_2\right)\).

       

       

       

       

    3. Describe the relationship between the variance of differences and the variances of male and female black bear weights \(\left(\sigma_1{ }^2 \text { and } \sigma_2{ }^2\right)\).

       

       

       

       

    Summary

    • If two independent quantitative variables are approximately normal, then differences between those variables will be approximately normal as well.
    • The mean of all differences between those variables is the difference of their means, respectively.
    • The variance of all differences between those variables is the sum of their variances, respectively. The standard deviation of differences is the square root of the variance.

    Distribution of Differences Between Population Means

    • To understand this sampling distribution for the difference in sample means, we just need to think about the sampling distribution for each population. Recall, the Central Limit Theorem states that the sample size must be greater than 30 or the sample must come from a normal population. If we have two samples, then both sample sizes must be greater than 30 or both samples must come from normal populations to ensure that the distribution of differences in sample means is approximately normal.

    Distribution of Differences Between Population Proportions

    • To understand the sampling distribution of the difference in sample proportions, we just need to think about the sampling distribution for each population. Recall, the Central Limit Theorem states that there must be at least 10 expected successes and failures in a sample to ensure its sampling distribution is approximately normal. If we have two samples, then there must be at least 10 expected successes and failures in each sample to ensure that the distribution of differences in sample proportions is approximately normal.

    Consider the sampling distribution of sample proportions. It has mean \(\mu_\hat{p}=p\), the population proportion, standard error \(\sigma_{\hat{p}}=\sqrt{\dfrac{p(1-p)}{n}}\), and variance \(\sigma_{\hat{p}}{ }^2=\left(\sqrt{\dfrac{p(1-p)}{n}}\right)^2=\dfrac{p(1-p)}{n}\).

    • Extending this for two populations, with population proportions \(p_1\) and \(p_2\), and sample sizes \(n_1\) and \(n_2\) respectively, we find that the mean of the sampling distribution of differences is \(\mu_{\hat{p}_1-\hat{p}_2}=p_1-p_2\).
    • The sampling distribution of differences has variance \(\sigma_{\hat{p}_1-\hat{p}_2}^2=\dfrac{p_1\left(1-p_1\right)}{n_1}+\dfrac{p_2\left(1-p_2\right)}{n_2}\) which implies that the standard error of the sampling distribution of differences is \(\sigma_{\hat{p}_1-\hat{p}_2}=\sqrt{\dfrac{p_1\left(1-p_1\right)}{n_1}+\dfrac{p_2\left(1-p_2\right)}{n_2}}\).

       

       

       

       

       

       

       

       

       

       

       

       

       

       

       

       

       

       

       

       

       

       


    This page titled 8.2: Distributions of Differences is shared under a CC BY-NC-SA 4.0 license and was authored, remixed, and/or curated by Hannah Seidler-Wright.

    • Was this article helpful?