Skip to main content
Statistics LibreTexts

8.3: Inference for a Difference in Two Population Means

  • Page ID
    49046
  • \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)

    ( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\id}{\mathrm{id}}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\kernel}{\mathrm{null}\,}\)

    \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\)

    \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\)

    \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)

    \( \newcommand{\vectorA}[1]{\vec{#1}}      % arrow\)

    \( \newcommand{\vectorAt}[1]{\vec{\text{#1}}}      % arrow\)

    \( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vectorC}[1]{\textbf{#1}} \)

    \( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)

    \( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)

    \( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)

    \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)

    Constructing a Confidence Interval for the Difference in Two Population Means

    An experiment conducted by a leadership consulting firm, Nextion, was used to investigate the unconscious biases in the workplace at a law firm.18 Researchers wrote a memo with 22 errors. The errors consisted of minor spelling and grammar errors, major technical writing errors, errors in fact, and errors in analysis of the facts. Fifty-three partners from 22 law firms received copies of the memo. 24 were told the memo was written by an African-American man named Thomas Meyer, and the remaining 29 were told the writer was a Caucasian man named Thomas Meyer. The partners gave the memo from the white Thomas Meyer an average rating of 4.1 out of 5 with a standard deviation of 0.8, while they gave the memo from the black Thomas Meyer an average rating of 3.2 out of 5 and a standard deviation of 1.12. Let’s assume that scores for each memo are approximately normal.

    1. State \(n_1, n_2, \bar{x}_1, \bar{x}_2, s_1, s_2, \text { and } \bar{x}_1-\bar{x}_2\).

       

       

       

       

       

       

       

       

    2. If the partners do have unconscious bias, which sample mean do you expect to be larger?

       

       

       

       

       

       

       

       

    • One-sample situations: you compare a statistic in _________ population against a ______________ about that population.
      • Example: Is the mean lead level in the Flint, Michigan water supply higher than the acceptable safe level of 15 parts per billion (ppb)?
         
    • Two-sample situations: you measure the ____________ __________________________ in ___________________ _____________________ and see if they are significantly different.
      • Example: Are black men assessed more harshly than white men, on average, by partners at law firms?
         

    Let \(\mu_1\) represent the mean score given to ___________ white Thomas Meyer memos by partners at law firms. Let \(\mu_2\) represent the mean score given to ___________ black Thomas Meyer memos by partners at law firms.

     

     

     

     

    Five Step Process for Constructing a Confidence Interval for the Difference in Two Population Means

    The process we use to build confidence intervals has not changed. The following is a list of familiar steps with the appropriate formulas to use for this situation.

    1. Verify that the sampling distribution is approximately normal by checking that each sample size is greater than 30 OR each sample came from a normal population.
    2. Find the critical value from the Student’s T-distribution that corresponds to the provided confidence level.
    3. Compute the margin of error \(E=T_c \cdot \sqrt{\dfrac{s_1^2}{n_1}+\dfrac{s_2^2}{n_2}}\)
    4. Compute the lower and upper limits of the interval, and write the interval in interval notation. \(\left(\bar{x}_1-\bar{x}_2-E, \bar{x}_1-\bar{x}_2+E\right)\)
    5. Write a conclusion in context. Interpret the interval.
       

    Apply this process to the following example: Researchers wrote a memo with 22 errors. The errors consisted of minor spelling and grammar errors, major technical writing errors, errors in fact, and errors in analysis of the facts. Fifty-three partners from 22 law firms received copies of the memo. 24 were told the memo was written by an African-American man named Thomas Meyer, and the remaining 29 were told the writer was a Caucasian man named Thomas Meyer. The partners gave the memo from the white Thomas Meyer an average rating of 4.1 out of 5 with a standard deviation of 0.8, while they gave the memo from the black Thomas Meyer an average rating of 3.2 out of 5 and a standard deviation of 1.12. Let’s assume that scores for each memo are approximately normal. We will construct a 95% confidence interval for the true mean difference in memo scores for memos written by white and black men at law firms. Use 51 degrees of freedom.

    1. Explain why the sampling distribution of differences in sample means is approximately normal. 

       

       

       

       

    2. The confidence level is _____% so the critical value (rounded to three decimal places) is \[T_c=\operatorname{tdist}(\underline{\ \ \ \ \ \ \ \ \ \ }) \cdot \operatorname{inversecdf}(\underline{\ \ \ \ \ \ \ \ \ \ })=\underline{\ \ \ \ \ \ \ \ \ \ }\].

       
    3. \(E=T_c \cdot \sqrt{\dfrac{s_1^2}{n_1}+\dfrac{s_2^2}{n_2}}=\underline{\ \ \ \ \ \ \ \ \ \ }\cdot\sqrt{\dfrac{(\underline{\ \ \ \ \ \ \ \ \ \ })^2}{\underline{\ \ \ \ \ \ \ \ \ \ }}+\dfrac{(\underline{\ \ \ \ \ \ \ \ \ \ })^2}{\underline{\ \ \ \ \ \ \ \ \ \ }}}=\underline{\ \ \ \ \ \ \ \ \ \ }\)
       
    4. \(\left(\bar{x}_1-\bar{x}_2-E, \bar{x}_1-\bar{x}_2+E\right)=\)

       
    5. We are _____% confident that the true _____________________ _______________________ in memo scores for memos written by white and black men at law firms is between ______ points out of 5 and ______ points out of 5.


      Does the interval suggest there is a difference? Why or why not?




       

    Testing a Claim about the Difference in Two Population Means

    Four Step Process for Testing a Claim about the Difference between Two Population Means

    The process we use to test a claim about a population parameter has not changed. The following is a list of familiar steps with the appropriate formulas to use for this situation.

    1. State the hypotheses. Define \(\mu_1\) and \(\mu_2\). The null hypothesis will always be \(H_0: \mu_1=\mu_2\). The alternative will be one of the following \(H_a: \mu_1<\mu_2, H_a: \mu_1>\mu_2 \text {, or } H_a: \mu_1 \neq \mu_2\) based on the statement of the claim in the problem.
       
    2. Verify that the sampling distribution is approximately normal by checking that each sample size is greater than 30 OR each sample came from a normal population.
       
    3. Compute the T-score: \(T=\dfrac{\bar{x}_1-\bar{x}_2}{\sqrt{\dfrac{s_1^2}{n_1}+\dfrac{s_2^2}{n_2}}}\). The degrees of freedom will be provided. Use the Student’s T-distribution to find the P-value (the probability of observing a sample difference as extreme or more extreme than the calculated sample difference just by chance).
       
    4. Make a decision about the null and alternative hypotheses and state a conclusion in context.

       

       

       

       

       

       

       

       

       

       

       

       

       

       

       

       

       

       

       

       

       

       

       

       

    Apply this process to the following example: Researchers wrote a memo with 22 errors. The errors consisted of minor spelling and grammar errors, major technical writing errors, errors in fact, and errors in analysis of the facts. Fifty-three partners from 22 law firms received copies of the memo. 24 were told the memo was written by an African-American man named Thomas Meyer, and the remaining 29 were told the writer was a Caucasian man named Thomas Meyer. The partners gave the memo from the white Thomas Meyer an average rating of 4.1 out of 5 with a standard deviation of 0.8, while they gave the memo from the black Thomas Meyer an average rating of 3.2 out of 5 and a standard deviation of 1.12. Let’s assume that scores for each memo are approximately normal. Do the results give convincing statistical evidence that partners at law firms assess black men more harshly than white men on average? Use a 5% level of significance and 51 degrees of freedom.

    1. Let \(\mu_1\) represent:


      Let \(\mu_2\) represent:


      \(H_0:\)


      \(H_a:\)


      Use a _____________-tailed test because:
       

    2. Explain why the sampling distribution of differences in sample means is approximately normal.

       

       

       

       

    3. \(T=\dfrac{\bar{x}_1-\bar{x}_2}{\sqrt{\dfrac{s_1^2}{n_1}+\dfrac{s_2^2}{n_2}}}=\dfrac{\underline{\ \ \ \ \ \ \ \ \ \ }-\underline{\ \ \ \ \ \ \ \ \ \ }}{\sqrt{\dfrac{( \underline{\ \ \ \ \ \ \ \ \ \ } )^2}{ \underline{\ \ \ \ \ \ \ \ \ \ } }+\dfrac{( \underline{\ \ \ \ \ \ \ \ \ \ } )^2}{ \underline{\ \ \ \ \ \ \ \ \ \ } }}}=\)   
       


      P-value is _____________.
       

    4. We _______________________ the null hypothesis, we _____________________ the alternative hypothesis.

      Conclusion in context: 

       

       

       

       

    Reference

    18Debra Cassens Weiss. "Partners in study gave legal memo a lower rating when told author wasn’t white" ABA Journal, April 21, 2014. Accessed July 5, 2022. https://www.abajournal.com/news/article/hypothetical_legal_memo_demonstrates_unconscious_biases


    This page titled 8.3: Inference for a Difference in Two Population Means is shared under a CC BY-NC-SA 4.0 license and was authored, remixed, and/or curated by Hannah Seidler-Wright.

    • Was this article helpful?