7.4: Hypothesis Tests for a Single Population Mean
- Page ID
- 49002
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)
( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\id}{\mathrm{id}}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\kernel}{\mathrm{null}\,}\)
\( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\)
\( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\)
\( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)
\( \newcommand{\vectorA}[1]{\vec{#1}} % arrow\)
\( \newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow\)
\( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vectorC}[1]{\textbf{#1}} \)
\( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)
\( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)
\( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)
In previous lessons, we have learned that there are two fundamental forms of statistical inference: confidence intervals and hypothesis tests. In the last section, we used confidence intervals to estimate a single population mean. We will now apply the four step hypothesis testing process to test hypotheses about the value of a single population mean. To explore this concept, we will examine examples of water contamination across the US.
The Flint Water Crisis
The water crisis in Flint, Michigan, is an example of environmental injustice that took place beginning in 2014. The city decided to switch its drinking water supply from Detroit’s system to the water from the Flint River to save money. This switch was made despite inadequate treatment and testing of the water from the river. As a result, the community was deeply impacted. Their water supply turned yellow and began to smell and taste of sewage. Members of the community suffered from skin rashes and hair loss. Later, independent research would reveal that the contaminated water was contributing to dangerous increases of blood lead levels in the city’s youth, and the lead level in the water greatly exceeded the EPAs standard.
Eventually, in 2016, with the efforts of community activists and scientists, a federal judge sided with Flint residents and ordered clean bottled water to be delivered to citizens. The following year, the city was ordered to replace the city’s lead pipes with funding from the state and the community was provided with resources to support their health and well-being. However, the fight to bring safe access to the water supply is ongoing.
Read more about the water crisis in Flint, Michigan: https://www.nrdc.org/stories/flint-water-crisis-everything-you-need-know
Use Statistics to Defend Environmental Justice
To test whether the water was safe, the scientists likely conducted hypothesis tests. Now you will conduct a hypothesis test, like those scientists who fought for the citizens in Flint.
- A mean lead level of 15 parts per billion (ppb) is considered safe, while anything above 15 ppb is considered dangerous. You believe the Flint water is unsafe, so you take a random sample of 200 households. You find that the average lead level of the sample is 16.5 ppb, with a standard deviation of 8 ppb. Do we have reason to believe that the true mean lead level for the entire water supply is dangerous, at a significance level of 1%?
Step 1 Determine the hypothesesLet \(\mu\) represent the _____________ lead level for the entire water supply.
\(H_0:\mu=\underline{\ \ \ \ \ \ \ \ \ \ }\text{ ppb}\)
Select the appropriate alternative hypothesis:- \(H_a: \mu<15 \text{ ppb}\)
- \(H_a: \mu>15 \text{ ppb}\)
- \(H_a: \mu \neq 15 \text{ ppb}\)
Select the appropriate test, and explain why:- Left-tailed
- Right-tailed
- Two-tailed
Step 2 Collect sample dataThe sample mean, \(\bar{x}\), is __________ ppb.
The sample standard deviation, \(s\), is __________ ppb.
The sample size, \(n\), is _________ households.
There are __________ degrees of freedom.
Explain why the conditions of the Central Limit Theorem are met:Step 3 Assess the evidence
Which distribution will we use to find the P-value?
- The normal distribution because the population standard deviation is given.
- The student’s T-distribution because the population standard deviation is unknown.
The test statistic, rounded to three decimal places, is
\[T=\dfrac{\bar{x}-\mu}{\dfrac{s}{\sqrt{n}}}=\dfrac{\underline{\ \ \ \ \ \ \ \ \ \ }-\underline{\ \ \ \ \ \ \ \ \ \ }}{\dfrac{\underline{\ \ \ \ \ \ \ \ \ \ }}{\sqrt{\underline{\ \ \ \ \ \ \ \ \ \ }}}}=\underline{\ \ \ \ \ \ \ \ \ \ }\nonumber\]
Below is the graph of the T-distribution with _______ degrees of freedom. Label the T-statistic on the horizontal axis. Then shade the region that represents the P-value.
Images are created with the graphing calculator, used with permission from Desmos Studio PBC.
Use https://www.desmos.com/calculator to find the P-value:
- Enter tdist(199) in the first line.
- Check the box for Find Cumulative Probability (CDF)
- The minimum and maximum will default to \(-\infty\) and \(\infty\) respectively. Enter the T-score in the min or max so that the graph matches the graph above.
Step 4 State a conclusion in context
The level of significance is \(\alpha=\underline{\ \ \ \ \ \ \ \ \ \ }\). The P-value (rounded to three decimal places) is 0.004. Fill in the blank with \(\leq\) or \(>\):
\[0.004\ \underline{\ \ \ \ \ \ \ \ \ \ }\ 0.01\nonumber\]
Defend the citizens of Flint:The evidence supports the claim that the true _____________ lead level for the entire water supply in Flint is ___________________ _______________________ 15 ppb. Therefore, by the current standards, the lead level in Flint’s water supply is dangerously high.
- \(H_a: \mu<15 \text{ ppb}\)
Summary of Hypothesis Testing Process for a Single Population Mean
Step 1: Determine the Hypotheses
In order to test a claim about a population parameter, we create two opposing hypotheses. We call these the null hypothesis, \(H_0\), and the alternative hypothesis, \(H_a\). Let \(\mu\) represent a given population mean.
The Null Hypothesis
In every hypothesis test, we assume that the null hypothesis is true. The null hypothesis is always a statement of equality and therefore, should always contain an equal symbol (=). When a test involves a single population mean, the null hypothesis will be
\[H_0: \mu=value\nonumber\]
The Alternative Hypothesis
The alternative hypothesis is a claim implied by the research question and is an inequality. The alternative hypothesis states that population mean is greater than (>), less than (<), or not equal (≠) to the assumed value in the null hypothesis.
When a test involves a single population mean, alternative hypothesis will be one of the following:
\[\begin{aligned}
& H_a: \mu>\text { value } \\
& H_a: \mu<\text { value } \\
& H_a: \mu \neq \text { value }
\end{aligned}\]
Step 2: Collect Sample Data
During a hypothesis test, we work to know if a sample statistic is unusual or not. Therefore, we must think about probabilities from a sampling distribution.
In a previous lesson, we learned about the sampling distribution of sample means. The Central Limit Theorem says that a sampling distribution of sample means is approximately normal if either the sample size, n, is greater than 30 or sampling was performed from a normally distributed population. In the second step of a hypothesis test, we verify that the sampling distribution is approximately normal and we identify or compute any sample statistics.
Step 3: Assess the Evidence
This step is all about probability. Since the sampling distribution is approximately normal (as determined in step 2), and the population standard deviation is likely unknown, we can compute a T-score and use the student’s T-distribution to find probabilities. The sampling distribution of sample means has mean
\[\mu_\bar{x}=\mu\nonumber\]
and standard error
\[\sigma_{\bar{x}}=\frac{s}{\sqrt{n}}\nonumber\]
where \(\mu\) is the assumed population mean, s is the sample standard deviation, and n is the sample size. The test statistic is
\[T=\frac{x-\mu}{\sigma} \text { which translates to } T=\frac{\bar{x}-\mu_{\bar{x}}}{\sigma_{\bar{x}}}=\frac{\bar{x}-\mu}{\frac{s}{\sqrt{n}}}\nonumber\]
when looking at the sampling distribution of sample means.
- When the alternative hypothesis is \(H_a: \mu>\text { value}\), we are conducting a right‐tailed test. The P‐value is the probability of observing a sample mean at least as extreme as the one we observed. In this case at least as extreme means “as high or higher”. The P‐value is the area to the right of the test statistic (T.S.).
Images are created with the graphing calculator, used with permission from Desmos Studio PBC.
- When the alternative hypothesis is \(H_a: \mu<\text { value }\), we are conducting a left‐tailed test. The P‐value is the probability of observing a sample mean at least as extreme as the one we observed. In this case at least as extreme means “as low or lower”. The P‐value is the area to the left of the test statistic (T.S.).
Images are created with the graphing calculator, used with permission from Desmos Studio PBC.
- When the alternative hypothesis is \(H_a: \mu \neq \text { value }\), we are conducting a two‐tailed test, and the P‐value is twice the area of either the tail to the right of a positive test statistic (T.S.), or the tail to the left of a negative test statistic (T.S.).
Images are created with the graphing calculator, used with permission from Desmos Studio PBC.
Step 4: State a Conclusion
Hypothesis tests are all about making decisions. We use the P-value to make a decision about the null and alternative hypotheses.
We compare our P-value to a level of significance. The level of significance, denoted \(\alpha\) (the greek letter “alpha”), is how unlikely a sample statistic needs to be to convince us about a claim. It is also the level of risk we accept in being wrong.
We have only two possible conclusions:
- If the P-value \(\leq \alpha\), we reject the null hypothesis and support the alternative hypothesis.
- If the P-value \(>\alpha\), we fail to reject the null hypothesis and cannot support the alternative hypothesis.
- This does not make the null hypothesis true—we cannot prove the null hypothesis because sample data cannot reveal the true value of the population mean.
- This does not make the null hypothesis true—we cannot prove the null hypothesis because sample data cannot reveal the true value of the population mean.
You try!
- A machine is designed to fill jars with 16 ounces of coffee. A consumer suspects that the machine is not filling the jars completely. They randomly sampled 12 jars shown below. Is there enough evidence to support the consumer’s claim at a 10% significance level? Assume that the population of volumes in each jar fill is approximately normal.
15
15.4
16.2
16.1
15.8
16.2
15.7
15.6
16
16.3
15.3
15.9
Step 1.Let \(\mu\) represent:
\(H_0\):
\(H_a\):
Which type of test and why?
Step 2.The sampling distribution is approximately normal because:
Go to https://www.desmos.com/calculator. Type \(C=\) into the first line and copy and paste the data.
\(\bar{x}=\operatorname{mean}(C)=\underline{\ \ \ \ \ \ \ \ \ \ }\text { ounces }\)
\(s=\operatorname{stdev}(C)=\underline{\ \ \ \ \ \ \ \ \ \ }\text { ounces }\)
\(n=\underline{\ \ \ \ \ \ \ \ \ \ }\text{ and }df=\underline{\ \ \ \ \ \ \ \ \ \ }\)
Step 3.\(T=\)
Images are created with the graphing calculator, used with permission from Desmos Studio PBC.
P-value is _________________________
Step 4.Compare the P-value and the significance level. Make a decision, and state a conclusion in context: