6.5: Conclusions (1)
- Page ID
- 48856
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)
( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\id}{\mathrm{id}}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\kernel}{\mathrm{null}\,}\)
\( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\)
\( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\)
\( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)
\( \newcommand{\vectorA}[1]{\vec{#1}} % arrow\)
\( \newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow\)
\( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vectorC}[1]{\textbf{#1}} \)
\( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)
\( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)
\( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)When we estimate a population parameter or conduct a hypothesis test, our last step is to state a conclusion in context. In this section, we will focus on conclusions of a hypothesis test for a population proportion.
Example 1
A Gallup poll11 indicates that LGBT (lesbian, gay, bisexual, or transgender) identification has increased significantly for Hispanic adults between 2020 and 2022. In 2020, 8.4% of Hispanic adults identified as LGBT. In a random sample of 500 Hispanic adults, 55 identified as LGBT. Test the claim at a 5% level of significance.
Solution:
Step 1. Let \(p\) represent the proportion of Hispanic adults that identify as LGBT in 2022.
\(H_0: p=0.084\)
\(H_a: p>0.084\)
Right-tailed test because the inequality in the alternative hypothesis is greater than.
Step 2. The number of expected successes in the sample is 500(0.084)=42. The number of expected failures in the sample is 500-42=458. Both are greater than or equal to 10 so the sampling distribution of sample proportions is approximately normal.
\(\hat{p}=\dfrac{55}{500}=0.11\)
Step 3. \(Z=\dfrac{0.11-0.084}{\sqrt{\dfrac{0.084+(1-0.084)}{500}}} \approx 2.10\)
P-value \(=0.0179\)
Images are created with the graphing calculator, used with permission from Desmos Studio PBC.
Step 4. \(\text{P-value }=0.0179 \leq 0.05=5 \%=\alpha\)
We reject the null hypothesis
We support the alternative hypothesis
There is evidence to support the claim that the proportion of all Hispanic adults who identify as LGBT has increased since 2020.
Errors in Hypothesis Tests
In the above example, we rejected the null hypothesis, meaning we were convinced by the data that the null hypothesis is false. This decision is based on sample data and sometimes, there is a small possibility that we made the wrong decision.
Even if we make no errors in the hypothesis testing process, our conclusion might still be wrong. At the conclusion of a test, we cannot know if we have made an error because we don’t know what is actually true. This is why we conduct hypothesis tests, because it is difficult to know the true values of population parameters due to the large size of populations.
There are two possible conclusions to a hypothesis test, which means there are two possible errors:
Type I. We reject the null hypothesis in support of the alternative hypothesis.
- If we made the wrong decision here, what is actually true?
We decide that the null hypothesis is ____________, when actually, it is _____________.
Type II. We fail to reject the null hypothesis in support of the alternative hypothesis.
- If we made the wrong decision here, what is actually true?
We decide that the null hypothesis is ____________, when actually, it is _____________.
Notice that Type I and II errors are exclusively about the null hypothesis.
The Level of Significance
The level of significance, , is related to the likelihood of these errors. Consider cases where the null hypothesis is true. When \(\alpha\)=5%, we reject a null hypothesis for sample data that occur less than 5% of the time by random chance. Thus, when the null hypothesis is true, we reject it 5% of the time, committing a Type I error. Note, this does not mean that a Type I error occurs in 5% of all hypothesis tests because a Type 1 error can only occur if the null hypothesis is true. This probability is conditional in nature.
The level of significance is the probability of rejecting the null hypothesis given that the null hypothesis is true.
\[\alpha=P\left(\text { reject } H_0 \mid H_0 \text { is true }\right)\nonumber\]
Lower significance levels indicate that you require stronger evidence to reject the null hypothesis.
You try!
- In example 1, \(p\) represented the proportion of Hispanic adults that identify as LGBT in 2022. The null hypothesis was that the proportion of Hispanic adults who identify as LGBT in 2022 is \(8.4 \%\left(H_0: p=0.084\right)\).
- Based on the conclusion of example 1, what type of error might have occurred? Explain.
- Describe Type I error in context.
- Describe Type II error in context.
- Based on the conclusion of example 1, what type of error might have occurred? Explain.
Example 2
According to a Gallup poll12, in 2021, 77% of U.S. adults were generally dissatisfied with the total cost of healthcare in the country. A researcher wants to know if the dissatisfaction rate is different this year. They randomly sampled 1200 U.S. adults and find that the sample proportion was \(\hat{p}=0.78\), so 78% of the adults in the sample were dissatisfied with the total cost of healthcare in the US. This proportion is slightly different from 77%, but with a P-value of around 0.21 (from the Z-score of Z=0.82), the difference is not statistically significant.
- The null hypothesis in this example is \(H_0: p=0.77\), and the alternative hypothesis is \(H_a: p \neq 0.77\).
- Based on the conclusion of example 2, what type of error might have occurred? Explain. Describe the error in context.
- What if we had observed the same difference (\(\hat{p}=0.78\) and \(p=0.77\)) from a sample of 10,000 U.S. adults? Compute the Z-score for the revised sample size.
\(Z=\dfrac{\hat{p}-p}{\sqrt{\dfrac{p(1-p)}{n}}}=\dfrac{\underline{ \ \ \ \ \ \ \ }-\underline{ \ \ \ \ \ \ \ }}{\sqrt{\frac{\underline{ \ \ \ \ \ \ \ }(1-\underline{ \ \ \ \ \ \ \ })}{\underline{ \ \ \ \ \ \ \ }} }}\)
- Use https://www.desmos.com/calculator to find the P-value for this two-tailed test.
- Is the sample proportion statistically significant with this new sample size (using a 5% level of significance)?
- In one scenario above, the difference between the sample statistic and the population parameter is statistically significant, but in another, it is not. Statistical significance is different from the sort of significance we assign to events through our own experiences and values. Do you feel that the difference between the proportion of U.S. adults in 2021 and the proportion of U.S. adults in the 2022 sample is significant in a real-world or practical sense? Explain.
- Based on the conclusion of example 2, what type of error might have occurred? Explain. Describe the error in context.
Statistical and Practical Significance
With very large samples, even extremely small differences can be statistically significant. With this in mind, remember that statistical significance is different from practical significance. Statistical significance is measured with probability. Practical significance is measured through our own system of personal values and is therefore difficult to measure.
Reference
11Values based on summary statistics given in Jeffrey M. Jones, “Growing LGBT ID Seen Across Major U.S. Racial, Ethnic Groups,” Gallup.com, June 8, 2022, accessed June 21, 2022, https://news.gallup.com/poll/393464/growing-lgbt-seen-across-major-racial-ethnic-groups.aspx
12Values based on summary statistics given in Gallup.com, accessed June 21, 2022, https://news.gallup.com/poll/4708/Healthcare-System.aspx