Skip to main content
Statistics LibreTexts

7.4: Reporting Results

  • Page ID
    50044

    \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)

    ( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\id}{\mathrm{id}}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\kernel}{\mathrm{null}\,}\)

    \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\)

    \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\)

    \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)

    \( \newcommand{\vectorA}[1]{\vec{#1}}      % arrow\)

    \( \newcommand{\vectorAt}[1]{\vec{\text{#1}}}      % arrow\)

    \( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vectorC}[1]{\textbf{#1}} \)

    \( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)

    \( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)

    \( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)

    \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)

    Reporting results for inferential statistics are much more complex than for descriptive statistics. Where descriptive statistics are usually reported in one sentence using one to two symbols at a time, inferential statistics include both descriptive statistics as a foundation and the results of inferential tests which require additional values and their related symbols. American Psychological Association (APA) format provides useful guidance to ensure that these are reported uniformly across studies by different researchers and statisticians. This helps to both make it easy for other researchers and statisticians to quickly find the information they are looking for, but also facilitates comparisons of results across studies and reduces the chances of misinterpretation caused by formatting differences.

    Structure for APA-Formatted Results

    In your APA write up for a one samples t-test you should state:

    1. Which test was used and the hypothesis which warranted its use.
    2. Whether the aforementioned hypothesis was supported or not. To do so properly, three components must be reported:
      1. The mean and standard deviation for the sample
      2. The test results as t(\(d f\)) = obtained value
      3. The significance as \(p\) < .05 if significant or \(p\) > .05 if not significant

    I will refer to component 2a as the descriptive statistics and components 2b and 2c together as the evidence string with the determination of significance in the following sections. Let’s take a detailed look at both parts of an APA write-up and the components of part 2 of the write-up in detail.

    Statement of Test Used and Hypothesis Tested

    Your job in creating a results paragraph is to tell your reader the story of those results in a clear and succinct manner. The hypothesis is the catalyst for all the work which was done; thus, the start of the story is the hypothesis and the specific inferential test that went with it. It can be tempting to jump right to the results, but that is like starting a movie at the climax and expecting someone to know what is going on and why it is happening. Therefore, the first sentence of a results section should state these two things together: the hypothesis and statistical test used. Though it is possible and permissible to use two sentences to report this information, there are two drawbacks to doing so: 1. You miss the opportunity to emphasize their connection and 2. You are using more words and punctuation than needed to get the job done well. Therefore, let’s report the hypothesis and test used for Data Set 7.1 together in a single sentence. Here is an example of how that can be done succinctly:

    A one sample t-test was used to test the hypothesis that the mean test score for the school sample would be different from the population mean of 58.00.

    Evidence String for t-Tests: Results and Significance

    When reporting the results of an inferential test, more than just the obtained value is needed. We must also provide two additional summaries relevant to the test: the degrees of freedom and the significance using something called a \(p\)-value. These details are used together in a prescribed format to create what I will refer to as the evidence string. The evidence string goes at the end of the sentence stating the main results and incudes, in this order: the symbol for the inferential test used, the degrees of freedom for the test inside parentheses, the obtained value, and the significance reported using a \(p\)-value.

    Anatomy of an Evidence String

    Symbol for the test

    Degrees of Freedom

    Obtained Value

    \(p\)-Value

    t

    (24)

    = 3.43,

    \(p\) < .05.

    Symbol for the test

    The start of the evidence string provides the symbol for formula used. When a t-test is used, a lowercase letter “t” is written in italics to start the evidence string.

    Degrees of Freedom

    The degrees of freedom (\(d f\)) are reported inside parentheses. The \(d f\) summarize how many pieces of evidence (or data points) are able to either help support of refute a hypothesis. The \(d f\) is calculated as the amount of data (which is equal to the sample size) minus the number of constraints placed on those data when the one sample t-test formula is used. Thus, the \(d f\) can be thought of as an adjusted version of the sample size. For the one sample t-test there is one constraint so the \(d f\) is equal to \(n – 1\). Thus, the degrees of freedom also allow the reader to deduce the sample size used for the test (because \(d f\) is an adjusted version of the sample size). For a more detailed review of \(d f\) and t-tests, see the section titled “Deeper Dive: What are Degrees of Freedom?” later in this chapter.

    Obtained value

    The main result is known as the test statistic or obtained value. It is what is yielded by using a formula to analyze the data. The obtained value is the result of step 4 of hypothesis testing. It should be rounded and shown to the hundredths place.

    p-values

    \(p\)-values summarize the chance of a Type I Error when concluding that a research hypothesis is supported based on both the evidence (i.e. the data and corresponding obtained value) and the alpha level chosen. The \(p\)-value tells two things: which alpha level was used and whether the result was significant when using that alpha level. Think of the letter p standing for “probability of a Type I Error.” A Type I Error refers to when a hypothesis is supported based on sample evidence but is actually incorrect (see Chapter 6 for a detailed review of Type I Error). When a result is significant the following symbols are used: \(p\) < .05. Let’s translate each symbol and value then put it together: \(p\) stands for “the probability of a Type I Error”; < translates to “less than”; .05 is the 5% alpha level written in decimal format. Taken together, \(p\) < .05 can be read as “The probability of a Type I Error is less than 5%.” This is just another way of saying that a result is significant. This set of symbols gets attached to the end of any sentence of a results section which declares that a hypothesis was significantly supported. Conversely, when a result is not significant, the symbols are changed and reported as \(p\) > .05; this version translates to “The probability of a Type I Error is greater than 5%.”

    These symbols are used anytime a test for significance (i.e. an inferential statistic) is performed. The fact that these symbols are always used and that they are reported at the end of sentences makes it very easy for researchers and statisticians to scan through a results section and identify which results were or were not significant and, thus, which hypotheses were or were not supported. This is especially helpful when a study includes many hypotheses and tests, which can sometimes span a full page or more. Luckily for us, this is an introductory book, so we get to focus on just one hypothesis and its results at a time. This allows us to practice the use of symbols and where they would be located in a results sentence or paragraph one at a time until they are familiar and comfortable. After familiarity and comfort comes utility and ease and, for the focused statistician, potentially a rewarding future career in research with a nice paycheck.

    Reporting \(p\)-Values

    When a result is significant, the following symbols are used: \(p\) < .05.

    When a result is not significant, the following symbols are used: \(p\) > .05.

    The symbols are stating whether, based on the strength of the evidence, the probability of a Type I Error is less than (significant) or greater than (not significant) the generally accepted threshold of 5%.

    Deeper Dive: What are Degrees of Freedom?

    Degrees of Freedom (\(d f\)) tell how much information you have that is free to vary. We are measuring things that can vary (i.e. variables) and are looking for specific patterns in how they vary. We use the information from a sample to estimate what is true in the population. Therefore, each case or person provides another piece of information we can use to estimate a parameter in the population.

    When we estimate based on these variables, however, the variables are no longer fully independent. Each time we estimate a parameter using our data, the data are constrained. Each test that we use requires us to make at least one estimate about a population parameter using a sample statistic; each time we estimate a parameter we lose one degree of freedom.

    When the constraints are groups (which is the case for t-tests), the number of independent groups is summarized with the symbol \(k\). Degrees of freedom for each t-test are calculated as the total sample size minus the number of groups. This can be summarized as \(n – k\). However, because each t-test has a different but stable number of groups, the \(d f\) formula is often written with the specifics for \(k\) indicated as follows:

    In the one sample t-test, there is only one sample so \(k\) = 1. Thus, our df for the one sample t-tests is calculated as \(n – 1\).

    In the dependent samples t-test, there is only one sample being tested two times so \(k\) = 1. Thus, our \(d f\) for the dependent samples t-test is calculated as \(n – 1\).

    In the independent samples t-test we estimate two parameters: the mean for group 1 and the mean for group 2. Thus, our \(d f\) is calculated as \(n – 2\). You may also see this specified by group so that \(d f\) is calculated as \(n_1-1+n_2-1\). Each of these will yield the same number.

    Reading Review 7.2

    1. Which two things should be stated first in a results summary paragraph?
    2. What are the four parts of the evidence string and what does each one report?
    3. Which set of symbols should be used at the end of a sentence to indicate that a result was not significant?
    4. What does reporting Cohen’s \(d\) add to a results summary?

    This page titled 7.4: Reporting Results is shared under a CC BY-NC-SA 4.0 license and was authored, remixed, and/or curated by .