Skip to main content
Statistics LibreTexts

2.3: The Value of Visual Summaries

  • Page ID
    49991

    \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)

    ( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\id}{\mathrm{id}}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\kernel}{\mathrm{null}\,}\)

    \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\)

    \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\)

    \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)

    \( \newcommand{\vectorA}[1]{\vec{#1}}      % arrow\)

    \( \newcommand{\vectorAt}[1]{\vec{\text{#1}}}      % arrow\)

    \( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vectorC}[1]{\textbf{#1}} \)

    \( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)

    \( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)

    \( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)

    \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)

    Visual summaries such as tables and graphs can reduce how overwhelming it is to look at all the raw data. Let’s use the variable salary for example. Take a moment to look at the data in the column for the variable Annual Salary in Data Set 2.1. You might have to look at the data several times to start to get a sense of how high, low, common, or rare the various salaries were. Though it is possible to get a sense of the variable and how scores are distributed from Data Set 2.1, it is harder than it needs to be. For comparison, take a look at Salary Reordered Data Set 2.1 below (see Table 2). You will likely find that it is easier to get a sense of the data when they are reordered for the focal variable; reordering the scores from highest to lowest reduces the effort it takes to understand the data. The amount of mental effort we exert to understand something is known as cognitive load. If you find it faster and easier to get a sense of the data when they are in order rather than in disorder, it is because the cognitive load has been reduced. Notice that when salary was reordered so that salaries were presented in order form highest at the top to lowest at the bottom, the data for the other variables were rearranged as well. Recall that each row represents a case. Therefore, when a value for one variable gets moved, the whole row moves with it. Compare the data for the case whose ID number is 1. They were the top row in Raw Data Set 2.1 but are the second row in Salary Ordered Data Set 2.1; despite this movement, however, all their data stayed the same. The data for this case still indicate the person is a UX researcher with 4 years of experience who makes $184,600 annually.

    Table 2 Salary Reordered Data Set 2.1

    ID Number

    Major

    Job

    Years of

    Experience

    Annual

    Salary

    Morale

    3

    Statistics

    Sr. Research Fellow

    7

    190,000

    8

    1

    Statistics

    UX Researcher

    4

    184,600

    7

    13

    Statistics

    Data Scientist

    7

    181,000

    8

    5

    Statistics

    Biostatistician

    5

    167,000

    4

    4

    Statistics

    Research Analyst

    4

    138,000

    6

    14

    Statistics

    UX Researcher

    6

    123,700

    10

    6

    Statistics

    Psychometrician

    6

    120,000

    3

    7

    Statistics

    Data Scientist

    5

    110,240

    6

    9

    Statistics

    Biostatistician

    3

    102,000

    6

    16

    Statistics

    UX Researcher

    1

    88,950

    5

    12

    Statistics

    Research Analyst

    2

    74,500

    9

    2

    Statistics

    Science Journalist

    6

    74,490

    5

    11

    Statistics

    Data Scientist

    4

    65,500

    5

    15

    Statistics

    Research Analyst

    5

    65,000

    4

    8

    Statistics

    Research Assistant

    5

    57,750

    7

    18

    Statistics

    Research Analyst

    2

    52,680

    2

    10

    Statistics

    Research Assistant

    3

    48,200

    3

    17

    Statistics

    Research Assistant

    1

    34,000

    7

    Reordering the data is especially helpful when trying to understand quantitative data because it allows the statistician to easily see the highest and lowest values. For example, when the data are reordered for annual salary, it is easy to quickly discern that the lowest salary was $34,000 and the highest was $190,000. That’s quite a large range. We could do the same by reordering data based on years of experience to quickly and easily see that the least experience in the data set was 1 year and the highest was 7 years (see Table 3).

    Table 3 Experience Reordered Data Set 2.1

    ID Number

    Major

    Job

    Years of

    Experience

    Annual

    Salary

    Morale

    3

    Statistics

    Sr. Research Fellow

    7

    190,000

    7

    13

    Statistics

    Data Scientist

    7

    181,000

    8

    14

    Statistics

    UX Researcher

    6

    123,700

    10

    6

    Statistics

    Psychometrician

    6

    120,000

    3

    2

    Statistics

    Science Journalist

    6

    74,490

    5

    5

    Statistics

    Biostatistician

    5

    167,000

    4

    7

    Statistics

    Data Scientist

    5

    110,240

    6

    15

    Statistics

    Research Analyst

    5

    65,000

    4

    8

    Statistics

    Research Assistant

    5

    57,750

    7

    1

    Statistics

    UX Researcher

    4

    184,600

    8

    4

    Statistics

    Research Analyst

    4

    138,000

    6

    11

    Statistics

    Data Scientist

    4

    65,500

    5

    9

    Statistics

    Biostatistician

    3

    102,000

    6

    10

    Statistics

    Research Assistant

    3

    48,200

    3

    12

    Statistics

    Research Analyst

    2

    74,500

    9

    18

    Statistics

    Research Analyst

    2

    52,680

    2

    16

    Statistics

    UX Researcher

    1

    88,950

    5

    17

    Statistics

    Research Assistant

    1

    34,000

    7

    Statistics can be complicated and sometimes feel difficult. However, we can use visual displays to ease the mental effort it takes to understand the data even further. Therefore, statisticians follow some basic strategies and rules for organizing, summarizing, and presenting data that are meant to reduce cognitive load, thereby making things easier to understand. So far we have seen the efficiency gained simply putting quantitative values in descending order. When statisticians are interested in summarizing how frequently various data occurred, however, reordering is not sufficient. Therefore, specific tables have been designed and used to make it easy to deduce the frequency with which various raw scores occurred.


    This page titled 2.3: The Value of Visual Summaries is shared under a CC BY-NC-SA 4.0 license and was authored, remixed, and/or curated by .

    • Was this article helpful?