Skip to main content
Statistics LibreTexts

2.16: Dotplots (2 of 2)

  • Page ID
    14029
  • \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)

    ( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\id}{\mathrm{id}}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\kernel}{\mathrm{null}\,}\)

    \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\)

    \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\)

    \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)

    \( \newcommand{\vectorA}[1]{\vec{#1}}      % arrow\)

    \( \newcommand{\vectorAt}[1]{\vec{\text{#1}}}      % arrow\)

    \( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vectorC}[1]{\textbf{#1}} \)

    \( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)

    \( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)

    \( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)

    \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)

    Learning Objectives

    • Describe the distribution of quantitative data using a dot plot.

    Now we will give more specific advice on how to describe the distribution of a quantitative variable.

    When we describe patterns in data, we use descriptions of shape, center, and spread. We also describe exceptions to the pattern. We call these exceptions outliers.

    Flow chart with three levels. The first level is "Graph the distribution of a quantitiative variable" which points to two different boxes on the second level, "Overall pattern" and "Deviations from the pattern". Overall pattern points to three options, "Shape", "Center", and "Spread." "Deviations from the pattern" points to one option, "Outliers."

    Shape: To describe the shape of a distribution, imagine sketching the outline of the data to emphasize the general trend.

    Two pairs of dotplots showing sugar content in various adult cereals. The first set of dot plots show the data without a line tracing the rise and fall of the dot plot distribution and the other set show the same data but with a line to illustrate the distribution. One data set shows the line gradually going up from 0 to 3 on the x axis and then gradually decreasing until it platues from the values 9 to 16. The result is an asymmetrical distribution with a longer tail on the right of the curve. The second set of data shows a platau from the values 70 to 76 and then a sharp increase at values 79 to 80, followed by a similarly sharp decrease with the line again plateauing from values 85 to 90. The result is a semi-symmetrical distribution.

    Example

    Some Common Descriptions of Shape Used to Categorize Distributions

    Right skewed: A cluster of data on the left with a tail of data tapering off to the right. A right-skewed distribution has a lot of data at lower variable values with smaller amounts of data at higher variable values.

    • The distribution of sugar in adult cereals is skewed to the right.
    Dotplot showing right-skewed distribution of sugar content in adult cereals

    Left skewed: A cluster of data on the right with a tail of data tapering off to the left. A left skewed distribution has a lot of data at higher variable values with smaller amounts of data at lower variable values.

    • The distribution of sugar in children’s cereals is skewed to the left.
    Dotplot showing left-skewed distribution of sugar content in children's cereals

    Symmetric with a central peak (also called bell-shaped): A central peak with a tail in both directions. A bell-shaped distribution has a lot of data in the center with smaller amounts of data tapering off in each direction.

    • The distribution of calories in children’s cereals is symmetric with a central peak. It is bell-shaped. The distribution of calories in adult cereals is also roughly bell-shaped.
    Dotplot showing bell-shaped distribution of calories in adult cereals

    Uniform: A rectangular shape, the same amount of data for each variable value.

    • Here is the last digit from 47 students’ telephone numbers. The distribution of the digits is roughly uniform.
    Dotplot showing uniform distribution of the last digit of 47 random telephone numbers

    Center and Spread

    To describe the pattern in a distribution of a quantitative variable, we describe more than the shape. We also describe the center and spread. Later in this module, we develop more precise ways to identify the center of a distribution and to measure the spread. For now, we discuss these concepts informally.

    When we describe a distribution of a quantitative variable, it is helpful to identify a typical value. We choose a single value of the variable to represent the entire group. This is one way to think about the center of the distribution.

    We also want to describe how much the data varies among individuals in the group. Variability is another word for spread. We describe the spread in two ways:

    • We look at the smallest value and the largest value to describe an overall range in the data.

    \mathrm{range}=\mathrm{largest}\text{}\mathrm{value}\text{}-\text{}\mathrm{smallest}\text{}\mathrm{value}

    • We also describe a range of typical values to represent common variable values for the group.

    Example

    Cereals

    Here we use shape, center, and spread to compare the distribution of sugar content in adult cereals and children’s cereals.

    Compare the shapes:

    Dotplots comparing the distribution of sugar content in adult and children's cereals. The graph showing adult sugar content is right-skewed, and the diagram showing children's sugar content is left-skewed

    The sugar content in adult cereals is skewed to the right. Many adult cereals have less than 8 grams of sugar in a serving. A smaller number of adult cereals contain high amounts of sugar. The sugar content for children’s cereals is skewed to the left. Many children’s cereals have more than 8 grams of sugar in a serving, with a smaller number of children’s cereals with low amounts of sugar.

    Comment: There is nothing special about the number 8. We chose 8 as a convenient reference point to describe the opposite trends in these two distributions.

    Compare the centers:

    A typical adult cereal has 3 grams of sugar in a serving. A typical children’s cereal has 12 grams of sugar in a serving.

    Comment: Here we looked at the most common value in each distribution. We develop more precise ways to describe the center of a distribution in the next section. For now, just choose a reasonable typical value to represent the group.

    Compare the spreads:

    Overall range: Adult cereals have 0 to 14 grams of sugar in a serving. Children’s cereals vary from 1 to 15 grams. So both types of cereal vary over a range of 14 grams.

    (Note: Overall range = highest value – lowest value. For adult cereals: 14 – 0 = 14. For children’s cereals: 15 – 1 = 14)

    Typical range: Typical adult cereals have between 0 and 6 grams of sugar in a serving, compared to 9 to 13 grams in typical children’s cereals.

    Comment: Here we looked at clumps in the data to identify a range of typical values. We develop more precise ways to describe the spread a distribution in the last two sections of this module.

    When comparing two distributions, we usually tie all of these ideas into one paragraph:

    In this sample, children’s cereals have more sugar per serving than adult cereals. A typical children’s cereal has 12 grams of sugar in a serving. It is not uncommon for children’s cereals to have 9 to 13 grams of sugar per serving, but it is unusual for a children’s cereal to have less than 8 grams of sugar. A typical adult cereal has 3 grams of sugar in a serving. It is not uncommon for adult cereals to have 0 to 6 grams of sugar in a serving. Larger amounts of sugar are less common.

    Here is a paragraph that uses more formal vocabulary to summarize the comparison:

    In this sample, children’s cereals have more sugar per serving than adult cereals. The distribution of sugar in children’s cereals is skewed left with an overall range of 14 grams. Typical children’s cereals have 9 to 13 grams of sugar per serving with 12 grams as the most common amount. The distribution of sugar in adult cereals is skewed right with the same overall range of 14 grams. Typical adult cereals have 0 to 6 grams of sugar per serving with 3 grams as the most common amount.

    Outliers: Outliers are observations that fall outside the overall pattern. We develop a more precise method for identifying outliers later in this module. For now, use your judgment to identify values that appear to be exceptions to the general trend in the data.

    Example

    Wrist Measurements

    In the distribution of wrist measurements, there are two women with unusually large wrists. These women might be outliers. They are marked in red.

    The man with the smallest wrist measurement is shown in yellow. This man is probably not an outlier.

    Dotplots showing distribution of men's and women's wrist girths. Mens are left-skewed and women's are right-skewed

    Contributors and Attributions

    CC licensed content, Shared previously

    This page titled 2.16: Dotplots (2 of 2) is shared under a CC BY 4.0 license and was authored, remixed, and/or curated by Lumen Learning.

    • Was this article helpful?