2.7: Skew and Kurtosis

Last updated
Save as PDF

Page ID: 22013

\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \) \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)\(\newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\) \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\) \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\) \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\) \( \newcommand{\Span}{\mathrm{span}}\) \(\newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\) \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\) \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\) \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\) \( \newcommand{\Span}{\mathrm{span}}\)\(\newcommand{\AA}{\unicode[.8,0]{x212B}}\)

Statisticians and researchers look at a lot of frequency charts (usually line graphs and histograms), so they know what to look for. Khan Academy has a great video previewing what will be discussed in this section.

Skew

Since it’s the more interesting of the two, let’s start by talking about the skew.

The shape of a frequency chart (line graph or histogram) can tell you a lot of about that data set. The simplest things to look for are any extreme scores (called outliers) that seem to be much higher or much lower than most of the other scores. These outliers may affect the shape of the distribution, making it skewed. A skewed distribution is one in which many scores are bunched up to one side, and there are only a few scores on the other side. A distribution can be positively skewed (meaning that the scores are bunched to the left, and the thin tail is pointing to the right) or negatively skewed (meaning that the scores are bunched to the right, and the thin tail is pointing to the left). Figure \(\PageIndex{1}\) shows examples of a positively skewed line graph (on the right) and a negatively skewed line graph on the left.

Line graph with peak towards the right labeled "negative skew" with another line graph with a peak towards the left labeled "positive skew". — Figure \(\PageIndex{1}\)- Diagram of Skew. (CC-BY-SA Rodolfo Hermans (Godot), CC BY-SA 3.0, via Wikimedia Commons)

Skewness is basically a measure of asymmetry, and the easiest way to explain it is by drawing some pictures. As Figure \(\PageIndex{2}\) illustrates in histograms, if the data tend to have a lot of extreme small values (i.e., the lower tail is “longer” than the upper tail) and not so many extremely large values (left panel), then we say that the data are negatively skewed. On the other hand, if there are more extremely large values than extremely small ones (right panel) we say that the data are positively skewed. That’s the qualitative idea behind skewness.

Three histogram showing negative skew, no skew, and positive skew. — Figure \(\PageIndex{2}\)- Diagram of Skew and No Skew (CC-BY-SA Danielle Navarro from Learning Statistics with R)

Kurtosis

For distributions that are not skewed, you can look at how short and fat the distribution is, or if it is a medium bell-shaped curve, or if it is tall and narrow. This way of describing the shape of symmetrical (not-skewed) distributions, how broad the distribution is, is called kurtosis. Put simply, kurtosis is a measure of the “tailedness” of the data. As you can see in the line graph in Figure \(\PageIndex{3}\), there are three main types: wide, medium, or tall. Wide and flat graphs are called platykurtic. Medium, bell-shaped graphs are called mesokurtic or a normal distribution. Tall and narrow graphs are called leptokurtic.

Three line graphs; one is wide and flat, one is like a bell, and one is tall and narrow. — Figure \(\PageIndex{3}\)- Diagram of Three Kurtosis Types in Line Graphs (CC-BY Larry Green)

Figure \(\PageIndex{4}\) shows the three types of kurtosis in histograms. The histogram on the left shows platykurtic data, the middle histogram shows mesokurtic data, and on the right, we have a leptokurtic data set. By mathematical calculations, the “normal curve” (black lines) has zero kurtosis.

Three histograms showing a flat distribution, a normal distribution, and a "pointy" distribution. — Figure \(\PageIndex{4}\): Diagram of Three Kurtosis Types in Histograms (CC-BY-SA Danielle Navarro from Learning Statistics with R)

The types of kurtosis are summarized in Table \(\PageIndex{1}\).

Table \(\PageIndex{1}\)- Informal Description of Types of Kurtosis
Technical Name	Informal Description
mesokurtic	just pointy enough
leptokurtic	too pointy
platykurtic	too flat

Summary

In practice, neither skew nor kurtosis is used anywhere near as frequently as the measures of central tendency and variability that will be discussed in next chapter. Skew is pretty important, though, so you do see it mentioned a fair bit; but I’ve actually never seen kurtosis reported in a scientific article to date

Contributors

Danielle Navarro (University of New South Wales)
Peter H. Westfall (Paul Whitfield Horn Professor and James and Marguerite Niver Professor, Texas Tech University)
Dr. MO (Taft College)
Figure \(\PageIndex{1}\) Image Attribution: Rodolfo Hermans (Godot), CC BY-SA 3.0, via Wikimedia Commons
Figure \(\PageIndex{3}\) Image Attribution: CC-BY Larry Green from Lake Tahoe Community College