5.40: The Zeta Distribution
- Page ID
- 10473
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)
( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\id}{\mathrm{id}}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\kernel}{\mathrm{null}\,}\)
\( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\)
\( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\)
\( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)
\( \newcommand{\vectorA}[1]{\vec{#1}} % arrow\)
\( \newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow\)
\( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vectorC}[1]{\textbf{#1}} \)
\( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)
\( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)
\( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)The zeta distribution is used to model the size or ranks of certain types of objects randomly chosen from certain types of populations. Typical examples include the frequency of occurrence of a word randomly chosen from a text, or the population rank of a city randomly chosen from a country. The zeta distribution is also known as the Zipf distribution, in honor of the American linguist George Zipf.
Basic Theory
The Zeta Function
The Riemann zeta function \(\zeta\), named after Bernhard Riemann, is defined as follows: \[ \zeta(a) = \sum_{n=1}^\infty \frac{1}{n^a}, \quad a \in (1, \infty) \]
You might recall from calculus that the series in the zeta function converges for \(a \gt 1\) and diverges for \(a \le 1\).

The zeta function satifies the following properties:
- \(\zeta\) is decreasing.
- \(\zeta\) is concave upward.
- \(\zeta(a) \downarrow 1\) as \(a \uparrow \infty\)
- \(\zeta(a) \uparrow \infty\) as \(a \downarrow 1\)
The zeta function is transcendental, and most of its values must be approximated. However, \(\zeta(a)\) can be given explicitly for even integer values of \(a\); in particular, \(\zeta(2) = \frac{\pi^2}{6}\) and \(\zeta(4) = \frac{\pi^4}{90}\).
The Probability Density Function
The zeta distribution with shape parameter \( a \in (1, \infty) \) is a discrete distribution on \( \N_+ \) with probability density function \( f \) given by. \[ f(n) = \frac{1}{\zeta(a) n^a}, \quad n \in \N_+ \]
- \( f \) is decreasing with mode \( n = 1 \).
- When smoothed, \( f \) is concave upward.
Proof
Clearly \( f \) is a valid PDF, since by definition, \( \zeta(a) \) is the normalizing constant for the function \( n \mapsto \frac{1}{n^a} \) on \( \N_+ \). Part (a) is clear. For part (b), note that the function \( x \mapsto x^{-a} \) on \( [1, \infty) \) has a positive second derivative.
Open the special distribution simulator and select the zeta distribution. Vary the shape parameter and note the shape of the probability density function. For selected values of the parameter, run the simulation 1000 times and compare the empirical density function to the probability density function.
The distribution function and quantile function do not have simple closed forms, except in terms of other special functions.
Open the special distribution calculator and select the zeta distribution. Vary the parameter and note the shape of the distribution and probability density functions. For selected values of the parameter, compute the median and the first and third quartiles.
Moments
Suppose that \( N \) has the zeta distribution with shape parameter \( a \in (1, \infty) \). The moments of \( X \) can be expressed easily in terms of the zeta function.
If \( k \ge a - 1 \), \( \E(X) = \infty \). If \( k \lt a - 1 \), \[\E\left(N^k\right) = \frac{\zeta(a - k)}{\zeta(a)}\]
Proof
Note that \[ \E\left(N^k\right) = \sum_{n=1}^\infty n^k \frac{1}{\zeta(a) n^a} = \frac{1}{\zeta(a)} \sum_{n=1}^\infty \frac{1}{n^{a - k}}\] If \( a - k \le 1 \), the last sum diverges to \( \infty \). If \( a - k \gt 1 \), the sum converges to \( \zeta(a - k) \)
The mean and variance of \(N\) are as follows:
- If \( a \gt 2 \), \[\E(N) = \frac{\zeta(a - 1)}{\zeta(a)}\]
- If \( a \gt 3 \), \[\var(N) = \frac{\zeta(a - 2)}{\zeta(a)} - \left(\frac{\zeta(a - 1)}{\zeta(a)}\right)^2\]
Open the special distribution simulator and select the zeta distribution. Vary the parameter and note the shape and location of the mean \( \pm \) standard deviation bar. For selected values of the parameter, run the simulation 1000 times and compare the empirical mean and standard deviation to the distribution mean and standard deviation.
The skewness and kurtosis of \(N\) are as follows:
- If \( a \gt 4 \), \[ \skw(N) = \frac{\zeta(a - 3) \zeta^2(a) - 3 \zeta(a - 1) \zeta(a - 2) \zeta(a) + 2 \zeta^3(a - 1)}{[\zeta(a - 2) \zeta(a) - \zeta^2(a - 1)]^{3/2}} \]
- If \( a \gt 5 \), \[ \kur(N) = \frac{\zeta(a - 4) \zeta^3(a) - 4 \zeta(a - 1) \zeta(a - 3) \zeta^2(a) + 6 \zeta^2(a - 1) \zeta(a - 2) \zeta(a) - 3 \zeta^4(a - 1)}{\left[\zeta(a - 2) \zeta(a) - \zeta^2(a - 1)\right]^2} \]
Proof
These results follow from the general moment result above and standard computational formulas for skewness and kurtosis.
The probability generating function of \( N \) can be expressed in terms of the polylogarithm function \( \Li \) that was introduced in the section on the exponential-logarithmic distribution. Recall that the polylogarithm of order \( s \in \R \) is defined by \[ \Li_s(x) = \sum_{k=1}^\infty \frac{x^k}{k^s}, \quad x \in (-1, 1) \]
\( N \) has probability generating function \( P \) given by \[ P(t) = \E\left(t^N\right) = \frac{\Li_a(t)}{\zeta(a)}, \quad t \in (-1, 1) \]
Proof
Note that \[ \E\left(t^N\right) = \sum_{n=1}^\infty t^n \frac{1}{n^a \zeta(a)} = \frac{1}{\zeta(a)} \sum_{n=1}^\infty \frac{t^n}{n^a} \] The last sum is \( \Li_a(t) \).
Related Distributions
In an algebraic sense, the zeta distribution is a discrete version of the Pareto distribution. Recall that if \( a \gt 1 \), the Pareto distribution with shape parameter \( a - 1 \) is a continuous distribution on \( [1, \infty) \) with probability density function \[ f(x) = \frac{a - 1}{x^a}, \quad x \in [1, \infty) \]
Naturally, the limits of the zeta distribution with respect to the shape parameter \( a \) are of interest.
The zeta distribution with shape parameter \( a \in (1, \infty) \) converges to point mass at 1 as \( a \to \infty \).
Proof
For the PDF \( f \) above, note that \( f(1) = \zeta(a) \to 1 \) as \( a \to \infty \) and for \( n \in \{2, 3, \ldots\} \), \( f(n) = 1 \big/ n^a \zeta(a) \to 0 \) as \( a \to \infty \)
Finally, the zeta distribution is a member of the family of general exponential distributions.
Suppose that \(N\) has the zeta distribution with parameter \(a\). Then the distribution is a one-parameter exponential family with natural parameter \(a\) and natural statistic \(-\ln N\).
Proof
This follows from the definition of the general exponential distribution, since the zeta PDF can be written in the form \[ f(n) = \frac{1}{\zeta(a)} \exp(-a \ln n), \quad n \in \N_+ \]
Computational Exercises
Let \(N\) denote the frequency of occurrence of a word chosen at random from a certain text, and suppose that \(X\) has the zeta distribution with parameter \(a = 2\). Find \(\P(N \gt 4)\).
Answer
\(\P(N \gt 4) = 1 - \frac{49}{6 \pi^2} \approx 0.1725\)
Suppose that \(N\) has the zeta distribution with parameter \(a = 6\). Approximate each of the following:
- \(\E(N)\)
- \(\var(N)\)
- \( \skw(N) \)
- \( \kur(N) \)
Answer
- \(\E(N) \approx 1.109\)
- \(\var(N) \approx 0.025\)
- \( \skw(N) \approx 11.700 \)
- \( \kur(N) \approx 309.19 \)