5.27: The Sine Distribution

Last updated
Save as PDF

Page ID: 10367

\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \) \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)\(\newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\) \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\) \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\) \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\) \( \newcommand{\Span}{\mathrm{span}}\) \(\newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\) \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\) \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\) \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\) \( \newcommand{\Span}{\mathrm{span}}\)\(\newcommand{\AA}{\unicode[.8,0]{x212B}}\)

\(\newcommand{\R}{\mathbb{R}}\) \(\newcommand{\N}{\mathbb{N}}\) \(\newcommand{\E}{\mathbb{E}}\) \(\newcommand{\P}{\mathbb{P}}\) \(\newcommand{\var}{\text{var}}\) \(\newcommand{\sd}{\text{sd}}\) \(\newcommand{\cov}{\text{cov}}\) \(\newcommand{\cor}{\text{cor}}\) \(\newcommand{\skw}{\text{skew}}\) \(\newcommand{\kur}{\text{kurt}}\)

The sine distribution is a simple probability distribution based on a portion of the sine curve. It is also known as Gilbert's sine distribution, named for the American geologist Grove Karl (GK) Gilbert who used the distribution in 1892 to study craters on the moon.

The Standard Sine Distribution

Distribution Functions

The standard sine distribution is a continuous distribution on \( [0, 1] \) with probability density function \(g\) given by \[g(z) = \frac{\pi}{2} \sin(\pi z), \quad z \in [0, 1] \]

\(g\) is symmetric about \( z = \frac 1 2 \).
\(g\) increases and then decreases with mode at \( z = \frac 1 2 \).
\( g \) is concave downward.

Proof

From simple calculus, \( g \) is a probability density function: \( \sin(\pi x) \ge 0 \) for \( x \in [0, 1] \) and \[ \int_0^1 \sin(\pi z) dz = \frac{2}{\pi} \] The properties follow from basic calculus since \begin{align} g^\prime(z) & = \frac{\pi^2}{2} \cos(\pi z), \quad z \in [0, 1] \\ g^{\prime \prime}(z) & = -\frac{\pi^3}{2} \sin(\pi z), \quad z \in [0, 1] \end{align}

Open the Special Distribution Simulator and select the sine distribution. Run the simulation 1000 times and compare the emprical density function to the probability density function.

The distribution function \(G\) is given by \( G(z) = \frac{1}{2} [1 - \cos(\pi z)]\) for \( z \in [0, 1] \).

Proof

This follows from the PDF above and simple calculus.

The quantile function \(G^{-1}\) is given by \( G^{-1}(p) = \frac{1}{\pi} \arccos(1 - 2 p) \) for \( p \in [0, 1] \).

The first quartile is \(q_1 = \frac{1}{3} \).
The median is \( \frac{1}{2} \).
The third quartile is \(q_3 = \frac{2}{3} \).

Proof

The formula for the quantile function follows immediately from the CDF above by solving \(p = G(z)\) for \(z\) in terms of \(p \in [0, 1]\).

Open the Special Distribution Calculator and select the sine distribution. Compute a few quantiles.

Moments

Suppose that \( Z \) has the standard sine distribution. The moment generating function can be given in closed form.

The moment generating function \( m \) of \( Z \) is given by \[ m(t) = \E\left(e^{t Z}\right) = \frac{\pi^2 (1 + e^t)}{2(t^2 + \pi^2)}, \quad t \in \R \]

Proof

Note first that \[ m(t) = \frac{\pi}{2} \int_0^1 e^{t z} \sin(\pi z) \, dz \] Integrating by parts with \( u = e^{t z} \) and \( dv = \sin(\pi z) dz \) gives \[ m(t) = \frac{t}{2} (1 + e^t) + \frac{t}{2} \int_0^1 e^{t z} \cos(\pi z) \, dz \] Integrating by parts again with \( u = e^{t z} \) and \( dv = \cos(\pi z) dz \) gives \[ m(t) = \frac{t}{2} (1 + e^t) - \frac{t^2}{\pi^2} m(t) \] Solving for \( m(t) \) gives the result.

The moments of all orders exist, but a general formula is complicated and involves special functions. However, the mean and variance are easy to compute.

The mean and variance of \( Z \) are

\(\E(Z) = 1/2 \)
\(\var(Z) = 1/4 - 2 / \pi^2\)

Proof

We know that the mean exists since the PDF is continuous on a bounded interval. By symmetry, the mean must be \( 1/2 \).
Integration by parts (twice) gives \[ \E(Z^2) = \int_0^1 z^2 \frac{\pi}{2} \sin(\pi z) \, dz = \frac{1}{2} - \frac{2}{\pi^2} \] The variance then follows from the usual computational formula \( \var(Z) = \E(Z^2) - [\E(Z)]^2 \).

Of course, the mean and variance could also be obtained by differentiating the MGF.

Numerically, \( \sd(Z) \approx 0.2176 \).

Open the Special Distribution Simulator and select the sine distribution. Note the position and size of the mean \(\pm \) standard deviation bar. Run the simulation 1000 times and compare the empirical mean and stadard deviation to the distribution mean and standard deviation.

The skewness and kurtosis of \(Z\) are

\(\skw(Z) = 0\)
\(\kur(Z) = (384 - 48 \pi^2 + \pi^4) / (\pi^2 - 8)^2\)

Proof

The skewness is 0 by the symmetry of the distribution.
The formula for the kurtosis follows from the usual computational formula and the first four moments: \( \E(Z) = 1/2 \), \( \E(Z^2) = 1/2 - 2 / \pi^2 \), \( \E(Z^3) = 1/2 - 3 / \pi^2 \), \( \E(Z^4) = 1/2 + 24 / \pi^4 - 6 / \pi^2 \).

Numerically, \( \kur(Z) \approx 2.1938 \).

Related Distributions

Since the distribution function and the quantile function have closed form representations, the standard sine distribution has the usual connection to the standard uniform distribution.

If \( U \) has the standard uniform distribution then \( Z = G^{-1}(U) = \frac{1}{\pi} \arccos(1 - 2 U) \) has the standard sine distribution.
If \( Z \) has the standard sine distribution then \( U = G(Z) = \frac{1}{2} [1 - \cos(\pi Z)] \) has the standard uniform distribution.

Part (a) of course leads to the random quantile method of simulation.

Open the random quantile simulator and select the sine distribution. Note the shape of the distribution and density functions. Run the simulation 1000 times and note the random quantiles. Compare the empirical density function to the probability density function.

Since the probability density function is continuous and is defined on a closed, bounded interval, the standard sine distribution can also be simulated using the rejection method.

Open the rejection method app and select the sine distribution. Run the simulation 1000 times and compare the empirical density function to the probability density function.

The General Sine Distribution

As with so many other standard distributions, the standard sine distribution is generalized by adding location and scale parameters.

Suppose that \(Z\) has the standard sine distribution. For \(a \in \R\) and \( b \in (0, \infty) \), random variable \( X = a + b Z \) has the sine distribution with location parameter \(a\) and scale parameter \(h\).

Distribution Functions

Analogies of the results above for the standard sine distribution follow easily from basic properties of the location-scale transformation. Suppose that \( X \) has the sine distribution with location parameter \( a \in \R \) and scale parameter \( b \in (0, \infty) \). So \( X \) has a continuous distribution on the interval \( [a, a + b] \).

The probability density function \( f \) of \( X \) is given by \[ f(x) = \frac{\pi}{2 b} \sin\left(\pi \frac{x - a}{b}\right), \quad x \in [a, a + b] \]

\( f \) is symmetric about \( x = a + b / 2 \).
\( f \) increases and then decreases, with mode \( x = a + b / 2 \).
\( f \) is concave downward.

Proof

Recall that \[ f(x) = \frac{1}{b} g\left(\frac{x - a}{b}\right), \quad x \in \R \] where \( g \) is the standard PDF.

Pure scale transformations (\( a = 0 \) and \( b \gt 0 \)) are particularly common, since \( X \) often represents a random angle. The scale transformation with \( b = \pi \) gives the angle in radians. In this case the probability density function is \( f(x) = \frac{1}{2} \sin(x) \) for \( x \in [0, \pi] \). Since the radian is the standard angle unit, this distribution could also be considered the standard one. The scale transformation with \( b = 90 \) gives the angle in degrees. In this case, the probability density function is \( f(x) = \frac{\pi}{180} \sin\left(\frac{\pi}{90} x\right) \) for \( x \in [0, 90] \). This was Gilbert's original formulation.

In the special distribution simulator, select the sine distribution. Vary the parameters and note the shape and location of the probability density function. For selected values of the parameters, run the simulation 1000 times and compare the empirical density function to the probability density function.

The distribution function \( F \) of \( X \) is given by \[ F(x) = \frac{1}{2}\left[1 - \cos\left(\pi \frac{x - a}{b}\right)\right], \quad x \in [a, a + b] \]

Proof

Recall that \[ F(x) = G\left(\frac{x - a}{b}\right), \quad x \in \R \] where \( G \) is the standard CDF.

The quantile function \( F^{-1} \) of \( X \) is given by \[ F^{-1}(p) = a + \frac{b}{\pi} \arccos(1 - 2 p), \quad p \in (0, 1) \]

The first quartile is \( a + b / 3\).
The median is \( a + b / 2 \).
The third quartile is \( a + 2 b / 3\)

Proof

Recall that \( F^{-1}(p) = a + b G^{-1}(p) \) for \( p \in (0, 1) \), where \( G^{-1} \) is the standard quantile function.

In the special distribution calculator, select the sine distribution. Vary the parameters and note the shape and location of the probability density function and the distribution function. For selected values of the parameters, find the quantiles of order 0.1 and 0.9.

Moments

Suppose again that \( X \) has the sine distribution with location parameter \( a \in \R \) and scale parameter \( b \in (0, \infty) \).

The moment generating function \( M \) of \( X \) is given by \[ M(t) = \frac{\pi^2 \left(e^{a t} + e^{(a + b) t}\right)}{2 \left(b^2 t^2 + \pi^2\right)}, \quad t \in \R \]

Proof

Recall that \( M(t) = e^{a t} m(b t) \) where \( m \) is the standard MGF.

The mean and variance of \( X \) are

\(\E(X) = a + b / 2\)
\(\var(X) = b^2 (1 / 4 - 2 / \pi^2)\)

Proof

By definition we can assume \( X = a + b Z \) where \( Z \) has the standard sine distribution. Using the mean and variance of \( Z \) we have

\( \E(X) = a + b \E(Z) = a + b / 2\)
\( \var(X) = b^2 \var(Z) = b^2 (1 / 4 - 2 / \pi^2) \)

In the special distribution simulator, select the sine distribution. Vary the parameters and note the shape and location of the mean \( \pm \) standard deviation bar. For selected values of the parameters, run the simulation 1000 times and compare the empirical mean and standard deviation to the distribution mean and standard deviation.

The skewness and kurtosis of \( X \) are

\( \skw(X) = 0 \)
\(\kur(X) = (384 - 48 \pi^2 + \pi^4) / (\pi^2 - 8)^2 \)

Proof

Recall that skewness and kurtosis are defined in terms of the standard score, and hence are invariant under location-scale transformations. So the skewness and kurtosis of \( X \) are the same as the skewness and kurtosis of \( Z \).

Related Distributions

The general sine distribution is a location-scale family, so it is trivially closed under location-scale transformations.

Suppose that \( X \) has the sine distribution with location parameter \( a \in \R \) and scale parameter \( b \in (0, \infty) \), and that \( c \in \R \) and \( d \in (0, \infty) \). Then \( Y = c + d X \) has the sine distribution with location parameter \( c + a d \) and scale parameter \( b d \).

Proof

Again by definition we can take \( X = a + b Z \) where \( Z \) has the standard sine distribution. Then \( Y = c + d X = (c + a d) + (b d) Z \).