Skip to main content
Statistics LibreTexts

14.4: The Gaussian Distribution

  • Page ID
    57771
  • \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \( \newcommand{\dsum}{\displaystyle\sum\limits} \)

    \( \newcommand{\dint}{\displaystyle\int\limits} \)

    \( \newcommand{\dlim}{\displaystyle\lim\limits} \)

    \( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)

    ( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\id}{\mathrm{id}}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\kernel}{\mathrm{null}\,}\)

    \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\)

    \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\)

    \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)

    \( \newcommand{\vectorA}[1]{\vec{#1}}      % arrow\)

    \( \newcommand{\vectorAt}[1]{\vec{\text{#1}}}      % arrow\)

    \( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vectorC}[1]{\textbf{#1}} \)

    \( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)

    \( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)

    \( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)

    \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \(\newcommand{\longvect}{\overrightarrow}\)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)

    To illustrate what we did in the previous sections, let us apply what we know to the Gaussian distribution, and show that it is exponential class. This will allow us to determine the canonical link, the expected value, and the variance. Hopefully, the results will not surprise us.

     

    Learning Objectives

    By the end of this section, you will be able to:

    1. Demonstrate that the Gaussian (normal) distribution is a member of the exponential family by rewriting its probability density function in the standard GLM form.
    2. Identify the components of the Gaussian distribution within this framework: the natural parameter, the dispersion parameter, the cumulant function, and the term.
    3. Derive the canonical link function for the Gaussian distribution as the identity link (\(g(\mu) = \mu\)), and confirm that the expected value and the variance match the known properties of the Normal distribution.
    4. Recognize that the GLM framework with the Gaussian distribution and identity link reproduces the Classical Linear Model, illustrating how GLM generalizes the CLM to handle other types of response variables.

     

    ✦•················• ✦ •··················•✦

     

    The Gaussian is Exponential Class

    We start with the probability density function (pdf) of the Gaussian.

    \begin{equation}
    f(y) = \frac{1}{\sqrt{2 \pi \sigma^2}}\ \exp \left[ - \frac{(y - \mu)^2}{2 \sigma^2} \right] \\[1em]
    \end{equation}

    Now, to write this in standard form, we use algebra and some logarithm rules.

    \begin{align}
    f(y) & = \exp \left[ - \frac{(y - \mu)^2}{2 \sigma^2} + \log \left( \frac{1}{\sqrt{2 \pi \sigma^2}} \right) \right] \\[2em]
    & = \exp \left[ - \frac{y^2 -2y\mu + \mu^2}{2 \sigma^2} + \log \left( \frac{1}{\sqrt{2 \pi \sigma^2}} \right) \right] \\[2em]
    & = \exp \left[ - \frac{y^2}{2 \sigma^2} + \frac{y\mu}{\sigma^2} - \frac{\mu^2}{2 \sigma^2} + \log \left( \frac{1}{\sqrt{2 \pi \sigma^2}} \right) \right] \\[2em]
    & = \exp \left[ \frac{y\mu -\frac{1}{2} \mu^2}{\sigma^2} + \log \left( \frac{1}{\sqrt{2 \pi \sigma^2}} \right) - \frac{y^2}{2 \sigma^2} \right]
    \end{align}

    Recall from the previous section that the "standard form" is

    \begin{equation}
    f(y) = \exp \left[ \frac{y \theta - b(\theta)}{a(\phi)} + c(y,\phi) \right]
    \end{equation}

    Comparing the two distributions gives us the following:

    • \(y = y\)
    • \(\theta = \mu\)
    • \(a(\phi)=\sigma^2\)
    • \(b(\theta) = \frac{1}{2} \mu^2 = \frac{1}{2} \theta^2\)
    • \(c(y, \phi)= \log \left( \frac{1}{\sqrt{2 \pi \sigma^2}} \right) - \frac{y^2}{2 \sigma^2}\)

     

    Thus, according to this list, the canonical link is \(g(\mu) = \mu\), also known as the identity function.

     

    The dispersion parameter is \(a(\phi) = \sigma^2\).

     

    Also note that the expected value is

    \begin{align}
    E[Y] & = b^\prime\!(\theta) \\[1em]
    &= \frac{\text{d}}{\text{d}\theta} \left( \frac{1}{2} \theta^2 \right) \\[1em]
    &= \theta \\[1em]
    &= \mu
    \end{align}

    Hopefully, this is as we expect.

     

    Finally, note that the variance is

    \begin{align}
    V[Y] & = b^{\prime\prime}\!(\theta) a(\phi) \\[1em]
    &= \frac{\text{d}^2}{\text{d}\theta^2} \left( \frac{1}{2} \theta^2 \sigma^2 \right)\\[1em]
    &= \frac{\text{d}}{\text{d}\theta} \left( \theta \sigma^2 \right)\\[1em]
    &= \sigma^2
    \end{align}

    Also as we expect, hopefully.

     

    Other Link Functions

    While the canonical link is the identity function (\(\eta = \mu\)), it is not the only allowable link function. In Section 8.1: The Issue of Boundedness, we transformed the continuous dependent variable because it was bounded below by (but never equaled) zero. In such a case, the logarithm is an appropriate link function: The dependent variable has a restricted range. The link function converts that range to an unbounded range. The same is true under the GLM framework. Similarly, the logit function is frequently an appropriate link function, as it was in Section 8.1: The Issue of Boundedness.

    With that, we start to see that for continuous dependent variables, what we did under the CLM paradigm we can do under the GLM paradigm. This is always true; the GLM paradigm extends the CLM paradigm to handle different classes of dependent variables.

     

     

     


    This page titled 14.4: The Gaussian Distribution is shared under a CC BY-NC-SA 4.0 license and was authored, remixed, and/or curated by Ole Forsberg.

    • Was this article helpful?