7.2: Sums of Continuous Random Variables

Last updated
Save as PDF

Page ID: 3150

Charles M. Grinstead & J. Laurie Snell
Swarthmore College and Dartmouth College via American Mathematical Society

\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \) \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)\(\newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\) \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\) \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\) \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\) \( \newcommand{\Span}{\mathrm{span}}\) \(\newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\) \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\) \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\) \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\) \( \newcommand{\Span}{\mathrm{span}}\)\(\newcommand{\AA}{\unicode[.8,0]{x212B}}\)

In this section we consider the continuous version of the problem posed in the previous section: How are sums of independent random variables distributed?

Definition \(\PageIndex{1}\): convolution

Let \(X\) and \(Y\) be two continuous random variables with density functions \(f(x)\) and \(g(y)\), respectively. Assume that both \(f(x)\) and \(g(y)\) are defined for all real numbers. Then the convolution \(f ∗ g\) of \(f\) and \(g\) is the function given by

\[ \begin{align*} (f*g) &= \int_{-\infty}^\infty f(z-y)g(y)dy \\[4pt] &= \int_{-\infty}^\infty g(z-x)f(x)dx \end{align*} \]

This definition is analogous to the definition, given in Section 7.1, of the convolution of two distribution functions. Thus it should not be surprising that if X and Y are independent, then the density of their sum is the convolution of their densities. This fact is stated as a theorem below, and its proof is left as an exercise (see Exercise 1).

Theorem \(\PageIndex{1}\)

Let X and Y be two independent random variables with density functions fX (x) and fY (y) defined for all x. Then the sum Z = X + Y is a random variable with density function \(f_Z(z)\), where \(f_X\) is the convolution of \(f_X\) and \(f_Y\)

To get a better understanding of this important result, we will look at some examples.

Sum of Two Independent Uniform Random Variables

Example \(\PageIndex{1}\):

Suppose we choose independently two numbers at random from the interval [0, 1] with uniform probability density. What is the density of their sum? Let X and Y be random variables describing our choices and \(Z = X + Y\) their sum. Then we have

\[f_X(x) = f_Y(y) = \begin{array}{cc} 1 & \text{if } 0 \leq x \leq 1 \\ 0 & \text{otherwise} \end{array} \nonumber \]

and the density function for the sum is given by

\[f_Z(z) = \int_{-\infty}^\infty f_X(z-y)f_Y(y)dy. \nonumber \]

Since \(f_Y (y) = 1 if 0 ≤ y ≤ 1\) and 0 otherwise, this becomes

\[f_Z(z) = \int_{0}^1 f_X(z-y)dy. \nonumber \]

Now the integrand is 0 unless 0 ≤ z − y ≤ 1 (i.e., unless z − 1 ≤ y ≤ z) and then it is 1. So if 0 ≤ z ≤ 1, we have

\[f_Z (z) = \int_0^z dy = z , \nonumber \]

while if 1 < z ≤ 2, we have

\[f_Z(z) = \int_{z-1}^1 dy = 2-z, \nonumber \]

and if \(z < 0\) or \(z > 2\) we have \(_fZ(z) = 0\) (see Figure 7.2). Hence,

\[f_Z(z) = \Bigg\{ \begin{array}{cc} z & \text{if } 0 \leq z \leq 1 \\ 2-z, & \text{if} 1 < z \leq 2 \\ 0, & \text{otherwise} \end{array} \nonumber \]

Note that this result agrees with that of Example 2.4.

Sum of Two Independent Exponential Random Variables

Example \(\PageIndex{2}\):

Suppose we choose two numbers at random from the interval [0, ∞) with an exponential density with parameter λ. What is the density of their sum? Let X, Y , and Z = X + Y denote the relevant random variables, and \(f_X , f_Y , \)and \(f_Z\) their densities. Then

\[ f_X(x) = f_Y(x) = \bigg\{ \begin{array}{cc} \lambda e^{-\lambda x}, & \text{if } x \geq 0 \\ 0, & \text{otherwise} \end{array} \nonumber \]

Figure \(\PageIndex{1}\): Convolution of two uniform densities

Figure \(\PageIndex{2}\): Convolution of two exponential densities with λ = 1.

and so, if z > 0,

\[ \begin{align*} f_Z(z) & = \int_{-\infty}^\infty f_X(z-y)f_Y(y)dy \\[4pt] &= \int_0^z \lambda e^{-\lambda (z-y)} \lambda e^{-\lambda y} dy \\[4pt] &= \int_0^z \lambda^2 e^{-\lambda z} dy \\[4pt] &= \lambda^2 z e^{-\lambda z} \end{align*} \]

while if z < 0, \(f_Z(z) = 0\) (see Figure 7.3). Hence,

\[ f_Z(z) = \bigg\{ \begin{array}{cc} \lambda^2 z^{-\lambda z}, & \text{if } z \geq 0, \\ 0, & \text{otherwise} \end{array} \nonumber \]

Sum of Two Independent Normal Random Variables

Example \(\PageIndex{3}\)

It is an interesting and important fact that the convolution of two normal densities with means \(µ_1 and µ_2\) and variances \(σ_1 and σ_2\) is again a normal density, with mean \(µ_1 + µ_2\) and variance \( \sigma_1^2 + \sigma_2^2\). We will show this in the special case that both random variables are standard normal. The general case can be done in the same way, but the calculation is messier. Another way to show the general result is given in Example 10.17.

Suppose X and Y are two independent random variables, each with the standard normal density (see Example 5.8). We have

\[ f_X(x) = f_Y(y) = \frac{1}{\sqrt{2\pi}}e^{-x^2/2} \nonumber \]

and so

\[\begin{align*} f_Z(z) & = f_X * f_Y(z) \\[4pt] &= \frac{1}{2\pi} \int_{-\infty}^\infty e^{-(z-y)^2/2} e^{-y^2/2}dy \\[4pt] &= \frac{1}{2\pi} e^{-z^2/4} \int_{-\infty}^\infty e^{-(y-z/2)^2}dy \\[4pt] &= \frac{1}{2\pi} e^{-z^2/4}\sqrt{\pi} \left[ \frac{1}{\sqrt{\pi}} \int_{-\infty}^{\infty} e^{-(y-z/2)^2dy} \right] \end{align*}\]

The expression in the brackets equals 1, since it is the integral of the normal density function with \( \mu =0\) and \(\sigma = \sqrt{2}\) So, we have

\[f_Z(z) = \frac{1}{\sqrt{4\pi}}e^{-z^2/4} \nonumber \]

Sum of Two Independent Cauchy Random Variables

Example \(\PageIndex{4}\):

Choose two numbers at random from the interval \((-\infty, \infty\) with the Cauchy density with parameter \(a = 1\) (see Example 5.10). Then

\[ f_X(x) = f_Y(y) = \frac{1}{\pi(1+x^2)} \nonumber \]

and \(Z = X +Y\) has density

\[f_Z(z) = \frac{1}{\pi^2} \int_{-\infty}^\infty \frac{1}{1+(z-y)^2} \frac{1}{1+y^2}dy. \nonumber \]

This integral requires some effort, and we give here only the result (see Section 10.3, or Dwass\(^3\) ):

\[fZ(z) =\frac{2}{\pi (4+z^2)} \nonumber \]

Now, suppose that we ask for the density function of the average

\[A = (1/2)(X + Y ) \nonumber \]

of X and Y . Then A = (1/2)Z. Exercise 5.2.19 shows that if U and V are two continuous random variables with density functions \(f_U(x)\) and \(f_V(x)\), respectively, and if \(V = aU\), then

\[f_V (x) = \bigg( \frac{1}{a}\bigg) f_U \bigg( \frac{x}{a} \bigg). \nonumber \]

Thus, we have

\[f_A(z) = 2f_Z(2z) = \frac{1}{\pi(1+z^2)} \nonumber \]

Hence, the density function for the average of two random variables, each having a Cauchy density, is again a random variable with a Cauchy density; this remarkable property is a peculiarity of the Cauchy density. One consequence of this is if the error in a certain measurement process had a Cauchy density and you averaged a number of measurements, the average could not be expected to be any more accurate than any one of your individual measurements!

Rayleigh Density

Example \(\PageIndex{5}\):

Suppose X and Y are two independent standard normal random variables. Now suppose we locate a point \(P\) in the xy-plane with coordinates (X, Y ) and ask: What is the density of the square of the distance of P from the origin? (We have already simulated this problem in Example 5.9.) Here, with the preceding notation, we have

\[\begin{align*} f_X(x) &= f_V(x) \\[4pt] &= \frac{1}{\sqrt{2\pi}} e^{-x^2/2} \end{align*} \]

Moreover, if \(X^2\) denotes the square of \(X\), then (see Theorem 5.1 and the discussion following)

\[ \begin{align*} f_{X^2}(r) &= \begin{array}{} \dfrac{1}{\sqrt{2\pi r}}(e^{-r/2}) & \text{if } r>0\\ 0 & \text{otherwise}.\end{array} \\[4pt] &= \begin{array}{cc} \dfrac{1}{2\sqrt{r}}(f_X\sqrt{r}) + f_X(-\sqrt{r})) & \text{if } r>0\\ 0 & \text{otherwise}.\end{array} \end{align*} \]

This is a gamma density with \(\lambda = 1/2\), \(\beta = 1/2\) (see Example 7.4). Now let \(R^2 = X^2 + Y^2\)

Then

Hence, \(R^2\) has a gamma density with λ = 1/2, β = 1. We can interpret this result as giving the density for the square of the distance of P from the center of a target if its coordinates are normally distributed. The density of the random variable R is obtained from that of \(R^2\) in the usual way (see Theorem 5.1), and we find

\[ f_R(r) = \Bigg\{ \begin{array}{cc} \frac{1}{2}e^{-r^2/2} \cdot 2r = re^{-r^2/2}, & \text{if } r \geq 0 \\ 0, & \text{otherwise} \end{array} \nonumber \]

Physicists will recognize this as a Rayleigh density. Our result here agrees with our simulation in Example 5.9.

Chi-Squared Density

More generally, the same method shows that the sum of the squares of n independent normally distributed random variables with mean 0 and standard deviation 1 has a gamma density with λ = 1/2 and β = n/2. Such a density is called a chi-squared density with n degrees of freedom. This density was introduced in Chapter 4.3. In Example 5.10, we used this density to test the hypothesis that two traits were independent.

Another important use of the chi-squared density is in comparing experimental data with a theoretical discrete distribution, to see whether the data supports the theoretical model. More specifically, suppose that we have an experiment with a finite set of outcomes. If the set of outcomes is countable, we group them into finitely many sets of outcomes. We propose a theoretical distribution that we think will model the experiment well. We obtain some data by repeating the experiment a number of times. Now we wish to check how well the theoretical distribution fits the data.

Let \(X\) be the random variable that represents a theoretical outcome in the model of the experiment, and let \(m(x)\) be the distribution function of X. In a manner similar to what was done in Example 5.10, we calculate the value of the expression

\[ V = \sum_x \frac{(\sigma_x - n\cdot m(x))^2}{n\cdot m(x)} \nonumber \]

where the sum runs over all possible outcomes x, n is the number of data points, and ox denotes the number of outcomes of type x observed in the data. Then

Table \(\PageIndex{1}\): Observed data.

Outcome	Observed Frequency
i	15
2	8
3	7
4	5
5	7
6	18

for moderate or large values of n, the quantity V is approximately chi-squared distributed, with ν−1 degrees of freedom, where ν represents the number of possible outcomes. The proof of this is beyond the scope of this book, but we will illustrate the reasonableness of this statement in the next example. If the value of V is very large, when compared with the appropriate chi-squared density function, then we would tend to reject the hypothesis that the model is an appropriate one for the experiment at hand. We now give an example of this procedure.

Example \(\PageIndex{6}\): DieTest

Suppose we are given a single die. We wish to test the hypothesis that the die is fair. Thus, our theoretical distribution is the uniform distribution on the integers between 1 and 6. So, if we roll the die n times, the expected number of data points of each type is n/6. Thus, if \(o_i\) denotes the actual number of data points of type \(i\), for \(1 ≤ i ≤ 6\), then the expression

\[ V = \sum_{i=1}^6 \frac{(\sigma_i - n/6)^2}{n/6} \nonumber \]

is approximately chi-squared distributed with 5 degrees of freedom.

Now suppose that we actually roll the die 60 times and obtain the data in Table 7.1. If we calculate V for this data, we obtain the value 13.6. The graph of the chi-squared density with 5 degrees of freedom is shown in Figure 7.4. One sees that values as large as 13.6 are rarely taken on by V if the die is fair, so we would reject the hypothesis that the die is fair. (When using this test, a statistician will reject the hypothesis if the data gives a value of V which is larger than 95% of the values one would expect to obtain if the hypothesis is true.)

In Figure 7.5, we show the results of rolling a die 60 times, then calculating V , and then repeating this experiment 1000 times. The program that performs these calculations is called DieTest. We have superimposed the chi-squared density with 5 degrees of freedom; one can see that the data values fit the curve fairly well, which supports the statement that the chi-squared density is the correct one to use.

So far we have looked at several important special cases for which the convolution integral can be evaluated explicitly. In general, the convolution of two continuous densities cannot be evaluated explicitly, and we must resort to numerical methods. Fortunately, these prove to be remarkably effective, at least for bounded densities.

Figure \(\PageIndex{3}\): Chi-squared density with 5 degrees of freedom.

Figure \(\PageIndex{4}\): : Rolling a fair die.

Figure \(\PageIndex{5}\): Convolution of n uniform densities.

Independent Trials

We now consider briefly the distribution of the sum of n independent random variables, all having the same density function. If \(X_1, X_2, . . . , X_n\) are these random variables and \(S_n = X_1 + X_2 + · · · + X_n\) is their sum, then we will have

\[f_{S_n}(x) = (f_X, \times f_{x_2} \times \cdots \times f_{X_n}(x), \nonumber \]

where the right-hand side is an n-fold convolution. It is possible to calculate this density for general values of n in certain simple cases.

Example \(\PageIndex{7}\)

Suppose the \(X_i\) are uniformly distributed on the interval [0,1]. Then

\[f_{X_i}(x) = \Bigg{\{} \begin{array}{cc} 1, & \text{if } 0\leq x \leq 1\\ 0, & \text{otherwise} \end{array} \nonumber \]

and \(f_{S_n}(x)\) is given by the formula \(^4\)

\[f_{S_n}(x) = \Bigg\{ \begin{array}{cc} \frac{1}{(n-1)!}\sum_{0\leq j \leq x}(-1)^j(\binom{n}{j}(x-j)^{n-1}, & \text{if } 0\leq x \leq n\\ 0, & \text{otherwise} \end{array} \nonumber \]

The density \(f_{S_n}(x)\) for \(n = 2, 4, 6, 8, 10\) is shown in Figure 7.6.

If the Xi are distributed normally, with mean 0 and variance 1, then (cf. Example 7.5)

\[f_{X_i}(x) = \frac{1}{\sqrt{2pi}} e^{-x^2/2}, \nonumber \]

Figure \(\PageIndex{6}\): Convolution of n standard normal densities.

and

\[f_{S_n}(x) = \frac{1}{\sqrt{2\pi n}}e^{-x^2/2n} \nonumber \]

Here the density \(f_Sn\) for \(n=5,10,15,20,25\) is shown in Figure 7.7.

If the \(X_i\) are all exponentially distributed, with mean \(1/\lambda\), then

\[f_{X_i}(x) = \lambda e^{-\lambda x}. \nonumber \]

and

\[f_{S_n} = \frac{\lambda e^{-\lambda x}(\lambda x)^{n-1}}{(n-1)!} \nonumber \]

In this case the density \(f_{S_n}\) for \(n = 2, 4, 6, 8, 10\) is shown in Figure 7.8.

Exercises

Exercise \(\PageIndex{1}\): Let \(X\) and \(Y\) be independent real-valued random variables with density functions \(f_{X}(x)\) and \(f_{Y}(y)\), respectively. Show that the density function of the sum \(X+Y\) is the convolution of the functions \(f_{X}(x)\) and \(f_{Y}(y)\). Hint: Let \(\bar{X}\) be the joint random variable \((X, Y)\). Then the joint density function of \(\bar{X}\) is \(f_{X}(x) f_{Y}(y)\), since \(X\) and \(Y\) are independent. Now compute the probability that \(X+Y \leq z\), by integrating the joint density function over the appropriate region in the plane. This gives the cumulative distribution function of \(Z\). Now differentiate this function with respect to \(z\) to obtain the density function of \(z\).

Exercise \(\PageIndex{2}\): Let \(X\) and \(Y\) be independent random variables defined on the space \(\Omega\), with density functions \(f_{X}\) and \(f_{Y}\), respectively. Suppose that \(Z=X+Y\). Find the density \(f_{Z}\) of \(Z\) if

Figure \(\PageIndex{7}\): Convolution of \(n\) exponential densities with \(\lambda=1\).

(a)

\[
f_{X}(x)=f_{Y}(x)= \begin{cases}1 / 2, & \text { if }-1 \leq x \leq+1 \\ 0, & \text { otherwise }\end{cases}
\]

(b)

\[
f_{X}(x)=f_{Y}(x)= \begin{cases}1 / 2, & \text { if } 3 \leq x \leq 5 \\ 0, & \text { otherwise }\end{cases}
\]

(c)

\[
\begin{gathered}
f_{X}(x)= \begin{cases}1 / 2, & \text { if }-1 \leq x \leq 1, \\
0, & \text { otherwise }\end{cases} \\
f_{Y}(x)= \begin{cases}1 / 2, & \text { if } 3 \leq x \leq 5 \\
0, & \text { otherwise }\end{cases}
\end{gathered}
\]

(d) What can you say about the set \(E=\left\{z: f_{Z}(z)>0\right\}\) in each case?

Exercise \(\PageIndex{3}\): Suppose again that \(Z=X+Y\). Find \(f_{Z}\) if

(a)

\[
f_{X}(x)=f_{Y}(x)= \begin{cases}x / 2, & \text { if } 0<x<2 \\ 0, & \text { otherwise. }\end{cases}
\]

(b)

\[
f_{X}(x)=f_{Y}(x)= \begin{cases}(1 / 2)(x-3), & \text { if } 3<x<5 \\ 0, & \text { otherwise }\end{cases}
\]

(c)

\[
f_{X}(x)= \begin{cases}1 / 2, & \text { if } 0<x<2 \\ 0, & \text { otherwise }\end{cases}
\]

\[
f_{Y}(x)= \begin{cases}x / 2, & \text { if } 0<x<2, \\ 0, & \text { otherwise. }\end{cases}
\]

(d) What can you say about the set \(E=\left\{z: f_{Z}(z)>0\right\}\) in each case?

Exercise \(\PageIndex{4}\): Let \(X, Y\), and \(Z\) be independent random variables with

\[
f_{X}(x)=f_{Y}(x)=f_{Z}(x)= \begin{cases}1, & \text { if } 0<x<1 \\ 0, & \text { otherwise }\end{cases}
\]

Suppose that \(W=X+Y+Z\). Find \(f_{W}\) directly, and compare your answer with that given by the formula in Example 7.9. Hint: See Example 7.3.

Exercise \(\PageIndex{5}\): Suppose that \(X\) and \(Y\) are independent and \(Z=X+Y\). Find \(f_{Z}\) if

(a)

\[
\begin{aligned}
& f_{X}(x)= \begin{cases}\lambda e^{-\lambda x}, & \text { if } x>0, \\
0, & \text { otherwise. }\end{cases} \\
& f_{Y}(x)= \begin{cases}\mu e^{-\mu x}, & \text { if } x>0, \\
0, & \text { otherwise. }\end{cases}
\end{aligned}
\]

(b)

\[
\begin{aligned}
& f_{X}(x)= \begin{cases}\lambda e^{-\lambda x}, & \text { if } x>0, \\
0, & \text { otherwise. }\end{cases} \\
& f_{Y}(x)= \begin{cases}1, & \text { if } 0<x<1 \\
0, & \text { otherwise }\end{cases}
\end{aligned}
\]

Exercise \(\PageIndex{6}\): Suppose again that \(Z=X+Y\). Find \(f_{Z}\) if

\[
\begin{aligned}
& f_{X}(x)=\frac{1}{\sqrt{2 \pi} \sigma_{1}} e^{-\left(x-\mu_{1}\right)^{2} / 2 \sigma_{1}^{2}} \\
& f_{Y}(x)=\frac{1}{\sqrt{2 \pi} \sigma_{2}} e^{-\left(x-\mu_{2}\right)^{2} / 2 \sigma_{2}^{2}} .
\end{aligned}
\]

Exercise \(\PageIndex{7}\): Suppose that \(R^{2}=X^{2}+Y^{2}\). Find \(f_{R^{2}}\) and \(f_{R}\) if

\[
\begin{aligned}
f_{X}(x) & =\frac{1}{\sqrt{2 \pi} \sigma_{1}} e^{-\left(x-\mu_{1}\right)^{2} / 2 \sigma_{1}^{2}} \\
f_{Y}(x) & =\frac{1}{\sqrt{2 \pi} \sigma_{2}} e^{-\left(x-\mu_{2}\right)^{2} / 2 \sigma_{2}^{2}} .
\end{aligned}
\]

Exercise \(\PageIndex{8}\): Suppose that \(R^{2}=X^{2}+Y^{2}\). Find \(f_{R^{2}}\) and \(f_{R}\) if

\[
f_{X}(x)=f_{Y}(x)= \begin{cases}1 / 2, & \text { if }-1 \leq x \leq 1 \\ 0, & \text { otherwise }\end{cases}
\]

Exercise \(\PageIndex{9}\): Assume that the service time for a customer at a bank is exponentially distributed with mean service time 2 minutes. Let \(X\) be the total service time for 10 customers. Estimate the probability that \(X>22\) minutes.

Exercise \(\PageIndex{10}\): Let \(X_{1}, X_{2}, \ldots, X_{n}\) be \(n\) independent random variables each of which has an exponential density with mean \(\mu\). Let \(M\) be the minimum value of the \(X_{j}\). Show that the density for \(M\) is exponential with mean \(\mu / n\). Hint: Use cumulative distribution functions.

Exercise \(\PageIndex{11}\): A company buys 100 lightbulbs, each of which has an exponential lifetime of 1000 hours. What is the expected time for the first of these bulbs to burn out? (See Exercise 10.)

Exercise \(\PageIndex{12}\): An insurance company assumes that the time between claims from each of its homeowners' policies is exponentially distributed with mean \(\mu\). It would like to estimate \(\mu\) by averaging the times for a number of policies, but this is not very practical since the time between claims is about 30 years. At Galambos'5 suggestion the company puts its customers in groups of 50 and observes the time of the first claim within each group. Show that this provides a practical way to estimate the value of \(\mu\).

Exercise \(\PageIndex{13}\): Particles are subject to collisions that cause them to split into two parts with each part a fraction of the parent. Suppose that this fraction is uniformly distributed between 0 and 1. Following a single particle through several splittings we obtain a fraction of the original particle \(Z_{n}=X_{1} \cdot X_{2} \cdot \ldots \cdot X_{n}\) where each \(X_{j}\) is uniformly distributed between 0 and 1 . Show that the density for the random variable \(Z_{n}\) is

\[
f_{n}(z)=\frac{1}{(n-1) !}(-\log z)^{n-1}
\]

Hint: Show that \(Y_{k}=-\log X_{k}\) is exponentially distributed. Use this to find the density function for \(S_{n}=Y_{1}+Y_{2}+\cdots+Y_{n}\), and from this the cumulative distribution and density of \(Z_{n}=e^{-S_{n}}\).

Exercise \(\PageIndex{14}\): Assume that \(X_{1}\) and \(X_{2}\) are independent random variables, each having an exponential density with parameter \(\lambda\). Show that \(Z=X_{1}-X_{2}\) has density

\[
f_{Z}(z)=(1 / 2) \lambda e^{-\lambda|z|}
\]

Exercise \(\PageIndex{15}\): Suppose we want to test a coin for fairness. We flip the coin \(n\) times and record the number of times \(X_{0}\) that the coin turns up tails and the number of times \(X_{1}=n-X_{0}\) that the coin turns up heads. Now we set

\[
Z=\sum_{i=0}^{1} \frac{\left(X_{i}-n / 2\right)^{2}}{n / 2} .
\]

Then for a fair coin \(Z\) has approximately a chi-squared distribution with \(2-1=1\) degree of freedom. Verify this by computer simulation first for a fair coin \((p=1 / 2)\) and then for a biased coin \((p=1 / 3)\).

\({ }^{5}\) J. Galambos, Introductory Probability Theory (New York: Marcel Dekker, 1984), p. 159.

Exercise \(\PageIndex{16}\): Verify your answers in Exercise 2(a) by computer simulation: Choose \(X\) and \(Y\) from \([-1,1]\) with uniform density and calculate \(Z=X+Y\). Repeat this experiment 500 times, recording the outcomes in a bar graph on \([-2,2]\) with 40 bars. Does the density \(f_{Z}\) calculated in Exercise 2(a) describe the shape of your bar graph? Try this for Exercises 2(b) and Exercise 2(c), too.

Exercise \(\PageIndex{17}\): Verify your answers to Exercise 3 by computer simulation.

Exercise \(\PageIndex{18}\): Verify your answer to Exercise 4 by computer simulation.

Exercise \(\PageIndex{19}\): The support of a function \(f(x)\) is defined to be the set

\[
\{x: f(x)>0\} .
\]

Suppose that \(X\) and \(Y\) are two continuous random variables with density functions \(f_{X}(x)\) and \(f_{Y}(y)\), respectively, and suppose that the supports of these density functions are the intervals \([a, b]\) and \([c, d]\), respectively. Find the support of the density function of the random variable \(X+Y\).

Exercise \(\PageIndex{20}\): Let \(X_{1}, X_{2}, \ldots, X_{n}\) be a sequence of independent random variables, all having a common density function \(f_{X}\) with support \([a, b]\) (see Exercise 19). Let \(S_{n}=X_{1}+X_{2}+\cdots+X_{n}\), with density function \(f_{S_{n}}\). Show that the support of \(f_{S_{n}}\) is the interval \([n a, n b]\). Hint: Write \(f_{S_{n}}=f_{S_{n-1}} * f_{X}\). Now use Exercise 19 to establish the desired result by induction.

Exercise \(\PageIndex{21}\): Let \(X_{1}, X_{2}, \ldots, X_{n}\) be a sequence of independent random variables, all having a common density function \(f_{X}\). Let \(A=S_{n} / n\) be their average. Find \(f_{A}\) if

(a) \(f_{X}(x)=(1 / \sqrt{2 \pi}) e^{-x^{2} / 2}\) (normal density).

(b) \(f_{X}(x)=e^{-x}\) (exponential density).

Hint: Write \(f_{A}(x)\) in terms of \(f_{S_{n}}(x)\).