# 4.4: Exercises

- Page ID
- 7807

A basketball player shoots four free throws, and you write down the sequence of hits and misses. Write down the sample space for thinking of this whole thing as a random experiment.

In another game, a basketball player shoots four free throws, and you write down the number of baskets she makes. Write down the sample space for this different random experiment.

You take a normal, six-sided die, paint over all the sides, and then write the letter **A** on all six sides. You then roll the die. What is the sample space of this experiment? Also, list all the possible events for this experiment. *[Hint: it may help to look at Example [**4.1.9.**]*

Now you paint it over again, and write **A** on half the sides and **B** on the other half. Again, say what is the sample space and list all possible events.

One more time you paint over the sides, then write **A** on one third of the faces, **B** on one third of the other faces, and **C** on the remaining third. Again, give the sample space and all events.

Make a conjecture about how many events there will be if the sample space has \(n\) outcomes in it.

Describe a random experiment whose sample space will be the set of all points on the (standard, 2-dimensional, \(xy\)-) plane.

The most common last [family] name in the world seems to be Wang [or the variant Wong]. Approximately 1.3% of the global population has this last name.

The most common first name in the world seems to be Mohammad [or one of several variants]. Some estimates suggest that perhaps as many as 2% of the global population has this first name.

Can you tell, from the above information, what percentage of the world population has the name “Mohammad Wang?” If so, why and what would it be? If not, why not, and can you make any guess about what that percentage would be, anyway?

*[Hint: think of all the above percentages as probabilities, where the experiment is picking a random person on Earth and asking their name. Carefully describe some events for this experiment, relevant to this problem, and say what their probabilities are. Tell how combining events will or will not compute the probability of the desired event, corresponding to the desired percentage.]*

*[Note: don’t bet on the numbers given in this problem being too accurate – they might be, but there is a wide range of published values for them in public information from different sources, so probably they are only a very crude approximation.]*

Suppose that when people have kids, the chance of having a boy or a girl is the same. Suppose also that the sexes of successive children in the same family are independent. [Neither of these is exactly true in real life, but let’s pretend for this problem.]

The Wang family has two children. If we think of the sexes of these children as the result of a random experiment, what is the sample space? Note that we’re interested in birth order as well, so that should be apparent from the sample space.

What are the probabilities of each of the outcomes in your sample space? Why?

Now suppose we know that at least one of the Wang children is a boy. Given this information, what is the probability that the Wangs have two boys?

Suppose instead that we know that the Wangs’ older child is a boy. What is the probability, given this different information, that both Wang children are boys?

To solve this, clearly define events in words and with symbols, compute probabilities, and combine these to get the desired probability. Explain everything you do, of course.

Imagine you live on a street with a stop light at both ends of the block. You watch cars driving down the street and notice which ones have to stop at the \(1^{st}\) and/or \(2^{nd}\) light (or none). After counting cars and stops for a year, you have seen what a very large number – call it \(N\) – of cars did. Now imagine you decide to think about the experiment “*pick a car on this street from the last year at random and notice at which light or lights it has to stop*.”

Let \(A\) be the event “*the car had to stop at the \(1^{st}\) light*” and \(B\) be the event “*the car had to stop at the \(2^{nd}\) light*.” What else would you have to count, over your year of data collection, to estimate the probabilities of \(A\) and of \(B\)? Pick some numbers for all of these variables and show what the probabilities would then be.

Make a Venn diagram of this situation. Label each of the four connected regions of this diagram (the countries, if this were a map) with a number from to , then provide a key which gives, for each of these numbered regions, **both** a formula in terms of \(A\), \(B\), unions, intersections, and/or complements, and then **also** a description entirely in words which do not mention \(A\) or \(B\) or set operations at all. Then put a decimal number in each of the regions indicating the probability of the corresponding event.

Wait – for one of the regions, you can’t fill in the probability yet, with the information you’ve collected so far. What else would you have had to count over the data-collection year to estimate this probability? Make up a number and show what the corresponding probability would then be, and add that number to your Venn diagram.

Finally, using the probabilities you have chosen, are the events \(A\) and \(B\) independent? Why or why not? Explain in words what this means, in this context.

EXERCISE 4.7. Here is a table of the prizes for the **EnergyCube** Lottery:

Prize | Odds of winning |
---|---|

$1,000,000 | 1 in 12,000,000 |

$50,000 | 1 in 1,000,000 |

$100 | 1 in 10,000 |

$7 | 1 in 300 |

$4 | 1 in 25 |

We want to transform the above into the [probability] distribution of a random variable \(X\).

First of all, let’s make \(X\) represent the **net gain** a Lottery player would have for the various outcomes of playing – note that the ticket to play costs $2. How would you modify the above numbers to take into account the ticket costs?

Next, notice that the above table gives winning **odds**, not probabilities. How will you compute the probabilities from those odds? Recall that saying something has odds of “1 in \(n\)” means that it tends to happen about once out of \(n\) runs of the experiment. You might use the word *frequentist* somewhere in your answer here.

Finally, something is missing from the above table of outcomes. What prize – actually the most common one! – is missing from the table, and how will you figure out its probability?

After giving all of the above explanations, now write down the full, formal, probability distribution for this “net gain in **EnergyCube** Lottery plays” random variable, \(X\).

In this problem, some of the numbers are quite small and will disappear entirely if you round them. So use a calculator or computer to compute everything here and keep as much accuracy as your device shows for each step of the calculation.

EXERCISE 4.8. Continuing with the same scenario as in the previous Exercise 4.7, with the **EnergyCube **Lottery: What would be your expectation of the average gain per play of this Lottery? Explain fully, of course.

So if you were to play every weekday for a school year (so: five days a week for the 15 weeks of each semester, two semesters in the year), how much would you expect to win or lose in total?

Again, use as much accuracy as your computational device has, at every step of these calculations.

EXERCISE 4.9. Last problem in the situation of the above Exercise 4.7 about the **En- ergyCube **Lottery: Suppose your friend plays the lottery and calls you to tell you that she won ... but her cell phone runs out of charge in the middle of the call, and you don’t know how much she won. Given the information that she won, what is the probability that she won more than $1,000?

Continue to use as much numerical accuracy as you can.

EXERCISE 4.10. Let’s make a modified version of Example 4.3.18. You are again throwing darts at a dartboard, but you notice that you are very left-handed so your throws pull to the right much more than they pull to the left. What this means is that it is not a very good model of your dart throws just to notice how far they are from the center of the dartboard, it would be better to notice the x-coordinate of where the dart hits, measuring (in cm) with the center of the board at x location 0. This will be your new choice of RV, which you will still call X.

You throw repeatedly at the board, measure \(X\), and find out that you *never* hit more than \(10cm\) to the right of the center, while you are more accurate to the left and never hit more than \(5cm\) in that direction. You do hit the middle (\(X=0\)) the most often, and you guess that the probability decreases linearly to those edges where you never hit.

Explain why your \(X\) is a *continuous* RV, and what its interval \([x_{min},x_{max}]\) of values is.

Now sketch the graph of the probability density function for \(X\). *[Hint: it will be a triangle, with one side along the interval of values \([x_{min},x_{max}]\) on the \(x\)-axis, and its maximum at the center of the dartboard.]* Make sure that you put tick marks and numbers on the axes, enough so that the coordinates of the corners of the triangular graph can be seen easily. *[Another hint: it is a useful fact that the total area under the graph of any probability density function is \(1\).]*

What is the probability that your next throw will be in the bull’s-eye, whose radius, remember, is \(1.5cm\) and which therefore stretches from \(x\) coordinate \(-1.5\) to \(x\)-coordinate \(1.5\)?

EXERCISE 4.11. Here’s our last discussion of dartboards [maybe?]: One of the prob- lems with the probability density function approaches from Example 4.3.18 and Exer- cise 4.10 is the assumption that the functions were *linear *(at least in pieces). It would be much more sensible to assume they were more *bell-shaped*, maybe like the Normal distribution.

Suppose your friend Mohammad Wang is an excellent dart-player. He throws at a board and you measure the x-coordinate of where the dart goes, as in Exercise 4.10 with the center corresponding to x = 0. You notice that his darts are rarely – only 5% of the time in total! – more than 5cm from the center of the board.

Fill in the blanks: “MW’s dart hits’ x-coordinates are an RV X which is Normally distributed with mean μ_{X} = and standard deviation σ_{X}=_______.” Explain, of course.

How often does MW completely miss the dartboard? Its radius is \(10cm\).

How often does he hit the bull’s-eye? Remember its radius is \(1.5cm\), meaning that it stretches from \(x\) coordinate \(-1.5\) to \(x\)-coordinate \(1.5\).