Probability Distributions
- Continuous Probability Distributions
- Discrete Probability Distributions
- Probability Distributions
- Standard Deviation and Variance
Random Variable
The experiment is random, but the function
When we write:
We are saying "the distribution of
For example, if we ask for
Support (Sample Space)
The set of all possible outputs of a random variable is called the support or sample space (i.e the range of the function/random variable, which is also the domain of the probability function).
E.g discrete (
Independent Random Variable
Random variables
"comma pretty much means and"
(see universally quantifier)
- In the discrete case,
- 3 or more:
- 3 or more:
Suppose two dice are rolled and we define random variables:
Are
solution
Counterexample:
If
Since
Independent and Identically Distributed (IID) Random Variable
| Example | Independent? | Identically distributed? |
|---|---|---|
| Two coins are flipped, |
Yes | Yes |
| Yes | No | |
| Flip a fair coin twice, |
No | Yes |
| Flip a fair coin thrice, |
No | No |
Probability Mass Function (p.m.f.)
Probability Density Function (p.d.f)
See probability density function
Cumulative Distribution Function (c.d.f.)
The c.d.f of
In the case that
In the case that
(see summation notation, integral, improper integral, Riemann Sum)
Compare this to the definition of a p.m.f:
(increasing) (monotone) (see limit) is a step function in the discrete case- Right-continuous
In the continuous case:
- Also left continuous
Given the following p.m.f.:
| X | f |
|---|---|
| 0 | 0.2 |
| 1 | 0.5 |
| 2 | 0.3 |
Find the c.d.f
solution
| 0 | 0.2 | 0.2 |
| 1 | 0.5 | 0.7 |
| 2 | 0.3 | 1 |
A c.d.f. may exists when a p.m.f. does not. For example, in the continuous case.
Four die are rolled simultaneously. Suppose the random variable
solution
As an example, the calculation for
| 1 | ||
| 2 | ||
| 3 | ||
| 4 | ||
| 5 | ||
| 6 |
Example calculations:
Probability Distribution
A probability distribution is a table, formula, or graph that provides the probabilities of a random variable assuming any of its possible values
The probability distribution where
| 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | |
|---|---|---|---|---|---|---|---|---|---|---|---|
The theoretical probability distribution of the sum of two dice:
This is an example of a discrete probability distribution. If we were measuring the distribution of say, height, we would have a continuous probability distribution.
Expected Value
The excepted value of a random variable is the mean value after many repetitions if an experiment.
More precisely:
Let
Let
That is, the expected value of a discrete random variable,
Often you may see the following "notation" of the formula, which may help with understanding:
"Expected value" is a bit of a misnomer. It's more like "weighted average". The expectation isn't that you will actually get the expected value, and often it's impossible.
Example:
| 0 | 0.2 |
| 100 | 0.8 |
but
- Non-negativity:
- If
then
- If
- Non-degeneracy
- Linearity
- Monotonicity
- Deterministic
- Constants
- For all real constants
,
- For all real constants
- Non-multiplicity
- If
and are independent, then - If
and are dependent, then generally , although there are special cases where the equality holds
- If
Calculate the expected value,
solution
Suppose you want to select a committee using three people. The group which the committee members can be selected consists of 4 Waterloo residents and 3 Kitchener residents.
- Create a probability distribution for the number of Kitchener residents on the committee
| 0 | 1 | 2 | 3 | |
|---|---|---|---|---|
- Calculate the probability that at least one Kitchener resident is on the committee
- Calculate the expected number of Kitchener residents on the committee
Suppose that a fair coin is tossed a total of 3 times. A player will lose or win money on each coin toss as follows:
- The player will win $1 if the first toss is a head, but will lose $1 if the first toss is a tail
- The player will win $2 if the second toss is a head, but will lose $2 if the second toss is a tail
- The player will win $3 if the third toss is a head, but will lose $3 if the third toss is a tail
We define the random variable
solution
| -6 | |
| -4 | |
| -2 | |
| 0 | |
| 2 | |
| 4 | |
| 6 |
or
Let
You are developing a platform to analyze the performance of your friend’s website selling Goose-themed merchandise.
Suppose that you are working on an algorithm that will highlight two different products to a customer, product
You are trying to decide which product to display to the customer first. If a customer does not buy the first product highlighted to them, then they will not buy the second product. If you can choose which order to highlight each product, justify a strategy to display the products for maximizing your friend’s profits.
solution
Let
| 0 | 0.5 | 0 | 0.7 |
| 10 | 20 | ||
| 30 | 30 |
So it's more profitable to display product
Law of the Unconscious Statistician (LOTUS)
Why not
St. Petersburg Paradox
A fair coin is tossed until a head appears. The casino will pay:
- $2 if a head appears on the first toss
- $4 if a head appears on the second toss
- $8 if a head appears on the third toss
How much are you willing to pay to play the game? What is the expected payoff?
solution
Let
| 2 | |
| 4 | |
| 8 | |
| 16 | |
The paradox: the expected payoff is
Memorylessness
If
(see conditional event)
Only the Geometric Distribution and the Exponential Distribution with non-negative real numbers are memoryless.
Suppose
As the name suggests, we constantly "forget" the current state of the system. That is, probabilities are not influenced by the history of the process.
With Memory
Suppose
Without Memory
Suppose there's a long hallway, lined on one wall with thousands of safes. The dial on each safe has 500 positions, and each has been randomly assigned an opening position. Imagine an eccentric person walks down the hallway, stopping once at each safe to make a single random attempt to open it.
We define random variable
In this case,
If this person instead focused on one safe, they would need at most 500 attempts to open the safe (and we would expect it to take 250 attempts), since they would remember.