I want to get every color of gumball in a gumball machine (where there are 16 types of gumballs, each with a 1/16 chance of obtaining a particular color [assume there are an infinite amount of gumballs]).
I'm interested in knowing what the PMF (as a function of # of attempts) of getting every single gumball. ie. what is the probability of getting all 16 colors of gumballs in n trials?
I'd rather not have to brute-force the answer and am hoping for a somewhat elegant solution (maybe we can use the negative binomial distribution to find this?).
Here's a much simpler approach that I just thought of:
Note that $P(A)=1-P(\neg A)$ In our case, lets define $\neg A$ to be event where at least one color $C_i$is missing $\neg A :=\{ \neg C_1 \cup \neg C_2....\cup \neg C_N\}$
What is $P(\neg A)$?
Lets say you draw $N$ gumballs $G_1...G_N$. The the probability that you dont get one of a particular color in N tosses can be modeled as Binomial:
$$P(G_1 \neq C_i,...,G_N \neq C_i)=\left(\frac{15}{16}\right)^{N}$$
However, there are $16$ colors, so multiply this by $16$ and then correct for the fact that more than one color can be missing to get:
$$P(\neg A)=16\left(\frac{15}{16}\right)^{N}-P(I)$$ where $I$ is calculated as a sum of intersection probabilities as per the inclusion-exclusion principle. For 16 colors, this will be a long formula, but mathematically it is rather simple: $P(\neg C_1 \cap \neg C_2)=\left(\frac{14}{16}\right)^{N}$ since there are now two colors that must be avoided.