I have an $x$-sided dice and I roll it $y$ times.
What is the probability that I will see at least $z$ of the $x$ sides?
e.g. If I rolled a $6$-sided dice $8$ times, what is the probability I see at least $4$ of the sides?
But I want to do with much bigger numbers
Specifically, I want to know what is probability I see at least $z$ of the $160,000$ sides if I rolled a $160,000$-sided dice $25,000$ times.
Thanks in advance!
The chance of a side missing out is $$\left({x-1\over x} \right)^y$$
The chance of two sides missing out is $$\left({x-2\over x}\right)^y$$ The usual approximation by an exponential causes a large error.
I will use the binomial distribution.
On average, you will see $$\mu\approx x(1-e^{-y/x})$$ different sides. The expected value of the square is $$x(1-\left({x-1\over x}\right)^y)+ \\ (x^2-x)(1-2\left({x-1\over x}\right)^y+\left({x-2\over x}\right)^y) $$ So the variance is $$\sigma^2= x(\left({x-1\over x}\right)^y-\left({x-2\over x}\right)^y \\ +x^2(\left({x-2\over x}\right)^y-\left({x-1\over x}\right)^{2y})$$ This is about 1500 for your numbers, compared with 19000 when I neglected the x^2 term. As I mention in comments, this is close to $$\sigma^2\approx xe^{-y/z}-(x+y)e^{-2y/x}$$ For large $x$ and $y$, it will be very close to a normal distribution with that mean and variance. Sixty-eight percent of the time $z$ will be between $\mu-\sigma$ and $\mu+\sigma$ (note $\sigma$ is the square-root of $\sigma^2$ defined above), and ninety-five percent of the time it will be between $\mu-2\sigma$ and $\mu+2\sigma$.