Poisson and Binomial approximations can be used to estimate the distribution of sum of a sequence of independent $0-1$ indicators. If the number of such indicators are fixed (say $n$), how big should be the value of n?
In general, can these approximations be used for small or medium sized $n$?
Thank you in advance for your help.
Neither of the binomial nor the Poisson distributions would be good approximations if they mismatch the mean and variance, which should be $\sum p_i$ and $\sum p_i(1-p_i)$ respectively. There can be a fix for the binomial calculation by changing the $n$ parameter, but it could be seen as unnatural step
Suppose for example you have the $99$ $p_i$s $\{0.01,0.02,0.03,\ldots,0.99\}$. Then the resulting distribution of the sum of your indicators has mean $49.5$ and variance $16.665$.
A Poisson distribution with parameter $\lambda=49.5$ would have the same mean but variance $49.5$ which would be substantially overdispersed
A binomial distribution with parameters $n=99$ and $p=0.5$ would have the same mean but variance $24.75$ which would still be overdispersed
A binomial distribution with parameters $n=75$ and $p=0.66$ comes much closer: it too has the same mean and has variance $16.83$, which is quite close to what we are aiming for. But it is difficult to motivate using this value of $n$ except on empirical grounds
A Gaussian distribution can have whatever mean and variance we wish, so $\mu=49.5$ and $\sigma^2=16.665$ seem like natural choices. If we want to keep the integer values of the sum of the indicators, then we could in effect round the values and use a continuity correction (though this will make a small difference to the variance)
Let's compare the probabilities for values near $49.5$ from these different approximations. It should be fairly clear that the Gaussian approximation is really very good:
You also asked whether this works for small $n$. Now let's suppose there are $4$ $p_i$s of $\{0.2,0.4,0.6,0.8\}$, so the distribution of the sum of the indicators has mean $2$ and variance $0.8$. We might consider approximation distributions with the same mean: a Poisson distribution with $\lambda=2$ (variance $2$); a binomial distribution with $n=4$ and $p=0.5$ (variance $1$); a binomial distribution with $n=3$ and $p=\frac23$ (variance $\frac23$); and a Gaussian with $\mu=2$ and $\sigma^2=0.8$ and then continuity corrected. We would then get suggested approximate probabilities which look like
and I think the Gaussian approximation is still pretty good here. It might be possible to come up with cases which might not fit so well, but if you must have an approximation, I believe a Gaussian approximation is the way to go