Why does probability of 50/50 outcome decrease as we toss a coin more and more?

11.1k Views Asked by At

I have following formula ("n" is number of experiments, "k" is number of successes that we expect, "p" is probability of success, "q" is probability of failure):

https://imgur.com/HXxITYZ

I used it to calculate chances of getting strictly 2 heads out of 4 tosses (order doesn't matter), 5 heads of 10 tosses, 10 heads out of 20 tosses.

2 heads out of 4 tosses have around 38% probability.

5 heads out of 10 tosses have around 25% probability.

10 heads out of 20 tosses have around 18% probability.

Looks like trend for me. And this trend is paradoxical, it means that by increasing number of tries we will decrease chance for 50/50 outcome. It seems to be at odds with what I was taught before. I was taught that as we increase number of tosses we will come closer and closer to 50/50 outcome. So if I toss a coin 10 times I can get 1 head and 9 tails, while tossing a coin 1000 times is more likely to give me result much closer to 50/50 outcome.

3

There are 3 best solutions below

3
On BEST ANSWER

This is because the more you flip a coin, the less likely it will be that you get heads $exactly$ half of the time. It is true, however, that the more you flip a coin, and then compute

$$\hat{p}=\frac{\text{number of heads}}{\text{number of tosses}}$$

that $\hat{p}$ will $converge$ to $0.5$. Consider flipping a coin $10$ times and obtaining $6$ heads. Then consider flipping a coin $1000$ times and obtaining $526$ heads.

While in the second instance, we are $26$ heads away from a perfect $50/50$, proportionally, we are closer as $0.526$ is closer to $0.5$ than $0.6$.

0
On

As it's been said, the fact that there are more and more possible values is what makes $\hat p=0.5$ less and less probable as exact value ($\hat p =\tfrac{\#heads}{\#toses}$).

Consider, for instance, the probability of $\tfrac 1 2 - \tfrac 1 {10}<\hat p_n<\tfrac 1 2 + \tfrac 1 {10}$, that is $$P\left(\tfrac 1 2 - \tfrac 1 {10}<\hat p_n<\tfrac 1 2 + \tfrac 1 {10}\right)=P\left(- \tfrac {2\sqrt n} {10}<\frac{\hat p_n-\tfrac 1 2}{\frac1{2\sqrt n}} < \tfrac {2\sqrt n} {10}\right)\approx 2\cdot\Phi\left(\tfrac {\sqrt n} {5}\right)-1$$ the approximation based on the CLT, valid for $n$ large enough.

Let's supose $n=50$. In that case we have $$P\left(\tfrac 1 2 - \tfrac 1 {10}<\hat p_{50}<\tfrac 1 2 + \tfrac 1 {10}\right)\approx 2\cdot \Phi\left(\frac{\sqrt {50}}5\right)-1\approx 0.8427.$$

Now let's do the same fot $n=100$. We get $$P\left(\tfrac 1 2 - \tfrac 1 {10}<\hat p_{100}<\tfrac 1 2 + \tfrac 1 {10}\right)\approx 2\cdot \Phi\left(\frac{\sqrt {100}}5\right)-1\approx 0.9545.$$ So the probability of $\hat p_n$ taking a value at a distance less than $0.1$ from the expected value has increased by around a 13%. But the values of $X=\#heads$ corresponding to that event went from $9$ (that is $21$ to $29$) to $19$ ($41$ to $59$), around the double. And if we kept doubling $n$ (for example), we would approximately double the number of integer values of $X$ that lead to $0.4<\hat p_n<0.6$ each time, but the total probability distributed between those values won't grow that much, fundamentally because it's very close to $1$, which is a bound, no matter $n$.

So, those probabilities which stay almost constant at some point, are distributed among more and more possible values of the measured discrete variable, and so is not surprising that the probability at each individual point decreases.

0
On

If we flip a fair ($p = \frac{1}{2}$) coin $2n$ times, the Binomial Formula gives that the probability of getting exactly $n$ heads is $$P = \frac{1}{2^{2n}} {{2 n} \choose n} = \frac{(2 n)!}{2^{2n} n!^2} .$$ Stirling's approximation for the factorial gives that $$m! = \sqrt{2 \pi m} \left(\frac{m}{e}\right)^m \left[1 + O\left(\frac{1}{m}\right)\right] .$$ Substituting into our formula for $P$ and simplifying gives an explicit first-order approximation for $P$ that not only shows that $P \to 0$ as $n \to \infty$ but also tells us the (asymptotic) rate at which $P$ decreases: $$P = \frac{1}{\sqrt{\pi n}} + O\left(\frac{1}{n^{3/2}}\right) .$$ (Already for $n \geq 3$ the leading term, $\frac{1}{\sqrt{\pi n}}$, estimates $P$ with relative error $< 5\%$.)