A coin is tossed $10$ times. Find the probability that there exist $7$ consecutive coin tosses with at least $5$ out of the $7$ being heads.
So for example, $TTHHTHHTHH$ is one of the outcomes we want.
I guess the best way to treat this is as a counting problem; finding the number of outcomes we want and then dividing by $2^{10}.$
$2^{10} = 1024,$ so listing all the outcomes is time-wise expensive.
The events, "The first consecutive $7$ tosses contain at least $5$ heads", "The second consecutive $7$ tosses contain at least $5$ heads", etc. are not mutually exclusive and so the answer is not simply:
$P$(The first consecutive $7$ tosses contain at least $5$ heads) + $P$(The second consecutive $7$ tosses contain at least $5$ heads) + ... .
Similarly, the events, "The first consecutive $7$ tosses does not contain at least $5$ heads", "The second consecutive $7$ tosses does not contain at least $5$ heads", etc. are also not mutually exclusive and so the answer is not simply:
$1 - $ $[P$(The first consecutive $7$ tosses does not contain at least $5$ heads) + $P$(The second consecutive $7$ tosses does not contain at least $5$ heads) + ...$]$ .
Edit: What about reflecting the $10$ boxes down the middle? This could cut our work in half maybe? For example, $HTTHHTHTHH \equiv HHTHTHHTTH$. Perhaps figuring out the symmetries is expensive too.
I'm also interested in doing a similar problem with larger numbers, e.g.:
A coin is tossed $10^{14}$ times. Find the probability that there exist $1000$ consecutive coin tosses with at least $650$ out of the $1000$ being heads.
This is probably impractical to calculate using binomial distributions, so how would you find an answer using the Normal distribution as an approximation, or is it not possible to do this?
The reason I'm interested in this latter question is that it is the sort of calculation one might make if one wanted to gain statistical evidence that a poker site is rigged against them, although of course the latter question would not be enough evidence to prove a poker site is rigged against a particular player; it could be a reasonable starting point for further calculations. Also, it is not hard to imagine this calculation could have applications in other areas, statistical mechanics or mathematical biology for example.
Treat the following as a long-comment: Inclusion-exclusion should work with the suggested events, namely, the event that is of interest is $P(A_1 \cup A_2 \cup A_3 \cup A_4)$ where $A_i$ indicates the tosses $(x_i, ..., x_{i+6})$ contain 5 Hs.
We note that $P(A_i)$ is $(\binom{7}{5} + \binom{7}{6} + \binom{7}{7}) \frac{1}{2^7}$. Now, one has to compute $P(A_i \cap A_{i+1}), P(A_i \cap A_{i+2})$, and $P(A_i \cap A_{i+3})$. For this, the best decision seems to be to condition on the number of heads in the intersections of their intervals, so for $P(A_i \cap A_{i+1})$, the intersecting interval of size 6 has to contain 4,5, or 6 heads, giving the form $P(A_i \cap A_{i+1}) = \binom{6}{4} \frac{1}{2^8} + \binom{6}{5}\frac{1}{2^6} + \binom{6}{6}\frac{1}{2^6}$. The other two are harder, but the same idea should work.
For the three-way and the last one, looking at the number of heads in spots (4,5,6,7), and then just placing an appropriate number of heads around might be the way to go. At that point, I would count.
EDIT: Glad this was done by another answerer; it looks fairly technical to actually do. Anyway, let me answer the second half of your question; your second question can be answered with bonferroni inequalities (think of cutting inclusion-exclusion); for the one you have there, the union bound already gives you that that is a one in a million event. (the probability of a 1000 tosses having 650 heads is $10^{-22} $ so multiply by $10^{14}$ for the union bound. If you wanted to find the probability of that 650 having heads using Gaussians, you can approximate $P(X > 650)$ by $P(2(X - 500)/\sqrt{1000} > 300/ \sqrt{1000})$ and take that first one to be a Gaussian with variance 1. (This gives a probability of $10^{-41}$ though, so I could be doing something wrong.)