On average, how many times must I roll a dice until I get a $6$?
I got this question from a book called Fifty Challenging Problems in Probability.
The answer is $6$, and I understand the solution the book has given me. However, I want to know why the following logic does not work: The chance that we do not get a $6$ is $5/6$. In order to find the number of dice rolls needed, I want the probability of there being a $6$ in $n$ rolls being $1/2$ in order to find the average. So I solve the equation $(5/6)^n=1/2$, which gives me $n=3.8$-ish. That number makes sense to me intuitively, where the number $6$ does not make sense intuitively. I feel like on average, I would need to roll about $3$-$4$ times to get a $6$. Sometimes, I will have to roll less than $3$-$4$ times, and sometimes I will have to roll more than $3$-$4$ times.
Please note that I am not asking how to solve this question, but what is wrong with my logic above.
Thank you!
You can calculate the average this way also.
The probability of rolling your first $6$ on the $n$-th roll is $$\left[1-\left(\frac{5}{6}\right)^n\right]-\left[1-\left(\frac{5}{6}\right)^{n-1}\right]=\left(\frac{5}{6}\right)^{n-1}-\left(\frac{5}{6}\right)^{n}$$
So the weighted average on the number of rolls would be $$\sum_{n=1}^\infty \left(n\left[\left(\frac{5}{6}\right)^{n-1}-\left(\frac{5}{6}\right)^{n}\right]\right)=6$$
Again, as noted already, the difference between mean and median comes in to play. The distribution has a long tail way out right pulling the mean to $6$.
For those asking about this graph, it is the expression above, without the Summation. It is not cumulative. (The cumulative graph would level off at $y=6$). This graph is just $y=x\left[\left(\frac{5}{6}\right)^{x-1}-(\left(\frac{5}{6}\right)^{x}\right]$
It's not a great graph, honestly, as it is kind of abstract in what it represents. But let's take $x=4$ as an example. There is about a $0.0965$ chance of getting the first roll of a $6$ on the $4$th roll. And since we're after a weighted average, that is multiplied by $4$ to get the value at $x=4$. It doesn't mean much except to illustrate why the mean number of throws to get the first $6$ is higher than around $3$ or $4.$
You can imagine an experiment with $100$ trials. About $17$ times it will only take $1$ throw($17$ throws). About $14$ times it will take $2$ throws ($28$ throws). About $11$ times it will take $3$ throws($33$ throws). About $9$ times it will take $4$ throws($36$ throws) etc. Then you would add up ALL of those throws and divide by $100$ and get $\approx 6.$