I am currently studying about Sampling distribution of Sample means, and came across below example here.
Question:
The average male drinks 2L of water when active outdoors with a standard deviation of .7L. You're planning a full day nature trip for 50 men and will bring 110L of water. What's the probability you'll run out?
Given Answer:(transcript taken from here which is same as khan's)
The probability of running out of water is the probability of using more than 110L of water. This is the same as the probability of the average water use is greater than 2.2L (110L divided by 50 men) per man
P(average water use > 2.2L per man)
$\mu_\bar{x}$ = $\mu$ = 2L
$\sigma_\bar{x}^2 = \dfrac {\sigma^2}{n}$
$\sigma_\bar{x} = \dfrac {\sigma}{\sqrt{n}} = \dfrac {0.7}{\sqrt{50}} = 0.099$
We just need to figure out how many standard deviations 2.2L is away from the mean (known as the z-score)
$\dfrac {2.2-\mu}{\sigma} = \dfrac {2.2-2}{0.099} = 2.02$
The probability that average water us > 2.2L per man is the same as probability that the sample mean will be more than 2.02 standard deviations above the mean. Now you can use a z-table to figure out that probability.
0.9783 is the probability that we're less than 2.02 standard deviations above the mean
P(running out of water) = 1 - .9783 = .0217
My questions:
Broadly, what is the inference getting a 2.17%, how is different from 1% or 3% or even 5% practically? What realistic action or usefulness there could be out of this inference?
Average male indicates huge or even entire population of male. And sampling distribution with just 50 men isn't too small to consider as a normal distribution? (given that, we do not know about population distribution in question. Unless one assumes that also as normal)
Even if 50 is normal, it is just 1 sample (of size 50 men). Shouldn't we get a normal distribution only when we repeat this N-trials or N-number of times, to have the normal distribution effect to take place?
How is it not Sampling distribution of sample proportion?
Isn't it counter intuitive that our sampling distribution has lower SD ( so higher certainty), makes one wonder if sampling distribution is better than population distribution? How latter is more beneficial?
Kindly clarify.
The main problem with this question, I think, is that we don't know that your $50$ men are a random sample from the population of all males, or how the activity they will be doing will compare to whatever those average males were doing. If the weather is like what it is in much of North America and Europe these days, they'll need a lot more water!