Say we have some population and decide to randomly take a sample of size $n$ from this population. What does it then mean to talk about the distribution of the sample mean?
In other words, what do we mean by the distribution of the sample mean here?
I’m quite confused about this as from what I understood from my textbook, it doesn’t make sense to talk about a distribution of a sample mean in this scenario (since we are looking at one sample). Obviously, I’m wrong, but I’m not seeing why.
If you take $n$ i.i.d. samples from a population with a particular distribution, you can then take the mean of the sample. This sample mean will also be a random variable, and will have its own distribution, called the distribution of the sample mean.
In particular, the mean of the distribution of the sample mean will be the mean of the original distribution for the population, while the variance of the distribution of the sample mean will be the variance of the original distribution for the population divided by $n$ (assuming these exist).
So the sample mean is an unbiased estimator of the population mean, while the likely error will reduce as the sample size $n$ increases.
As an example, suppose the population is exponentially distributed with rate $1$ (so its mean is $1$ and its variance is $1$), and you take a sample of size $n=4$. You might get data like one of the following rows
and you may be able to see that many different values are possible, but the sample mean often tends to be closer to $1$ than the individual sample. In this example, the individual samples have an exponential distribution by construction as shown by the blue density in the chart below, while the mean has a stretched gamma distribution (which would get closer in shape to a normal distribution if the sample size were increased) as shown by the red density below, more concentrated around $1$. It is this red density line which shows the distribution of the sample mean