Why is Bessel's correction applied in t-score calculation when true population mean and variance is assumed?

251 Views Asked by At

A sample mean has a standard normal distribution $$z = \sqrt{n} \frac{\bar{x} - \mu}{\sigma}$$ . This statement seems to assume that true population mean and variance is known. If that is the case, why is t-score then defined as $$t = \sqrt{n} \frac{\bar{x} - \mu}{s}$$ where $$s = \sqrt{\frac{1}{n-1} \sum_{i=1}^n (x_i - \bar{x})^2}$$ and $\mu$ is still known presumably? Sample and population statistics are mixed here which is making it hard for me to understand why Bessel's correction is applied to variance. If it's to compensate for an unknown population distribution, how come we still use true population mean ($\mu$) in our formula?

2

There are 2 best solutions below

5
On

No, neither $\mu$ nor $\sigma$ is assumed to be known. We assume the $x_i$ are Gaussian IIDs, so $\bar{x}$ is Gaussian too. But we have to esimtate $\mu$, without bias, as the sample mean $\bar{x}$ of the $x_i\sim N(\mu,\,\sigma^2)$ with $\sigma>0$. Then $s^2$ is an unbiased estimator of $\sigma^2$, and $t$ has a $\nu=N-1$ Student's $t$-distribution. That last part can be proven with the theory of ratio distributions, allowing us to test a hypothesis about $\mu$.

1
On

With known variance, your $z$, which could be written as $\frac{\bar{x} - \mu}{\sqrt{\sigma^2/n}}$, has a standard normal distribution with mean $0$ and variance $1$. You can use it to test whether the observations are significantly different from a particular possible value for the mean or to construct confidence intervals for the population mean from the observations

With unknown variance, your $t$, which could be written as $\frac{\bar{x} - \mu}{\sqrt{\sum_i (x_i-\bar{x})^2/(n(n-1))}}$, has a Student's $t$-distribution with $n-1$ degrees of freedom. Again, you can use it to test whether the observations are significantly different from a particular possible value for the mean or to construct confidence intervals for the population mean from the observations

You need the $n(n-1)$ divisor to get the Student's $t$-distribution. Otherwise you have a scaled $t$-distribution, which is less convenient. You can regard this as first using Bessel's correction to get $s^2$ to be an unbiased estimate of unknown $\sigma^2$ and then divide by $n$; or you can use the uncorrected sample variance and then divide by $n-1$. It makes no difference which you use, but you must use one to get a $t$ distribution on the correct scale