What does level of confidence mean in statistics?

323 Views Asked by At

The thing I am talking about is the interval estimate of the population mean. Let $X\sim N(\mu,\sigma^2)$ where $\mu$ is unknown but $\sigma$ is known. To estimate $\mu$, we perform $n$ experiments and find the mean $\overline X$. On my statistics book, there is something like this:

Let $Z=\frac{\overline X-\mu}{\sigma/\sqrt n}$ so $$ P(-r<Z<r)=P(-r\sigma/\sqrt n<\overline X-\mu<r\sigma/\sqrt n)\\ =P(\overline X-r\sigma/\sqrt n<\mu<\overline X+r\sigma/\sqrt n)\\ =\alpha $$ where $\alpha$ is the given confidence level. Of course, we should choose $r=\Phi^{-1}(\frac{1+\alpha}{2})$.

The next step is what makes me worried.

Therefore, if in one experiment we get $\overline X= \bar x$, then $$P(\overline x-r\sigma/\sqrt n<\mu<\overline x+r\sigma/\sqrt n) =\alpha \text{ }(*)$$

To me, this is very strange: it's just like having a random variable $Y$ with $P(Y<3)=0.95$, and in an experiment, we observe that $Y=3.5$, so we replace $Y$ with the observed value, and write $P(3.5<3)=0.95$, which is ridiculous.

What $(*)$ is doing is to give a probability of $\mu$ lying in a certain interval - but to give $\mu$ a probability, $\mu$ must have a distribution first. So, what is the distribution of $\mu$?

2

There are 2 best solutions below

8
On

If $\mu$ is not known and $\sigma$ is known, then a 95% confidence interval for $\mu$ is of the form $\bar X \pm 1.96\sigma/\sqrt{n}.$

For a normal sample, $\bar X$ is the best estimate of $\mu,$ but it is not perfect. It is subject to variability.

  • First because there is variability in the population, expressed by $\sigma.$ Second, because each sample is is different.

  • However, large samples can overcome population variability. For example, the population standard deviation $\sigma = 25$ might be a huge population variability.

  • Specifically, if you have $n = 100$ observations in your sample, it is possible to show that the standard deviation of $\bar X$ is decreased to $25/\sqrt{100} = 25/10 = 2.5,$ which might not be so bad.

  • The confidence interval extends about two standard deviations (relative to $\bar X)$ on either side of $\bar X.$ Thus, the confidence has a 'margin of error' to account for this remaining variability. (Here 'about two' turns out to be $1.96.)$

There can be no guarantee that any one 95% confidence interval (CI) truly includes the value of the population mean $\mu.$ But if you repeatedly use the formula in my first paragraph, then over the long run 95% of your CIs will include $\mu.$

Example: Suppose that, unknown to you, $\mu = 200$ and that, known to you $\sigma = 25.$ Let's make 20 CIs by taking $n - 100$ observations for this population---for each CI. (That's 2000 observations altogether, but I'm sampling by computer so it's easy to do.)

In the figure below each of the 20 vertical bars represents a CI made with the formula above. Dots are sample means. The horizontal line at $\mu=200$ makes it is easy to see which CIs contain $\mu.$ In particular, We can see that Sample 17 yielded a CI that does not include $\mu.$

enter image description here

Notes: (1) This is a simulation so (in the background) we know $\mu=200,$ but in real life experiments, $\mu$ wouldn't be exactly known.

(2) Here is R code used to make the figure.

set.seed(1234)
m = 20;  a = lcl = ucl = numeric(m)
mu = 200; sg = 25; n = 100;  me=1.96*sg/sqrt(n)
for(i in 1:m) {
 x = rnorm(n, mu, sg)
 a[i] = mean(x)
 lcl[i] = a[i] - me;  ucl[i] = a[i] + me }
top = max(ucl);  bot = min(lcl)
plot(c(1,20), c(top,bot), col="white",
 ylab="Units", xlab="Sample", 
 main="CIs From 20 Normal Sample of Size 100")
for(i in 1:m) {
  lines(c(i,i), c(lcl[i], ucl[i]))
  points(i, a[i], pch=19, col="blue") }
 abline(h=mu, col="green3")
0
On

Refer to Wikipedia:

Various interpretations of a confidence interval can be given (taking the 90% confidence interval as an example in the following).

  • The confidence interval can be expressed in terms of samples (or repeated samples): "Were this procedure to be repeated on numerous samples, the fraction of calculated confidence intervals (which would differ for each sample) that encompass the true population parameter would tend toward 90%."

  • The confidence interval can be expressed in terms of a single sample: "There is a 90% probability that the calculated confidence interval from some future experiment encompasses the true value of the population parameter." Note this is a probability statement about the confidence interval, not the population parameter. This considers the probability associated with a confidence interval from a pre-experiment point of view, in the same context in which arguments for the random allocation of treatments to study items are made. Here the experimenter sets out the way in which they intend to calculate a confidence interval and to know, before they do the actual experiment, that the interval they will end up calculating has a particular chance of covering the true but unknown value. This is very similar to the "repeated sample" interpretation above, except that it avoids relying on considering hypothetical repeats of a sampling procedure that may not be repeatable in any meaningful sense.

  • The explanation of a confidence interval can amount to something like: "The confidence interval represents values for the population parameter for which the difference between the parameter and the observed estimate is not statistically significant at the 10% level". In fact, this relates to one particular way in which a confidence interval may be constructed.

Based on the point 2, you must not consider the probability of the population parameter $\mu$ (because it is a constant value), but consider the probability of the confidence interval that it will include the population parameter.