Let $(X, \mathcal{A}, \mu)$ be a probability space. Let $\boldsymbol{\psi} = (\psi_n: X \rightarrow \mathbb{R})_{n \ge 0}$ be an IID sequence of random variables such that $\mathbb{E}(\psi_0) \in \mathbb{R}$ and $var(\psi_0) \in \mathbb{R}$.
The law of large numbers says that the sequence of random variables $(A(\boldsymbol{\psi},n): X \rightarrow \mathbb{R})_{n \ge 0}$, where $A(\boldsymbol{\psi},n)(x) = \frac{\psi_0(x) + \ldots + \psi_{n-1}(x)}{n}$, converges to $\int \psi_0 d\mu$, $\mu$-a.e..
Now consider the following model to the average of a finite number of trials of an IID sequence of random variables: $(B(\boldsymbol{\psi},n): X^\mathbb{N} \rightarrow \mathbb{R})_{n \ge 0}$, given by $B(\boldsymbol{\psi},n)(x_0, \ldots, x_{n-1}, \ldots) = \frac{\psi_0(x_0) + \ldots + \psi_{n-1}(x_{n-1})}{n}$. Here, $X^\mathbb{N}$ is to be equipped with the product $\sigma$-algebra $\hat{\mathcal{A}}=\bigotimes_{j=0}^{\infty} \mathcal{A}$ and measure $\hat{\mu}=\bigotimes_{j=0}^{\infty} \mu$.
For me, it seems clear (if false, please let me knwon, I have not done the maths, just argued mentally) that the distributions of $A(\boldsymbol{\psi},n)$ (namely, $A(\boldsymbol{\psi},n)_* \mu)$ and $B(\boldsymbol{\psi},n)$ (namely, $B(\boldsymbol{\psi},n)_* \hat{\mu})$, are equal.
My questions are:
1) Probabilistically speaking, what are the relevant differences between $(A(\boldsymbol{\psi},n))_{n \ge 0}$ and $(B(\boldsymbol{\psi},n))_{n \ge 0}$?
2) What kind of convergence can we obtain for $(B(\boldsymbol{\psi},n))_{n \ge 0}$?
3) Does the answer to (2) involve putting the law of large number in a different clothing? Which one?
I appreciate your thoughts and hope you enjoy the question! Lucas A.
This is a good question! I think the two models you considered are actually equivalent. A natural way to construct IID random variables is to construct an infinite product space $X^\mathbb{N}$ and have the $\psi_n$'s be determined by the "$n^{th}$ copy of X". It should be intuitive that the resulting $\psi_n$'s will be independent, and we can easily make them IID by choosing their value the same way, for each copy of $X$. So we can naturally discuss convergence in the finite model you gave $(X^n, \hat{A},\hat{\mu})$ since each of these naturally embeds in the larger space $X^\mathbb{N}$ (with the product sigma-algebra and product measure). So the short answer is: you started the explicit construction of IID random variables!