Here is a question I am currently working on. I need some help on part b.
An environmental variable $X$ has value $X_i$ on day $i$. Due to budgetary constraints, $X$ is measured only every third day. If $X$ is measured on day $i$, then the observed value of $X_i$ is used to estimate the three day average $$\bar{X}=\frac{1}{3}(X_{i-1}+X_{i}+X_{i+1})$$ We're given that the variables $X_i$ have a multivariate normal distribution with common mean $m$ and common variance $\sigma^2$ and that the correlation coefficient between $X_{i}$ and $X_{j}$ is $\rho_{ij}=0.9^{|i-j|}$.
Part a) Find the distribution of $\bar{X}|X_i=x$
Part b) Based on the answer to a), suggest a better estimator of $\bar{X}$ from $X_i$
Here is how I approached part a). Notice $$E(\bar{X}|X_i=x)=\frac{2m+x}{3}$$ and $$V(\bar{X}|X_i=x)=\frac{1}{9}V(X_{i-1}+X_{i+1}+x)=\frac{181}{450}\sigma^2$$ Hence $\bar{X}|X_i=x \sim N\big(\frac{2m+x}{3},\frac{181}{450}\sigma^2\big)$. Is this correct? If it is, how I can use this result to find a better estimator for the three day average? If we don't have any other information than the observed value on day $i$, I don't see what else we can use as an estimator for $\bar{X}$.
Thank you!
For simplicity, I will call the r.v. $X_1, X_2, X_3$.
Let $a=0.9$, the correlation matrix is given by $$ \Sigma = \sigma^2\begin{bmatrix} 1& a&a^2 \\ a & 1 & a \\ a^2 & a & 1 \end{bmatrix} $$
and is positive-definite, by the spectral theorem we can write $$ \Sigma = S D^{\frac{1}{2}} (D^{\frac{1}{2}})^T S^T = AA^T$$ where $A = S D^{\frac{1}{2}}$, $S$ is orthogonal and $D$ is diagonal with $D = diag(\lambda_1, \lambda_2, \lambda_3)$ (see the actual values here).
So, $X = AZ+m$, where $Z = (Z_1, Z_2, Z_3)$, with $Z_i \sim N(0,1)$ and all independent.
Notice that if $S$ has rows $s_i$, then: $$ X_i = \langle s_i, D^{\frac{1}{2}}Z \rangle + m $$
This poses a linear relation between $Z_1$, $Z_2$ and $Z_3$. If we force $X_i = x_i$ then we can pick some $Z_i$ and write it as a linear function of the others. Suppose that $i=2$ (that is your case, since you have only the middle value), then $Z_2 = \alpha Z_1 + \beta Z_3 + \gamma$ for some coefficients $\alpha, \beta, \gamma$ that depends only on $a$, $\sigma$ and $x_2$. That gives you the distribution of $\mathbb{E}(X | X_2=x_2)$ since $X = AZ+m$ and $$ \mathbb{E}(Z|X_2=x_2) = \begin{bmatrix} Z_1 \\ \alpha Z_1 + \beta Z_3 + \gamma \\ Z_3 \end{bmatrix} $$ with $Z_1, Z_3 \in N(0,1)$ independent.
We want to compute $\bar{X}| X_2=x_2$, this is just: $$ \mathbb{E}(\bar{X}|X_2=x_2) = 3m+\sum_{i=1}^3 \langle s_i, D^{\frac{1}{2}}\mathbb{E}(Z|X_i=x_i) \rangle $$
The above relation can be used to construct an estimative, since: $$ \mathbb{E}(\mathbb{E}(\bar{X}|X_i=x_i)) = 3m+\sum_{i=1}^3 \langle s_i, D^{\frac{1}{2}} \begin{bmatrix} 0 \\ \gamma \\ 0 \end{bmatrix}\rangle = 3m + \sqrt{\lambda_2}\langle s_2, \begin{bmatrix} 0 \\ \gamma \\ 0 \end{bmatrix}\rangle $$
Now you can compute $\lambda_2$, $s_2$ and $\gamma$ (notice that $\gamma$ is a function of $x_2$) numerically or even get the closed expressions with a good CAS. And so you have a better estimative then before.
The good thing about this resolution is that if you want to consider also other measures you can adapt it to improve your estimative.