$\mu_{X_n,Y_n}$ is a sequence of discrete joint-distributions on $\mathbf{R}^2$ that converge weakly to a continuous measure $\mu_{X,Y}$. That is, for any continuous function $f:\mathbf{R}^2\rightarrow \mathbf{R},$ then $|\int f\ d\mu_{X_n,Y_n}-\int f\ d\mu|\rightarrow 0$.
Is it true that $I(X_n,Y_n)\rightarrow I(X,Y)$?
I have been trying to prove this by realizing that the sequence $I(X_n,Y_n)$ is kind of like a monte-carlo estimation of the integral $I(X,Y)$
Let n be a large even number. Take n equally spaced points along the unit interval. A probability distribution which gives equal weight to all these points converges weakly to the uniform distribution. The product of two copies of this distribution converges weakly to uniform on the unit square. Since this distribution is equal to the product of it's marginals, it has mutual information equal to 0.
Now take the same set of points, and this time discard half of them in a chequerboard pattern. Share out the probability equally between the rest. This also converges weakly to uniform for large n. However,mutual information is now the expectation of a random variable equal to constant the logarithm of two.