Given $A\in\mathbb{R}^{m\times n}$. Find a matrix $X\in\mathbb{R}^{m\times r}$ with orthonormal columns, s.t. $\|X^TA\|$ is maximized, where $\|\cdot\|$ denotes the Frobenius norm.
The answer is given by the $r$ leading left singular vectors of $A$, could anyone help to explain why is that?
My thoughts: with the (reduced) SVD $A=U\Sigma V^T$, we have $$\|X^TA\|=\|X^TU\Sigma\|.$$ If $X=U$ then the norm is $\|U^TU\Sigma\|=\|\Sigma\|$, but how can I show this is the maximum?
I will assume that $r \leq n$. Otherwise, there's some extra "bookkeeping" to be done regarding the singular values of $A$.
It's easy to show that taking $X$ to be the leading left singular vectors of $A$ (i.e. the leading columns of $U$) yields $\|X^TA\| = \sigma_1^2 + \cdots + \sigma_r^2$, where $\sigma_1 \geq \cdots \geq \sigma_n$ denote the singular values of $A$. To show that this is the maximal possible value of $\|X^TA\|$, it suffices to show that $\sigma_i(A) \geq \sigma_i(X^TA)$ for all $i = 1,\dots,r$.
One strategy is to use the Courant-Fischer (min-max) theorem, noting that $\|X^Tv\| \leq \|v\|$ holds for all vectors $v$. Another approach is to note that for any such $X$, there exists a $Y$ such that $U = [X\ \ Y]$ is square with orthonormal columns and hence unitary. Note that $A$ has the same singular values as the matrix $$ U^TA = \pmatrix{X^TA\\ Y^TA}. $$ Now, the singular values of $U^TA$ are the positive eigenvalues of the block-matrix $$ \pmatrix{0 & (U^TA)^T\\ U^TA & 0} = \pmatrix{0 & A^TX & A^TY\\X^TA & 0 & 0\\Y^TA & 0 & 0}. $$ By Cauchy's interlacing theorem, these eigenvalues interlace the positive eigenvalues of the principal submatrix $$ \pmatrix{0 & (X^TA)^T\\ X^TA & 0}, $$ whose positive eigenvalues are the singular values of $X^TA$.