Solving a variant of an orthogonal Procrustes problem

135 Views Asked by At

While reading the following paper on an optimization problem, there was a variant of an orthogonal Procrustes problem, where the solution is an element of the Stiefel manifold. The authors provided a closed-form solution, but I couldn't understand how they derived it!

Given $Y$ is an $n \times k$ matrix and $D$ is an $n \times n$ diagonal matrix of strictly positive values on its main diagonal, the following optimization problem in $n \times k$ matrix $\Psi$

$$ {\bf \Psi}^{*} = \arg\min_{{\bf \Psi}, {\bf \Psi}^{\top}{\bf \Psi} = {\bf I}} \left\Vert {\bf D}^{-\frac12} {\bf \Psi} - {\bf Y} \right\Vert_2^2 $$

The $\left\Vert\cdot\right\Vert_2$ refers to the Frobenius norm, and the solution they used is as follows:

$$ {\bf \Psi}^{*} = \bf{Y}\bf{V}\bf{W}^{-\frac12}\bf{V}^{\top} $$

where $\bf{V}$, $\bf{W}$ and $\bf{V}^{\top}$ are obtained via the SVD decomposition

$$ (\bf{D}^{\frac{1}{2}}\bf{Y})^{\top}(\bf{D}^{\frac{1}{2}}\bf{Y}) = \bf{V}\bf{W}\bf{V}^{\top} $$

I attached the screenshot of the problem from the paper (we want to solve equation 10, the equations of interest are 14, 15 and 16), any help is appreciated!


enter image description here


enter image description here