Optimization using Lagrange Multipliers for conditions with different codomain

245 Views Asked by At

I'm trying to maximize the trace of $X^TAX$ subject to the columns of $X$ being orthonormal, where $A$ is a diagonal matrix and X is not necessarily square, but does not have more columns than rows. I can represent this constraint succinctly as $X^TX = I$, but I'm not sure how I would use this with the method of Lagrange multipliers without explicitly writing out a condition for each row, column pair, as I can't simply add $\lambda(I - X^TX)$ to the derivative to form the Lagrangian. How does one go about doing this?

1

There are 1 best solutions below

0
On BEST ANSWER

Your Lagrangian is

$$L(X,Λ)=tr(X^TAX)+tr(Λ(I-X^TX))$$

with derivative

$$0=\frac{∂ L}{∂X}=X^T(A+A^T)-(Λ+Λ^T)X^T$$

Note that $tr(X^TAX)=tr((XX^T)A)$ and $P=XX^T$ is an orthogonal projector.