I realize that Lagrange multipliers are extremely useful for applied optimization problems.
However, I know that the standard analytic proof of the spectral theorem relies on them. I've also seen a few other uses/mentions of them in some pure math textbooks. (For example, Wade's An Introduction to Analysis uses an exercise on Lagrange multipliers to later prove a result due to Bernstein on the convergence of Fourier series.)
My question, then, is if Lagrange multipliers are generally a useful technique for extremal problems that arise in pure math. If so, are there some well-known proofs in this area that use them (other than those I mentioned)?
I'm simply curious as to their use outside applied optimization, since the derivation of the existence of the so-called Lagrange multiplier is really just a corollary of a very geometric fact--namely that the gradient is perpendicular to the level sets.
EDIT: To be a bit more specific, by "useful" I mean that it is in fact applicable to certain pure math problems with somewhat regular frequency.
Once, I had to solve this problem:
I solved it using Lagrange multipliers. This led me to the system$$\left\{\begin{array}{l}x+a=2\lambda x\\y+b=-2\lambda y\\x^2-y^2=a^2-b^2\\x>0\text{.}\end{array}\right.$$This, in turn, led me to the equation$$\frac{4(a^2-b^2)\lambda^4-3(a^2-b^2)\lambda^2-(a^2+b^2)\lambda}{(1-2\lambda)^2(1+2\lambda)^2}=0\text{.}$$After dividing the numerator by $4(a^2-b^2)\lambda$ (the solution $\lambda=0$ is irrelevant here), one gets a third degree polynomial:$$\lambda^3-\frac34\lambda-\frac{a^2+b^2}{4(a^2-b^2)}.$$Since there is no second degree term, Cardano's formula can be applied directly, giving$$\lambda=\frac12\left(\sqrt[3]{\frac{a-b}{a+b}}+\sqrt[3]{ \frac{a+b}{a-b}}\right)\tag1$$and therefore the point of the right branch of the hyperbola closest to $P$ is $\bigl(-\frac a{1-2\lambda},-\frac b{1+2\lambda}\bigr)$, where the value of $\lambda$ is the one given by $(1)$.