coordinate descent in very basic

180 Views Asked by At

I try to figure out how coordinate descent works from wiki https://en.wikipedia.org/wiki/Coordinate_descent
From wiki example :
the equation is $5x^2-6xy+5y^2$.

Let $x = -0.5$ and $y =-1$
For the first iteration $y$ is going up graphically and the result of equation is minimizing
then $x$ is going right and the result of equation is minimizing, too
But how do we do this mathmatically?

My guess is:
put $x = -0.5$ an equation becomes $5(-0.5)^2-6(-0.5)y+5y^2$
after partial dererative $-6(-0.5)+10y=0$ and solve for $y$
do it in the same way for $x$ with the new $y$ from above.
Can someone tell me if I am in the right track?

1

There are 1 best solutions below

2
On

Yes, this is right. Fix $x$, get optimal $y$. Then fix that $y$, get new optimal $x$, and so on.

Anyway, the coordinate descent method is in principle used for numerical minimization. This means that you use a numerical algorithm for 1D minimization (such as the Golden section method).

Using the method analytically is a little masochistic as you will have to solve an infinity of 1D minimization problems rather than directly tackle 2D minimization by cancellation of the gradient.