Why proximal gradient instead of plain subgradient methods for LASSO?

301 Views Asked by At

I was thinking to solve LASSO via vanilla subgradient methods. But,I have read people suggesting to use Proximal GD. Can somebody highlight why proximal GD instead of vanilla subgradient methods be used for LASSO?

1

There are 1 best solutions below

0
On BEST ANSWER

OK, since you liked my comments... :-)

  1. Better performance. That's it.
  2. Longer, but more handwaving, answer: proximal methods allow you to use a lot more information about the function at each iteration. Also, proximal gradient methods take into account a much larger neighborhood around the initial point, enabling longer steps.
  3. Third, practical answer: you should not be implementing your own LASSO solver. There are too many good implementations out there. If you need to port one from one language to another, that's one thing, but to write one from scratch at this point makes no sense unless you have very specific problem structure you need to exploit.