Why my gradient descent seems to diverge "pair-wise"?

118 Views Asked by At

Why my gradient descent seems to diverge "pair-wise"?

I've checked the algorithms and they work for golden section line search and "small step parameter".

However, when trying to get the algo to diverge, I notice that the results indicate that the divergence occurs "pair-wise". The components flip on every iteration (although they're not entirely the same, they show slow improvement).

enter image description here

Function is:

$$f(x,y)=x^2+y^2+x+2y$$

Does this have some symmetry that'd explain this?


Reproducing code:

https://pastebin.com/QM4twYLy