Speeding up the convergence for Steepest Descent Method

2k Views Asked by Bumbble Comm At 31 Mar 2026 - 4:27

How would one speed up the convergence of the method of steepest descent when the minimum is in a very long, narrow structure? I know the fact that is a steep minimum covers more iterations to go down, but I'm unaware of how to project downwards in a more efficient manner.

Any thoughts can help!

Original Q&A

There are 4 best solutions below

Bumbble Comm On 26 Jul 2013 - 4:54

Scale the gradient by some factor at some iteration steps. The gradient points you in the right direction; if it's not fast enough, just give it a little shot of espresso. This method won't always work, but there are situations when it will.

Bumbble Comm On 26 Jul 2013 - 5:47

You could try non-linear conjugate gradients (nl-CG).

The conjugate gradient method can follow narrow (ill-conditioned) valleys where the steepest descent method slows down and follows a criss-cross pattern. [wikipedia]

This is a very good introduction to CG. The nonlinear method is described in section 14.

Bumbble Comm On 26 Jul 2013 - 5:57

Two common classes of improved algorithm are

(nonlinear) conjugate gradient, which works by choosing directions which are not independent each time so as to avoid criss- crossing as much.
quasi-Newton methods such as BFGS which work similarly to Newton's method in that they use (approximations to) the second derivative.

Bumbble Comm On 29 Sep 2013 - 11:38

The simplest modification would be to scale the gradient by the inverse of the diagonal of the hessian:

$x_{k+1} = x_k −\gamma\mathrm{diag}( (\frac{\mathrm df^2(x)}{\mathrm d^2x_1})^{-1}, (\frac{\mathrm df^2(x)}{\mathrm d^2x_2})^{-1}, \dots, (\frac{\mathrm df^2(x)}{\mathrm d^2x_n})^{-1}) ∇f(x_k)$

Speeding up the convergence for Steepest Descent Method

There are 4 best solutions below

Related Questions in NUMERICAL-METHODS

Related Questions in NUMERICAL-OPTIMIZATION

Trending Questions

Popular # Hahtags

Popular Questions