Newton method and machine learning

131 Views Asked by Bumbble Comm At 27 Mar 2026 - 8:47

There is some debate about why Newton method is not widely used in machine learning. Instead, people tend to use gradient descent.

Some people claim that Newton method is not used because it involves the second derivative. How so? Indirectly? Why? Doesn't Newton method neglect the second derivative?
Is there a name for Newton's method with cubic convergence?
Can we claim that Newton's method is a form of gradient descent?

Original Q&A

There are 1 best solutions below

Bumbble Comm On 18 Apr 2020 - 9:46 BEST ANSWER

In machine learning, the interest in solving function-is-$0$ conditions is for, say, minimizing $f$ by setting $\nabla f=0$. Since this is already a first derivative, Newton's method ends up using the second derivative $\nabla^2 f$, which is very expensive in high dimensions.

The cubic approach you linked looks unfamiliar. I was hoping it'd be Halley's method, but it seems different.

Newton's method isn't considered a form of gradient descent, because GD doesn't choose its step size to approximate the root. Newton's method is quadratically convergent, which is a bit of a double-edged sword; GD prefers a slower but somewhat safer linear convergence.

Newton method and machine learning

There are 1 best solutions below

Related Questions in NEWTON-RAPHSON

Trending Questions

Popular # Hahtags

Popular Questions