Why local gradient of output node and hidden node are different?

16 Views Asked by Bumbble Comm At 22 Apr 2026 - 6:27

I got a question studying Machine Learning.

The author says that

in this diagram, the local gradient of output node is defined by:

and in below diagram,

the local graident of hidden node is defined by:

epsilon is defined like this:

By chaing rule, those two equations have same structure, except -. I know that there are many nodes between hidden node and output, but that doesn't explain why there's -.

can anyone help me why those are different??

Original Q&A

Why local gradient of output node and hidden node are different?

Related Questions in MACHINE-LEARNING

Related Questions in GRADIENT-DESCENT

Related Questions in BACKPROPAGATION

Trending Questions

Popular # Hahtags

Popular Questions