Why we sum up the derivatives of the loss w.r.t. Weights at each time step in RNN back-propagation?

286 Views Asked by Bumbble Comm At 10 May 2026 - 5:02

I am reading a paper explaining the derivations of the back-propagation equations in RNNs. There I read 'Note that the Weight Matrix remains the same across all time sequence so we can differentiate to it at each time step and sum all together.'

My question is why this statement is correct. What is its mathematical derivation?

Your advice will be appreciated.

Original Q&A

Why we sum up the derivatives of the loss w.r.t. Weights at each time step in RNN back-propagation?

Related Questions in DERIVATIVES

Related Questions in NEURAL-NETWORKS

Related Questions in ERROR-PROPAGATION

Trending Questions

Popular # Hahtags

Popular Questions