confusion with chain rule - derivative of scalar by vector

96 Views Asked by Bumbble Comm At 11 May 2026 - 6:19

I am studying reinforcement learning and came across the following:

Here $\delta$ is a function of $\theta$. $\phi$ is independent of $\theta$

I am confused on the third line (annotated with red arrow). Using the following identity:

In my derivation third line becomes:

$$ -\alpha\DeclareMathOperator{\E}{\mathbb{E}} \E[\phi\phi^T]^{-1}\DeclareMathOperator{\E}{\mathbb{E}} \E[\delta\phi](\bigtriangledown_\theta\DeclareMathOperator{\E}{\mathbb{E}} \E[\delta\phi]) $$

What am I missing?

Original Q&A

confusion with chain rule - derivative of scalar by vector

Related Questions in DERIVATIVES

Related Questions in VECTORS

Related Questions in MATRIX-CALCULUS

Related Questions in CHAIN-RULE

Trending Questions

Popular # Hahtags

Popular Questions