How to differentiate 2nd order Taylor expansion of scalar field?

100 Views Asked by At

I'm wondering how I can minimize this function with respect to $x$ (not $x_0$). This isn't for homework - I saw them give the answer in the book but they didn't explain how they did it and I'm wondering how. $$ f(x_0 ) + (x - x_0 )^T \nabla f(x_0 ) + \frac{1}{2}(x - x_0 )^T H(x - x_0 ) $$ Here's the answer in the book. $$ x^ * = x_0 - \nabla f(x_0 )H^{ - 1} $$ I was able to get to the same answer but I'm pretty sure I didn't do it using the right rules, so if someone could explain how to derive the formula, especially the tricky last term, would be greatly appreciated.

My main question surrounds the fact that there is a smaller function $x - x_0$ inside the quadratic term at the end. I know how to derive this if $x- x_0$ was just a vector, where the derivative w.r.t that vector would be ($H + H^T$) $\cdot$ vector. I got this from (here) as well as figuring it out myself.

Thanks!

1

There are 1 best solutions below

2
On BEST ANSWER

$$\begin{aligned} f({\rm x}) &= f({\rm x}_0) + ({\rm x} - {\rm x}_0 )^\top \nabla f({\rm x}_0 ) + \frac12({\rm x} - {\rm x}_0 )^\top {\rm H} \, ({\rm x} - {\rm x}_0 ) \\ &= (\color{grey}{\text{constant terms}}) + {\rm x}^\top \nabla f({\rm x}_0 ) - {\rm x}_0 ^\top {\rm H} \, {\rm x} + \frac12 {\rm x}^\top {\rm H} \, {\rm x} \end{aligned}$$

Thus,

$$\nabla f({\rm x}) = \nabla f({\rm x}_0 ) - {\rm H} {\rm x}_0 + {\rm H} \, {\rm x} = \color{blue}{\nabla f({\rm x}_0 ) + {\rm H} ({\rm x} - {\rm x}_0 )}$$