Trying to understand symmetric rank one updates and there is this like in the Wikipedia page that says...
A twice continuously differentiable function $x\mapsto f(x)$ has a gradient ($\nabla f$) and Hessian matrix $B$: The function $f$ has an expansion as a Taylor series at $x_{0}$, which can be truncated
$$ f(x_{0}+\Delta x)\approx f(x_{0})+\nabla f(x_{0})^{T}\Delta > x+{\frac {1}{2}}\Delta x^{T}{B}\Delta x $$
its gradient has a Taylor-series approximation also
$$\nabla f(x_{0}+\Delta x)\approx \nabla f(x_{0})+B\Delta x $$
which is used to update $B$. The above secant-equation need not have a unique solution $B$. The SR1 formula computes (via an update of rank 1) the symmetric solution that is closest to the current approximate-value $B_{k}$:
$$ B_{k+1}=B_{k}+{\frac {(y_{k}-B_{k}\Delta x_{k})(y_{k}-B_{k}\Delta x_{k})^{T}}{(y_{k}-B_{k}\Delta x_{k})^{T}\Delta x_{k}}} $$
Question
Why is the last formula above "the symmetric solution that is closest to the current approximate value $B$?
I can trivially see that the solution is symmetric, but how do we justify constructing it like this and why does $(y_{k}-B_{k}\Delta x_{k})^{T}$ suddenly appear in the denominator?
The approximate matrix updates as follows: $$ B_{k+1}=B_{k}+\Delta B_{k}. $$
Note that we expect $B_{k}=\nabla^{2}f(x_{k})^{-1}, B_{k+1}=\nabla^{2}f(x_{k+1})^{-1}$. Then we have $$ \Delta B_{k}\approx \nabla^{2}f(x_{k+1})^{-1}-\nabla^{2}f(x_{k})^{-1}. $$
Since $\nabla^{2}f(x_{k+1})^{-1}$ and $\nabla^{2}f(x_{k})^{-1}$ are symmetric matrices, $\Delta B_{k}$ is supposed to be symmetric. Denote $\Delta B_{k}=\beta u u^{\top}$. Then the update becomes $$ B_{k+1}=B_{k}+\beta u u^{\top}, $$ where $\beta\in\mathbb{R},u\in\mathbb{R}^{n}$. Then we have $$ B_{k+1}y_{k}=B_{k}y_{k}+\beta u u^{\top}y_{k} \Rightarrow B_{k+1}y_{k}-B_{k}y_{k}=\beta(u^{\top}y_{k})u. $$
Thus $u=(B_{k+1}y_{k}-B_{k}y_{k})/(\beta(u^{\top}y_{k}))$. Denote $r=1/(\beta(u^{\top}y_{k}))\in\mathbb{R}$. Similarly we have $B_{k+1}y_{k}-B_{k}y_{k}=\beta u u^{\top}y_{k}$. Then it holds $$B_{k+1}y_{k}-B_{k}y_{k}=\beta r^{2}(B_{k+1}y_{k}-B_{k}y_{k})(B_{k+1}y_{k}-B_{k}y_{k})^{\top}y_{k}.$$
Then we have $\beta r^{2}(B_{k+1}y_{k}-B_{k}y_{k})^{\top}y_{k}=1$. Substituting this into the previous formula, we get $$B_{k+1}=B_{k}+\frac{(B_{k+1}y_{k}-B_{k}y_{k})(B_{k+1}y_{k}-B_{k}y_{k})^{\top}}{(B_{k+1}y_{k}-B_{k}y_{k})^{\top}y_{k}}.$$
Substitute $B_{k+1}y_{k},y_{k}$ with $y_{k}, \Delta x_{k}$, and you can get the update scheme in your post.