for example:
$f(x)=(xx^T)^{-\frac{1}{2}}x$, where $x \in \mathbb R_{+}^{1\times d}$ is a row vector.
It is hoped that there will be specific theoretical basis (formula derivation and origin)
(Revised)
supplement:
for any a $||A||_{2,1}$, that is a norm from paper Efficient and Robust Feature Selection via Joint $l_{2,1}$-Norms Minimization
According to the above problem, how to solve the second derivative of this norm?
Some related work: The norm $\|\cdot\|_{2,1}$ of a matrix $A=(a_1,\ldots, a_n)\in\mathbb{R}^{m\times n}$ is defined as
$$ \Vert A \Vert_{2,1} = \sum_{j=1}^n \Vert a_{j} \Vert_2 = \sum_{j=1}^n \left( \sum_{i=1}^m |a_{ij}|^2 \right)^{1/2} $$
Thank you all for your help.
Given a column vector $x$, consider how its length $\lambda$ varies as $x$ is varied. $$\eqalign{ \lambda^2 &= x^Tx,\quad \lambda\,d\lambda = x^Tdx \cr }$$ Now consider the unit vector $f$ and its variation with $x$. $$\eqalign{ f &= \lambda^{-1}x \cr df &= \lambda^{-1}dx - x\lambda^{-2}\,d\lambda \cr &= \lambda^{-1}dx - x\lambda^{-3}\,(\lambda\,d\lambda) \cr &= \lambda^{-1}Idx - x\lambda^{-3}(x^Tdx) \cr &= \lambda^{-3}\Big(\lambda^2I -xx^T\Big)\,dx \cr \frac{\partial f}{\partial x} &= \lambda^{-3}\Big(\lambda^2I -xx^T\Big) \cr }$$