How would I go about calculating $ \frac{d\textit{$\alpha$}}{d\boldsymbol{A}}$ for $$\textit{$\alpha$} = \sqrt{\boldsymbol{x}^{\intercal}\boldsymbol{A}\boldsymbol{x}}$$ where $\alpha$ is a scalar, $\boldsymbol{x}\in\mathbb{R}^{n}$, and $\boldsymbol{A}\in\mathbb{R}^{n\times n}$.
Sorry if this question is straightforward. I'm trying to implement an algorithm and came across this equation. I'm not familiar with matrix and vector derivatives. Also, any links to a comprehensive introduction to matrix/vector calculus would be appreciated.
$x^\top A x$ can be written as $\sum_i \sum_j a_{ij} x_i x_j$. In this form, it is not hard to see what the partial derivative with respect to $a_{ij}$ is for any $i,j$. Then $\frac{d}{dA}(x^\top A x)$ can be viewed as a matrix consisting of all such partial derivatives.
To deal with $\sqrt{x^\top A x}$ you can just use the usual chain rule with the map $z \mapsto \sqrt{z}$.