Let $X$ be a positive definite matrix with positive definite matrix square root $X^{1/2}$. Define $$y = \text{trace}(AX^{1/2})$$ some known matrix $A$. What is ${\partial y}/{\partial X}$ ? I tried using this set of notes together with the square root formula from here to evaluate it in the $2 \times 2$ case using index notation, but there must be a better way? Especially to generalize it to the $n \times n$ case. I would guess it is something like $$ \frac{\partial y}{\partial X} = \frac{1}{2} A^T X^{-1/2}$$ where $X^{-1/2}$ is the square root of $X^{-1}$.
2026-04-12 16:57:56.1776013076
Matrix derivative of scalar function involving matrix square root
1.1k Views Asked by Bumbble Comm https://math.techqa.club/user/bumbble-comm/detail At
1
There are 1 best solutions below
Related Questions in MULTIVARIABLE-CALCULUS
- Equality of Mixed Partial Derivatives - Simple proof is Confusing
- $\iint_{S} F.\eta dA$ where $F = [3x^2 , y^2 , 0]$ and $S : r(u,v) = [u,v,2u+3v]$
- Proving the differentiability of the following function of two variables
- optimization with strict inequality of variables
- How to find the unit tangent vector of a curve in R^3
- Prove all tangent plane to the cone $x^2+y^2=z^2$ goes through the origin
- Holding intermediate variables constant in partial derivative chain rule
- Find the directional derivative in the point $p$ in the direction $\vec{pp'}$
- Check if $\phi$ is convex
- Define in which points function is continuous
Related Questions in MATRIX-CALCULUS
- How to compute derivative with respect to a matrix?
- Definition of matrix valued smooth function
- Is it possible in this case to calculate the derivative with matrix notation?
- Monoid but not a group
- Can it be proved that non-symmetric matrix $A$ will always have real eigen values?.
- Gradient of transpose of a vector.
- Gradient of integral of vector norm
- Real eigenvalues of a non-symmetric matrix $A$ ?.
- How to differentiate sum of matrix multiplication?
- Derivative of $\log(\det(X+X^T)/2 )$ with respect to $X$
Trending Questions
- Induction on the number of equations
- How to convince a math teacher of this simple and obvious fact?
- Find $E[XY|Y+Z=1 ]$
- Refuting the Anti-Cantor Cranks
- What are imaginary numbers?
- Determine the adjoint of $\tilde Q(x)$ for $\tilde Q(x)u:=(Qu)(x)$ where $Q:U→L^2(Ω,ℝ^d$ is a Hilbert-Schmidt operator and $U$ is a Hilbert space
- Why does this innovative method of subtraction from a third grader always work?
- How do we know that the number $1$ is not equal to the number $-1$?
- What are the Implications of having VΩ as a model for a theory?
- Defining a Galois Field based on primitive element versus polynomial?
- Can't find the relationship between two columns of numbers. Please Help
- Is computer science a branch of mathematics?
- Is there a bijection of $\mathbb{R}^n$ with itself such that the forward map is connected but the inverse is not?
- Identification of a quadrilateral as a trapezoid, rectangle, or square
- Generator of inertia group in function field extension
Popular # Hahtags
second-order-logic
numerical-methods
puzzle
logic
probability
number-theory
winding-number
real-analysis
integration
calculus
complex-analysis
sequences-and-series
proof-writing
set-theory
functions
homotopy-theory
elementary-number-theory
ordinary-differential-equations
circles
derivatives
game-theory
definite-integrals
elementary-set-theory
limits
multivariable-calculus
geometry
algebraic-number-theory
proof-verification
partial-derivative
algebra-precalculus
Popular Questions
- What is the integral of 1/x?
- How many squares actually ARE in this picture? Is this a trick question with no right answer?
- Is a matrix multiplied with its transpose something special?
- What is the difference between independent and mutually exclusive events?
- Visually stunning math concepts which are easy to explain
- taylor series of $\ln(1+x)$?
- How to tell if a set of vectors spans a space?
- Calculus question taking derivative to find horizontal tangent line
- How to determine if a function is one-to-one?
- Determine if vectors are linearly independent
- What does it mean to have a determinant equal to zero?
- Is this Batman equation for real?
- How to find perpendicular vector to another vector?
- How to find mean and median from histogram
- How many sides does a circle have?
For convenience define a new matrix $$S=X^{1/2}$$ Let's start by finding the differential of $X$ in terms of the $S$-matrix. $$\eqalign{ X &= SS \cr dX &= dS\,S + S\,dS \cr dx &= (S^T\otimes I+I^T\otimes S)\,ds \cr ds &= (S\otimes I+I\otimes S)^{-1}\,dx = M\,dx \cr }$$ where in the last steps I've vectorized the results using the notation $ds={\rm vec}(dS)$ and $dx={\rm vec}(dX)$, and taken advantage of the fact that $S$ and $I$ are symmetric.
Now we need the Kronecker decomposition of the matrix $M$. Look for the classic paper "Approximation with Kronecker Products" by van Loan and Pitsianis, or Pitsianis' 1997 dissertation (which contains Matlab code). Despite the name of the paper, the Kronecker factorization is a full decomposition, not an approximation.
Anyway, the matrix can be decomposed into $$\eqalign{ M &= \sum_{k=1}^r Y_k\otimes Z_k \cr }$$ where $r={\rm rank}(\widetilde{M})$, the rank of the so-called tilde matrix of $M$, which is an operation which doesn't do any actual calculations, it merely reshapes and shuffles the elements of the matrix, but the operation does change the rank. Note that in this case, we want the the factors $\{Y_k, Z_k\}$ to be square matrices with the same dimensions as the other matrices $\{A, S, X\}$. The desired dimensions of the factors are one of the inputs to the tilde function.
Substituting this decomposition into the previous expression $$\eqalign{ ds &= M\,dx = \sum_{k=1}^r Y_k\otimes Z_k\,dx \cr dS &= \sum_{k=1}^r Z_k\,dX\,Y_k^T \cr \cr }$$
Finally let's write your function in of the inner/Frobenius product, i.e. $$A:B={\rm tr}(A^TB)$$ and find its differential. $$\eqalign{ y &= A:S \cr dy &= A:dS \cr &= A:\Bigg(\sum_{k=1}^r Z_k\,dX\,Y_k^T\Bigg) \cr &= \Bigg(\sum_{k=1}^r Z_k^T\,A\,Y_k\Bigg): dX \cr\cr }$$ This means that the gradient is $$\eqalign{ \frac{\partial y}{\partial X} &= \sum_{k=1}^r Z_k^T\,A\,Y_k \cr\cr }$$