I came across some problems that are related to partial derivative but I haven't learnt this yet. And I looked up many online resources but couldn't find answers to my doubts. Really hope someone can help me.
Here is my problem. $y=A^TB$, where A, B are two matrices. Now I want to know what $\frac{\partial y}{\partial A}$,$\frac{\partial y}{\partial B}$ are.
2026-04-23 06:26:11.1776925571
partial derivative of transpose matrix-matrix multiplication
149 Views Asked by Bumbble Comm https://math.techqa.club/user/bumbble-comm/detail At
1
There are 1 best solutions below
Related Questions in CALCULUS
- Equality of Mixed Partial Derivatives - Simple proof is Confusing
- How can I prove that $\int_0^{\frac{\pi}{2}}\frac{\ln(1+\cos(\alpha)\cos(x))}{\cos(x)}dx=\frac{1}{2}\left(\frac{\pi^2}{4}-\alpha^2\right)$?
- Proving the differentiability of the following function of two variables
- If $f ◦f$ is differentiable, then $f ◦f ◦f$ is differentiable
- Calculating the radius of convergence for $\sum _{n=1}^{\infty}\frac{\left(\sqrt{ n^2+n}-\sqrt{n^2+1}\right)^n}{n^2}z^n$
- Number of roots of the e
- What are the functions satisfying $f\left(2\sum_{i=0}^{\infty}\frac{a_i}{3^i}\right)=\sum_{i=0}^{\infty}\frac{a_i}{2^i}$
- Why the derivative of $T(\gamma(s))$ is $T$ if this composition is not a linear transformation?
- How to prove $\frac 10 \notin \mathbb R $
- Proving that: $||x|^{s/2}-|y|^{s/2}|\le 2|x-y|^{s/2}$
Related Questions in MATRICES
- How to prove the following equality with matrix norm?
- I don't understand this $\left(\left[T\right]^B_C\right)^{-1}=\left[T^{-1}\right]^C_B$
- Powers of a simple matrix and Catalan numbers
- Gradient of Cost Function To Find Matrix Factorization
- Particular commutator matrix is strictly lower triangular, or at least annihilates last base vector
- Inverse of a triangular-by-block $3 \times 3$ matrix
- Form square matrix out of a non square matrix to calculate determinant
- Extending a linear action to monomials of higher degree
- Eiegenspectrum on subtracting a diagonal matrix
- For a $G$ a finite subgroup of $\mathbb{GL}_2(\mathbb{R})$ of rank $3$, show that $f^2 = \textrm{Id}$ for all $f \in G$
Related Questions in PARTIAL-DERIVATIVE
- Equality of Mixed Partial Derivatives - Simple proof is Confusing
- Proving the differentiability of the following function of two variables
- Partial Derivative vs Total Derivative: Function depending Implicitly and Explicitly on Variable
- Holding intermediate variables constant in partial derivative chain rule
- Derive an equation with Faraday's law
- How might we express a second order PDE as a system of first order PDE's?
- Partial derivative of a summation
- How might I find, in parametric form, the solution to this (first order, quasilinear) PDE?
- Solving a PDE given initial/boundary conditions.
- Proof for f must be a constant polynomial
Related Questions in MATRIX-CALCULUS
- How to compute derivative with respect to a matrix?
- Definition of matrix valued smooth function
- Is it possible in this case to calculate the derivative with matrix notation?
- Monoid but not a group
- Can it be proved that non-symmetric matrix $A$ will always have real eigen values?.
- Gradient of transpose of a vector.
- Gradient of integral of vector norm
- Real eigenvalues of a non-symmetric matrix $A$ ?.
- How to differentiate sum of matrix multiplication?
- Derivative of $\log(\det(X+X^T)/2 )$ with respect to $X$
Trending Questions
- Induction on the number of equations
- How to convince a math teacher of this simple and obvious fact?
- Find $E[XY|Y+Z=1 ]$
- Refuting the Anti-Cantor Cranks
- What are imaginary numbers?
- Determine the adjoint of $\tilde Q(x)$ for $\tilde Q(x)u:=(Qu)(x)$ where $Q:U→L^2(Ω,ℝ^d$ is a Hilbert-Schmidt operator and $U$ is a Hilbert space
- Why does this innovative method of subtraction from a third grader always work?
- How do we know that the number $1$ is not equal to the number $-1$?
- What are the Implications of having VΩ as a model for a theory?
- Defining a Galois Field based on primitive element versus polynomial?
- Can't find the relationship between two columns of numbers. Please Help
- Is computer science a branch of mathematics?
- Is there a bijection of $\mathbb{R}^n$ with itself such that the forward map is connected but the inverse is not?
- Identification of a quadrilateral as a trapezoid, rectangle, or square
- Generator of inertia group in function field extension
Popular # Hahtags
second-order-logic
numerical-methods
puzzle
logic
probability
number-theory
winding-number
real-analysis
integration
calculus
complex-analysis
sequences-and-series
proof-writing
set-theory
functions
homotopy-theory
elementary-number-theory
ordinary-differential-equations
circles
derivatives
game-theory
definite-integrals
elementary-set-theory
limits
multivariable-calculus
geometry
algebraic-number-theory
proof-verification
partial-derivative
algebra-precalculus
Popular Questions
- What is the integral of 1/x?
- How many squares actually ARE in this picture? Is this a trick question with no right answer?
- Is a matrix multiplied with its transpose something special?
- What is the difference between independent and mutually exclusive events?
- Visually stunning math concepts which are easy to explain
- taylor series of $\ln(1+x)$?
- How to tell if a set of vectors spans a space?
- Calculus question taking derivative to find horizontal tangent line
- How to determine if a function is one-to-one?
- Determine if vectors are linearly independent
- What does it mean to have a determinant equal to zero?
- Is this Batman equation for real?
- How to find perpendicular vector to another vector?
- How to find mean and median from histogram
- How many sides does a circle have?
$\def\p#1#2{\frac{\partial #1}{\partial #2}}$Let $\,(\alpha,\beta)\,$ be fourth-order tensors with components $$\eqalign{ \alpha_{ijk\ell} &= \delta_{ik}\,\delta_{j\ell} \\ \beta_{ijk\ell} &= \delta_{i\ell}\,\delta_{jk} \\ }$$ and properties with respect to the matrices $(F,G,H)$ $$\eqalign{ \alpha:H &= H:\alpha = H \\ \beta:F &= F:\beta = F^T \\ HFG &= H\alpha G^T:F \\ }$$ where a colon denotes a double-contraction product, i.e. $$\eqalign{ \left(\alpha:H\right)_{ij} &= \sum_k\sum_\ell\alpha_{ijk\ell}\,H_{k\ell} \\ \left(F:\beta\right)_{k\ell} &= \sum_i\sum_jF_{ij}\,\beta_{ijk\ell} \\ }$$ and juxtaposition implies a single-contraction product $$\eqalign{ \left(H\alpha\right)_{mjk\ell} &= \sum_i H_{mi}\,\alpha_{ijk\ell} \\ \left(\alpha G^T\right)_{ijkm} &= \sum_\ell\alpha_{ijk\ell}\,G^T_{\ell m} \\ }$$
With these tensors, the posted question can be answered as follows $$\eqalign{ Y &= A^TB \\&= A^T\alpha:B \quad&\implies\quad\p{Y}{B} &= A^T\alpha \\ Y &= \alpha B^T:A^T \\ &= \alpha B^T:\beta:A \quad&\implies\quad\p{Y}{A} &= \alpha B^T:\beta \\ }$$ So the gradients in question are seen to be fourth-order tensors.
An approach which avoids higher-order tensors, is to transform the relationship into a vector equation using Kronecker products. $$\eqalign{ {\rm vec}(Y) &= (I\otimes A^T)\;{\rm vec}(B) \quad&\implies\quad \p{{\,\rm vec}\,Y}{{\,\rm vec}\,B} = (I\otimes A^T) \\ &= (B^T\otimes I)K\;{\rm vec}(A) \quad&\implies\quad \p{{\,\rm vec}\,Y}{{\,\rm vec}\,A} = (B^T\otimes I)K \\ }$$ where $K$ is the commutation matrix associated with vectorization.
Another approach is to use component-wise derivatives $$\eqalign{ \p{Y}{A_{ij}} &= E_{ij}^TB \qquad\quad \p{Y}{B_{ij}} &= A^TE_{ij} \\ }$$ where $E_{ij}$ is a matrix with all components equal to zero, except the $(i,j)$ component which equals one. And any matrix with independent components satisfies the identity $$\eqalign{ \p{G}{G_{k\ell}} &= E_{k\ell} \qquad\iff\qquad \p{G^T}{G_{k\ell}} &= E_{k\ell}^T \\ }$$ Finally, to bring things full circle $$\eqalign{ \p{G_{ij}}{G_{k\ell}} &= \alpha_{ijk\ell} \qquad\iff\qquad \p{G_{ij}^T}{G_{k\ell}} &= \beta_{ijk\ell} \\ }$$