$\frac{d\bf{Ax}}{d\bf{x}'}=A$ makes sense as Ax becomes a column vector given x is a column vector, but I see elsewhere $\frac{d\bf{Ax}}{d\bf{x}}=A'$

79 Views Asked by At

$\frac{d\bf{Ax}}{d\bf{x}'} = \bf{A}$ makes sense as $\bf{Ax}$ becomes a column vector given $\bf{x}$ is a column vector and $\bf{x}'$ is a row vector and $\bf{A}$ is a $n\times n$ matrix, but I see elsewhere that states $\frac{d\bf{Ax}}{d\bf{x}} = \bf{A}'$, which doesn't makes sense (to me!) because it takes a derivative of $\bf{Ax}$ by the column vector which should place each partial column-wise. It only makes sense if somehow $Ax$ is a row vector such as $\bf{x'A'}$.

What is the explanation of this result?

1

There are 1 best solutions below

3
On

Index notation often brings clarity to this kind of problem. The only thing you need to know is $$\frac{\partial x_i}{\partial x_j} = \delta_{ij} = \begin{cases} 1 & i = j \\ 0 & i\ne j \end{cases}$$

For the current problem $y=Ax$ we have $$\eqalign{ y_i &= A_{ij}x_j \cr \frac{\partial y_i}{\partial x_k} &= A_{ij}\frac{\partial x_i}{\partial x_k} = A_{ij}\delta_{ik} = A_{ik} \cr \frac{\partial(Ax)}{\partial x} &= A \cr }$$ where summation over a repeated index is implied (aka Einstein convention).

Differentiating $w^T = x^TA^T\,$ produces $$\eqalign{ w_i &= x_jA^T_{ji} = A_{ij}x_j =y_i \cr \frac{\partial w_i}{\partial x_k} &= \frac{\partial y_i}{\partial x_k} = A_{ik} \cr \frac{\partial(x^TA^T)}{\partial x} &= A \cr }$$ You don't need to concern yourself with metric tensors or dual bases unless you're working with a non-Euclidean space (general relativity) or an oblique coordinate system (crystallography).

In standard cartesian coordinates you don't need to distinguish between $y^T$ and $y$. It just happens to be a limitation of matrix notation which creates this confusion.