So I was under the impression that the L2 norm squared of a vector x is just 2x, but the example in the screenshot I have linked to says otherwise. What gives? I can't figure out why there's an extra A transpose factor in the result for the derivative.
2025-01-13 02:26:53.1736735213
Difficulty with the derivative of $L_2$ norm
2.5k Views Asked by user2474041 https://math.techqa.club/user/user2474041/detail At
1
There are 1 best solutions below
Related Questions in MATRICES
- Show CA=CB iff A=B
- What is the correct chain rule for composite matrix functions?
- Is the row space of a matrix (order n by m, m < n) of full column rank equal to $\mathbb{R}^m$?
- How to show that if two matrices have the same eigenvectors, then they commute?
- Linear Algebra: Let $w=[1,2,3]_{L_1}$. Find the coordinates of w with respect to $L$ directly and by using $P^{-1}$
- How to prove the cyclic property of the trace?
- Matrix expression manipulation
- Matrix subring isomorphic to $\mathbb{C}$
- Is the ellipsoid $x'Qx < \alpha$ equivalent to $\alpha Q^{-1} - x x' \succ 0$?
- Show that matrix $M$ is not orthogonal if it contains column of all ones.
Related Questions in DERIVATIVES
- Help in finding error in derivative quotient rule
- Function satisfing : $h(x)=f(2x-1)$ with $f'(-1)=0 $ and $f'(2)=-2$ then what is $h'(x) $?
- Using the chain rule of differentiation to evaluate an integral along a curve
- Derivative of power series
- What does the second value of `x` mean here?
- Partial derivative of composition with multivariable function.
- How to take the derivative of $Y=\log(x+\sqrt{a^2+x^2})$?
- The derivative of a two-to-one complex function has no zeros.
- Derivative of power series with nonnegative coefficients
- Error in logarithmic differentiation of $R(s)=s^{\ln s}$
Related Questions in MATRIX-CALCULUS
- What is the correct chain rule for composite matrix functions?
- Determinant of rectangular block matrix
- Matrix transformations and eigenvalues
- Minimizing the Frobenius norm of a matrix involving the Hadamard product, $\|X(A\odot Y)-S\|_F$
- $A$ is Hermitian, $B$ is leading principal submatrix of $A$, $rank B = rank A$. Why does $A$ is positive semidefinite?
- If $A, G, H$ ∈ $M_n$ be positive definite , $GAG = HAH$ then $G = H$
- Derivative of Frobenius norm
- If $A, B$ be Hermitian and similar, $S = UQ$ then $A$ and $B$ are unitarily similar
- Any reference for the following Hadamard property
- Find the inverse of $A+uB+vC+uvD+u^2E+v^2F$ where $A,B,C,D,E,F$ are symmetric.
Related Questions in LEAST-SQUARES
- What does the subscript after euclidean distance notation mean?
- Nonlinear Least Squares vs. Extended Kalman Filter
- Moore-Penrose Inverse as least-squares solution
- Least squares fitting of an iterative function
- Least-squares fit of a nonlinear (polar) system
- Rounding in the method of least squares for linear regression analysis?
- Least Square Approximation for Exponential Functions
- Redundant optimization parameters?
- $2$-norm of a matrix &$ 2$-norm of a vector
- Example for nonzero covariance when expectation of product of two random variables is zero?
Related Questions in SCALAR-FIELDS
- Derivative of Frobenius norm
- Difficulty with the derivative of $L_2$ norm
- Difference between gradient and Jacobian
- Gradient versus Tangent
- Take Matrix Derivative, Minimize function.
- Does strictly convex imply invertible gradient?
- Gradient of the Frobenius Norm (or matrix trace) of an expression involving a matrix and its inverse
- Scalar by matrix derivative $\frac{d {\rm tr} (e^{\bf X} {\bf A})}{d {\bf X}}$
- Constant scalar field on a surface
- Derivative of a quadratic cost function with respect to a vector
Trending Questions
- Induction on the number of equations
- How to convince a math teacher of this simple and obvious fact?
- Refuting the Anti-Cantor Cranks
- Find $E[XY|Y+Z=1 ]$
- Determine the adjoint of $\tilde Q(x)$ for $\tilde Q(x)u:=(Qu)(x)$ where $Q:U→L^2(Ω,ℝ^d$ is a Hilbert-Schmidt operator and $U$ is a Hilbert space
- Why does this innovative method of subtraction from a third grader always work?
- What are the Implications of having VΩ as a model for a theory?
- How do we know that the number $1$ is not equal to the number $-1$?
- Defining a Galois Field based on primitive element versus polynomial?
- Is computer science a branch of mathematics?
- Can't find the relationship between two columns of numbers. Please Help
- Is there a bijection of $\mathbb{R}^n$ with itself such that the forward map is connected but the inverse is not?
- Identification of a quadrilateral as a trapezoid, rectangle, or square
- A community project: prove (or disprove) that $\sum_{n\geq 1}\frac{\sin(2^n)}{n}$ is convergent
- Alternative way of expressing a quantied statement with "Some"
Popular # Hahtags
real-analysis
calculus
linear-algebra
probability
abstract-algebra
integration
sequences-and-series
combinatorics
general-topology
matrices
functional-analysis
complex-analysis
geometry
group-theory
algebra-precalculus
probability-theory
ordinary-differential-equations
limits
analysis
number-theory
measure-theory
elementary-number-theory
statistics
multivariable-calculus
functions
derivatives
discrete-mathematics
differential-geometry
inequality
trigonometry
Popular Questions
- How many squares actually ARE in this picture? Is this a trick question with no right answer?
- What is the difference between independent and mutually exclusive events?
- Visually stunning math concepts which are easy to explain
- taylor series of $\ln(1+x)$?
- Determine if vectors are linearly independent
- What does it mean to have a determinant equal to zero?
- How to find mean and median from histogram
- Difference between "≈", "≃", and "≅"
- Easy way of memorizing values of sine, cosine, and tangent
- How to calculate the intersection of two planes?
- What does "∈" mean?
- If you roll a fair six sided die twice, what's the probability that you get the same number both times?
- Probability of getting exactly 2 heads in 3 coins tossed with order not important?
- Fourier transform for dummies
- Limit of $(1+ x/n)^n$ when $n$ tends to infinity
You can use the chain rule for this problem. But for matrix/vector problems the intermediate derivatives required by the chain rule often involve complicated 3rd and 4th order tensors. So my preferred approach is to use successive change-of-variables within differential expressions.
Define the varible $y=Ax+b$. Then the norm (written in terms of the Frobenius product) and its differential are $$\eqalign{ f &= \|y\|_F^2 \cr &= y:y \cr\cr df &= 2y:dy \cr &= 2y:A\,dx \cr &= 2A^Ty:dx \cr }$$ Since $df=\big(\frac{\partial f}{\partial x}:dx\big),\,$ the gradient is $$\eqalign{ \frac{\partial f}{\partial x} &= 2A^Ty \cr }$$ Note that your initial impression is correct, i.e. with respect to $y$ the gradient is simply $$\eqalign{ df &= 2y:dy \cr \frac{\partial f}{\partial y} &= 2y \cr }$$