I'm wondering if there are situations where index substitution using Kronecker deltas is not allowed? I'm currently fiddling with differentiation of the Softmax-function where I arrive at the following result $$ \frac{\partial a_i}{\partial z_k} = a_i(\delta_{ik} - a_k). $$ Expanding terms $$ \frac{\partial a_i}{\partial z_k} = a_i\delta_{ik} - a_ia_k. $$ Now I was tempted to simplify to $$ \frac{\partial a_i}{\partial z_k} = a_k - a_ia_k, $$ but that's obviously wrong, as the unsimplified version drops the first term when $i \ne k$, but the simplified does something completly different. Can someone explain what's wrong? Am I missing some contraints on when substitution can be performed and when not?
2026-03-25 06:17:51.1774419471
Kronecker delta - substitution issues
240 Views Asked by Bumbble Comm https://math.techqa.club/user/bumbble-comm/detail At
1
There are 1 best solutions below
Related Questions in DERIVATIVES
- Derivative of $ \sqrt x + sinx $
- Second directional derivative of a scaler in polar coordinate
- A problem on mathematical analysis.
- Why the derivative of $T(\gamma(s))$ is $T$ if this composition is not a linear transformation?
- Does there exist any relationship between non-constant $N$-Exhaustible function and differentiability?
- Holding intermediate variables constant in partial derivative chain rule
- How would I simplify this fraction easily?
- Why is the derivative of a vector in polar form the cross product?
- Proving smoothness for a sequence of functions.
- Gradient and Hessian of quadratic form
Related Questions in INDEX-NOTATION
- Index notation for vector calculus proof
- How does one deal with modulus in index notation?
- Summing up discrete probabilities - trivial?
- Levi-Civita tensor contraction contradiction
- Show that using Suffix Notation
- Show with index notation that $||\nabla \times \underline{u}||^2=||\nabla \underline{u}||^2 - \mathbf{Tr}[(\nabla \underline{u})^2]$
- When would $\underline{\nabla} \cdot \underline{F} = 0$?
- Fluid Dynamics Proof
- Difference between $T^{i}_{\;\;j}$ and $T_i^{\;\;j}$?
- Notation - the element with the maximum value in a different set
Trending Questions
- Induction on the number of equations
- How to convince a math teacher of this simple and obvious fact?
- Find $E[XY|Y+Z=1 ]$
- Refuting the Anti-Cantor Cranks
- What are imaginary numbers?
- Determine the adjoint of $\tilde Q(x)$ for $\tilde Q(x)u:=(Qu)(x)$ where $Q:U→L^2(Ω,ℝ^d$ is a Hilbert-Schmidt operator and $U$ is a Hilbert space
- Why does this innovative method of subtraction from a third grader always work?
- How do we know that the number $1$ is not equal to the number $-1$?
- What are the Implications of having VΩ as a model for a theory?
- Defining a Galois Field based on primitive element versus polynomial?
- Can't find the relationship between two columns of numbers. Please Help
- Is computer science a branch of mathematics?
- Is there a bijection of $\mathbb{R}^n$ with itself such that the forward map is connected but the inverse is not?
- Identification of a quadrilateral as a trapezoid, rectangle, or square
- Generator of inertia group in function field extension
Popular # Hahtags
second-order-logic
numerical-methods
puzzle
logic
probability
number-theory
winding-number
real-analysis
integration
calculus
complex-analysis
sequences-and-series
proof-writing
set-theory
functions
homotopy-theory
elementary-number-theory
ordinary-differential-equations
circles
derivatives
game-theory
definite-integrals
elementary-set-theory
limits
multivariable-calculus
geometry
algebraic-number-theory
proof-verification
partial-derivative
algebra-precalculus
Popular Questions
- What is the integral of 1/x?
- How many squares actually ARE in this picture? Is this a trick question with no right answer?
- Is a matrix multiplied with its transpose something special?
- What is the difference between independent and mutually exclusive events?
- Visually stunning math concepts which are easy to explain
- taylor series of $\ln(1+x)$?
- How to tell if a set of vectors spans a space?
- Calculus question taking derivative to find horizontal tangent line
- How to determine if a function is one-to-one?
- Determine if vectors are linearly independent
- What does it mean to have a determinant equal to zero?
- Is this Batman equation for real?
- How to find perpendicular vector to another vector?
- How to find mean and median from histogram
- How many sides does a circle have?
Perhaps you may already know that, but you were not allowed to use Einstein summation convention in this situation. Only if you could, it would be legitimate to simplify the delta.
There is still $i$ index on the left hand side which indicates that the summation is not possible (see the top rule on 2nd page of these notes where the difference between the summation "dummy" index and "free index" is shown (even if the phrase "dummy index" is not used explicitly, that's the index over which you sum)). You may be also interested in the answer to my question on when can we use summation convention and when we cannot.