How can I best improve my understanding on linear operator interactions: $df/dx$ vs $df(x)/dx$, and $df/dx$ vs $dy/dx$, etc?

161 Views Asked by At

For years now, I had always assumed that $\frac{df}{dx}$ was notational shorthand for $\frac{df(x)}{dx}$, because many teachers and authors of papers treat them interchangably. However upon getting studying deeper into vector/matrix calculus, and the general general theory of operators, I've realized that $\frac{df}{dx} \ne \frac{df(x)}{dx}$, likewise $\frac{df}{dx} \ne \frac{dy}{dx}$, and sometimes $\frac{df(x)}{dx} \ne \frac{dy}{dx}$. IE, the difference between $df$ and $df(x)$ is that the latter essentially involves a composition operator, which technically changes the logical/mathematical meaning of the expression. This is likewise further complicated by the fact that some authors use $d$ and $\partial$ interchangably, whereas others use them to indicate different properties of the differential.

While I able to observe a general pattern to the identities and operator interactions in terms of associative/commutative order of operations, like with integration by substitution and differentiation rules, they don't seem to be completely consistent, which means there's something I am not understanding.

TL;DR: I am basically trying to get a clear intuition on the interaction/relationship between the composition operator and differential operator, but cannot really find any sources that elucidate this - and such is further complicated by the fact that many authors seem to abuse notation using forms interchangably.

Note: I definitely don't know everything about operator theory, as there is still a lot I am learning, but I can't find many good learning resources about the topic.

2

There are 2 best solutions below

5
On

Generally we don't distinguish much between $\frac{\mathrm{d}f(x)}{\mathrm{d}x}$ and $\frac{\mathrm{d}f}{\mathrm{d}x}$, but formally speaking we can view $\frac{\mathrm{d}f(x)}{\mathrm{d}x}$ as the value obtained when evaluating the derivative of $f$ at a point $x$, i.e $$\frac{\mathrm{d}f(x)}{\mathrm{d}x}=f'(x) = \lim_{h\to 0} \frac{f(x+h)-f(x)}{h}$$ Whereas we can view $\frac{\mathrm{d}f}{\mathrm{d}x}$ as the function defined as the derivative of $f$, i.e, $$\frac{\mathrm{d}f}{\mathrm{d}x}=f' = \mathcal{D}(f): x \mapsto \lim_{h\to 0} \frac{f(x+h)-f(x)}{h}$$ Here $\mathcal{D}$ denotes the differential operator.

2
On

There is no unique standard for the notation employed in calculus, let alone operator theory. Different authors use different notation to refer to the same operations. This is partly due to the historical debate regarding the "correct" notation for calculus (look up the Analytical Society) - which resulted in different notations for differentiation. There is also regional variation in commonly used notation by different education systems. As maddening as it is, the only guaranteed way to know what the notation means is from context.

Still, there are some generally accepted notations which are commonly used for specific operators. The ones that immediately come to mind are $\mathbf{J}$ for the Jacobian matrix, $\mathbf{H}$ for the Hessian matrix, $\nabla$ for the gradient, and $\nabla\times$ and $\nabla\cdot$ for curl and divergence, respectively.

Additionally, $\partial$ is most commonly used to indicate the partial derivative of a multivariable function - though it can be used to denote the derivative of a function of a single variable - while $d$ is typically used for the derivative of a function of a single variable, or the total derivative of a function of multiple variables.

As others have stated, the inclusion or exclusion of a variable (e.g. $\frac{df}{dx}$ vs $\frac{df(x)}{dx}$), if significant, is used to indicate whether the derivative of a function (which is another function) or the value of the derivative at a point is being considered.

Lastly, it is common in vector calculus to treat differentiation operators as scalars and vectors of a sort. This primarily a matter of convenience, as it allow, for instance, concise definitions like...

$$\text{div}\ \mathbf{f} = \nabla\cdot \mathbf{f}$$

...to replace larger expression like...

$$\text{div}\ \mathbf{f} = \sum_{i=1}^n \frac{\partial f_i}{\partial x_i}$$