I've been reading about the shift operator $E=e^{\frac{\mathrm{d}}{\mathrm{d}x}}$, which can be represented as
$$e^{\frac{\mathrm{d}}{\mathrm{d}x}} = I + \frac{\mathrm{d}}{\mathrm{d}x} + \frac{1}{2!} \frac{\mathrm{d}^2}{\mathrm{d}x^2} + \frac{1}{3!} \frac{\mathrm{d}^3}{\mathrm{d}x^3} + \ldots$$
My understanding of this Taylor series is that the latter differential operators in this series come from the $x^k$ factors of the summation terms, $\left( \frac{\mathrm{d}}{\mathrm{d}x} \right)^0, \left( \frac{\mathrm{d}}{\mathrm{d}x} \right)^1, \left( \frac{\mathrm{d}}{\mathrm{d}x} \right)^2,\ldots$. If, generally, $\left(\frac{\mathrm{d}f}{\mathrm{d}x}\right)^2 \ne \frac{\mathrm{d}^2f}{\mathrm{d}x^2}$, how does raising the differential operator to some power, $k$ yield the $k$th differential operator, as asserted here? That is, why does $\left(\frac{\mathrm{d}}{\mathrm{d}x}\right)^k = \frac{\mathrm{d}^k}{\mathrm{d}x^k}$
$\left(\frac d{dx}\right)^k = \frac{d^k}{dx^k}$ is the definition of $\left(\frac d{dx}\right)^k$. In other words, the mathematical community at large has, collectively, come to the conclusion that among all possible interpretations of $\left(\frac d{dx}\right)^k$, $\frac{d^k}{dx^k}$ makes the most sense. So that is what we have decided that it means.
And to me it doesn't seem strange at all that $\left(\frac{d}{dx}\right)^kf^k\neq \left(\frac{df}{dx}\right)^k$. Multiplication isn't always commutative, and $\frac{d}{dx}$ and $f$ is just another example of things that do not commute.