Can we prove the chain rule without using an artificial trick?

1.7k Views Asked by At

THEOREM 9(THE CHAIN RULE)
If $g$ is differentiable at $a$, and $f$ is differentiable at $g(a)$, then $f\circ g$ is differentiable at $a$, and $$(f\circ g)^{'}(a)=f^{'}(g(a))\cdot g^{'}(a).$$

My proof of this theorem is here:

(1) First consider the case in which for any positive real number $\epsilon$, there exists $h$ such that $0<|h|<\epsilon$ and $g(a+h)-g(a)=0$.
In this case, $\lim_{h\to 0} \frac{g(a+h)-g(a)}{h}=0$ since $g'(a)$ exists and there exists $h$ such that $0<|h|<\epsilon$ and $g(a+h)-g(a)=0$ for any positive real number $\epsilon$.
Let $\phi(h):= \frac{f(g(a+h))-f(g(a))}{g(a+h)-g(a)}$ if $g(a+h)-g(a)\neq 0$.
Let $\phi(h):=f'(g(a))$ if $g(a+h)-g(a) = 0$.
Then $\phi$ is continuous at $h=0$ since $f$ is differentiable at $g(a)$.
$\frac{f(g(a+h))-f(g(a))}{h} = \phi(h) \cdot \frac{g(a+h)-g(a)}{h}\to f'(g(a))\cdot 0 = f'(g(a))\cdot g'(a) \,\,(h\to 0)$.

(2) Second consider the case in which there exists a positive real number $\epsilon$ such that $0<|h|<\epsilon\implies g(a+h)-g(a)\neq 0.$
In this case $\frac{f(g(a+h))-f(g(a))}{h} = \frac{f(g(a+h))-f(g(a))}{g(a+h)-g(a)} \cdot \frac{g(a+h)-g(a)}{h}\to f'(g(a))\cdot g'(a) \,\,(h\to 0)$.

I don't like the function $\phi$ because it is artificial.
Can we prove the chain rule without using an artificial trick?

2

There are 2 best solutions below

0
On BEST ANSWER

If $f,g$ are continuously differentiable, then it is possible to prove directly.
Here is an argument:
Using mean value theorem: $g(a+h) = g(a) + g'(x) h$ with $x \in [a,a+h]$, $$ \lim_{h \rightarrow 0} \frac{f(g(a+h))-f(g(a))}{h} = \lim_{h \rightarrow 0} \frac{f(g(a)+g'(x)h)-f(g(a))}{h} $$ Again using mean value theorem, now in $f$: $f(g(a)+h) = f(g(a)) + f'(y) h$ with $y \in [g(a),g(a)+h]$, $$ = \lim_{h \rightarrow 0} \frac{f(g(a)) + f'(y) g'(x) h - f(g(a))}{h} = \lim_{h \rightarrow 0} \frac{f'(y)g'(x)h}{h} = f'(g(a)) g'(a)$$

The last step: $\lim_{h \rightarrow 0} f'(y)g'(x) = f'(g(a)) g'(a)$ uses the fact that derivatives are continuous.

4
On

Start with a reformulation of differentiability avoiding quotients:

A function $f: I\to\mathbb R$ on a set $I\subseteq \mathbb R$ is differentiable at $a\in I$ if and only if there is $\varphi:I\to \mathbb R$ which is continuous at $a$ and satisfies $f(x)-f(a)=\varphi(x)(x-a)$. Then $\varphi(a)=f'(a)$.

If $f$ is differentiable at $a$ with correspondig function $\varphi$ and $g:f(I)\to\mathbb R$ is differentiable at $f(a)$ with corresponding function $\gamma$, we get $$g(f(x))-g(f(a))=\gamma(f(x)) (f(x)-f(a))=\gamma(f(x))\varphi(x)(x-a).$$ Since compostions and products of continuous functions are continuous we get that $g\circ f$ is differentiable at $a$ with corresponding function $\gamma(f(x))\varphi(x)$ whose value at $a$ is $g'(f(a))f'(a)$.