I've been posed a question in which I'm to differentiate with respect to $x$ a function of the form $(x+a)^k$. I've successfully completed (matches the book's answer) the question by using the chain rule, however I cannot achieve the same result using the definition of the derivative.
I would like a worked example of differentiation from first principles of a function of the form $(x+a)^k$.
For concreteness, let's first take $k=3$.
We use the factorization $x^3-y^3=(x-y)(x^2+xy+y^2)$.
Here we have $(x+a+h)^3-(x+a)^3=h((x+a+h)^2+(x+a+h)(x+a)+(x+a)^2)$.
The difference quotient is then
$$\frac{(x+a+h)^3-(x+a)^3}{h}=((x+a+h)^2+(x+a+h)(x+a)+(x+a)^2).$$
As $h$ goes to zero, this becomes $3(x+a)^2$, as desired.
How does this generalize? We have similar factorizations for all $k$:
$$x^k-y^k=(x-y)(x^k+x^{k-1}y+\cdots xy^{k-1}+ y^k)$$
where there are $k$ terms in the second parenthesis. This factorization is easy to check, because the terms all telescope and cancel.
So what I did for $k=3$ can be repeated for general $k$, and when $h$ goes to zero you get $k$ terms of $(x+a)^{k-1}$ added together, which is exactly the same derivative you get using the chain rule.