To illustrate the question that I'm asking, consider the theorem:
If the partial derivatives of the $f$ exists in a neighbourhood of $c$ and are continuous at $c$, then $f$ is differentiable at $c$.
However, when we talk about partial derivatives, we talk about the directional derivative in the direction of the standard basis vectors (0,...,1,...0)of $\mathbb{R}^p$.
First of all, why do we choose and formulate the theorems with respect to these basis ? I mean would any orthonormal basis of $\mathbb{R}^p$ work in the same way ?
I mean, for example, if we think geometrically, for $\mathbb{R}^3 $the choice of $x,y,z$ coordinates is arbitrary, hence any orthonormal basis of $\mathbb{R}^3$ should work, but how about any basis of $\mathbb{R}^3$ ?
Secondly, if we were to formulate, for example, the above theorem wrt any basis of $\mathbb{R}^p$, how could we do that ?
Edit:
Please provide an argument to your statements.
Edit 2:
In the book that I'm using (The elements of Real Analysis by Bartle), for example, the above theorem is given before stating that the derivative of $f$ can be written in terms of the gradient, so to answer to this question, I cannot use this fact.
The basis you chose to work with does matter. Indeed, consider a basis $v_1,\ldots, v_n$ of $\mathbb R^n$. Rewriting the Fréchet derivative, for any $h=\sum_{k=1}^n h_i v_i$,
$$df(x)(h)=df(x)(\sum_{k=1}^n h_iv_i)=\sum_{k=1}^n h_i df(x)(v_i)$$
If you define $\frac{\partial f}{\partial x_i}(x)$ as $df(x)(v_i)$ then you get the formula $df(x)(h) = \sum_{k=1}^n h_i \frac{\partial f}{\partial x_i}(x)$.
However, you may not write this last quantity as $\nabla f(x)^Th$ because the $h_i$ are not necessarily the coordinates of $h$ in the canonical basis.