I have seen the claim that the directional derivative of $f$ in the direction $\hat{v}$, where $||\hat{v}|| = 1$, (denoted $\nabla_\hat{v} f$) is equal to the gradient of $\nabla f$ dotted with $\hat{v}$.
I have tried to prove this to myself, but I got stuck: $\newcommand{\R}{\mathbb{R}}$ $\newcommand{\limit}[2]{\lim_{#1 \to #2}}$ $\newcommand{\pderiv}[2]{\dfrac{\partial#1}{\partial#2}}$
Let $f : \R^n \to \R$
I accept that
$$ \nabla_{\hat{v}} f = \limit{h}{0} \frac{f(x + h\hat{v}) - f(x)}{h}$$
for making intuitive sense. Furthermore I know that
$$ \pderiv{f}{x_i} = \limit{h}{0} \frac{f(x + h\hat{i}) - f(x)}{h} $$
where $\hat{i}$ is the unit vector of the $i$-th dimension. I also know that
$$\nabla f = \left( \pderiv{f}{x_1}, \dots, \pderiv{f}{x_n} \right)$$
Now I want to show that $\nabla_\hat{v} f = \nabla f \bullet \hat{v}$:
\begin{align*} \nabla f \bullet \hat{v} & = \left( \pderiv{f}{x_1}, \dots, \pderiv{f}{x_n} \right) \bullet v\hat{v}\\ &= \sum_{i = 1}^{n} \pderiv{f}{x_i} \cdot \hat{v}_i\\ &= \sum_{i = 1}^{n} \limit{h}{0} \frac{f(x + h\hat{i}) - f(x)}{h} \cdot \hat{v}_i\\ &= \sum_{i = 1}^{n} \limit{h}{0} \frac{\hat{v}_if(x + h\hat{i}) - \hat{v}_if(x)}{h}\\ \end{align*}
And now I am stuck. I don't see a way to transform the last line into $\limit{h}{0} \frac{f(x + h\hat{v}) - f(x)}{h}$ to reach $\nabla_{\hat{v}} f$
Can you help me out here?
Edit
I now conceptually understand why the dotproduct of direction and gradient is the directional derivative:
Let's say we have a differentiable function $f$ from $\mathbb{R}^2$ to $\mathbb{R}^3$ mapping the $xy$-plane into the $xyz$-space and we want to know the directional derivative of $f$ at a point $p = (x', y')$ for some vector $u$.
First what we need to know is, how much $f$ changes in $x$ direction and how much it changes in $y$ direction.
Then we need to realize that for small distances whatever direction we go along the surface of $f$, the total change in height is the sum of the change in height in the $x$ component of our direction and the change in height in $y$ component in our direction.
But now we can weight the partial derivatives for the $x$ and $y$ direction with the components of $u$ to get their individual contributions for the direction in which $u$ is pointing!
So if $u$ has an $x$-component of $u_x$ we weight the partial derivative of $f$ for the $x$ direction accordingly. When we do the same for $y$ we get:
$f'(x') \cdot u_x + f'(y') \cdot u_y$
which is in fact $\nabla f \bullet u$
I understood this after listeing to this lecture:
Let $v:=(v_1,v_2,...,v_n)$ and $x:=(x_1,...,x_n)$. Define $y$ with $y_k:=x_k+hv_k$ for $h>0$ and $k=1,...,n$ so that $y=x+hv$. Then $$\frac{df(y_1,...,y_n)}{dh}=\frac{\partial f}{\partial y_1}\frac{dy_1}{dh}+...+\frac{\partial f}{\partial y_n}\frac{dy_n}{dh}=\frac{\partial f}{\partial y_1}v_1+...+\frac{\partial f}{\partial y_n}v_n=\langle\nabla_y f,v\rangle$$ On the other hand $$\frac{df(y_1,...,y_n)}{dh}=\frac{df(x+hv)}{dh}:=g'(h)$$ where $g(h):=f(x+hv)$. Therefore $$g'(0)=\lim_{h\to 0}\frac{f(x+hv)-f(x)}{h}=\frac{df}{dh}\Big|_{h=0}=\langle\nabla_y f,v\rangle\Big|_{h=0}=\langle\nabla_x f,v\rangle$$