Why is $df|_p(X)$ called the "gradient of $f$" in differential geometry?

540 Views Asked by At

In undergraduate applied vector calculus, the gradient is simply the one-dimensional array of partial derivatives of a function $f$: $(\delta_1f,...,\delta _df)$

In differential geometry (at least in the general relativity course I'm taking), the gradient of $f$ (a function on a manifold $M$) is defined as a map $$(df)_p: T_pM \to \mathbb R,$$ $$(df)_p(X) := X(f),$$ where $T_pM$ is the tangent space on point $p \in M$.

Why do these definitions intuitively capture the same concept of "gradient"?

1

There are 1 best solutions below

3
On

If you fix a Riemannian metric $g$ on $M$, then the vector field $\nabla f$ determined by $$g_p(\nabla f, X) = X_p(f) = (df)_p(X)$$ is a direct generalization of the gradient in calculus. Specifically, if $M = \mathbb{R}^n$ and $g$ is the usual scalar product then $\nabla f$ is just the vector of partial derivatives.

Referring to $(df)_p$ itself as the gradient of $f$ is a slight abuse of notation.

Edit Here is what happens when $M = \mathbb{R}^n$. If $X = \partial_i$, then $$g(\nabla f, \partial_i) = \partial_i (f);$$ since these are an orthonormal basis of $T_p M$ at every $p$, it follows that $$\nabla f = \sum_{i=1}^n g(\nabla f, \partial_i) \partial_i = \sum_{i=1}^n (\partial_i f) \partial_i,$$ so the components of $\nabla f$ are indeed just the $\partial_i f$.

As Travis points out, $\nabla f$ depends on the choice of $g$, while $df$ does not. This makes $df$ a more natural object.