Thanks for reading.
Short Version
Why is it enough for the tangent plane to only agree with the slope of a differentiable 2-input function in the $x$ and $y$ directions at a certain point for it to agree in every direction at that point?
Isn’t it still possible for the function to cross the tangent plane, even if their slopes agree in the $x$ an $y$ directions, when we move in some other direction?
Long Version
Say we have some two-input differentiable function $f(x,y)$ defining some surface in 3D space. The input into the function is $(x,y)$ coordinates, and the output of the function is the height of the surface (or hill) at each input point.
Consider some point $(x_0,y_0)$ on this surface. At this point, the surface will have an infinite number of slopes, since there are an infinite number of directions we can move in. However, for two of those directions, the slopes are easy to find:
The slope is $\frac{\partial f}{\partial x}$ in the $x$ direction.
The slope is $\frac{\partial f}{\partial y}$ in the $y$ direction.
For a plane to be tangent to a function at $(x_0,y_0)$, it means it only touches the surface at that point, and doesn't cross the surface anywhere.
When we find a tangent plane to the surface at $(x_0,y_0)$, its enough to make the slope of the plane agree with the slope of the surface in 2 directions. Usually the $x$ and $y$ directions.
I'm aware that two slopes, one in the $x$ and one in the $y$, uniquely define a single plane at that point.
However, there are an infinite number of functions that may have those slopes at that point.
Tangency implies that the slope of the tangent plane agrees with the slope of the function at that point in every direction, not just the $x$ and $y$ directions.
If it didn't agree with the slope of the function at that point in some arbitrary direction, then were we to draw lines in that direction on both the "tangent" (not actually tangent) plane and the function, those lines would intersect, and thus the plane wouldn't be brushing the surface at only a single point, and it wouldn't be a tangent plane.
Why is it enough for the tangent plane to only agree with the slope of a differentiable 2-input function in the $x$ and $y$ directions at a certain point for it to agree in every direction at that point?
Isn’t it still possible for the function to cross the tangent plane, even if their slopes agree in the $x$ an $y$ directions, when we move in some other direction?
I'd like to be able to see the answer intuitively...but I can't, and I need help.
Thank you!
Edit:
I added this as a comment to David's answer below, but I'll add it as part of the question too, as I think it may help communicate what I'm trying to "see".
Take the function $f(x)=x^2$. That's a parabola in 2D space.
If a tangent line agrees with the slope $2x$ at a certain point...well, there's only one slope to agree with, so I KNOW that the line will be tangent.
But, now take the surface $f(x,y)=x^2$. That's a surface in 3D space.
A tangent plane at a certain point must have a slope of $2x$ in the $x$ direction and $0$ in the $y$ direction
Lines drawn on a certain "tangent" plane in those two directions will be tangent to the surface at that point, since the slopes agree in those two directions. We used those directions (the $x$ and $y$ directions) to define the plane.
But, how can I intuitively "see" that lines drawn on the plane in ANY direction will be tangent to the function at that point?
Edit 2
As someone pointed out in the comments, a tangent line doesn't not crossing the function only applies if the function is convex (or concave) like $x^2$. Here, let me draw out the function, and draw a tangent line to $f(x)=x^2$.
The tangent line (blue) agrees with the slope of the function (red) in the one direction it has to agree with the slope in (the $x$ direction).
Therefor, I know that the tangent line won't cross the parabola. If it didn't agree with the slope, then it would cross it either when we moved a little bit towards the right from the point of tangency, or a little bit towards the left.
However, lets say we draw a tangent plane the graph of $f(x,y)=x^2$, with a tangent plane drawn to it.
Although I know lines on the plane won't cross the surface when moving in the $x$ direction or in the $y$ direction (the two white tangent lines won't cross the $x^2$ surface) since the plane agrees with the slope of the surface in those two directions, how can I know that if I draw lines on the plane in some other arbitrary direction, those lines also won't cross the surface?
Why was the slope agreeing in the $x$ and $y$ directions enough to define the tangent plane?
Thanks!
Final Thoughts:
After watching some of Ted's (in the comments) lectures on YouTube, I've realized that when I asked this question, although I had an intuitive "feel" for differentiability in one dimension, I hadn't really thought enough about what it meant in higher dimensions.
Differentiability (for a two dimensional surface) means that the function is locally flat.
Say that $T(x ⃗ )$ is the tangent plane to a function $f(x ⃗ )$ at a point $a ⃗$.
Therefor, it must agree with the slopes of $f(x ⃗ )$ in every direction at $a ⃗$, not just the $m$ Cartesian directions, for which we have the partial derivatives of $f(x ⃗ )$.
However, when we define $T(x ⃗ )$, we only make it agree with the slopes of $f(x ⃗ )$ in the $m$ Cartesian directions.
$$T(x ⃗ )=\frac{∂f}{∂x_0} (x_0-a_0 )+\frac{∂f}{∂x_1}(x_1-a_1 )+⋯+\frac{∂f}{∂x_m}(x_m-a_m )$$
(Each of the $x_i$'s are orthogonal Cartesian coordinates, they're the components of the input vector $\vec{x}$, and each of the partial derivatives above are evaluated at $\vec{a}$).
This is because given the $m$ partial derivatives of $f$ at $\vec{a}$, there’s only one unique hyper-plane approximation for $f(x ⃗ )$ which can be defined!!
That is, the $m$ partial derivatives of $f(x ⃗ )$ at $a ⃗$ define a unique hyper-plane. There is no other hyper-plane that has $m$ slopes and passes through $(\vec{a}, f(\vec{a}))$ in $(m+1)$ dimensional space ($m+1$ because we're adding one more dimension for the function's output itself).
My original question was why this implied that the tangent plane agreed with the slopes in every direction.
The answer?
It doesn't!
It's entirely possible for this "tangent" (in quotations because in this case, it's not actually tangent) to agree with the slopes in the $m$ Cartesian directions, but not in every direction (Ted has some good examples in the lectures referenced in the comments, and here's a picture from "Math-Insight").
However, if that unique “tangent” plane doesn’t agree with the function’s slopes in every direction (not only the Cartesian directions) that just means that the function isn’t differentiable at $a ⃗$!!! In other words, at $\vec{a}$ the function isn't locally flat.
To prove that a function is differentiable at $\vec{a}$, we attempt to build a "tangent" plane at that point by making it agree with the slope of our function in the $m$ Cartesian directions, and then show that the "tangent plane" indeed is a tangent-plane by showing that...
$$\mathrm{lim_{(|dx ⃗ |→0)}}(\frac{[(f(a ⃗+dx ⃗ )-f(a ⃗ )]-[T(a ⃗+dx ⃗ )-T(a ⃗ )])}{|dx ⃗ |})=0$$
...regardless of the direction in which we move from $\vec{a}$ (regardless of the direction of $\vec{dx}$).
In his videos, Ted shows an example of this (with the difference that in his example, $T(\vec{x})$ is not a tangent plane but a linear approximation passing through the origin, although the idea is the same.)
In a nutshell, to answer my original question, if the function is differentiable, then it’s enough for the tangent plane to agree with the slope of the function in the $m$ Cartesian directions for it to agree in every direction, because $m$ slopes define a unique tangent-plane in $m+1$ dimensional space, and that the tangent plane agrees with the slopes of the function in every direction is just the definition of differentiability - that the function is locally flat.
And to show that a function indeed is differentiable, we have to show that a potential unique "tangent" (in quotes because it may not be a tangent plane if the function isn't differentiable) plane to the function at a certain point that agrees with the slope of the function in the $m$ Cartesian directions agrees with the slope of the function in every direction, so that the function indeed is locally flat.
Thank you!




If the slope of the function in any particular direction does not match the slope of the tangent plane in that same direction, then this so-called "tangent" plane was not really a tangent plane, was it?
Consider the function $f(x,y) = \lvert x+y\rvert - \lvert x-y\rvert.$ If you take directional derivatives in the $x$ or $y$ directions at $(x,y) = (0,0),$ you get zero. But if you look at the function along any other line through $(0,0)$ you get something proportional to $\lvert t\rvert$ where $t$ is the distance along the line. This has "right" and "left" derivatives of opposite signs at $(0,0)$.
In fact there is no tangent plane to this function at $(0,0)$ any more than there is a tangent line to $g(x) = \lvert x\rvert$ at $x = 0.$ The plot of $f$ consists of pieces of four planes intersecting at $(0,0).$ Any of those planes is equally eligible (or ineligible) to be the "tangent" plane. In fact there is no unique tangent plane at $(0,0),$ and the function is not differentiable there.
What it comes down to is that the thing that makes a multivariable function differentiable at a particular point is that the function has a unique "tangent plane" at that point and that no matter which direction you go from that point, the instantaneous rate of change in the function will give you a line whose slope matches the slope of the tangent plane in that direction. If anything happens to contradict this, the function simply is not differentiable at that point.
In short, the tangent plane at a point $(x_0,y_0)$ says exactly and completely what the directional slope of the function will be in any direction from $(x_0,y_0)$. All you need to do is to identify the tangent plane.
But as a reminder, all of this depends on the function being differentiable in the first place. Fortunately we have lots of ways to know that a function is differentiable without having to check its slope in every possible direction, just as we don't have to do another delta-epsilon proof on every single differentiable function we deal with in single-variable analysis.