A geometric reason why the square of the focal length of a hyperbola is equal to the sum of the squares of the axes?

3.5k Views Asked by At

When I teach conics, I give a simple geometric argument for the Pythagorean relation for ellipses. (By "Pythagorean relation" I mean the fact that the square of the focal length $c$ is the difference of the squares of the semimajor and semiminor axes, i.e. $c^2=|a^2-b^2|$ for the ellipse defined by $\frac{x^2}{a^2}+\frac{y^2}{b^2}=1$.) The picture argument goes as follows: just draw the obvious isosceles triangle from the two foci to one of the two points at the end of the minor axis. We see two right triangles; in each, one leg is the focal length $c$ and the other leg is the semiminor axis, while the hypotenuse is the semimajor axis. The result follows from the Pythagorean theorem applied to this right triangle.

But when it comes to the Pythagorean relation for hyperbolas, I can't think of an equally convincing picture to draw. Instead I have to give a rather inelegant algebraic argument for the relation $c^2=a^2+b^2$ (for the hyperbola defined by $\frac{x^2}{a^2}-\frac{y^2}{b^2}=1$, with focal length $c$), directly from the definition of the hyperbola. The messy details are below.*

Now, obviously there is an asymmetry between the argument for the ellipse and the argument for the hyperbola: the ellipse argument is almost trivial and doesn't involve working straight from the locus definition, but the hyperbola argument is tedious and does require working straight from the definition. So, my question:

Is there an equally persuasive picture I could draw to convince high school students of the Pythagorean relation for hyperbolas -- without having to get technical with points at infinity, say?

The natural triangle to draw is the right triangle with legs from the center to a vertex and from that vertex to an asymptote, which has legs of length $a$ and $b$; but then the problem becomes why the length along the asymptote should be equal to $c$. I doubt there is a purely (Euclidean) geometrical reason because this triangle lies on the asymptote, not on the hyperbola itself; I suspect some sort of limiting procedure will be required.


[*] The tedious algebraic details:

The foci lie at $(\pm c,0)$. Say $a>b$. Then hyperbola is the locus of points $(x,y)$ such that

$$\left|\sqrt{(x+c)^2+y^2}-\sqrt{(x-c)^2+y^2}\right|=2a$$

Assume for simplicity the expression between the absolute value bars is positive. (The algebra is even more tedious if we don't make such a simplifying assumption.) Then moving the right root to the other side and squaring gives

$$(x+c)^2+y^2=4a^2+(x-c)^2+y^2+4a\sqrt{(x-c)^2+y^2}$$

Simplifying gives

$$xc-a^2=a\sqrt{(x-c)^2+y^2}$$

and upon squaring again we have

$$x^2c^2-2xca^2+a^4=a^2(x^2-2xc+c^2+y^2)=a^2x^2-2xca^2+c^2a^2+a^2y^2$$

which reduces to

$$x^2(c^2-a^2)-y^2a^2=a^2(c^2-a^2)$$

Defining $b^2:=c^2-a^2$ brings the equation into the form $x^2/a^2+y^2/b^2=1$, and the Pythagorean relation follows from the definition of $b$.

5

There are 5 best solutions below

6
On BEST ANSWER

Consider the hyperbola: $$ \frac{x^2}{a^2}-\frac{y^2}{b^2}=1\tag{1} $$ Given $f_1,f_2$, the two foci of a hyperbola, one property of a hyperbola is that there is a constant, $\Delta\text{ distance}$, so that any point on the hyperbola, p, satisfies $$ \Big||p-f_1|-|p-f_2|\Big|=\Delta\text{ distance}\tag{2} $$ Consider a point at the intersection of the hyperbola and the line between the foci. The $\Delta\text{ distance}$ given in $(2)$ is the distance between the two branches of the hyperbola.

$\hspace{3.3cm}$enter image description here

Using $(1)$, we get that the distance between the two branches of the hyperbola is $2a$. Therefore, $$ \Delta\text{ distance}=2a\tag{3} $$ Consider a point at an infinite distance on the upper right branch of the hyperbola. Because $\triangle gpf_2$ is essentially isosceles, the $\Delta\text{ distance}$ given in $(2)$ is $$ \Delta\text{ distance}=|f_1-f_2|\cos(\theta)\tag{4} $$ Using $(1)$, we get that $$ \begin{align} \tan(\theta)&=\lim_{x,y\to\infty}\frac yx=\frac ba\\ \cos(\theta)&=\frac{a}{\sqrt{a^2+b^2}}\tag{5} \end{align} $$ Combining $(3)$, $(4)$, and $(5)$, we get $$ |f_1-f_2|=2\sqrt{a^2+b^2}\tag{6} $$ Thus, if $c$ is the distance from the center of the hyperbola to each of the foci, then $(6)$ gives $$ c^2=a^2+b^2\tag{7} $$

2
On

You can solve this the same way as the ellipse, by taking the point with greatest y-value. However, that point lies at infinity, so instead of drawing an exact triangle, you take a limit of triangles with points on the curve (as you suggested) and scale them down so they don't go off to infinity themselves. This can be made precise in projective geometry, because then that point at infinity is a real point (there are 2, I believe). In fact, the hyperbola is just an ellipse turned inside out, so that it's greatest and least y-values lie at infinity (just like a line is a circle with a point at infinity).

2
On

Since the hyperbola is defined as the locus where the difference of the two distances is constant, I don't see any analogous argument. But certainly a limit argument does work: As $(x,y)\to \infty$ (with $x>0$), we have $\left|\frac yx\right|\to\frac ba$, and so \begin{align*} \sqrt{(x+c)^2+y^2}-\sqrt{(x-c)^2+y^2}&=\\\left(\sqrt{(x+c)^2+y^2}-\sqrt{(x-c)^2+y^2}\right)&\frac{\sqrt{(x+c)^2+y^2}+\sqrt{(x-c)^2+y^2}}{\sqrt{(x+c)^2+y^2}+\sqrt{(x-c)^2+y^2}}\\ &=\frac{4c}{\sqrt{(x+c)^2+y^2}+\sqrt{(x-c)^2+y^2}}\\&=\frac{4c}{\sqrt{(1+c/x)^2+(y/x)^2}+\sqrt{(1-c/x)^2+(y/x)^2}}\\ &\to \frac{2ac}{\sqrt{a^2+b^2}}\,. \end{align*}

0
On

Here's a solution using for the special case of a hyperbola formed as a "vertical" section of a cone ... that is, for the case in which the cutting plane is parallel to the cone axis.

Take a "sideways" look at one of the hyperbola's Dandelin Spheres; let $C$ be its center, and let it meet the cone tangentially at $T$. Our view of the configuration is parallel to the cutting plane (and perpendicular to the cone axis), which here appears as the vertical line $FM$. Were we to look perpendicular to the plane, point $M$ would be the center of the hyperbola, point $F$ would be a focus (so that $|MF| = |OC| = c$ in traditional notation), and point $V$ would be a vertex (so that $|MV| = a$); importantly ---and this is something of a leap of faith--- the crossed lines would be the asymptotes (with "slope" $\pm b/a$, where Dandelin radius $b = |CF|$ gives the rise for a run of $a$).

Sideways view of a vertical conic section

Since right triangles $\triangle OTC$ and $\triangle VMO$ have matching legs ($CT \cong OM$, via mutual congruence with $CF$) and acute angles ($\angle COT \cong \angle OVM$), the triangles are themselves congruent, so that $|OT|=|MV| = a$ and $$|OT|^2 + |TC|^2 = |OC|^2 \qquad \implies \qquad a^2 + b^2 = c^2$$


For now, I'll leave it as an exercise for the reader to tweak the argument to cover hyperbolas created by oblique cutting planes (as well as to cover ellipses), where the situation is much more complicated. The reader should also justify the "leap of faith" (which, of course, has to be so, because we know how the story ends).

BTW, the figure was drawn using GeoGebra.


Regarding the oblique case ... If the cone's generator makes an acute angle $\theta$ with its axis, and the cutting plane makes an acute angle $\phi$ with that axis, then one can show $$a = r \cos\theta \qquad c = r \cos\phi$$

where $r$ is the radius of the circle through the Dandelin centers, centered at their midpoint. Interestingly, the consequent relation $$b^2 = c^2 - a^2 = r^2 \left( \cos^2\phi - \cos^2\theta \right) = r^2 \left( \sin^2\theta - \sin^2\phi \right)$$ reveals $b$ as the geometric mean of the Dandelin radii, $r \left( \sin\theta - \sin\phi \right)$ and $r \left( \sin\theta + \sin \phi \right)$.

While it's possible to find $a$, $b$, $c$ represented by segments in a "sideways" diagram analogous to the one above, the connection between $a$, $b$, and the hyperbola's asymptotes is far less obvious. After all, the asymptotes arise from intersecting the cone, at its apex, with a plane parallel to the cutting plane; those lines don't have the same "slope" as the cone's generators, so the geometry of the sideways diagram isn't sufficient for capturing their nature.

0
On

This image may help: https://en.wikipedia.org/wiki/File:Hyperbel-leitl-e.svg

Let us look at the right branch of a hyperbola. Let $C$ be the center, $F$ be the right focus, and $L$ be the point where the right directrix line intersects with the major axis. The length of $CF$ is $c=\sqrt{a^2+b^2}$, and that of $CL$ is $l=a^2/c$.

Draw a circle with center $C$ and radius $a$. It intersects with the directrix line at $E$. $\triangle LCE$ and $\triangle ECF$ share the same angle at $C$, and because $l/a=a/c$, they have the same ratio of side lengths at the angle. Thus $\triangle LCE$ and $\triangle ECF$ are similar, and $\angle CEF=\angle CLE=\frac{\pi}{2}$. In other words, $EF$ is a tangent line of the circle at $E$. The length of $EF$ is $b=\sqrt{c^2-a^2}$. Then $\tan\angle ECF=\frac{b}{a}$, and thus line $CE$ is an asymptote line of the hyperbola.