The staircase paradox, or why $\pi\ne4$

Question

The staircase paradox, or why $\pi\ne4$

114.8k Views Asked by Bumbble Comm At 30 Mar 2026 - 11:57

What is wrong with this proof?

Is $\pi=4?$

Original Q&A

There are 12 best solutions below

Bumbble Comm On 03 Dec 2010 - 3:32

This problem illustrates the fact that two functions can be very close: $|f(x)-g(x)|<\epsilon$ for all $x\in [0,1]$, but their derivatives can still be far apart, $|f'(x)-g'(x)|>c$ for some constant $c>0$. In our case, let $x=a(t),y=b(t),0\le t\le 1$ and $x=c(t),y=d(t), 0\le t\le 1$ be the parametrizations of the two curves. By smoothing the corners, we may assume that both are smooth. $$ \|(a(t),b(t))\|\approx \|(c(t),d(t))\|$$ does not imply $$ \|(a'(t),b'(t))\|\approx \|(c'(t),d'(t))\|$$ Therefore $\int_0^1 \|(a'(t),b'(t))\| dt$ need not be close to $\int_0^1 \|(c'(t),d'(t))\| dt.$ Here $\|(x,y)\|$ denotes $\sqrt{x^2+y^2}$.

Bumbble Comm On 03 Dec 2010 - 3:40

The pithy expression for this "paradox" is as follows: let $x_n(t)$ be a sequence of parameterized curves which converges uniformly to a limit curve $x(t)$. Then it need not be the case that the arclengths of $x_n(t)$ approach the arclength of $x(t)$.

[Added after seeing TCL's answer: it is also true that uniform convergence of a sequence of functions does not imply convergence of their derivatives. See Section 3 here for some discussion of this. As TCL points out, since arclength elements are computed using derivatives, the observation about derivatives may be in some sense more fundamental. In other words, I think I like TCL's answer better than mine.]

As Ross Millikan points out, this is more familiarly shown by approximating the hypotenuse of a right triangle by a staircase pattern of horizontal and vertical line segments. I still remember being a senior in high school and having a friend (whom I had had no prior mathematical interactions with) show this to me. I definitely remember thinking that it was not paradoxical but certainly surprising. (And I have mathematically respected this person ever since, even though I haven't seen her since I was a teenager.)

Added much later: if you think about the phenomenon physically rather than geometrically, it seems to me that the surprise disappears. For instance, suppose I'm running and you're riding a motorcycle. It is possible for your speed at every instant to be 25 times (say) faster than mine while maintaining a very small distance from me, e.g. by making very small, very fast circles around me.

Bumbble Comm On 03 Dec 2010 - 3:57

Hilarious! Of course, the circumference is not approximated by the sum of lengths of the lines constructed as shown, but by the sum of the hypotenuses of each of the right-angle triangles formed around the edge of the circle (forming a polygon with vertices on the circle).

Bumbble Comm On 03 Dec 2010 - 4:38

What is wrong with this?

Fundamentally, that you have jumped in without a definition of the length of an arc.

Jill On 20 Feb 2011 - 12:04

This is simply another example of why the "limit of the sum" is not the "sum of the limit."

(Length of curves are a subset of Sums/Integrals which are really the same thing in my mind. If you like, in this case "the limit of the lengths of the curves " is not the "length of the limit curve")

user7263 On 20 Feb 2011 - 12:28

Correct answer: Nothing is wrong with this, as long as your space is defined using a Manhattan metric. Normal Euclidean space is defined using a Euclidean metric.

Bumbble Comm On 23 May 2011 - 5:28

Summary: The construction at the top (pi=4) merely shows an upper bound. It's an upper bound but it is very easy to come up with a lower but still upper bound to the circumference. The =?4 construction doesn't prove or disprove very much more about the length of the circle besides providing an upper bound pi<4. It certainly doesn't prove =4. Finally, we always should rely on physical "experimentation" to support the meaningfulness of any construction.

If we use a hexagon circumscribed around the circle, we can find a different upper bound rather easily that would be lower than 4; hence proving that the construction above is merely an upper bound (of which there are an infinite number.. including pi=?10000, by using a path that weaves all over a small area between a square and circle) but certainly not the lowest upper bound.

To get closer to discovering pi, we can then also use a mirrored complementary approach of increasing lower bounds via inscribed polygons of an increasing number of sides. However, arguably we could "weave" as well with the incribing to create such a "lower" bound approaching say 4 from below! [Ie, by weaving, we can inscribe a weaved path that ends up being arbitrarily large in measurement despite nicely fitting inside the tightening space between the circle and polygons.]

Ultimately, one key to getting sanity is to postulate/believe that the shortest distance between 2 points is a line (Euclid did this a long time ago). We observe, for example, that the accepted height of a person involves a procedure of using a taught measuring stick or, generally, measuring curves by tightening a flexible measuring device as far as it can be tightened while still hugging (remaining within a certain area of) the curvy item being measured. This procedure is very easy to do for a circle made of an iron ring, for example, and would give a very close approximation to pi. This exercise also shows that math is not reality. Math is built upon postulates and definitions (something must be accepted as being true), but these postulates need not match our physical world in order to remain logically consistent. To say meaningful things about the physical world, we must judge the reasonableness of mathematical postulates and definitions [phv3773 noted in an answer how definitions were missing, while others have also noted collectively many of these points]; we must determine just what is a reasonable set of postulates to start with in order to reach a reasonable meaning of length of a circle (ie, of the value of pi). We might conclude for example, that the usual inscribed/circumscribed approach relies upon a framework closer to reality because in fact it approaches the tape measure "experimental" results arbitrarily close.

I googled an excellent essay that goes into detail on Archimedes' essay http://www.ams.org/samplings/feature-column/fc-2012-02. It appears to have been written for the American Mathematical Society but maybe its author (Bill Casselman) can be persuaded to make a contribution here.

[Below is my old response]

What if the measurement we use, patterning it after a string wrapped around this circle, weaves back and forth? Essentially, we can find a series of connected line segments with length that total $1000000000$ and yet "hug" the circle very closely. A string analogy follows closely though line segments have width $0$ so we can fit arbitrarily many.

This is why not just any reasoning about infinity will do. Mathematicians have developed well reasoned arguments and axioms that correlate well in many cases with reality (see also this argument).

So the question of why doesn't $\pi = 4$ is best answered by asking, "Why should it?" We can just as well have used the ridiculous construction above to suggest $\pi =$ any number $> 3.15$.

The approach we take to argue convincingly that the sum of the line segments approaches the "length of the curve" is to find sequences (from series partial sums) that match to functions (note the question example and the weaving example do not constitute a function because of its multiple values at a given "$x$") which have certain characteristics. For example we might use a lower and upper bounding pair of sequences that correspond to function values of line segment endpoints for such created polygons where one remains on one side of the curve and the other on the other side at all times and where these two sequences approach the same limiting value. We might use the Mean Value Theorem or related results to help prove our final answer. In any case, mathematicians leverage a convincing set of arguments and assumptions and don't just ad hoc throw a bunch of twisted string at a problem and claim the amount of string used proves the unprovable.

Bumbble Comm On 30 May 2011 - 7:29

(non rigorous) If you repeat the process a million times it "seems" (visually) that the perimeter approaches in length to the circumference, but if you magnify the picture of a single "tooth" to full screen, you will notice a big difference from the orthogonal segments and the arc of the circumference. No matter how many times you repeat the process that difference will never fade.

ADDED: A visual example of what I meant is folding of a rope. If you imagine the rope not having thickness, you can fold it so many times that you can tend to a point (zero length?). If you unfold it, it will return to its original shape. In the example the perimeter will always be of total length = 4, but it only appears to blend with the circumference.

Bumbble Comm On 30 May 2011 - 5:07

I am probably going a little off-topic with these comments, so feel free to downvote :)

In my opinion this type of proof emphasizes why it is wrong to teach/take “Calculus” instead of Analysis.

For most of the nice applications of integration, we always use the following approach: take some quantity/expression, break it in many pieces, identify the sum of many pieces as a Riemann sum, and thus our quantity is the limit of the Riemann sums, thus the corresponding integral…

Unfortunately, except in serious Analysis courses, not even once do we go into the subtle details: why is the Riemann sum a good approximation for our quantity, namely why does the error in our approximation go to zero…

Most students who take Calculus end up “understanding” lots of false results, which we don’t have the time to disprove in general: any derivative is continuous, any approximation that looks good is good, …

To come back to this problem, not all approximations that look good are good. We always MUST prove that the errors in our approximations go to zero. And for all the formulas we “prove” in calculus, there is an actual mathematical proof, which is pretty technical (and most non-mathematicians would say boring and stupid, but then without such proofs one cannot really understand why the “proof” from the above picture is wrong). But without going through the formal proofs, one cannot truly understand why that particular approximation works in that case, and more importantly why a different approximation won’t work.

Coming back to the above picture, one way to understand it is the following: we approximate the circle by a sequence of polygons. Let $c_n$ be the length of the $n$th polygon and $c$ be the length of the circle. At each step the error in our approximation is $4-\pi$, which doesn’t go to zero. This means that the arclength of the circle might not be the limit of arclengths of the polygons. The only thing we can conclude is that, if all the quantities and limits that appear in the picture exist, then the limit approximates the arclength of the circle with an error of at most the limsup of the errors. In other words, $4 \approx \pi$ with an error less than or equal to $4-\pi$. Hmm, what is wrong with this?

Bumbble Comm On 08 Jun 2012 - 1:16

Ah, the old engineer vs mathematician thought process.

Place an engineer and a mathematician at one end of a room. At the other end is a beautiful woman. At each "step", they can each move half of the remaining distance between their current position and the woman. The mathematician will say you'll never reach her. The engineer will say you can get close enough.

This problem is similar. A unit square's outermost corners are being "bent" inward to touch a 1/2-unit circle until there are so many corners that the square is, at this zoom level, indistinguishable from the circle itself (similar to using rectangular pixels). Repeated "to infinity" the two shapes would have the same area. However, this process will never yield a mathematical circle; only an engineer's approximation ("close enough") This will always produce the same perimeter measurement even as the areas of the two shapes converge. If instead you were to measure around the hypotenuses as you iterated this shape definition, the perimeter WOULD begin to approach that of the circumference of the half-unit circle, $\pi$.

The fallacy of the proof is illustrated if you consider the shape made by any two line segments that intersect at a point other than on the circle. These two lines will inscribe an arc length as they each intersect a different point on the circle. For simplicity, you can think of the resulting shape as a right triangle. The proof is basically claiming that the sum of the length of the two legs of that triangle is equal to the hypotenuse. This is never true, because the Pythagorean Theorem of $a^2+b^2=c^2$ never holds for any $a,b,c > 0$ where $a+b=c$.

The only way it can work is for an $a$ or $b$ that is zero and thus the area of the shape is zero; this never happens in the construction being generated, at any interval, because by the definition of the construction we have two points that lie on the circle and one point lying outside the circle, and from geometry, any three non-colinear points will always inscribe a shape within a plane of non-zero area.

Bumbble Comm On 08 Feb 2022 - 4:35

I am not satisfied with any of the answers so far, so here is mine:
Let $S(n)$ be the shape we have after $n$ "foldings". (For example $S(0)$ is the square at the beginning. )

The construction of the "paradox" could claim one of the following:

Since the area of $S(n)$ is decreasing, the perimeter of $S(n)$ approaches the area of $C$.

Or,

Since the area of $S(n)$ is decreasing, the area of $S(n)$ approaches the area of $C$.

The first one does not even make sense, since the area of a shape is not the same with the perimeter. There is no reason to connect the area with the perimeter.

The second one is correct. After one million repetitions, the area of the polygon we obtain from "folding" $S$ will be approximately equal to $\pi$. But this has nothing to do with the number $4$, since this was the perimeter of $S$.

**Bumbble Comm** · Accepted Answer

This question is usually posed as the length of the diagonal of a unit square. You start going from one corner to the opposite one following the perimeter and observe the length is $2$, then take shorter and shorter stair-steps and the length is $2$ but your path approaches the diagonal. So $\sqrt{2}=2$.

In both cases, you are approaching the area but not the path length. You can make this more rigorous by breaking into increments and following the proof of the Riemann sum. The difference in area between the two curves goes nicely to zero, but the difference in arc length stays constant.

Edit: making the square more explicit. Imagine dividing the diagonal into $n$ segments and a stairstep approximation. Each triangle is $(\frac{1}{n},\frac{1}{n},\frac{\sqrt{2}}{n})$. So the area between the stairsteps and the diagonal is $n \frac{1}{2n^2}$ which converges to $0$. The path length is $n \frac{2}{n}$, which converges even more nicely to $2$.

The staircase paradox, or why $\pi\ne4$

There are 12 best solutions below

Related Questions in GEOMETRY

Related Questions in ANALYSIS

Related Questions in CONVERGENCE-DIVERGENCE

Related Questions in PI

Related Questions in FAKE-PROOFS

Trending Questions

Popular # Hahtags

Popular Questions