Can a proof be just words?

4.8k Views Asked by At

I suppose this is a question about mathematical convention. In a problem in Introduction to Probability by Bertsekas and Tsitsiklis, they ask the reader to prove an identity. But then their proof is mostly words:

Problem 3.* Prove the identity $$A \cup \Bigg( \bigcap_{n=1}^\infty B_n \Bigg) = \bigcap_{n=1}^\infty\big(A \cup B_n\big).$$

Solution. If $x$ belongs to the set on the left, there are two possibilities. Either $x \in A$, in which case $x$ belongs to all of the sets $A \cup B_n$, and therefore belongs to the set on the right. Alternatively, $x$ belongs to all of the sets $B_n$ in which case, it belongs to all of the sets $A \cup B_n$, and therefore again belongs to the set on the right.

Conversely, if $x$ belongs to the set on the right, then it belongs to $A \cup B_n$ for all $n$. If $x$ belongs to $A$, then it belongs to the set on the left. Otherwise, $x$ must belong to every set $B_n$ and again belongs to the set on the left.

In mathematics, why is this allowed? Can you say that this is more correct a proof that is, "Oh, it's obvious!" or "Just keep distributing $A$ over and over ad nauseum and you get the term on the right"?

I'm not trolling. I'm genuinely curious as to how thorough one must be when using words as proof.

10

There are 10 best solutions below

0
On BEST ANSWER

Exactly as thorough as you would have to be using any other kinds of symbols. It's just that vast messes of symbols are hellish for humans to read, but sentences aren't. Adding symbols to something doesn't make it more rigorous, less likely to be wrong, or really anything else. Symbols are useful for abbreviating in situations where this adds clarity, and making complex arguments easier to follow, but shouldn't be used where they do not help in this regard.

7
On

Yes they can and I'm of the opinion that symbolism and notation should be avoided unless it serves to simply the presentation of the material or to perform calculations. For example you want to cut a cube so that each face has a three by three grid of smaller cubes similar to the Rubix cube and with a little thought and experimentation once might conjecture that six is the minimal number of cuts. The best proof of this that I know of is simply "Consider the faces of the center cube." They require six cuts because there are six faces and it follows immediately. No symbols or calculation but still logical and mathematically sound.

3
On

Natural language for expressing mathematical statements can be indeed vague and ambigous. However, when you study mathematics, one thing you will usually learn at the beginning is how to use mathematical terminology in a rigid, unambigous way (at least for communication with other people trained in mathematical terminology). This process takes usually some time if you are not a genius (I guess it took me about two years at the university until I became reasonable fluent), so unfortunately I fear I cannot tell you a small set of rules which kind of language is "right" for mathematical proofs, and which is "wrong". This is something you can only learn by practicing.

Hence, the answer is IMHO "yes, words are fine, when used correctly by a trained expert". (Amazingly, one could say the same about more formal proofs using symbols.)

Note that historically, before the 18th century, proofs using natural language was the de facto standard in mathematics. Most of the symbolic notation we usually use today was developed in the 18th and 19th century.

8
On

Two points:

(i) Historically, all proofs were done in words—the use of standardised symbols is a surprisingly recent development. This is obscured a bit because a modern edition of, say, Euclid's Elements is likely to have had the words translated into modern notation.

(ii) Before symbols can be used they have to be defined, and ultimately that definition will be in words. It's easy to forget this, especially with ones that we use all the time and learnt in childhood. But, for example, we once had to learn that $2+3=5$ was short for "Two things together with three things is the same as five things".

Though a lot of us learnt instead that $2+3=5$ meant "Three things added to two things makes five things".

Now, these two definitions are different. One makes $2+3$ into an operation done to $2$, and treats $=$ as an instruction to carry it out; the other says that the number on the right has the same value as the expression on the left. The notation, though, doesn't make this distinction, and it's possible to spend years using the $=$ sign as though it meant "put the result of the operation on the left on the right".

So in this case we've got one string of symbols ($2+3=5$) a correct definition and a misleading definition. And how do we clarify the correct meaning of the symbols? By choosing which verbal definition to use. The precision is in the words (at least if they're well chosen).

Of course, more advanced symbols will most likely have some mathematical symbols in their definitions—but ultimately, we'll get back to words.

1
On

Behind the proof system is logic... you need to write a reasoning that is fool proof and can be reproduced by the reader to lead to the same conclusion, and every step of the proof must be unambiguous and without "exceptions" (if there are special cases, they must be stated). As long as this is respected, the proof is correct and complete. When you see a symbolic proof, you can still read it in plain language, as long as you understand what it means, so there is no real difference (as long as the proof is rigorous, without "holes" or ambiguous statements).

Note that this excludes statements such as "this is obvious". You need to tell the reader of the proof what steps to take in his own mind to come to a single unmistaken conclusion. This part is very important - not understanding this leads some people to rejects proofs as opinions (all pseudoscience relies on this fallacy).

Now, just as words are just notation for thoughts, so are symbolic expressions just short notation for longer words. Symbolic notation has the advantage to being language-independent, and exact within their previous agreed upon definition. They a lot of times simplify things in algebra, arithmetics and functional analysis, where reasoning just follows simple steps without decision making and reasoning.

However, when it comes to logic, deduction, and other high-level thought processes, notation gets clumsier and a lot of times harder to understand (there are symbols for "therefore" and statements such as "A implies Β", but the author might not choose to use them). Instead of calculations, you have something that very much resembles formal computer programs, and fewer people are trained to read them fluently.

Think of lawyers: law is written in "english", but most "everyday english" isn't used, because it's ambiguous. Instead, the words are meticulously put together to try to cover all the corner cases and have only one interpretation (so much, that for a layman, the text is almost incomprehensible). The metaphor is not the best, because in lawmaking, there is no rigorous foundation (no true axioms) to rely upon, but I hope you understand the point.

0
On

For your particular example:

Just keep distributing $A$ over and over ad nauseum and you get the term on the right.

would not be a convincing proof. This is not because it is in words, however -- words are perfectly fine.

But it fails to convince because the intersection is over an infinite family of sets. Your proposal would work fine for a finite intersection, in that it gives a recipe for constructing an algebraic proof that would itself be convincing. And in ordinary mathematics a convincing recipe for a convincing proof is itself as good as the real thing.

But for an infinite intersection, the algebraic calculation you're describing never ends! No matter how many steps you do, there will still be an intersection of infinitely many $A_i$s that have yet to be distributed over in your expression. So your recipe does not lead to a finite proof, and infinite things (to the extent they are "things" at all) are not convincing arguments.


There are ways to convert some cases of infinitary intuition into actual convincing proofs, but they have subtle pitfalls, so you can't get away with using them -- no matter whether with words or with symbols -- unless you also convince the reader/listener that you know what these pitfalls are and have a working strategy for avoiding them. Typically this means you need to explicitly describe how you handle the step from "arbitrarily but finitely many" to "infinitely many" (or in more sophisticated phrasing: what do you do at a limit ordinal?).

A somewhat unheralded part of mathematics education is that over time you will get to see sufficiently many examples of this that you collect a toolbox of "usual tricks". When communicating in a situation where you trust everyone knows the usual tricks you can often get away with not even specifying which trick you're using, if everybody present is experienced enough to see quickly that there's one of the usual tricks that will obviously work.

8
On

Yes. All proofs can be written in words. While some will say that this is because you can use words in certain ways with formal descriptions and such, ultimately the real reason is because all mathematical symbols and statements correspond to written words! Now this isn't to say that things cannot get messy, but for instance take $4 + 5 = 9$. That is a symbolic statement. There is nothing fundamentally wrong with me instead saying that four plus five equals nine. The same could be said with a lot of other statements. Obviously some things will get messy due to lack of proper names, but I think one would be hard pressed to find something that cannot be expressed in words.

However, in the problem 3 example you give the proof does use words. I think the problem here is that you are confusing "proof" with "algebra/symbol manipulation". If you write a proof with nothing but math symbols I wouldn't really call that a proof. Perhaps on stack exchange it might qualify, but seriously to whoever does that - wrap it in a sentence and don't be lazy.

When I was taught proofs there were a few basic rules.

  1. Proofs are a piece of writing. Everything must be complete English sentences.

  2. Never use the word "obvious" or any synonyms. They are filler words and are usually placeholders for "I'm too lazy to do this or have a lack of knowledge".

  3. Never state things in the form "if done repeatedly" or the form "if continue doing this over and over we obtain". It can create pitfalls if you use the same language with infinite steps rather than finite steps. Instead say, things like "expanding the equation further we obtain" or "integrating three more times we get".

  4. Don't write equations in words verbatim. In other words, if you have $4 + 5 = 9$ don't write "four plus five equals nine". Technically this has no bearing on the validity of the proof, but it's annoying for the reader.

  5. Write in formal language and keep it succinct. Don't go into details about your thought process and how you came up with the proof. Write that separately if wanted, like in a response.

The list has probably evolved for me over time, but I think this is the crux of what you need to make sure you do in a proof. And yes, saying something is obvious is technically alright if the "proof" is a sarcastic response to someone asking for a proof of something truly obvious such as asking for a proof of "4 + 5 = 9" in the context of a proof of a calculus identity. In that case saying it's true because it's assumed to be true in the context of that proof is alright, because you don't have to rebuild the entire foundation of arithmetic when proving that integration by parts is a valid integration formula, not unless you have some unusual desire to do that.

0
On

Yes, it's perfectly acceptable to write proofs using mostly words.

In modern mathematics, all statements can be written using only the symbols $\forall, \exists, \vee, \wedge, \implies, \lnot, \in, (, )$ and a countable collection of variables. Notice that each of these symbols have a slight english meaning as well:

  • $\forall$ : for all
  • $\exists$ : there exists
  • $\vee$ : or
  • $\wedge$ : and
  • $\implies$ : implies
  • $\lnot$ : not

In most "heavily-worded" mathematical proofs, the words used are often a rough image of the precise symbols above. For a small example from your proof:

if $x$ belongs to the set on the right, then it belongs to $A \cup B_n$ for all $n$

translates to

$$(x \in \cap_{n = 1}^\infty(A \cup B_n)) \implies \forall n( n \in \mathbb{N} \implies x \in A\cup B_n))$$

and that's not even the most precise form, as there are ways (using the primitive symbols above) to translate the union, intersection, and $\mathbb{N}$ symbol to their more primitive forms. Can you imagine translating your entire proof into this symbolic form? The rough image (the "word form") of this formalism is often enough for the reader to understand the precise meaning of Theorems and their proofs.

0
On

All proofs are words. Display and inline equations are words, phrases, and sentences written using a significantly more precise syntax and semantics than most natural languages, but they're still words.

When you speak to tell your friend, "$x = 2$", do you say words or do you somehow switch to some other mode of communication?

1
On

I will prove that "proofs of only words" exist.

Assume not. That is, assume that all proofs must involve more than words. If this were true, then this proof would be impossible, yet it is completed with this sentance.


Another from a textbook somewhere.

All people can be catagorized according to some catagorization.

Proof by construction: divide all people into two categories, those who believe this statement and those who do not.