If you want to use infinitesimals to teach calculus, what kind of example of an infinitesimal can you give to the students? What I am asking for are specific techniques for explaining infinitesimals to students, geometrically, algebraically, or analytically.
What's an example of an infinitesimal?
4.9k Views Asked by Mikhail Katz https://math.techqa.club/user/mikhail-katz/detail AtThere are 11 best solutions below
An infinitesimal is some quantity that is explicitly nonzero and yet smaller in absolute value than any real quantity ( http://mathworld.wolfram.com/Infinitesimal.html )
Consider a quantity$=1\:$ for example. Then, the half. Then the half of the half. And so on. Of course in continuing to repeat this operation, the quatity tends to $\:0\:$. But a small and more and more smaller quantity always remains to be again divided in two. It's not exactly zero. It is an infinitesimal on the common sens.
More expanation in the above link and in : https://en.wikipedia.org/wiki/Infinitesimal
Comment :
In fact, "infinitesimal" is an abstract concept. Think also to another abstract concept : "Infinity" is an unbounded quantity that is greater than every real number. Both abstract concepts cannot be represented in real world on a concrete manner. Nevertheless, they can be intuitively understood.
A (relatively) explicit construction of a field with infinitesimal elements via ultraproducts. The problem with the sequences is that the cartesian product of fields $${\Bbb R}^{\Bbb N} = \Bbb R\times\Bbb R\times\Bbb R\times\cdots$$ isn't a field because has zero divisors: $$(0,1,0,1,\dots)(1,0,1,0,\dots)=(0,0,0,0,\dots).$$ The solution is taking a quotient: let be $\mathcal U$ a nonprincipal ultrafilter on $\Bbb N$. Define $$(a_1,a_2,\dots)\sim(b_1,b_2,\dots)$$ when $$\{n\in\Bbb N\,|\,a_n=b_n\}\in\mathcal U$$ The quotient $\Bbb R^* = {\Bbb R}^{\Bbb N}/\sim$ will be a field with infinitesimals, like the class of equivalence of the sequence $$(1,1/2,1/3,1/4,\dots).$$
An infinitesimal is a positive number whose absolute value is less than any assignable positive number. Observe the word assignable. The difference between assignable and non-assignable was well understood by the founders of the calculus. It is arrogant to think that the giants like Newton, Lebniz and Euler did not know that there is no nonzero number whose absolute value is less than any positive real. They bear in mind the fact that there are two kinds of reals. Our concept of standard is the ancient assignable in a modern logical disguise.
When one asks for an example of infinitesimal which is the inverse of an infinite and says there are no such, he mocks himself for the numbers are mental constructions, and there are no numbers independent of human beings. That is, he cannot give an example of any number in much the same way as he knows no "real" examples of infinitely large and infinitely small numbers.
If we think about the numbers expressing the quantity of molecules in a room or if we count the grains of sands on a beach, then we see that these numbers are unreachable in contract to the number of fingers. The natural series starts with assignable numbers but there are clearly some numbers that are practically unreachable by successive count. These examples explain the difference between standard or assignable and non-standard or non-assignable numbers on a naïve level.
For this Q (the existence of a concrete, meaningful infinitesimal) to have a mathematical meaning, one has to specify the structure (a non-archimedean extension of $\mathbb R$) wrt which the Q is considered. If it is a typical Robinson-style ultrapower of $\mathbb R$ then the method described by Martín-Blas Pérez Pinilla gives you a lot of concrete infinitesimals in such an ultrapower. One can also name concrete infinitesimals, in different manner, in typical pre-Robinson non-archimedean fields like Levi-Civita.
In the other direction, one can probably ask if there is a non-archimedean extension of $\mathbb R$ with no real-ordinal definable (ROD) infinitesimals, and I wonder if one can get such an extension in the Solovay model
Regardless of what you believe about the axiom of choice, the fact that there are ordered fields with infinitesimals is provably true. e,g. the field Q[t] of rational functions over Q. Comparing infinitesimals to fields with one element is misleading at best: the statement "there is a field with one element" is provably false. Fields with one element are suggestive (for example sets would be just vector spaces over these fields) and if any sense is to be made of the concept it would have to be by considering some larger set of structures.
I am teaching a calculus class using infinitesimals, so this question is very relevant for me too, user72694. I am avoiding formal constructions with ultrafilters and such, because I have found that these are too abstract for my students. What I want is something more concrete that will allow the students to build a reasonably robust concept image for infinitesimals -- something they can appeal to as necessary when they start working with dxs.
So the examples I give them are 0.000...1 (infinity 0s followed by a 1), 0.000...2, 0.000...01, etc. The reason I like these is that in my research students are able to independently reason with them and to develop conjectures about what happens when you add them, multiply, take ratios, square them, etc.
Of course, if someone asks me what these really are, then I will provide a more formal explanation for using sequences (keeping in mind that research shows they don't usually even understand what 0.999... is at this point). Then I'll talk about how you can think of these infinitesimal decimals as specific sequences of numbers that converge to 0 (there are others like 1, 1/2, 1/3, 1/4, ... that I don't know how to write as "decimals"). Ultrafilters etc. arise only when a student figures out that it's actually a bit hairy to compare two convergent sequences where one doesn't strictly dominate the other.
The discussion is mostly revolving around Robinson's non-standard analysis, even though the OP has not actually specified what kind of infinitesimals he is looking for. There are definitely good sources on how to teach Robinson's non-standard analysis, but I am not familiar with those, so I cannot say much about that.
Since the purpose is to teach people, it is worth looking at nilpotent infinitesimals, as known in Synthetic Differential Geometry. They quickly give the students methods of calculation that are practically the same as the methods employed by physicists. Synthetic Differential Geometry can be presented axiomatically, without ultrafilters or much attention to the logical language (standard vs. non-standard, internal formulas and). As long as we are interested in concrete computations, we will not even notice the main snag, which is lack of excluded middle.
Here are some references which explain infinitesimals in an accessible way which ought to appeal to students:
I highly recommend John Bell's A primer on infinitesimal analysis, or his shorter An Invitation to Infinitesimal Analysis. If you tone down the stuff about intuitionistic logic and just skip to the axioms and computations, you can get a lot of interesting stuff quickly.
I apologize for blowing my own horn, but I once wrote a blog post about intuitionistic mathematics for physics which has a section about infinitesimal analysis. This was targeted at students of physics. There is a similar exposition by me in the book A Computable Universe.
If you teach computer science students you can motivate infinitesimals through the use of dual numbers in automatic differentiation, a very cool technique for writing programs that automagically calculate derivatives. You don't quite get "true infinitesimals" but it is a start and it can be quite appealing to computer-sciency students.
You asked specifically how to present to the students infinitesimals, or perhaps how to show them "a concrete" infinitesimal. This is always a bit difficult to do, both in Robinson's non-standard analysis and in Synthetic Differential Geometry. In Robinson's theory things revolve about non-principal ultrafilters, which are probably not the sort of thing you want to teach beginning analysis students. In Synthetic Differential Geometry we have (a squre-nilpotent infinitesimal is defined to be an element of the smooth real line $R$ whose square is zero).
$0$ is not the only square-nilpotent infinitesimal: $\lnot \forall x \in R . (x^2 = 0 \Rightarrow x = 0)$.
No square-nilpotent is distinct from zero: $\lnot \exists x \in R . x^2 = 0 \land (x < 0 \lor x > 0)$.
These two statements taken together are quite counter-intuitive, especially for a person who is used to classical logic (so 99.99% of mathematicians). The second statement actually tells us that we cannot ever display a concrete infinitesimal that is detectably different from $0$. Infinitesimals are intrinsically strange! But you can actually take advantage of the oddity and entice students to question some basic assumptions about how their geometric intuitions work and what sort of things are possible in mathematics. It is a lot of fun. I tried to explain the strange status of infinitesimals in my blog post and the paper, so I will not repeat that here.
My pedagogical answer is to explain one over a generic natural number.
We cannot explicitly write down a generic natural number just as we cannot explicitly write a generic (non-constructible) irrational number. Like a non-constructible irrational number, it is an abstraction. We do know that it is larger than any fixed integer. We have no algorithmic method to determine any of its non-trivial properties, such as whether it is even or odd, prime or composite. Indeed, we have no algorithmic method to distinguish two different generic natural numbers.
I feel that this approach is close to the infinitesimals of old, and it's also highly intuitive. The notion of one over a generic natural number as an "example of an infinitesimal" comes from Kauffman's version of Sergeyev's grossone. It also relates to a view I have heard Tim Gowers express online, that a large integer out to be judged by how much we can say about it, and therefore (my words now) that one over a generic natural number is "functionally" an infinitesimal quantity.
To define "infinitesimal" as either
- a sequence or function that converges to zero
or
- as an adjective for describing such a sequence or function
is a very old tradition that goes back in some form to Cauchy's Cours de Analyse or earlier, and is consistent with (if not necessarily the same as) statements in writings of old masters since Newton. If supplemented with ideas and notation from asymptotic analysis it can do most of what was accomplished with infinitesimal arguments before the rise of abstract analysis in the 20th century.
Of course it is easy to give examples of things that converge to 0.
Disclaimer: The infintesimals simulated here are close to finite approximations of real numbers. aka: finite numbers
Python3 code that simulates infintesimals using surreal representation
nan = ()
bits = 16
def construct(num):
if num is None: return nan
seed = nan,nan
power = 0
while num != 0 and 2**-bits <= abs(num):
if num > 0:
num -= 1/2**power
seed = seed,seed[1]
else:
num += 1/2**power
seed = seed[0],seed
if abs(num) < 1:
power += 1
return seed
surreal representations are created like this:
construct(29.75)
this routine compares representations
def le(a,b):
l,r = a[0],b[1]
return not(l and le(b,l) or r and le(r,a))
surreal representations are compared like this:
le(construct(1/3),construct(27/79))
The infintesimal routines generate representations that will compare closer to the input number than any number that could be created by construct.
def pos_infintesimal(a):
inf = []
inf[:] = [(a),inf]
return inf
def neg_infintesimal(a):
inf = []
inf[:] = [inf,(a)]
return inf
create two infintesimals close to pi...
from math import pi
surreal_pi = construct(pi)
pos_inf_pi = pos_infintesimal(surreal_pi)
neg_inf_pi = neg_infintesimal(surreal_pi)
now neg_inf_pi < surreal_pi < pos_inf_pi, and there is no surreal number you can make using construct() that will come between them.
As a minimum, these will be true and no finite construction will come between them.
le(neg_inf_pi,surreal_pi)
le(surreal_pi,pos_inf_pi)
And also, this will be False (so they are not equal)
le(pos_inf_pi,surreal_pi)
le(surreal_pi,neg_inf_pi)
And there is no number that comes between them, so this should not be true for any x:
n = construct(x)
le(neg_inf_pi,n) and le(n,surreal_pi) or le(surreal_pi,n) and le(n,po_inf_pi)
The reason is that every result from construct() is a finite limited structure. But the infintesimal values are self referential, so they have unlimited bounds for traversing.
One example of infinitesimals that has been used historically is that of hornangles, which were particularly popular in the 17th century. A hornangle $\alpha$ can be thought of as the "crevice" at the origin between the $x$-axis and the graph of the parabola $y=x^2$. If a real number $r>0$ is represented geometrically by the angle (in the first quadrant) between the $x$-axis and the line $y=r x$, it is easy to convince oneself that $\alpha$ is less than $r$ because a sufficiently short arc of the parabola $y=x^2$ will necessarily dip below the line $y=rx$ (when $r$ is fixed).
In a more arithmetic vein, Skolem in 1933 used sequences of integers to construct an extended number system incorporating infinite numbers. Here an infinite number is represented by a sequence tending to infinity. Skolem's construction does not rely on the axiom of choice. Using the quotient field of Skolem's integers, one gets a number system where a large fragment of calculus can be treated.
Similarly, Keisler in his book https://www.math.wisc.edu/~keisler/calc.html on page 913 gives an example of an infinitesimal represented by a sequence tending to zero, such as $(\frac{1}{n})$. Here the infinitesimal represented by $(\frac{1}{n^2})$ will be smaller than the infinitesimal represented by $(\frac{1}{n})$, etc.
Classroom experience shows that students find such examples intuitively appealing.
An additional approach is the one using Levi-Civita fields with the lexicographic ordering. These were used by Shamseddine and colleagues to develop computer implementations with infinity; see http://www.bt.pa.msu.edu/index_cosy.htm Needless to say, these "nonstandard models" are completely explicit.
As editor @nombre pointed out in a comment, the transfer principle is the crux of the matter. Depending on the theory one wishes to transfer, the level of difficulty may vary considerably. For example, if the theory is PA then Skolem's construction in ZF (without relying on the axiom of choice) is enough. If one wishes more powerful tools one will need more foundational input. This is a point often overlooked, even by high-profile people like Connes. The issue of constructiveness of Skolem's procedure is probed in more detail in this question: https://mathoverflow.net/questions/227945
Skolem's numbers are relevant because they actually embed in the hyperreals as explained in this article: http://dx.doi.org/10.1007/s10699-012-9316-5
Teaching experience shows that freshmen react well to examples of infinitesimals as represented by null sequences (i.e., sequences tending to zero). They also have some exposure to equivalence relations usually, so they find comprehensible a comment to the effect that an infinitesimal is not exactly a null sequence but rather an equivalence class of those. Of course the hyperreals cannot be constructed in a freshman calculus class any more than the reals.