During U-substitution, is the $du$ that we use as the infinitesimal in $\int f(u) \ du$ changing size?

581 Views Asked by At

I am familiar and comfortable with the following definition of an integral (defined as the the Riemann Sum where n approaches infinity).

Riemann Summ

I have circled in red two terms to emphasize a concept that I was taught: the $\frac{b-a}{n}$ term effectively becomes the $dx$ term. This intuitively makes sense to me.

Additionally, I am familiar with the following statement (as expressed here: How does $u$-substitution work?)

U-Substitution Formula

However, I have boxed in red something that I find rather confusing when trying to interpret it through the understanding that $\frac{b-a}{n}$ sorta kinda equals $dx$...

Specifically, the $du$ value appears to be changing throughout the interval $[ g(a), g(b)]$. To reframe this in the "Riemann Sum" version,we would have something like: $\text{ "value that varies depending on where you are"} * \frac{g(b)-g(a)}{n}$

This is very confusing to me...because, if this is the correct interpretation, I'm not quite certain I understand why this is allowed.

3

There are 3 best solutions below

6
On BEST ANSWER

You seem to be under the impression that one has to have one fixed $\Delta x$, i.e. a uniform subdivision of the interval. This is a very common misconception and the full definition of what a Riemann sum can be is given in the answer by LutzL.

If you have a Riemann sum with a uniform spacing then a u-substitution (performed at the level of the sum) will lead to a non-uniform spacing so you will have a $\Delta x$ that varies across the interval. For example consider $\int_0^1 2xe^{-x^2}{\rm d}x$ and let's use a uniform division of the interval $x_i = \frac{i}{n}$ to get $$R_n = \sum_{i=0}^{n-1} 2x_i e^{-x_i^2} \Delta x_i$$ where $\Delta x_i = x_{i+1}-x_i \equiv \frac{1}{n}$ is the same across the interval. In a u-substitution we will put $u_i = x_i^2$ to get $$R_n = \sum_{i=0}^{n-1} e^{-u_i} \Delta u_i \cdot \frac{x_i+x_1}{x_i+x_{i+1}}$$ where now $\Delta u_i = u_{i+1} - u_i = \frac{2i+1}{n^2}$ which varies from point to point. The last factor above will approach unity as $n\to\infty$ so for large $n$ this sum is just $\simeq \sum_{i=0}^{n-1} e^{-u_i} \Delta u_i$ which is a Riemann sum for the integral $\int_0^1 e^{-u}{\rm d}u$.

The largest spacing is found close to $u=1$ where $\Delta u_{n-1} = \frac{2n-1}{n^2} \sim \frac{2}{n}$ compared to the spacing close to $u=0$ where $\Delta u_0 = \frac{1}{n^2}$. However as $n\to\infty$ note that this largest spacing does go to zero and this is all we need for the Riemann sum to converge to the value of the integral.

Perhaps it's more instructive to show a figure. Recall that a Riemann sum (before taking the limit) is nothing but a finite sum of rectangles that approximate the (signed) area below the curve. Here is one example of such a sum where the spacing varies from point to point:

enter image description here

If we consider a finer subdivision of the interval, but still non-uniform, we generally get a much better approximation:

enter image description here

You can from this convince yourself that for approximating the area well it doesn't matter that some regions have a larger spacing than others as long as the largest spacing is "small enough". This is the basis of the general definition of a Riemann sum.

2
On

Well consider your first integral

$$\int_a^bf(x)dx$$

Where I now introduce the variable $u=x^2$. Then $$du=2x dx$$ and is now explicitly a function of $x$. The confusion is arising because in the definition of the integral, it is correct that $$dx=\lim_{n\to\infty} (b-a)/n$$ in the integral, but if you make a variable change then this definition no longer applies. In the case that you have presented, you might pretend a variable change was made, and that is how the infinitesimal is a function of $x$. In my example $$du=\lim_{n\to\infty} 2x(b-a)/n$

To respond to your comment below, let's consider a real example. Suppose $f(x)=x^2$, which is easy to integrate. Now, lets choose $a=0$ and $b=1$. I assume that you are familiar with integration, so we have

$$\int_0^1x^2dx$$

which I'm sure you know how to integrate,

$$=\left.\frac{1}{3}x^3\right|^1_0$$ $$=\frac{1}{3}$$

Now let's try the same integration, but with $u=x^2$. This will be the exact same integral, so we need to get the same result or we've done something wrong. Now, $f(u)=u$, and $dx=2xdu=2\sqrt{u}du$. $f(0)=0^2=0$ and $f(1)=1^2=1$ still. Now we have

$$\int_0^1u^{\frac{3}{2}}du$$

Certainly it is the case that

$$du=\lim_{p\to\infty}\frac{1-0}{p}$$

is useful in this case, but in relation to $x$ we have

$$du=2x\lim{n\to\infty}\frac{1-0}{n}$$

The confusion falls out because technically $du=0$ and $dx=0$, because of course they are the limit in the form

$$\frac{1}{\infty}$$

which of course, goes to zero. But in fact, we must add up each individual piece to get a non-zero value because we know the area under these curves are non-zero. It is nice, intuitively, to think of

$$dx\equiv \lim_{n\to\infty}\frac{1-0}{n}$$

but it is clear that if you try to use that definition as an axiom, you're going to encounter difficulties. The truth is that infinitesimals do not really make sense outside of integrals and derivatives. This is because infinitesimals, when outside of their natural habitat, are just zero.

The short response to your comment is that, we changed the definition, so of course the definition changed. That should be obvious. If I said $f(x)=x^3$, and then changed my mind and decided $f(x)=x^{17}$, you should understand that a plot of these two functions should be different, because the definition was changed.

5
On

Actually, the (original) Riemann sum is $$ \sum_{i=0}^{n-1}f(\xi_i)(x_{i+1}-x_i) $$ where $a=x_0<x_1<...<x_n=b$ is a subdivision of the integration interval and $\xi_i\in[x_i,x_{i+1}]$ are some midpoints (in the original text, the $(x_{i+1}-x_i)$ were supposed to be infinitesimals).

The Riemann integral is then the limit over these subdivisions and all selections of midpoints when the maximal length of a sub-interval in the subdivision goes to zero. Thus the inhomogeneous subdivision after application of the substitution rule is the general case, a subdivision by a strict arithmetic sequence is the exceptional case. At an elementary practical level, there is more use for a geometric sequence $x_k=aq^k$, $q>1$, $x_n=b$, in the subdivision.