My Calculus textbook spent a good chunk of the chapter deriving Left/Right Riemann sums, only to ditch them for general Riemann sums, which they never bother to derive or illustrate.
I'm fairly confident that I understand the notion of a right (or left) Riemann sum. Let's say we have a partition $P$, of closed interval $[a,b]$, with subintervals $[x_k, x_k+1]$ (of equal length, of course). If we want to approximate the area under a given curve (on interval $[a,b]$) , we can multiply $Δx_k = (b-a)/n$ (the kth subinterval) by $f( x̄ )$ (the minimum y value on the kth subinterval) then take the sum of $f( x̄ )*Δx_k$ from $k = 1$ to $k = n$. And to obtain a better approximation, we can take the limit of this expression as $n$ goes to infinity. This method of approximation is not only geometrically, but also algebraically, obvious to me.
The mathematical clarity breaks down for me when we start talking about "general" Riemann sums. When in regards to general Riemann sums, the author states: "Since $Δx_k$ can now vary, it is now no longer enough to require that $n$ approaches infinity; we must also require that the length of the longest subinterval approaches zero. Since the latter condition includes the former, we now let "max $x_k$" denote the length of the longest subinterval, and take the sum (from $k = 1$ to $k = n$) of $f(x̄ )$ $Δx_k$ as $\max x_k$ approaches zero."
My question is, where does this relationship come from? With right Riemann sums we had the equation $Δx_k = (b-a)/n$ for each subinterval. This makes it perfectly clear that as $Δx_k$ approaches zero, n must approach infinity (because b-a is constant). For the general Riemann sums this equation doesn't hold, due to the fact that the subintervals vary in relative size, right? So what equation explicitly shows the relationship that the author claims? Namely that "the latter includes the former" in the case of a general Riemann sum?
Notice that as subintervals shrink the designation of 'longest subinterval' won't always apply to the same subinterval. So, as you keep shrinking the subinterval which is currently the longest you will also need to keep adding more subintervals. So, by making sure that you always take the length of the longest subinterval to zero you are also ensuring that the number of subintervals approaches infinity.