For context, I'm trying to prove the change of variables theorem for Riemann integrals. I need to know how a $C^1$ diffeomorphism $f$ changes the volume of a region. Here's what I know already:
The Riemann integral can be equivalently defined using partitions into arbitrary compact Jordan-measurable sets (CJM sets), instead of (hyper)rectangles. (That is, after we've defined Jordan measure using rectangles.)
A $C^1$ diffeomorphism, applied to a CJM set, is a CJM set. (Proposition 10.5.6 in Jiří Lebl's notes)
A linear transformation changes volume by the absolute value of its determinant.
Given that $f$ is continuously differentiable, there is a continuous function $h$ of two variables such that $$f(x) = f(y) + f'(y)\, (x - y) + h(x, y)\, \lVert x - y\rVert,\qquad h(y, y) = 0,$$ for all $x$ and $y$. (Here $f'(y)$ is a matrix, and all the other things are vectors, except of course $\lVert x-y\rVert$.)
Denote the volume of a region $R$ as $V(R) = \int\, [x \in R]\, dx$, using Iverson brackets: $[\text{true}]=1,\; [\text{false}]=0$.
What I want is a bound on the difference $V(f(R)) - |\det f'(y)|\, V(R)$, where $y$ is some point in $R$, and $R$ is some CJM subset of a fixed CJM set. The region $R$ could be a small rectangle, to simplify things. Specifically, for any $\varepsilon>0$, I want some $\delta>0$ such that, for all $R$ with diameter smaller than $\delta$, and for all $y \in R$, that volume difference is smaller than $\varepsilon\, V(R)$.
$$\newcommand{\assign}{:=} \newcommand{\nosymbol}{} \newcommand{\of}{:} \newcommand{\tmop}[1]{{\operatorname{#1}}}$$
In 3.1.49 of http://mtaylor.web.unc.edu/wp-content/uploads/sites/16915/2018/04/analmv.pdf, the author claims that given $\varepsilon > 0$, if $\delta > 0$ is small enough and $\tmop{maxsize} (\mathcal{P}) \leq \delta$, then we have \begin{eqnarray*} \eta_{\alpha} + (1 + \varepsilon) H_{\alpha} & \supset & G (R_{\alpha}) \text{ for all } R_{\alpha} \in \mathcal{P}, \end{eqnarray*} that is, \begin{eqnarray*} G (\xi_{\alpha}) + (1 + \varepsilon) D G (\xi_{\alpha}) \tilde{R}_{\alpha} & \supset & G (R_{\alpha}) \text{ for all } R_{\alpha} \in \mathcal{P}. \end{eqnarray*} We can show this by brute force. By properties of the Riemann integral, we may assume each $R_{\alpha}$ is a cube $R_{\alpha} = \{ x \in \mathbb{R}^n \of | x - \xi_{\alpha} |_{\infty} \leq \delta_{\alpha} \}$. Fix $R_{\alpha} \in \mathcal{P}$. Let $\xi_{\alpha} + y \in R_{\alpha}$ (i.e. $y \in \tilde{R}_{\alpha}$). Then we want to find $v$ such that \begin{eqnarray*} G (\xi_{\alpha} + y) & = & G (\xi_{\alpha}) + D G (\xi_{\alpha}) v. \end{eqnarray*} By the fundamental theorem of calculus, \begin{eqnarray*} G (\xi_{\alpha} + y) & = & G (\xi_{\alpha}) + D G (\xi_{\alpha}) y + \Phi (\xi_{\alpha}, y) y,\\ \Phi (\xi, y) & = & \int_0^1 (D G (\xi + ty) - D G (\xi)) d t. \end{eqnarray*} Thus we want \begin{eqnarray*} G (\xi_{\alpha}) + D G (\xi_{\alpha}) y + \Phi (\xi_{\alpha}, y) y & = & G (\xi_{\alpha}) + D G (\xi_{\alpha}) v. \end{eqnarray*} Performing algebra yields \begin{eqnarray*} v & = & y + D G (\xi_{\alpha})^{- 1} \Phi (\xi_{\alpha}, y) y. \end{eqnarray*} Let us use $\tmop{maxsize} (\mathcal{P}) \assign \max_{R_{\alpha} \in \mathcal{P}} \delta_{\alpha}$. Suppose $\tmop{maxsize} (\mathcal{P}) = \delta$. Then \begin{eqnarray*} {| v |_{\infty}}_{\nosymbol} & \leq & (1 + \| D G (\xi_{\alpha})^{- 1} \Phi (\xi_{\alpha}, y) \|) | y |_{\infty}\\ & \leq & (1 + \sup_{\xi \in R, | x | \leq \delta} \| D G (\xi)^{- 1} \Phi (\xi, x) \|) | y |_{\infty}\\ & \assign & (1 + q (\delta)) | y |_{\infty} \end{eqnarray*} Thus $v \in (1 + q (\delta)) \tilde{R}_{\alpha}$. Since $y$ was arbitrary, $$ G (R_{\alpha}) \subset G (\xi_{\alpha}) + D G (\xi_{\alpha}) (1 + q (\delta)) \tilde{R}_{\alpha} = G (\xi_{\alpha}) + (1 + q (\delta)) D G (\xi_{\alpha}) \tilde{R}_{\alpha} . $$ We now need to show that $q (\delta) = \sup_{\xi \in R, \| x \| \leq \delta} \| D G (\xi)^{- 1} \Phi (\xi, x) \| \rightarrow 0$ as $\delta \rightarrow 0$. Note that since $R \subset \mathcal{O}$ can be as small as we like, we can ensure there exists $r > 0$ such that $\Phi \of R \times \overline{B_r (0)} \rightarrow M (n \times n, \mathbb{R})$ is well defined (we just need $\{ x \of d (x, R) \leq r \} \subset \mathcal{O}$). Then note that $\Phi$ is continuous, and $\Phi (\xi, 0) = 0$. Since $\Phi$ is continuous on a compact set, $\Phi$ is uniformly continuous, so $q (\delta) = \sup_{x \in R, \| x \| \leq \delta} (\Phi (\xi, x) - \Phi (\xi, 0)) \rightarrow 0$ as $\delta \rightarrow 0$.