Let ${\mathcal D} = C^\infty_c$, the space of test functions (smooth functions $\Bbb R \to \Bbb C$ with compact support). Let $\mathcal D^*$ be the space of distributions (continuous linear functionals ${\mathcal D} \to {\Bbb C}$).
I am trying to fill in details of a proof in Richards & Youn's Theory of Distributions: A Nontechnical Introduction that
$\qquad$ If $S \in \mathcal D^*$ and $\varphi, \psi \in {\mathcal D}$, then $S * (\varphi * \psi) = (S * \varphi) * \psi)$.
My sticking point is this: I need to show that $\rho_n \to \rho$ in $\mathcal D$, where
$\qquad \rho(x) = \int_{-\infty}^{+\infty} \varphi(v) \psi(x-v) dv \quad$ (This defines the convolution $\rho := \varphi * \psi.)$ $\qquad \rho_n(x) = \sum_{m=-\infty}^{+\infty} \varphi (\frac m n)\psi(x - \frac m n) {\frac 1 n}$.
By "convergence in $\mathcal D$" is meant that, for each $k = 0, 1, 2, \dots$, the sequence of derivatives $\rho_n^{(k)}$ converges uniformly to the derivative $\rho^{(k)}$.
Because $\text{support }\varphi \subseteq [-M, M]$ for some positive integer $M$, the ostensibly infinite limits can be replaced by finite ones. Observe that $\rho_n$ is a Riemann sum for $\rho$, where $\rho_n$ uses the partition ${\mathscr P}_n = \{ \frac m n \}$ consisting of equally spaced points separated by distance $\frac 1 n$. As $\rho$ necessarily exists and as $\text{mesh } {\mathscr P}_n = {\frac 1 n} \to 0$, one has pointwise convergence $\rho_n(x) \to \rho(x)$ for each $x \in \Bbb R$.
Any test function is uniformly continuous, and $\varphi, \psi, \rho,$ and $\rho_n$ are all test functions. Richards & Youn state:
$\qquad \rho_n \to \rho$ uniformly because of that uniform continuity.
$\qquad$ Can anyone provide details as to why that should be so?
Some facts that might be helpful: $\text{support }\psi \subseteq [-N, N]$ for some positive integer $N$. Therefore $\text{support }\rho, \text{ support }\rho_n \subseteq [-M-N, M+N]$.
(I don't need to prove uniform convergence of the $k$th derivatives, $k>0$. Such a proof would be virtually identical to that for uniform convergence of $\rho_n$ to $\rho$.)
$$|\rho(x)-\rho_n(x)|=\left|\sum_m{\int_{m/n}^{(m+1)/n}{(\varphi(v)\psi(x-v)-\varphi(m/n)\psi(x-m/n))\,dv}}\right| \leq \sum_m{\int_{m/n}^{(m+1)/n}{|\varphi(v)\psi(x-v)-\varphi(v)\psi(x-m/n)+\varphi(v)\psi(x-m/n)-\varphi(m/n)\psi(m/n)|\, dv}}$$
Thus
$$|\rho(x)-\rho_n(x)| \leq \sum_m{\|\psi\|_{\infty}\int_{m/n}{(m+1)/n}{|\varphi(v)-\varphi(m/n)|\,dv}+\|\psi’\|_{\infty}\int_{m/n}^{(m+1)/n}{|\varphi(v)||v-m/n|\,dv}}$$.
Hence
$$|\rho(x)-\rho_n(x)| \leq \|\psi\|_{\infty}\|\varphi’\|_{\infty}\sum_{|m| \leq M}{\int_{m/n}^{(m+1)/n}{|v-m/n|\,dv}}+1/n\|\psi’\|_{\infty}\|\varphi\|_{L^1} \leq C/n$$
where $C$ is a constant depending on the first-order seminorms of $\varphi,\psi$ and on the support of $\varphi$ (assumed to be in $(-(M-1),M-1)$).