On this math.MO post, "What is convolution intuitively?", Terence Tao's answer (in the case where one function is a bump function) involves "blurring" and "fuzz."
Could someone clarify his interpretation more explicitly? The intuition still escapes me. Thanks!
Consider a function $f_1$: $$f_1(x)=\delta(x)$$ where $\delta(x)$ is Dirac delta, and a Gaussian function $g(x)$: $$g(x)=\frac1{\sigma\sqrt{2\pi}}\exp\left(-\frac{x^2}{2\sigma^2}\right).$$ One of main properties of Dirac delta is this: $$\int_{-\infty}^\infty q(x)\delta(x-x_0)dx=q(x_0).\tag1$$ Thus, convolution of $f_1$ and $g$ will equal $g$. Let's see how it looks ($f_1$ is red, $f_1*g$ is blue):
You can see the 'blurring' effect of gaussian convolved with a Dirac delta. Now consider more complex function: $$f_2(x)=\sum_{i=-N}^N a(x_i) f_1(x-x_i)\Delta x_i.\tag2$$ $\Delta x_i$ here is step in $x$, i.e. $\Delta x_i=x_i-x_{i-1}$. We'll need it later. We'll take $\Delta x_i=\Delta x_j\; \forall x,j\in\mathbb{Z}$, then it can be viewed as just a coefficient before sum.
Here's a plot of $f_2$ and $f_2*g$ ($f_2$ is red, $f_2*g$ is blue):
Here you can also see that the result of convolving a gaussian with a sum of Dirac deltas gives you blurred version of original function. It's easy to generalize $f_2$ to a sum of infinite number of deltas with $N\to\infty$: $$f_{2a}(x)=\sum_{i=-\infty}^\infty a(x_i)f_1(x-x_i)\Delta x_i.$$
$f_{2a}(x)$ resembles a Riemann sum. Let's use this and take the limit $\Delta x_i\to0$: $$f_3(x)=\lim_{\Delta x_i\to 0}f_{2a}(x)=\int_{-\infty}^\infty a(t)f_1(x-t)dt\equiv\int_{-\infty}^\infty a(t)\delta(x-t)dt.\tag3$$ From $(1)$, $$f_3(x)=a(x).$$ In $(3)$ after taking limit $x_i$ becomes $t$, and $\Delta x_i$ becomes $dt$. $\Delta x_i$ is needed to gradually attenuate Dirac delta amplitude as $\Delta x_i\to 0$, so that in the limit it becomes finite, resulting in a finite function.
Here's a plot of $f_3$ and $f_3*g$ ($f_3$ is red, $f_3*g$ is blue):
This is finally the convolution of a usual function with a gaussian, which still does resemble the blurring, but is somewhat less obvious.
Also note that since $p*q=q*p$, you could as well start from a Dirac delta, represent $g(x)$ as a sum of scaled&translated $\delta(x)$'s and get the same result by summing $a(x)$'s scaled by $g(x_i)$ instead of $g(x)$'s scaled by $a(x_i)$.