The mathematics behind Fourier transform for image processing

2.7k Views Asked by At

I am following Robert Fisher's page on the Fourier transform. I understand the application of the Fourier transform behind image processing, but, right now, I am curious about the mathematics behind it, and it is giving me a bit of a hard time. For example,

$$ F (k,l) = \sum_{i=0}^{N-1} \sum_{j=0}^{N-1} f (i,j) \exp \left( - \iota 2 \pi \left( \frac{k i}{N} + \frac{l j}{N} \right) \right) $$

In this formula, where do all these equations come from? Could somebody please elaborate the mathematics behind the scene in layman's term?

3

There are 3 best solutions below

5
On

I have given a similar explanation of Fourier series here and here.

The Fourier series of a function $f : \mathbb R / \mathbb Z \to \mathbb C$ is a sum $\sum_{k = -\infty}^\infty \hat{f}(e^{2 \pi i k x}) e^{2 \pi i k x}$. Here, $e^{2 \pi i k x} = \chi_k (x)$ denotes the $k$-th character of the topological group $\mathbb R / \mathbb Z $ and $\hat{f} : \widehat{\mathbb R / \mathbb Z} \to \mathbb C$ denotes the Fourier transform of $f$.

The setting is: You have a topological group $G$, then you consider a space of functions $G \to \mathbb C$ on it, say for example $L^2(G)$. This is a Hilbert space and comes with an inner product $\langle \cdot , \cdot \rangle$ which lets you define orthogonality of functions in the space, and characters in particular. Those characters form an orthonormal basis for your functions which means that you can approximate every function in $L^2(G)$ as $\| f - \sum_{k = -N}^N \hat{f}(\chi_k) \chi_k (x) \|_2 \xrightarrow{N \to \infty} 0$.

0
On

In my opinion it is quite okay to understand Fourier transform as orthogonal basis matrices to evaluate the certain frequencies for a given image. However I have the following link which will be helpful for you to further understand:

http://sharp.bu.edu/~slehar/fourier/fourier.html#harmonics (dead link) The working one

Good luck.

0
On

I think the easiest way of understanding the math, Is understanding what it means. It's also easier to start in 1D and only then move on to higher dimensions.

To see what the sum you mentioned is, try inserting a wave with constant frequency. You see that the summation only gives a peak at the "correct" frequency. If you have a super-position of waves, the sum will give you a number of peaks each corresponding to a different part of the super-position.

Thus, the sum gives you the spectrum of frequencies in the wave or image.