I am aware there are similar questions about the subject, but my question is probably much more simple.
I want to know why is the expectation of the second moment of continuous time white-noise equal to the some constant times the delta function, or:
$E[w(t)w(t+\tau)] = Q \delta(\tau)$
I am aware of the difficulties involving the definition of a function that is discontinuous in all points, but I just wish to know why engineers defined it like this. I appreciate on-line references to the subject. Thank you.
That is because the PSD of the white noise process is by definition a constant $Q$. So, by Wiener-Khinchin theorem the auto-correlation function is $$r_{ww}(\tau):=\mathbb{E}\left(w(t)w(t+\tau)\right)=\mathcal{F}^{-1}(Q)=Q\delta(\tau)$$
The motivation for the definition comes from the frequency domain description of random processes. White noise is a pure theoretical model which simply assumes that the spectral power density is same for the noise process in all frequency bands.
A practical example which has a description close to white noise is the Thermal noise in electrical circuits which has a constant power spectral density over a large band of frequencies. See here for more details.