How does distribution theory plays role in solving differential equations? This question might seem to be very general. I will try to explain, please bear with me.
I understand, distributions make it possible to differentiate functions whose derivatives do not exist in the classical sense and any locally integrable function has a distributional derivative. In terms of differential equations, if coefficients of a differential operator are piece-wise continuous then we make use of distributions (how and why it works?).
I am more interested in their relation with Green's function. Please help me understand, how can I use distribution theory for solving differential equations.
The most basic application is the use of the fundamental solution (also known as the Green's function) to solve inhomogeneous linear problems. When $*$ is convolution and $\delta$ is the Dirac delta centered at zero, $\delta * f=f$ for a wide class of $f$. On the other hand, if $L$ is a linear differential operator, then $Lu * f=L(u*f)$. (Or at least, this is definitely true when $u,f$ are smooth.) This means that if you could find a solution to $Lu=\delta$, then you could convolve it with $f$ on both sides to get $L(u*f)=f$. So $u*f$ is the solution to $Lv=f$ if $u$ is the solution to $Lw=\delta$. This $u$ is called the fundamental solution or Green's function for the operator $L$.
Duhamel's principle lets us extend this to time-dependent problems, provided the spatial differential operator is constant (and again linear). Cf. http://en.wikipedia.org/wiki/Duhamel%27s_principle#General_considerations