Why should I "believe in" weak solutions to PDEs?

Question

Why should I "believe in" weak solutions to PDEs?

13.6k Views Asked by Bumbble Comm At 27 Mar 2026 - 2:59

This is a sort of soft-question to which I can't find any satisfactory answer. At heart, I feel I have some need for a robust and well-motivated formalism in mathematics, and my work in geometry requires me to learn some analysis, and so I am confronted with the task of understanding weak solutions to PDEs. I have no problems understanding the formal definitions, and I don't need any clarification as to how they work or why they produce generalized solutions. What I don't understand is why I should "believe" in these guys, other than that they are a convenience.

Another way of trying to attack the issue I feel is that I don't see any reason to invent weak solutions, other than a a sort of (and I'm dreadfully sorry if this is offensive to any analysts) mathematical laziness. So what if classical solutions don't exist? My tongue-in-cheek instinct is just to say that that is the price one has to pay for working with bad objects! In other words, I do not find the justification of, "well, it makes it possible to find solutions" a very convincing one.

A justification I might accept, is if there was a good mathematical reason for us to a priori expect there to be solutions, and for some reason, they could not be found in classical function spaces like $C^k(\Omega)$, and so we had to look at various enlargements in order to find solutions. If this is the case, what is the heuristic argument that tells me whether or not I should expect a PDE (subject to whatever conditions you want in order to make your argument clear) to have solutions, and what function space(s) are appropriate to look at to actually find these solutions?

Another justification that I would accept is if there was some good analytic reason to discard the classical notion of differentiability all together. Perhaps the correct thing to do is to just think of weak derivatives as simply the 'correct' notion of differentiability in the first place. My instinct is to say that maybe weak solutions are a sort of 'almost-everywhere' type generalization of differentiability, similar to the Lebesgue integral being a replacement for the Riemann integral which is more adept at dealing with phenomena only occurring in sets of measure $0$.

Or maybe both of these hunches are just completely wrong. I am basically brand new to these ideas, and wrestling with my skepticism about these ideas. So can somebody make me a believer?

Worth noting is that there is already a question on this site here, but the answer in this link is essentially that there exist a bunch of nice theorems if you do this, or that physically we don't care very much about what happens pointwise, only in terms of integrals over small regions. It should be clear why I don't like the first reason, and the second reason I may accept if it could be turned into something that looks like my proposed justification #2 - if integrals over small regions of derivatives are the 'right' mathematical formalism for PDEs. I just don't understand how to make that leap. In other words, I would like a reason to find weak solutions interesting for their own sake.

Original Q&A

There are 9 best solutions below

user658409 On 05 Aug 2019 - 8:20

People can maybe talk more generally but I have a really simple example (but helpful in my opinion):

Not all waves are differentiable. We want all waves to satisfy the wave equation (in some sense). That sense is weak.

Bumbble Comm On 05 Aug 2019 - 8:35

Reason 1. Even if you actually care only about smooth solutions, in some cases it is easier to first establish that a weak solution exists and separately show that the structure of the PDE and the geometry of the domain on which you are solving actually enforce it to be smooth. Existence and regularity are handled separately and using different tools.

Reason 2. There are physical phenomena which are described by discontinuous solutions of PDEs, e.g. hydrodynamical shock waves.

Reason 3. Discontinuous solutions may be used as a convenient approximation for describing macroscopic physics neglecting some details of the microscopic theory. For example in electrodynamics one derives from the Maxwell equations that the electric field of an electric dipole behaves at large distances in a universal way, depending only on the dipole moment but not on the charge distributions. On distances comparable to the dipole size these microscopic details start to become important. If you don't care about these small distances you may work in the approximation in which dipole is a point-like object, with charge distribution given by a derivative of the delta distribution. Even though the actual charge distribution is given by a smooth function, it is more convenient to approximate it by a very singular object. One can still make sense of the Maxwell equations, and the results obtained this way turn out to be correct (provided that you understand the limitations of performed approximations).

Reason 4. It is desirable to have "nice" spaces in which you look for solutions. In functional analysis there are many features you might want a topological vector space to have, and among these one of the most important is completeness. Suppose you start with the space of smooth functions on, say, $[0,1]$ and equip it with a certain topology. In this case it is completely natural to pass to the completion. For many choices of the topology you will find that the completed space contains objects which are too singular to be considered as bona fide functions, e.g. measures or distributions. Just to give you an example of this phenomenon: if you are interested in computing integrals of smooth functions, you are eventually going to consider gadgets such as $L^p$ norms on $C^{\infty}[0,1]$. Once you complete, you get the famous $L^p$ spaces, whose elements are merely equivalence classes of functions modulo equality almost anywhere. Space of distributions on $[0,1]$ may be constructed very similarly: instead of $L^p$ norms you consider the seminorms $p_f$ given by $p_f(g)= \int_{0}^1 f(x) g(x) dx$ for $f,g \in C^{\infty}[0,1]$. If you can justify to yourself that it is interesting to look at this family of seminorms, then distibutions (and also weak solutions of PDEs) become an inevitable consequence.

Bumbble Comm On 06 Aug 2019 - 1:55

First, you should not believe in anything in mathematics, in particular weak solutions of PDEs. They are sometimes a useful tool, as others have pointed out, but they are often not unique. For example, one needs an additional entropy condition to obtain uniqueness of weak solutions for scalar conservation laws, like Burger's equation. Also note that there are compactly supported weak solutions of the Euler equations, which is absurd (a fluid that starts at rest, no force is applied, and then it does something crazy and comes back to rest). They are a useful tool, connected to physics sometimes, but that is it.

In general, it is naive to ignore applications when studying or looking for motivations for theoretical objects in PDEs. Nearly all applications of PDEs are in physical sciences, engineering, materials science, image processing, computer vision, etc. These are the motivations for studying particular types of PDEs, and without these applications, there would be almost zero mathematical interest in many of the PDEs we study. For instance, why do we spend so much time studying parabolic and elliptic equations, instead of focusing effort on bizarre fourth order equations like $u_{xxxx}^\pi = u_y^2e^{u_z}$? (hint: there are physical applications of elliptic and parabolic equations). We study an extremely small sliver of all possible PDEs, and without a mind towards applications, there is no reason to study these PDEs instead of others.

You say you do not know anything about physics; well I would encourage you to learn about some physics and connections to PDEs (e.g., heat equation or wave equation) before learning about theoretical properties of PDEs, like weak solutions.

PDEs are only models of the physical phenomenon we care about. For example, consider conserved quantities. If $u(x,t)$ denotes the density (say heat content, or density of traffic along a highway) of some quantity along a line at position $x$ and time $t$, then if the quantity is truly conserved, it satisfies (trivially) a conservation law like $$\frac{d}{dt} \int_a^b u(x,t) \, dx = F(a,t) - F(b,t), \ \ \ \ \ (*)$$ where $F(x,t)$ denotes the flux of the density $u$, that is, the amount of heat/traffic/etc flowing to the right per unit time at position $x$ and time $t$. The equation simply says that the only way the amount of the substance in the interval $[a,b]$ can change is by the substance moving into the interval at $x=a$ or moving out at $x=b$.

The function $u$ need not be differentiable in order to satisfy the equation above. However, it is often more convenient to assume $u$ and $F$ are differentiable, set $b = a+h$ and send $h\to 0$ to obtain (formally) a differential equation $$\frac{\partial u}{\partial t} + \frac{\partial F}{\partial x} = 0. \ \ \ \ \ (+)$$ This is called a conservation law, and we can obtain a closed PDE by taking some physical modeling assumption on the flux $F$. For instance, in heat flow, Newton's law of cooling says $F=-k\frac{\partial u}{\partial x}$ (or for diffusion, Fick's law of diffusion is identical). For traffic flow, a common flux is $F(u)=u(1-u)$, which gives a scalar conservation law.

Whatever physical model you choose, you have to understand that (*) is the real equation you care about, and (+) is just a convenient way to write the equation. It would seem absurd to say that if one cannot find a classical solution of (+), then we should throw up our hands and admit defeat.

Most applications of PDEs, such as optimal control, differential games, fluid flow, etc., have a similar flavor. One writes down a function, like a value function in optimal control, and the function is in general just Lipschitz continuous. Then one wants to explore more properties of this function and finds that it satisfies a PDE (the Hamilton-Jacobi-Bellman equation), but since the function is not differentiable we look for a weak notion of solution (here, the viscosity solution) that makes our Lipschitz function the unique solution of the PDE. This point is that without a mind towards applications, one is shooting in the dark and you will not find elegant answers to such questions.

Bumbble Comm On 06 Aug 2019 - 12:50

Absolutely nothing in physics is completely described by a PDE, if you look at a sufficiently small resolution, because space and time are not continuous. (Since the OP has said in a comment that he doesn't know much physics, google for "Planck length" for more information.)

However almost everything in physics is described at a fundamental level by conservation laws which are most naturally expressed mathematically as integral equations not as differential equations.

Integral equations can be converted to differential equations with some loss of generality - i.e. you exclude solutions of the integral equations which are not sufficiently differentiable. But the solutions you might have excluded are interesting and useful from a physicist's point of view, so excluding them simply "because PDEs are easier to work with than integral equations" is throwing the baby out with the bathwater.

Hence, "weak solutions of PDEs" are a thing worth studying. Of course if you want to convert any interesting theorems about weak solutions back into the language of integral equations, feel free to do that - or even better, figure out a way to unify the two subjects using nonstandard analysis, or something similar! (Nonstandard analysis corresponds very well with physicists' idea of "infinitesimal quantities" which can be treated mathematically as if they are numbers even though they are not!)

Bumbble Comm On 06 Aug 2019 - 9:23

It is a fact that not all physical problems have smooth solutions. Often this situation arises from a set of conservation laws that are expressed mathematically by applying such laws to a finite control volume to obtain an integral equation. Then we let the size of the control volume go to zero and arrive at some PDEs if the flow is smooth. But then we discover that the PDEs are unable to solve many important problems and have to rethink our strategy.

When this first occurred to me I found it a bit shocking because surely differential calculus was the natural language for describing continua? After a bit I realised that the integral calculus is more fundamental. It can be applied to functions that are more general (Anything can be integrated, but not everything can be differentiated) and it is the form in which much physical knowledge comes to us.

I suspect you felt the same surprise that I did. I thought that I wanted to solve differential equations, so why would I start integrating things? The truth is the reverse. I really want to solve integral equations, and the PDE is a powerful tool, but only if it is valid. That it often is should come as another surprise.

Bumbble Comm On 06 Aug 2019 - 10:52

To the excellent longer answers above I will add a short one: weak solutions in a conveniently-chosen (and in particular, finite-dimensional) function space can often be explicitly computed, whereas strong solutions often cannot (even if one can prove a solution must theoretically exist). Computability has obvious and immense practical importance.

Of course, one does not simply believe in the weak solutions: one proves existence, approximability, and conservation theorems, etc, for the weak solutions.

Bumbble Comm On 07 Aug 2019 - 11:06

Well, I hope this doesn't come off as snarky, but why should we expect that $$x^2 +1 =0$$ should have solutions? And why should we abandon the meaning of "squaring" that we all first learned for real numbers and adopt $$(a,b)^2 = (a^2-b^2, 2ab)$$

It's not a perfect analogy but I think it's rather similar to your questions about PDE solutions.

Bumbble Comm On 11 Aug 2019 - 6:21

The existing answers provide good reasons towards the question in the title, but from the perspective of a geometer I feel the applications in physics aren't quite as convincing. It's true that singular phenomena that arises in for example conservation laws requires a suitable notion of a generalised solution, but why is it also useful for geometric problems?

One way I think of weak solutions is that they provide a candidate for a strong solution. Suppose you want to a solve a particular PDE problem with suitable data and you can prove the following:

A weak solution exists.
Any classical solution, if it exists, is also a weak solutions.
The weak solution is suitably unique.

Then from the above you can infer that if a classical solution exists, it must be the unique weak solution. Hence the problem of existence is effectively reduced to proving the regularity of the weak solution.

Hence in nice cases where existence can established in general (e.g. linear elliptic problems), weak solutions provide a way of solving PDE problems using the above methodology. This is method is effective for the technical reason that it allows us to work in spaces with better compactness properties.

If a solution doesn't always exist however, things get more interesting. If you can still establish the first three points, the solubility criterion is reduced to a regularity problem and we can then look for necessary/sufficient conditions based on this.

Example (Harmonic map flow): If $(M,g)$ and $(N,h)$ are Riemannian manifolds, a classical problem in geometric analysis is whether a non-trivial harmonic map $u : M \rightarrow N$ exists. In the case when $M$ is a closed surface, we have the following sufficient condition for existence due to Eells and Sampson; non-trivial harmonic maps $M \rightarrow N$ exist provided there exists no non-trivial harmonic map $S^2 \rightarrow N.$

This theorem can be proved using the harmonic map flow to "evolve" a given map $u_0$ into a harmonic map $u_*,$ which is the work of Struwe. This method doesn't always work as the flow may develop singularities in general, but the non-existence condition about harmonic spheres provides a sufficient condition to prevent these singularities from forming.

**Bumbble Comm** · Accepted Answer

Let's have a look at the Dirichlet problem on some (say smoothly) bounded domain $\Omega$, i.e. $$ -\Delta u=f \text{ in } \Omega\\ u=0~ \text{ on } \partial \Omega $$ for $f \in \text{C}^0(\overline{\Omega})$. Then, Dirichlet's principle states a classical solution is a minimizer of an energy functional, namely $E(u):=\dfrac{1}{2}\int_\Omega \left|\nabla u\right|^2 \mathrm{d}x-\int_\Omega f u ~\mathrm{d}x$. (Here we need some boundary condition on $\Omega$ for the first integral to be finite).

So the question one may ask is, if I have some PDE why not just take corresponding the energy functional, minimize it in the right function space and obtain a solution of the PDE. So far so good. But the problem that may occur is finding this minimizer. It can be shown that such functionals are bounded by below, so we have some infimum. As also stated in the Wikipedia article, it was just assumed (e.g. by Riemann) that this infimum will always be attained, which shown by Weierstrass unfortunately not always is the case (see also this answer on MO).

Hence, we find differentiable functions which are "close" (in some sense) to a "solution" of the PDE, but no actual differentiable solution. I feel that this is quite unsatisfactory.

So have could we save this? We can multiply the PDE (take the Laplace equation for simplicity) with some test function and integrate by parts to obtain $$ \int_\Omega \nabla u \cdot \nabla v~\mathrm{d}x= \int_\Omega fv~\mathrm{d}x $$ for all test functions $v$. But from what space should $u$ come from? What do we need to make sense to the integral?

Well, $\nabla u \in \text{L}^2(\Omega)$ would be nice, because then the first integral is well-defined via Cauchy-Schwarz. But as shown by Weierstrass, classical derivatives are not enough, so we need some weaker sense. And here we got to Sobolev Spaces and looking again at the last formula, we see the weak formulation.

I am aware that this does not give a full explanation to why one should "believe" in weak solutions, Sobolev spaces and so on. What I stated above is a quick run through how in my course on PDE the step from classical to weak theory was motivated and at least I was quite happy about it.

Why should I "believe in" weak solutions to PDEs?

There are 9 best solutions below

Related Questions in PARTIAL-DIFFERENTIAL-EQUATIONS

Related Questions in SOFT-QUESTION

Related Questions in REGULARITY-THEORY-OF-PDES

Trending Questions

Popular # Hahtags

Popular Questions