The old and modern definitions of total variation are actually equivalent?

651 Views Asked by At

According to wikipedia, the total variation of the real-valued function $f$, defined on an interval $[a,b]\subset \mathbb{R}$, is the quantity $$V_b^a=\sup_{P\in\mathcal{P}}\sum_{i=0}^{n_P-1}\left | f(x_{i+1})-f({x_i)}\right |$$ where $\mathcal{P}= \left \{P=\{x_0,\ldots, x_{n_P}\} \mid P \text{ is a partition of } [a,b]\right \}$.

This is actually the more traditional definition of total variation, found for example in Rudin's Principles of Mathematical Analysis from the 1960s. However, in modern image analysis applications, the total variation is defined differently: $$\text{TV}(f,\Omega)= \sup \, \bigg\{ \int_{\Omega} f\, div \phi \, dx : \phi \in C_c^{\infty} (\Omega,\mathbb{R}^N), \, \lvert \phi (x) \rvert \leq 1\, \forall x\in \Omega \bigg \} $$ where $\Omega$ is the domain on which we define the TV (and unlike the previous definition it could have dimension higher than 1).

Are these definitions somehow related? Perhaps equivalent?

2

There are 2 best solutions below

2
On BEST ANSWER

The two definitions, at least in the one-dimensional case, are related but not equivalent.

You can see in a moment that they are not equivalent because the second definition does not depend on the equivalence class of the function $f\in L^1$, whereas the first definition strongly depends on the pointwise value of the function. As an example, the Dirichlet function (i.e. the characteristic function of the rationals) has $V_0^1 = +\infty$ wheres $TV = 0$.

On the other hand, it can be proved that if $f\in L^1(a,b)$ is such that $TV(f, (a,b))$ is finite, then there exists a representative $\tilde{f}$ of $f$ such that $V_a^b(\tilde{f}) = TV(f, (a,b))$.

0
On

The "traditional" definition also appears in the same Wikipedia page, so perhaps look again at that article.

In the case where $f$ is differentiable with integrable derivative, your first definition of total variation can be written as $\int_a^b |f'(x)| \mathop{dx}$.

From the second definition with $\phi(a)=\phi(b)=0$, you may do integration by parts $$\int_a^b f(x) \phi'(x) \mathop{dx} = -\int_a^b f'(x) \phi(x) \le \int_a^b |f'(x)|, \mathop{dx}$$ where the last step is by Cauchy-Schwarz, with equality attained for $\phi(x)=-f'(x)/|f'(x)|$. So the supremum in the second definition becomes the expression I wrote in the previous paragraph.

I have been rather sloppy with assumptions/details, so if anyone more knowledgeable can correct me, please do.