Is Jensen's inequality an iff condition on convex functions?

3.9k Views Asked by At

According to wikipedia this is Jensen's inequality:

If X is a random variable and φ is a convex function, then:

$$\varphi\left(\mathbb{E}[X]\right) \leq \mathbb{E}\left[\varphi(X)\right].$$

Which stated as an implication reads as follows:

If Convex $\implies $ $\varphi\left(\mathbb{E}[X]\right) \leq \mathbb{E}\left[\varphi(X)\right].$

However, I was wondering if the converse was also true.

$\varphi\left(\mathbb{E}[X]\right) \leq \mathbb{E}\left[\varphi(X)\right]$ $\implies $ Convex.

I thought it would be an iff because of the definition of convexity. Recall what convexity means (and let me explain why I thought iff for jensen's):

f is called convex if: $$\forall x_1, x_2 \in X, \forall t \in [0, 1]: \qquad f(tx_1+(1-t)x_2)\leq t f(x_1)+(1-t)f(x_2).$$

Since its a definition, the the word convex and its definition statement are logically equivalent. Therefore, its an iff, i.e.

f is called convex $ \Leftrightarrow \forall x_1, x_2 \in X, \forall t \in [0, 1]: \qquad f(tx_1+(1-t)x_2)\leq t f(x_1)+(1-t)f(x_2).$

right? Does that logic generalize to Jensen's inequality or am I wrong for both?

Context:

The reason that I care about this is because I wanted to proof that a multivariable function f is convex. My plan of attack was to show Jensen's inequality "worked" on it and since Jensen's inequality and convexity are logically equivalent, then I can conclude that f is convex. But that only works if its an iif. Also, I absolutely wanted to avoid taking the Hessian or derivatives to prove it was convex, I strictly wanted to stick with the "original" definition of convexity to prove this (i.e. the inequality definition).

3

There are 3 best solutions below

0
On BEST ANSWER

The convexity of $f$ is equivalent to Jensen's inequality holding for all random variables $X$. (If it holds for all $X$, then in particular it holds for the random variable with $P(X=x_1)=t$ and $P(X=x_2)=1-t$, where Jensen reduces to the definition of convexity.) So if somehow it's easier to prove Jensen for all $X$ (perhaps because the generality of that statement clears away distractions), then sure, your method is sound.

6
On

That is not true take $U$ uniform in $[0,1]$ and consider the function $f(x)=x^3$. Then one can see that $E[\psi(X)]\geq \psi(E[X])$ but $f$ is non-convex.

In general convexity is a much stronger argument than Jensen's inequality.

0
On

It depends on the underlying probability space.

For example, if the measure is $\delta_{1\over2}$ on $[0,1]$, then $E X= X({1 \over 2})$, $E \phi(X) = \phi(X({1 \over 2}))$, so Jensen's inequality holds whether or not $\phi$ is convex.

For all $t\in [0,1]$ you need to be able to find a set $A$ such that $pA = t$.

If you can do this, let $X=x_1 \cdot 1_A + x_2 \cdot 1_{X \setminus A}$. Then $EX = t x_1 + (1-t) x_2$ and $E \phi(X) = t \phi(x_1) + (1-t) \phi(x_2)$, which implies that $\phi$ is convex if Jensen's inequality holds.