Convexity in NMF proof

103 Views Asked by At

I am currently studying a proof for the NMF multiplicative update rule based on generalized Kullback-Leibler divergence in the paper Algorithms for Non-negative Matrix Factorization, and for some reason I just can't seem to figure out how to derive equation 29 (in proof for Lemma 3).

I was thinking about using the classic version of Jensen's inequality, stating that $\varphi\Big(\frac{\sum_{a} \alpha_a x_a}{\sum_{a} \alpha_a} \Big) \leq \frac{\sum_{a} \alpha_a \varphi(x_a)}{\sum_{a} \alpha_a}$ for positive $\alpha_a$'s and $\varphi$ convex. Since the $\alpha_a$'s sum to unity, I guess the inequality can be reduced to $\varphi(\sum_{a} \alpha_a x_a) \leq \sum_{a} \alpha_a \varphi(x_a)$, and hence by letting $x_a = \frac{W_{ia} h_a}{\alpha_a}$, we obtain the inequality \begin{align*} \log \sum_a \alpha_a \frac{W_{ia} h_a}{\alpha_a} = \log \sum_a W_{ia} h_a \leq \sum_a \alpha_a \log \frac{W_{ia} h_a}{\alpha_a}, \end{align*} but then multiplying with -1 the inequlity flips, which is not the case in the paper.

Any suggestions to where I'm wrong and how to fix it?

1

There are 1 best solutions below

2
On BEST ANSWER

You should apply the Jensen inequality (as stated) to the function $-\log$ (which is convex) instead of $\log$ (which is concave).