I was reading this paper but could not understand one of it's condition. It says a function $f(x)$ is twice differentiable and strongly convex with parameter $m$ and Lipschitz continuous gradient with parameter $M$ and $M\geq m$. Then: $$(x-y)^T(\nabla f(x)-\nabla f(y) \geq \frac{mM}{m+M} \lVert x-y \rVert^2+\frac{1}{m+M}\lVert \nabla f(x)-\nabla f(y) \rVert ^2$$
I know that if $f$ is strongly convex and Lipshcitz continuous gradient, then : $$(x-y)^T(\nabla f(x)-\nabla f(y)) \geq m \rVert x-y\lVert ^2$$ and $$(x-y)^T(\nabla f(x)-\nabla f(y)) \geq \frac{1}{M} \lVert \nabla f(x)-\nabla f(y) \rVert ^2 $$. I tried multiply each by $M$ and $m$ and adding up but that didn't quite work.
I emailed the author of the paper, here is the reply I got: "It is a non trivial result which is often used in convex optimization. I guess you can find it in Nesterov's book Introductory Lectures on Convex Optimization: A Basic Course. Theorem 2.1.12"