I am trying to understand why the formulation $\lambda f(x_1)+(1-\lambda)f(x_2)$ should be greater than $f[\lambda x_1+(1-\lambda)x_2]$ and what does it mean geometrically.
Convexity condition of $f$: $$f[\lambda x_1+(1-\lambda)x_2]\leq\lambda f(x_1)+(1-\lambda)f(x_2)\quad\forall \space 0 < \lambda < 1$$
The idea is that the value of $f$ at a point between $x_1$ and $x_2$ is less (or equal) than the value at the same point for the line segment between $f(x_1)$ and $f(x_2)$.
(credit Wikipedia)
The expression $\lambda x_1+(1-\lambda)x_2$ is just a parametrization for all the points between $x_1$ and $x_2$ on $x$ axis and $\lambda f(x_1)+(1-\lambda)f(x_2)$ is the corresponding parametrization for the line segment between $f(x_1)$ and $f(x_2)$.
The concept can be generalized for more points by Jensen's inequality.