My question is more about the meaning of Mahalanobis distance when compared to L2 distance. As a reminder:
- Mahalanobis distance is the distance between a point $x$ and a distribution with mean $\mu$ and covariance $\Sigma$ can be calculated as: $d_{\text{Mahalanobis }} = \sqrt{(x-\mu)^T\Sigma^{-1} (x-\mu)}$
Back to the question: if the Euclidean distance between a point $p_a$ and $p_b$ is for example 0.5 and the Mahalanobis distance between the point $p_a$ and some other distribution $\beta$ is 0.75, could we state that point $p_a$ is closer to $p_b$ than distribution $\beta$?
If not, then under what conditions we could have this comparison valid?
Comparing a Euclidean distance directly to a Mahalanobis distance does not make sense, generally.
However, define distribution $\alpha$, such that $\vec{\mu_{\alpha}} = \vec{p_b}$ and $\Sigma_\alpha = I$. So $\alpha$ is centered at $\vec{p_b}$ and spherical.
The Mahalanobis distance between $\alpha$ and $\vec{p_a}$ is
$$ d_M{(\alpha,p_a)} = \sqrt{(\vec{p_a}-\vec{\mu_\alpha})^T\Sigma_\alpha^{-1}(\vec{p_a}-\vec{\mu_\alpha})} = \sqrt{\sum_{i=1}^n(p_{a_i}-p_{b_i})^2}$$
Thus the Euclidean distance between $\vec{p_a}$ and $\vec{p_b}$ is the Mahalanobis distance between $\vec{p_a}$ and $\alpha$.
Now you can compare Mahalanobis distances and make a statement that $\vec{p_a}$ is closer to $\alpha$ (which has center $\vec{p_b})$ than to $\beta$.