I wanted to know more about the different methods for comparing the similarities of two probability distributions P and Q.
I wanted a list of the different methods that exist for comparing probability distributions.
For example, the two that I am aware that exist are:
- KL-Divergence
- EMD (earths mover's distance)
I was wondering if people knew about more different measures and if they could provide maybe a good reference for learning about it. Also, on top of providing the distance measure that you are suggesting, if you could provide a brief intuition on it before I research it more, it could be very helpful!
There is the total variation distance between finite measures $\mu$ and $\nu$: $$ \frac{1}{2} \sum_{i \in S} |\mu(i) - \nu(i)| \qquad \ (\text{discrete case}) $$
$$ \frac{1}{2} \int_\Omega |\mu(i) - \nu(i)|d\pi \qquad \ (\text{continuous case}) $$ (here $\mu,\nu << \pi$).
There is the $\chi^2$ distance which J.A. Fill used in a paper (can't recall the name) to measure the distance between the distribution $\pi_n$ at time $n$ with the steady state $\pi$ for a Markov Chain on a countable state space $S$: $$ \chi^2_n = \sum_{i \in S} \frac{(\pi_n(i) - \pi(i))^2}{\pi(i)}. $$ This is giving the relative distance at $i$ compared to the size of $\pi(i)$, so the effect is amplified where $\pi$ is small. Fill bounds the total variational distance with the $\chi^2$ distance.
This link lists several other metrics about which I'm afraid I don't know much (sorry!).