Why use rejection sampling in Monte Carlo simulations?

480 Views Asked by At

I've noticed that a lot of physics Monte Carlo simulations make extensive use of rejection sampling, rather than inverse transform sampling. In my research, I'm sampling random energy transfers from energy loss distributions, ie: f(E) represents the relative probability of losing E energy in a collision between two particles. So I need to be able to generate random energy transfers such that the distribution of said transfers will converge to f(E).

I personally use inverse transform sampling all the time, and I'm wondering why people use rejection sampling.

Rejection sampling requires that you generate potentially many random numbers, whereas inverse transform sampling gives you a random variate from the desired distribution every time you generate your uniform variate. Rejection sampling also requires that you can test your variate selection using the original f(E), which could be a complicated function or a large numerical dataset in its own right.

For inverse transform sampling, I generate lookup tables: if I generate a uniform random variate R, I find the closest value in my lookup table and select my non-uniform random variate, which is a O(1) operation, since it's just indexing into an array, so run-time speed is not an issue.

Wikipedia mentions that generating those lookup tables may be computationally difficult, but the example they give of selecting from a random normal distribution is pretty trivial to numerically integrate, so I could generate the probability distribution quite readily. And if something has discontinuities that make it hard to integrate, it may not be any easier to generate a bounding box that's suitable for rejection sampling.

Are there any concrete examples of where this won't work? I suppose for many-dimensional large datasets, the lookup table might become prohibitively large, although I've happily used it with 3D probability distributions with no problems. And if you were generating a new probability distribution on the fly, so that you had to regenerate the lookup tables, then rejection sampling would be the way to go, but that's not the case in most of the papers I've been reading.