Stratified Sampling for Variance Reduction--Need Intuition as to Why it Works

Question

Stratified Sampling for Variance Reduction--Need Intuition as to Why it Works

2.2k Views Asked by Bumbble Comm At 10 May 2026 - 2:05

When working on variance reduction techniques, I was studying stratified sampling.

Suppose we wanted to estimate a definite integral, and we decided to do so using classical Monte Carlo.

It can be shown that stratified sampling reduces the overall variance of our estimator, but I don't see intuitively why this is true.

In classical Monte Carlo, we sample points from the function, and then take the average.

In stratified sampling, we partition the interval into strata, collect samples from each stratum, and then combine our results.

So my question is how does stratified sampling reduce variance? I can kind of see the variance being smaller on a particular stratum, but I don't see how the sum of these estimates yields an overall lower variance.

Original Q&A

There are 2 best solutions below

**Bumbble Comm** · Answer 1 · 2018-04-26 21:26:35

It forces a certain degree of non-clumping of the points that you actually use for quadrature. For example, suppose you are integrating $f(x)=x$ on $[-1,1]$. With regular Monte Carlo, it is possible that all your integration points will be in $[0,1]$ (this has probability $2^{-n}$ of course but still). In these cases your quadrature result will be significantly larger than the desired result of $0$, which contributes some variance. But if you take strata on $[-1,0]$ and $[0,1]$ for example, then at least the two strata themselves (in this example, the negative and positive values of the integrand) will be equally represented.

**Bumbble Comm** · Answer 2 · 2018-04-26 21:46:13

Assume you want to estimate the average height of the $100$ billions humans living in another planet. You know that there are $60\%$ female and $40\%$ male in this planet. What you do not know is that all females are exactly $10$ feet tall while all the males are $5$ feet tall. You do suspect, a priori, that the heights is sensitive to the gender.

Your computer only allows you to compute the average of the heights of $1000$ people. Once. Choose wisely.

Stratification sampling tells you that it is less risky (lower variance estimator) to ask the heights of 600 females and 400 males instead of asking 1000 random people that could equally likely be male or female.

The average over a stratified sample always gives you the true mean 8 feet. The purely random sample gives you a estimator which will be a random number between 5 feet and 10 feet depending on the proportion of male and female you get. Of course this second estimator will converge to 8 feet as you increase the sample size (Monte Carlo iteration) because you will eventually be more and more likely to have accurate proportions in your sample (Law of large number) with a controlled error rate (Central limit theorem).

Add some variance in the height of the female and male and the result is less extreme but the conclusion remains the same.

Stratified Sampling for Variance Reduction--Need Intuition as to Why it Works

There are 2 best solutions below

Related Questions in VARIANCE

Related Questions in SAMPLING

Related Questions in MONTE-CARLO

Trending Questions

Popular # Hahtags

Popular Questions