I have two distributions (generated from binned data) and wish to answer the question: Do they come from the same underlying distribution? I don't have the form of the underlying distribution.
Edit: Really I have ~ 30,000 samples (of some physical quantity, say v) distributed in some large physical space. The distribution I care about is in some very small subset of this physical space that has essentially zero sample points. I'd like to characterize the variability of the v distribution in physical space (or on different physical scales).
Most people use the distribution in the large space. But since I have so much data, I've broken the large space down into about 700 smaller spaces which overlap. I want to know if these 700 small space distributions (~3000 samples) are drawn from a common underlying distribution, and if so if it is the same underlying distribution as the large space's distribution is drawn from.
Part of my worry is that these data all have some (in the case of the smaller space distributions) common points, or in the most extreme case the sample points are a subset of the large space distribution.


Pearson's chi-squared test was invented exactly for this kind of task.