Computing a combined average (mean) of values

2.4k Views Asked by At

My textbook on descriptive statistics says that when we have a data set of m members, the mean is calculated like this:

x-bar = 1/m (Σfixi) where sigma sums values from 1 to k (we have k groups in our frequency table and xi is the representative of each group)

I get the idea, but it immediately asks the reader to prove that given a data set of m members with the mean value of x-bar and another data set of n members with the mean value of y-bar, the combined mean is calculable by the following formula: [m(x-bar)+n(y-bar)]/m+n

I tried to prove this by breaking it down to the definition but I simply can't get to the final formula.

2

There are 2 best solutions below

0
On BEST ANSWER

We know

$$\bar{x} = \frac{1}{m}\sum_{i=1}^m x_i \text{ and } \bar{y} = \frac{1}{n}\sum_{j=1}^n y_j.$$

So

$$\sum_{i=1}^m x_i = m\bar{x} \text{ and } \sum_{j=1}^n y_j = n\bar{y}.$$

Then, calculating the mean of all $x_i, y_j$ directly:

$$\mu = \frac{x_1 + x_2 + ... + x_m + y_1 + y_2 + ... + y_n}{m+n} = \frac{1}{m+n}\left(\sum_{i=1}^m x_i + \sum_{j=1}^n y_j\right) = \frac{m\bar{x} + n\bar{y}}{m+n}.$$

8
On

The mean of the new data set is $$\frac1{m+n}\left(\sum_{i=1}^{k}f_ix_i+\sum_{i=1}^{j}g_iy_i\right)$$ where there $g_i$'s are the frequencies for the new data set and $\sum_{i=1}^k f_i+\sum_{i=1}^j g_i=n+m$. Also $$n\bar{x}=\sum_{i=1}^kf_ix_i,\hspace{10mm}m\bar{y}=\sum_{i=1}^jg_iy_i$$ Therefore the new mean is $$\frac{n\bar{x}+m\bar{y}}{m+n}$$