A simple arithmetic average question - two approaches

165 Views Asked by At

I'm trying to get the weighted average of CpM (Cost per Mile) for the dataset below: enter image description here

I'm calculating the average by:

1st approach: Average CpM = sum(TotalCost / Miles) / count = (2 + 2.85 + 3.1 + 2.9) / 4 = $2.715

However, I've seen a colleague calculate the weighted average CpM the following way:

2nd approach: Average CpM = sum(TotalCost) / sum(Miles) = (100 + 200 + 310 + 290) / (50 + 70 + 100 + 100) = $2.81

When we introduce a new outlier that has extremely high CpM to the dataset, the 1st approach tends to shoot up along with the outlier, while the 2nd is more robust to the outlier:

enter image description here

1st approach: Average CpM = sum(TotalCost / Miles) / count = (2 + 2.85 + 3.1 + 2.9 + 10) / 5 = $5.21

2nd approach: Average CpM = sum(TotalCost) / sum(Miles) = (100 + 200 + 310 + 290 + 100) / (50 + 70 + 100 + 100 + 10) = $3.03

I'm not sure if the 2nd approach of calculating Average CpM is a valid approach, and if not, what this formula means and what it's getting at. I also wonder if there are any other ways to calculate the weighted arithmetic mean.

2

There are 2 best solutions below

7
On BEST ANSWER

I'm trying to get the weighted average of CpM (Cost per Mile) for the dataset below: enter image description here

1st approach: Average CpM = sum(TotalCost / Miles) / count = (2 + 2.85 + 3.1 + 2.9) / 4 = $2.715

This is merely the simple average of the four CpM values, not any weighted average.

2nd approach: Average CpM = sum(TotalCost) / sum(Miles) = (100 + 200 + 310 + 290) / (50 + 70 + 100 + 100) = $2.81

This is the arithmetic mean of the four CpM values weighted by their mileage.

$$\frac{m_1}{m_1+m_2+m_3+m_4}\left(\dfrac{c_1}{m_1}\right)+\frac{m_2}{m_1+m_2+m_3+m_4}\left(\dfrac{c_2}{m_2}\right)+\frac{m_3}{m_1+m_2+m_3+m_4}\left(\frac{c_3}{m_3}\right)+\frac{m_4}{m_1+m_2+m_3+m_4}\left(\frac{c_4}{m_4}\right)\\=\boxed{\frac{c_1+c_2+c_3+c_4}{m_1+m_2+m_3+m_4}}.$$

what does this formula means and what is it getting at

It gives the representative (i.e., 'average') value of the four CpM values such that the trips with higher mileage are given more weight than the trips with lower mileage.

If you require more intuition, please skim the middle section of this answer: Calculating marks using simple average versus weighted average.

I also wonder if there are any other ways to calculate the weighted arithmetic mean.

You could also compute the arithmetic mean of the four CpM values weighted by their associated costs, but not useful.

1
On

Your first method does not give a meaningful result.

Imagine you want to build a house. Someone calculates the average cost of the building materials, by finding out the cost of one plank of wood, one brick, one doorbell, etc., and taking the average. That won't be useful because you'll need much more wood and bricks than doorbells.

You are doing the same here. You are taking the cost of one mile from the first trip, one mile from the second trip, etc, and taking the average. You are however paying for many more miles on some trips than others.