The application of this problem is in bioinformatics, but the problem itself is a general math one, I think. I have a function that converts a sequence score to a value in time units. This function involves a logrithm, so it will of course be non-linear. I also have a sequence score for several different sequences, and I want to find the average.
I don't know whether to apply the function to each individual score, and then average all of those time values, or average the sequence scores, and then apply my function to that average. Both approaches give a different number, and I was hoping someone could tell me why one is obviously wrong.
Cheers, Dave
This is a difficult question, and not a mathematical one. When you measure something, often the number you get needs to be scaled to get the number you want. You might imagine weighing an object on a scale with a nonlinear spring. In this case you would want to apply the function to each measurement. In another case, we know that hearing response is logarithmic-that doubling the sound pressure makes a given increment in perceived volume. Whether you want to average the sound pressure or its log depends upon whether you are interested in perceived volume or how strong something has to be to withstand the pressure. You have to think clearly about what the average is for before you can know how to get it.