From a textbook:
If $X$ is a continuous random variable, then so too is the new random variable $Y = Y (X)$. The probability that $Y$ lies in the range $y$ to $y + dy$ is given by $$g(y)=\int_{dS}f(x)dx,$$ where $dS$ corresponds to all values of $x$ for which $Y$ lies in the range $y$ to $y +dy$. The simplest case occurs when $Y (X)$ possesses a single-valued inverse $X(Y)$. In this case, we may write $$g(y)dy=\lvert\int_{x(y)}^{x(y+dy)}f(x')dx'\rvert=\int_{x(y)}^{x(y)+\lvert \frac{dx}{dy}\lvert dy}f(x')dx',$$ from which we obtain $$g(y)=f(x(y))\lvert\ \frac{dx}{dy} \lvert.$$
What I know:
- For this case, $x$ and $y$ has one to one correspondence, ie for each value of $x$, $Y(x)$ gives a unique value of $y$. Similarly $X(y)$ give unique $x$.
- What the integral does is to integrate a range of infinitesimal probability $f(x)$. That range is all $x$ that gives that particular value of $y$ by $Y(x)$.
What I don't know:
- Why is the integral needed to be "absoluted"? I think $f(x)$ is always positive because it is a probability distribution and by definition $0\le f(x)\le 1$, so there is no need to make sure the integral gives non-negative values.
- How is the absolute sign moved to the upper limit of the integral?
- How to evaluate the equation to give the final result? I think the text just divide the whole equation by $dy$ to give the final result and ignore what happens to the lower and upper limit of the integral. Again I don't how the absolute sign appears.
Attempts to answer: I searched wikipedia and google and found several online lecture notes. None of them explained the source of the absolute sign.
Sorry, I'm on a smartphone so I'll answer shortly.
1) $f>0$ but $x(y)$ could decrease, the integral must be taken from the lowest $x$ so, using the absolute value, you are just swapping the integration limits.
2) same as above, but we don't swap the limits but force the upper limit to be greater than the lower (here we abuse the $dx$ smallness).
3) suppose you know the primitive, the integral is a difference that, for $dx$ going to $0$, becomes again the $f$ function.