Can someone please explain why is $P(X > Y) = \frac{1}{2}$, when $X, Y$ are i.i.d. continuous random variables? I have seen people use the symmetry argument to justify this answer. The argument goes as follows: there are two ways of arranging two numbers $x$ and $y$, and out of these arrangements only one has $x > y$. So, the probability using symmetry is $0.5$.
I don't understand this conclusion. Doesn't this argument make an assumption that the values of $X$ and $Y$ that are drawn are not equal? To take a more concrete example, if we assume $X, Y$ are standard normal, wouldn't the sample space be divided into three events: $X > Y, X < Y, X = Y$? Based on this, we can say that $X > Y$ and $X < Y$ must be equal using symmetry, and let's call this value $\alpha$. So,
$2\alpha + P(X=Y) = 1$.
Clearly, $\alpha < 0.5$, contrary to the first argument. So, which is the correct argument and why?
Edit: based on the comments I am adding the calculations for $X=Y$ in case of normal distributions. Can someone point out the mistake? Thanks!
$P(X=Y) = \int_{-\infty}^{\infty} P(X=Y|Y=y) P(Y=y) dy$
$P(X=Y) = \int_{-\infty}^{\infty} \frac{1}{2\pi} e^{-y^{2}} dy$
$P(X=Y) = \frac{1}{2\sqrt\pi}$.
Intuitively, the symmetry argument feels like an approximation (probably a very good one). Imagine we have a bivariate normal distribution, which is formed using $X$ and $Y$. $P(X>Y)$ represents the region below the line $X=Y$ (in the 1st quadrant). Similarly, we can argue about the values in the other 3 quadrants. By geometry, the area of line is zero (because the line has no width?), and hence you arrive at the 1D analogy that the probability it takes a specific value is zero. Still not sure though why it doesn't show up in the calculations above.
Edit: I realize the mistake I made in the above calculations. Going from step 1 to step 2, when I replaced $P(X=Y|Y=y)$ with $f_{X}(y)$, this is wrong. As pointed out in the comments and answer below, this must be equal to zero. I confused (/abused) the notation for the discrete and continuous cases. Thanks everyone for an interesting discussion.
The distribution of $X-Y$ is symmetric around zero, that is $\mathsf{P}(X-Y\le v)=\mathsf{P}(Y-X\le v)$ for all $v\in \mathbb{R}$ (for example, you may show that the characteristic function of $X-Y$ and $Y-X$ are equal, i.e. $\varphi_{X-Y}(t)=\varphi_{Y-X}(t)=\varphi_X(t)\varphi_X(-t)$). Then as you noticed $$ \mathsf{P}(X-Y>0)+\mathsf{P}(Y-X>0)+\mathsf{P}(X-Y=0)=1, $$ and the result follows assuming that $\mathsf{P}(X-Y=0)=0$. (This is always true for continuous r.v.s.)