I need some help to understand the following proposition (mainly to understand how it is proven):
Let $Y_1,Y_2...,Y_n$ be $n$ random variables which are independent, identically distributed random variables with probability density function $f$. The joint density of the order statistics $Y_{(1)},Y_{(2)},..,Y_{(n)}$ is given by:
$\textbf{(1)}\quad$$f(y_1,y_2,...,y_n)= n!\prod\limits_{i=1}^{n}f(y_1) \qquad y_1 <y_2...<y_n$
$\textbf{(i)} \quad$the preceding follows since: ($Y_{(1)},Y_{(2)},..,Y_{(n)})$ will equal $(y_1,y_2...,y_n)$ if $(Y_1,Y_2,...Y_n)$ is equal to any of the $n!$ permutations of $(y_1,y_2...,y_n)$
** this part i get; they are saying that the ordered statistics is equal to the tuple $(y_1,y_2...,y_n)$ only if the unordered $(Y_1,Y_2,...Y_n)$ is equal to a permutation of $(y_1,y_2...,y_n)$. But what does have for consequence in $\textbf{(1)} ??$ **
$\textbf{(ii)}\quad$ the probability density that $(Y_1,Y_2,...,Y_n)$ is equal to $y_{i_{1}},...,y_{i_{n}}$ is $\prod_{j=1}^{n}f(y_{i_{j}}) = \prod_{j=1}^{n}f(y_j)$ when $i_1,...i_n$ is a permutation of $1,2...,n$
** here Iam completely lost; the probability density that $(Y_1,Y_2,...,Y_n)$ is equal to $y_{i_{1}},...,y_{i_{n}}$
what do they mean? probability density (function)? I don't know what they mean at all with this statement
It may help to see a simple example of small size. Suppose $n = 2$. Then our order statistics consist of the smaller of the two observations, and the larger. So if, say, we observed $Y_1 = 3, Y_2 = 5$, then $Y_{(1)} = Y_1 = 3$ and $Y_{(2)} = Y_2 = 5$; nothing has changed. But if the first observation was $5$ and the second was $3$, we would have to interchange them to write $Y_{(1)} = 3, Y_{(2)} = 5$ as we did before. Notice then that for a given realization of the ordered set of random variables, there is in general two ways to have observed the original variables so as to obtain the ordered result (we can ignore the case where the observations are equal because for continuous random variables, such an outcome has infinitesimal probability).
Therefore, in general, for any given realization of order statistics of a sample of size $n$, there are $n!$ possible ways that we could have observed a sample that, upon ordering, would yield such a realization. That's basically why the joint density is scaled the way it is.