$X \sim U(0,\theta)$. Find the distribution of $X_1\mid \max (X_i)$

160 Views Asked by At

The problem is to find the distribution of $X_1\mid M$ where $M$ is the maximum of the i.i.d. random variables $X \sim U(0,\theta)$. I have a complete solution but am having trouble justifying one step. We use Bayes' Theorem for CDF's to get started:

$$ P(X_1 < x_1 \mid M < m) = \frac{P(M < m \mid X_1 < x_1) P(X_1 < x_1)}{P(M < m)} $$

The cdf's for $M$ and $X_1$ are $(m/\theta)^n$, by independence, and $x_1/\theta$. The cdf for $M\mid X_1$ is $(m/\theta)^{n-1} {\bf 1} [x_1 \leq m]$. The justification I have is that if the observed value $x_1$ is greater than $m$, then $m$ cannot be the maximum. So, I threw the indicator on the cdf in order to justify that $M\mid X_1$ is just the distribution of the maximum excluding $X_1$. So,

$$ \frac{P(M < m \mid X_1 < x_1) P(X_1 < x_1)}{P(M < m)} = \frac{(x_1/\theta) (m/\theta)^{n-1}}{(m/\theta)^n} = \frac{x_1}{m} $$

It follows that $X_1\mid M \sim U(0,m)$.

Is my justification for the distribution of $M\mid X_1$ correct? I believe my final answer is intuitive.

2

There are 2 best solutions below

7
On BEST ANSWER

We want the CDF of $X_1|M=m$. One knows that: \begin{align} F_{X_1|M}(x|m)=\int^x_{-\infty} \frac{f_{X_1,M}(u,m)}{f_M(m)}\,du \end{align} It is easy to find \begin{align} F_{X_1,M}(x,m)=P(X_1<x, M<m)= \begin{cases} \left(\frac{m}{\theta}\right)^n & \text{ if } & 0\leq m \leq x \leq \theta\\ \frac{x}{\theta}\left(\frac{m}{\theta}\right)^{n-1} & \text{ if } & 0\leq x < m \leq \theta\\ \end{cases} \end{align} Note that $F_{X_1,M}$ is not differentiable everywhere. But differentiating where it is differentiable yields: \begin{align} f_{X_1,M}(x,m)=\frac{(n-1)m^{n-2}}{\theta^n}\mathbf{1}_{0\leq x<m\leq\theta} \end{align} We also know that $f_M(m)=\frac{nm^{n-1}}{\theta^n}$, just differentiate the CDF you already have. So we get: \begin{align} F_{X_1|M}(x|m) = \begin{cases} 0 & \text{ if } & x<0\\ \frac{x(n-1)}{mn} & \text{ if } & 0\leq x<m\\ 1 & \text{ if } & m\leq x\\ \end{cases} \end{align}

0
On

Suppose you're asking for $P(X_1 \leq a \mid \max X_i=b)$. Consider $n=2$. Then the relevant set is the intersection between the line segments between $(0,b)$ to $(b,b)$ and $(b,0)$ to $(b,b)$, with the half-plane $X_1 \leq a$. Assuming of course that $0<a<b$, then you're looking at the ratio of the lengths of the segment $(0,b)$ to $(a,b)$ and the whole pair of segments. So that's $\frac{a}{2b}$. The results for $a \leq 0$ and $a \geq b$ are clearly $0$ and $1$ respectively.

For general $n$, what happens? Well, for $0<a<b$ again, you're missing one "face" which is the face where $X_1$ is actually the biggest one; this face has probability $1/n$. Other than that, the fraction of the "area" on each face where $X_1 \leq a$ is $a/b$. So you're looking at a general result of $\frac{a}{b} \left ( 1 - \frac{1}{n} \right )$. Note that this can be interpreted as $P(X_1 \leq a \mid X_1 \leq b) P(X_1<\max X_i)$.

Of course this approach is geometric, but you can use the general formulae for conditional CDFs/PDFs to do it analytically.