Question about convolution of iid random variables with an emphasis on the iid concept

496 Views Asked by At

Let's say I want to find the $n$-fold convolution of $n$ iid binomial random variables. My question is regarding the probability of success $p$. Would $p$ be the same for each random variable in the sample or is the correct way to allow $p$ to vary, that is, probability of success for each $X_i$ is $p_i$ for all $i = 1,\dotsc,n$. If it's the latter, then I don't understand the concept of iid. If these random variables are iid, then the expected value (at least) will be the same for each $X_i$; but if $p_i$'s are not necessarily equal, how can it be claimed that the expected values are equal?

2

There are 2 best solutions below

5
On BEST ANSWER

Let's do it for $n=2.$ If, independently, $X_1, X_2 \sim \mathsf{Binom}(5, 1/3),$ then $Y = X_1 + X_2 \sim \mathsf{Binom}(10, 1/3).$ Intuitively, you can think of $X_1$ as the sum of the first five Bernoulli trials and $X_2$ as the sum of the second five. Perhaps the easiest formal proof is via moment generating functions. (The discussion extends to general $n$ without difficulty.)

The figure below was made by generating 100,000 realizations of $X_1, X_2,$ and $Y = X_1 + X_2,$ then making a histogram of the 100,000 sums $Y.$ Centers of open circles show the exact distribution of $\mathsf{Binom}(10, 1/3).$

enter image description here


R code for the sampling and the figure is shown below.

x1 = rbinom(10^5, 5, 1/3)
x2 = rbinom(10^5, 5, 1/3)
y = x1 + x2
hist(y, prob=T,br=(-1:10)+.5, col="skyblue2", main="Y ~ BINOM(10,1/3)")
  yy = 0:10; pdf=dbinom(yy, 10, 1/3)
  points(yy,pdf,col="red")
0
On

There's no such thing as the convolution of two random variables. Two distributions (ie probability measures on $\Bbb R$) have a convolution, but a distribution is not a random variable.

The notion of the convolution of two probability distributions has a lot to do with independence, nothing necessarily to do with "identically distributed": If $X$ and $Y$ are independent random variables then the distribution of $X+Y$ is the convolution of the distribution of $X$ with the distribution of $Y$. That holds regardless of whether $X$ and $Y$ are identically distributed (and it doesn't say anything about the non-existent "convolution" of $X$ and $Y$.)