Suppose $X_1, ..., X_n \stackrel{iid}{\sim}$ Exponential(rate = $\lambda$) independent of $Y_1, ..., Y_n \stackrel{iid}{\sim}$ Exponential$(1)$.
Define $Z_i \equiv \min\{X_i, Y_i\}$
I want to find the maximum likelihood estimator for $\lambda$ in the following scenario: I observe $Z_1, ..., Z_n$ and $Y_1, ..., Y_n$ but NOT any of the $X_i$.
First I need to determine the likelihood and then maximize it over $\theta > 0$, but I'm not really sure of the right approach. I calculate the joint cdf as follows:
$$P(Z_i \leq z, Y_i \leq y) = \begin{cases} P(Y_i \leq y), & y \leq z \\ P(Y_i \leq z, Y_i \leq X_i) + P(Y_i \leq y, X_i \leq z, X_i < Y_i), & y > z\end{cases} \\ = \begin{cases} 1- e^{-y}, & y \leq z \\ 1-e^{-z} + (e^{-z}-e^{-y})(1-e^{-\lambda z}), & y > z \end{cases}$$
This is because $Z_i \leq Y_i$ always. Would the likelihood function therefore be:
$$L(\lambda |Y_i, Z_i, i \in \{1,...n\}) = \prod_{\{i : Y_i = Z_i\}} (1-e^{-Y_i}) \prod_{\{i:Y_i > Z_i\}} \lambda e^{-Y_i}e^{-\lambda Z_i}$$
splitting into the "discrete" and "continuous" parts? Or am I getting this wrong? Or should I be doing something like here or here? I should note my scenario is different than theirs, as intuitively at least, observing the magnitude of the difference between the minimum and the maximum (in the cases where $Z_i$ and $Y_i$ differ) should give us more information about $\lambda$, right?
Would this be $$\prod_{\{i: Y_i = Z_i\}} \frac{1}{\lambda +1} \prod_{\{i: Y_i > Z_i\}} e^{-Y_i}\lambda e^{-\lambda Z_i} $$
where we just have the point mass/probability of equality contributing when $Y_i = Z_i$ and the joint density contributing otherwise. Can someone please provide some insight?