Let $z_i = (y_i, x_i')'$ be an i.i.d sample of $N$ observations and let $z_i$ have density of the form: $$f(z_i \mid \theta_0) = f_1(y_i \mid x_i, \theta_0) f_2(x_i \mid \theta_0)$$ Consider the joint MLE estimator: $$\widehat{\theta}_J = \operatorname*{arg\,max}_{\theta \in \Theta} \frac{1}{N} \sum_{i=1}^{N} \ln(f_1(y_i \mid x_i, \theta_0) f_2(x_i \mid \theta_0))$$ Now consider the information matrix $I_J$, which is equal to: $$I_J = -\mathbb{E}\left[\frac{\partial^2 \ln\left(f_1(y_i \mid x_i, \theta_0)f_2(x_i \mid \theta_0) \right)}{\partial \theta \, \partial \theta'} \right] \\ = \mathbb{E}\left[\frac{\partial \ln\left(f_1(y_i \mid x_i, \theta_0)f_2(x_i \mid \theta_0)\right) }{\partial \theta} \frac{\partial \ln\left(f_1(y_i \mid x_i, \theta_0)f_2(x_i \mid \theta_0)\right)}{\partial \theta'} \right] $$ One can further decompose the previous expression as: $$I_J = \mathbb{E}\left[\left(\frac{\partial \ln\left(f_1(y_i \mid x_i, \theta_0)\right)}{\partial \theta} +\frac{\partial \ln\left(f_2(x_i \mid \theta_0)\right)}{\partial \theta}\right) \left(\frac{\partial \ln\left(f_1(y_i \mid x_i, \theta_0)\right)}{\partial \theta'} +\frac{\partial \ln\left(f_2(x_i \mid \theta_0)\right)}{\partial \theta'}\right)\right] \\ = \mathbb{E}\left[\frac{\partial \ln\left(f_1(y_i \mid x_i, \theta_0)\right)}{\partial \theta} \frac{\partial \ln\left(f_1(y_i \mid x_i, \theta_0)\right)}{\partial \theta'} \right] + \mathbb{E}\left[\frac{\partial \ln\left(f_1(y_i \mid x_i, \theta_0)\right)}{\partial \theta} \frac{\partial \ln\left(f_2(x_i \mid \theta_0)\right)}{\partial \theta'}\right] \\ {} + \mathbb{E}\left[\frac{\partial \ln\left(f_2(x_i \mid \theta_0)\right)}{\partial \theta} \frac{\partial \ln\left(f_1(y_i \mid x_i, \theta_0)\right)}{\partial \theta'}\right] + \mathbb{E}\left[\frac{\partial \ln\left(f_2(x_i \mid \theta_0)\right)}{\partial \theta} \frac{\partial \ln\left(f_2(x_i \mid \theta_0)\right)}{\partial \theta'} \right] $$ I am told that the cross product expectations are equal to zero, that is, $$\mathbb{E}\left[\frac{\partial \ln\left(f_1(y_i \mid x_i, \theta_0)\right)}{\partial \theta} \frac{\partial \ln\left(f_2(x_i \mid \theta_0)\right)}{\partial \theta'}\right]= \mathbb{E}\left[\frac{\partial \ln\left(f_2(x_i \mid \theta_0)\right)}{\partial \theta} \frac{\partial \ln\left(f_1(y_i \mid x_i, \theta_0)\right)}{\partial \theta'}\right]= 0 $$ Why is this the case?
2026-03-25 01:34:03.1774402443
Why are cross expectations zero in MLE?
109 Views Asked by Bumbble Comm https://math.techqa.club/user/bumbble-comm/detail At
1
There are 1 best solutions below
Related Questions in STATISTICS
- Given is $2$ dimensional random variable $(X,Y)$ with table. Determine the correlation between $X$ and $Y$
- Statistics based on empirical distribution
- Given $U,V \sim R(0,1)$. Determine covariance between $X = UV$ and $V$
- Fisher information of sufficient statistic
- Solving Equation with Euler's Number
- derive the expectation of exponential function $e^{-\left\Vert \mathbf{x} - V\mathbf{x}+\mathbf{a}\right\Vert^2}$ or its upper bound
- Determine the marginal distributions of $(T_1, T_2)$
- KL divergence between two multivariate Bernoulli distribution
- Given random variables $(T_1,T_2)$. Show that $T_1$ and $T_2$ are independent and exponentially distributed if..
- Probability of tossing marbles,covariance
Related Questions in EXPECTATION
- Prove or disprove the following inequality
- Show that $\mathbb{E}[Xg(Y)|Y] = g(Y) \mathbb{E}[X|Y]$
- Need to find Conditions to get a (sub-)martingale
- Expected Value of drawing 10 tickets
- Martingale conditional expectation
- Variance of the integral of a stochastic process multiplied by a weighting function
- Sum of two martingales
- Discrete martingale stopping time
- Finding statistical data for repeated surveys in a population
- A universal bound on expectation $E[X^ke^{-X}]$
Related Questions in MAXIMUM-LIKELIHOOD
- What is the point of the maximum likelihood estimator?
- Finding a mixture of 1st and 0'th order Markov models that is closest to an empirical distribution
- How does maximum a posteriori estimation (MAP) differs from maximum likelihood estimation (MLE)
- MLE of a distribution with two parameters
- Maximum Likelihood Normal Random Variables with common variance but different means
- Possibility of estimating unknown number of items based on observations of repetitions?
- Defects of Least square regression in some textbooks
- What is the essence of Least Square Regression?
- Finding maximum likelihood estimator of two unknowns.
- Mean of experiment results is the maximum likelihood estimator only when the distribution of error is gaussian.
Trending Questions
- Induction on the number of equations
- How to convince a math teacher of this simple and obvious fact?
- Find $E[XY|Y+Z=1 ]$
- Refuting the Anti-Cantor Cranks
- What are imaginary numbers?
- Determine the adjoint of $\tilde Q(x)$ for $\tilde Q(x)u:=(Qu)(x)$ where $Q:U→L^2(Ω,ℝ^d$ is a Hilbert-Schmidt operator and $U$ is a Hilbert space
- Why does this innovative method of subtraction from a third grader always work?
- How do we know that the number $1$ is not equal to the number $-1$?
- What are the Implications of having VΩ as a model for a theory?
- Defining a Galois Field based on primitive element versus polynomial?
- Can't find the relationship between two columns of numbers. Please Help
- Is computer science a branch of mathematics?
- Is there a bijection of $\mathbb{R}^n$ with itself such that the forward map is connected but the inverse is not?
- Identification of a quadrilateral as a trapezoid, rectangle, or square
- Generator of inertia group in function field extension
Popular # Hahtags
second-order-logic
numerical-methods
puzzle
logic
probability
number-theory
winding-number
real-analysis
integration
calculus
complex-analysis
sequences-and-series
proof-writing
set-theory
functions
homotopy-theory
elementary-number-theory
ordinary-differential-equations
circles
derivatives
game-theory
definite-integrals
elementary-set-theory
limits
multivariable-calculus
geometry
algebraic-number-theory
proof-verification
partial-derivative
algebra-precalculus
Popular Questions
- What is the integral of 1/x?
- How many squares actually ARE in this picture? Is this a trick question with no right answer?
- Is a matrix multiplied with its transpose something special?
- What is the difference between independent and mutually exclusive events?
- Visually stunning math concepts which are easy to explain
- taylor series of $\ln(1+x)$?
- How to tell if a set of vectors spans a space?
- Calculus question taking derivative to find horizontal tangent line
- How to determine if a function is one-to-one?
- Determine if vectors are linearly independent
- What does it mean to have a determinant equal to zero?
- Is this Batman equation for real?
- How to find perpendicular vector to another vector?
- How to find mean and median from histogram
- How many sides does a circle have?
I'm going to try and clean this up later, but it should be correct. There are some regularity conditions on the score function of $f_1(y|x,\theta)$ that I need to show.
We want to show that : \begin{equation} \mathbb{E}\left[\frac{\partial \ln\left(f_1(y_i \mid x_i, \theta_0)\right)}{\partial \theta} \frac{\partial \ln\left(f_2(x_i \mid \theta_0)\right)}{\partial \theta'}\right]= \mathbb{E}\left[\frac{\partial \ln\left(f_2(x_i \mid \theta_0)\right)}{\partial \theta} \frac{\partial \ln\left(f_1(y_i \mid x_i, \theta_0)\right)}{\partial \theta'}\right]= 0 \end{equation}
The derivative of the cross terms is:
\begin{equation} \mathbb{E}\left[\frac{1}{f_1(y|x,\theta)f_2(x|\theta)}\frac{\partial}{\partial \theta}f_1(y|x,\theta)\frac{\partial}{\partial \theta'}f_2(x|\theta')\right] \hspace{2mm} \& \hspace{2mm}\mathbb{E}\left[\frac{1}{f_1(y|x,\theta)f_2(x|\theta)}\frac{\partial}{\partial \theta'}f_1(y|x,\theta')\frac{\partial}{\partial \theta}f_2(x|\theta)\right] \end{equation}
If you take the expectation of $h(x,y)$ you're evaluating $\int_{\mathcal{X,Y}}h(x,y)f_1(y|x,\theta)f_2(x|\theta)$. Canceling out the $f_2(x|\theta)$ in the expectation and the fractional part of the partial derivative:
\begin{equation} \int_{\mathcal{X,Y}}\frac{\frac{\partial}{\partial \theta}f_1(y|x,\theta)}{f_1(y|x,\theta)}\frac{\partial}{\partial \theta'}f_2(x|\theta')f_1(y|x,\theta) dxdy \hspace{2mm} \& \hspace{2mm} \frac{\frac{\partial}{\partial \theta'}f_1(y|x,\theta)}{f_1(y|x,\theta)}\frac{\partial}{\partial \theta}f_2(x|\theta)f_1(y|x,\theta) dxdy \end{equation} Taking first the integral with respect to $Y$:
\begin{equation} \int_{\mathcal{X}}\frac{\partial}{\partial \theta'}f_2(x|\theta)\int_{\mathcal{Y}}-V_{f_1}(\theta',X)f_1(y|x,\theta) dydx \hspace{2mm} \& \hspace{2mm} \int_{\mathcal{X}}\frac{\partial}{\partial \theta'} f_2(x|\theta)\int_{\mathcal{Y}}-V_{f_1}(\theta,X)f_1(y|x,\theta) dydx \end{equation}
we have that the $Y$ integral is the integral of the score function of $f_1$, which, under some regularity conditions is equal to 0.
This doesn't work on the cross terms because when you evaluate the derivatives instead of getting the score function you obtain the product of two score functions.
Edit: Regularity conditions for expected score to be 0.
I was trying to give more explicit conditions such that the third property was satisfied, but was having some trouble. It should be related to the existence of a unique MLE or something to that effect.