Can someone explain this answer to me? I would understand how to find the maximum likelihood if it were only one random variable and a variance but what happens here? I understand I have to equal it to zero to find the peak but whats up with the min and max? Why take the arg of it? How did they get to equation 20 and why?
2026-03-31 21:54:45.1774994085
How to find the maximum likelihood estimator when you have two random variable with two variances but one mean?
533 Views Asked by Bumbble Comm https://math.techqa.club/user/bumbble-comm/detail At
1
There are 1 best solutions below
Related Questions in STATISTICS
- Given is $2$ dimensional random variable $(X,Y)$ with table. Determine the correlation between $X$ and $Y$
- Statistics based on empirical distribution
- Given $U,V \sim R(0,1)$. Determine covariance between $X = UV$ and $V$
- Fisher information of sufficient statistic
- Solving Equation with Euler's Number
- derive the expectation of exponential function $e^{-\left\Vert \mathbf{x} - V\mathbf{x}+\mathbf{a}\right\Vert^2}$ or its upper bound
- Determine the marginal distributions of $(T_1, T_2)$
- KL divergence between two multivariate Bernoulli distribution
- Given random variables $(T_1,T_2)$. Show that $T_1$ and $T_2$ are independent and exponentially distributed if..
- Probability of tossing marbles,covariance
Related Questions in NORMAL-DISTRIBUTION
- Expectation involving bivariate standard normal distribution
- How to get a joint distribution from two conditional distributions?
- Identity related to Brownian motion
- What's the distribution of a noncentral chi squared variable plus a constant?
- Show joint cdf is continuous
- Gamma distribution to normal approximation
- How to derive $E(XX^T)$?
- $\{ X_{i} \}_{i=1}^{n} \thicksim iid N(\theta, 1)$. What is distribution of $X_{2} - X_{1}$?
- Lindeberg condition fails, but a CLT still applies
- Estimating a normal distribution
Related Questions in MAXIMUM-LIKELIHOOD
- What is the point of the maximum likelihood estimator?
- Finding a mixture of 1st and 0'th order Markov models that is closest to an empirical distribution
- How does maximum a posteriori estimation (MAP) differs from maximum likelihood estimation (MLE)
- MLE of a distribution with two parameters
- Maximum Likelihood Normal Random Variables with common variance but different means
- Possibility of estimating unknown number of items based on observations of repetitions?
- Defects of Least square regression in some textbooks
- What is the essence of Least Square Regression?
- Finding maximum likelihood estimator of two unknowns.
- Mean of experiment results is the maximum likelihood estimator only when the distribution of error is gaussian.
Trending Questions
- Induction on the number of equations
- How to convince a math teacher of this simple and obvious fact?
- Find $E[XY|Y+Z=1 ]$
- Refuting the Anti-Cantor Cranks
- What are imaginary numbers?
- Determine the adjoint of $\tilde Q(x)$ for $\tilde Q(x)u:=(Qu)(x)$ where $Q:U→L^2(Ω,ℝ^d$ is a Hilbert-Schmidt operator and $U$ is a Hilbert space
- Why does this innovative method of subtraction from a third grader always work?
- How do we know that the number $1$ is not equal to the number $-1$?
- What are the Implications of having VΩ as a model for a theory?
- Defining a Galois Field based on primitive element versus polynomial?
- Can't find the relationship between two columns of numbers. Please Help
- Is computer science a branch of mathematics?
- Is there a bijection of $\mathbb{R}^n$ with itself such that the forward map is connected but the inverse is not?
- Identification of a quadrilateral as a trapezoid, rectangle, or square
- Generator of inertia group in function field extension
Popular # Hahtags
second-order-logic
numerical-methods
puzzle
logic
probability
number-theory
winding-number
real-analysis
integration
calculus
complex-analysis
sequences-and-series
proof-writing
set-theory
functions
homotopy-theory
elementary-number-theory
ordinary-differential-equations
circles
derivatives
game-theory
definite-integrals
elementary-set-theory
limits
multivariable-calculus
geometry
algebraic-number-theory
proof-verification
partial-derivative
algebra-precalculus
Popular Questions
- What is the integral of 1/x?
- How many squares actually ARE in this picture? Is this a trick question with no right answer?
- Is a matrix multiplied with its transpose something special?
- What is the difference between independent and mutually exclusive events?
- Visually stunning math concepts which are easy to explain
- taylor series of $\ln(1+x)$?
- How to tell if a set of vectors spans a space?
- Calculus question taking derivative to find horizontal tangent line
- How to determine if a function is one-to-one?
- Determine if vectors are linearly independent
- What does it mean to have a determinant equal to zero?
- Is this Batman equation for real?
- How to find perpendicular vector to another vector?
- How to find mean and median from histogram
- How many sides does a circle have?

Because there are two independent random variables, their joint PDF is the product of the individual PDFs, which is where (19) comes from. Since the $\mu$ are the same, each of the individual PDFs in the joint have the same $\mu$.
Since the sample consists of only one point from the random vector, i.e. one sample from each of the two independent random variables, the likelihood function is the same as the joint PDF.
Taking the arg max of the likelihood function is the definition of a maximum likelihood estimate. You are finding which value of the parameter would maximize the likelihood.
For equation (20), they are finding which value of $\mu$ maximizes the likelihood function. They ignore the constants multiplying the exponentials, since they don't have $\mu$ in them (which is why I called them constants, we are maximizing over $\mu$).
In their next step, they do two things:
Because the exponential function is monotonic, you can maximize/minimize it by maximizing/minimizing its argument, which is why they drop the exponential function.
Also, maximizing a quantity is the same as minimizing the negative of that quantity, so they take the negative and switch from arg max to arg min.
That's how they get (20).
From that point, as you have said, they find the min by setting the derivative equal to zero.