I am reading the book Mathematics for Machine learning and I am a bit confused with maximum likelihood estimation. I understand that the likelihood is the probability of get certain observations $x$, given a set of parameters $\theta$: $$p(x|\theta)$$ So, find a maximum value implies that you have chosen the set of parameters for which data is more likely to have been measured. But then book gives an example for linear regression where it assumes a gaussian likelihood function and it says that it is: $$p(y_n|\mathbf{x}_n, \mathbf{\theta}) = \mathcal{N}(y_n|\mathbf{x}_n^\intercal\mathbf{\theta}, \sigma^2)$$ What I dont understand is how data $\mathbf{x}_n$ now is in the other side of $|$. I think that $\mathbf{x}_n$ is part of the data observed, along with $y_n$, and $\mathbf{\theta}$ the parameters that should be on the right side of the conditional probability.
2026-03-11 14:36:04.1773239764
How data goes to the other side in maximum likelihood estimation
17 Views Asked by Bumbble Comm https://math.techqa.club/user/bumbble-comm/detail AtRelated Questions in PROBABILITY
- How to prove $\lim_{n \rightarrow\infty} e^{-n}\sum_{k=0}^{n}\frac{n^k}{k!} = \frac{1}{2}$?
- Is this a commonly known paradox?
- What's $P(A_1\cap A_2\cap A_3\cap A_4) $?
- Prove or disprove the following inequality
- Another application of the Central Limit Theorem
- Given is $2$ dimensional random variable $(X,Y)$ with table. Determine the correlation between $X$ and $Y$
- A random point $(a,b)$ is uniformly distributed in a unit square $K=[(u,v):0<u<1,0<v<1]$
- proving Kochen-Stone lemma...
- Solution Check. (Probability)
- Interpreting stationary distribution $P_{\infty}(X,V)$ of a random process
Related Questions in MACHINE-LEARNING
- KL divergence between two multivariate Bernoulli distribution
- Can someone explain the calculus within this gradient descent function?
- Gaussian Processes Regression with multiple input frequencies
- Kernel functions for vectors in discrete spaces
- Estimate $P(A_1|A_2 \cup A_3 \cup A_4...)$, given $P(A_i|A_j)$
- Relationship between Training Neural Networks and Calculus of Variations
- How does maximum a posteriori estimation (MAP) differs from maximum likelihood estimation (MLE)
- To find the new weights of an error function by minimizing it
- How to calculate Vapnik-Chervonenkis dimension?
- maximize a posteriori
Related Questions in BAYESIAN
- Obtain the conditional distributions from the full posterior distribution
- What it the posterior distribution $\mu| \sigma^2,x $
- Posterior: normal likelihood, uniform prior?
- If there are two siblings and you meet one of them and he is male, what is the probability that the other sibling is also male?
- Aggregating information and bayesian information
- Bayesian updating - likelihood
- Is my derivation for the maximum likelihood estimation for naive bayes correct?
- I don't understand where does the $\frac{k-1}{k}$ factor come from, in the probability mass function derived by Bayesian approach.
- How to interpret this bayesian inference formula
- How to prove inadmissibility of a decision rule?
Related Questions in BAYES-THEOREM
- Question to calculating probability
- Bayes' Theorem, what am I doing wrong?
- A question about defective DVD players and conditional probabaility.
- Is my derivation for the maximum likelihood estimation for naive bayes correct?
- 1 Biased Coin and 1 Fair Coin, probability of 3rd Head given first 2 tosses are head?
- Conditional Probability/Bayes Theory question
- Dependence of posterior probability on parameters
- Probability Question on Bayes' Theorem
- Coin probability
- What is the probability of an event to happen in future based on the past events?
Related Questions in MAXIMUM-LIKELIHOOD
- What is the point of the maximum likelihood estimator?
- Finding a mixture of 1st and 0'th order Markov models that is closest to an empirical distribution
- How does maximum a posteriori estimation (MAP) differs from maximum likelihood estimation (MLE)
- MLE of a distribution with two parameters
- Maximum Likelihood Normal Random Variables with common variance but different means
- Possibility of estimating unknown number of items based on observations of repetitions?
- Defects of Least square regression in some textbooks
- What is the essence of Least Square Regression?
- Finding maximum likelihood estimator of two unknowns.
- Mean of experiment results is the maximum likelihood estimator only when the distribution of error is gaussian.
Trending Questions
- Induction on the number of equations
- How to convince a math teacher of this simple and obvious fact?
- Find $E[XY|Y+Z=1 ]$
- Refuting the Anti-Cantor Cranks
- What are imaginary numbers?
- Determine the adjoint of $\tilde Q(x)$ for $\tilde Q(x)u:=(Qu)(x)$ where $Q:U→L^2(Ω,ℝ^d$ is a Hilbert-Schmidt operator and $U$ is a Hilbert space
- Why does this innovative method of subtraction from a third grader always work?
- How do we know that the number $1$ is not equal to the number $-1$?
- What are the Implications of having VΩ as a model for a theory?
- Defining a Galois Field based on primitive element versus polynomial?
- Can't find the relationship between two columns of numbers. Please Help
- Is computer science a branch of mathematics?
- Is there a bijection of $\mathbb{R}^n$ with itself such that the forward map is connected but the inverse is not?
- Identification of a quadrilateral as a trapezoid, rectangle, or square
- Generator of inertia group in function field extension
Popular # Hahtags
second-order-logic
numerical-methods
puzzle
logic
probability
number-theory
winding-number
real-analysis
integration
calculus
complex-analysis
sequences-and-series
proof-writing
set-theory
functions
homotopy-theory
elementary-number-theory
ordinary-differential-equations
circles
derivatives
game-theory
definite-integrals
elementary-set-theory
limits
multivariable-calculus
geometry
algebraic-number-theory
proof-verification
partial-derivative
algebra-precalculus
Popular Questions
- What is the integral of 1/x?
- How many squares actually ARE in this picture? Is this a trick question with no right answer?
- Is a matrix multiplied with its transpose something special?
- What is the difference between independent and mutually exclusive events?
- Visually stunning math concepts which are easy to explain
- taylor series of $\ln(1+x)$?
- How to tell if a set of vectors spans a space?
- Calculus question taking derivative to find horizontal tangent line
- How to determine if a function is one-to-one?
- Determine if vectors are linearly independent
- What does it mean to have a determinant equal to zero?
- Is this Batman equation for real?
- How to find perpendicular vector to another vector?
- How to find mean and median from histogram
- How many sides does a circle have?