Deriving the Bayes Optimal Classifier (Mitchell, Machine Learning)

97 Views Asked by Bumbble Comm At 04 Apr 2026 - 5:23

I am trying to recreate the Bayes Optimal Classifier result given in Machine Learning textbook by Mitchell. Below, I've added the desired result from the text and my work.

I think I've taken the right approach but the final equality has a difference in the conditional. Is my approach incorrect or is there an intuitive rationalization for why the conditionals are really the same?

DESIRED RESULT

Formula

MY DERIVATION

Derivation

Original Q&A

There are 1 best solutions below

Bumbble Comm On 01 Mar 2022 - 3:08 BEST ANSWER

We have

$$P(v_j|D) = \sum_{h_i}P(v_j|h_i,D)P(h_i|D)$$ by the law of total probability.

I think the key to understand the equality iswe have $P(v_j|h_i, D)=P(v_j|h_i)$, that is given the hypothesis, that is the hypothesis determines the probability of $v$ taking the values of $v_j$ regardless of $D$.

Deriving the Bayes Optimal Classifier (Mitchell, Machine Learning)

There are 1 best solutions below

Related Questions in PROBABILITY

Related Questions in STATISTICS

Related Questions in MACHINE-LEARNING

Related Questions in BAYES-THEOREM

Trending Questions

Popular # Hahtags

Popular Questions