Products of conditional probabilities for sequence modeling

59 Views Asked by Bumbble Comm At 11 Apr 2026 - 11:46

In machine learning I often see expressions of the form

$$P(x_n) = \prod_{i=1}^{n} P(x_i | x_{i-1}, x_{i-2}, ..., x_2, x_1)$$

(eg. here, page 2) when modeling sequences (where $P$ is a probability-valued function, ie. a function that maps "things" to probabilities). I don't understand what these expressions mean (anyone care to explain?), but it makes me wonder if the following is true:

$$ P(x_n|x_{n-1}) \cdot \ldots \cdot P(x_3 | x_2) \cdot P(x_2 | x_1) = P(x_n | x_{n-1}, \ldots, x_2, x_1) $$

and why (not)? And, if not, under what conditions is it true?

Original Q&A

There are 1 best solutions below

Bumbble Comm On 27 Dec 2022 - 8:05 BEST ANSWER

The first expression is just the recursive application of the definition of conditional probability.

Let $x,y,z$ be random variables (or events?).

By definition, the conditional probability (law? density? cumulative?) $p[x | y]$ (which people call "$x$ given $y$", but I think of as "the probability of $x$ in the $y$-world", or "$x$ under the $y$-lens") is:

$$p[x|y] := {p[x \cap y] \over p[y]}.$$

Now:

$$p[x \cap y] == p[x|y] * p[y]$$ $$p[x \cap y] == p[y|x] * p[x]$$

Therefore:

Products of conditional probabilities for sequence modeling

There are 1 best solutions below

Related Questions in PROBABILITY

Related Questions in CONDITIONAL-PROBABILITY

Related Questions in MACHINE-LEARNING

Trending Questions

Popular # Hahtags

Popular Questions