I have questions on understanding this article about Dirichlet process. If you look at the beginning of section 2.1, it shows three equations 2.1, 2.2, 2.3. The question is I don't understand what exactly those probabilities represents and why we need them. And one thing that the article confuses me is that they removed the subscripts in equations 2.1 and 2.2. They said that $L_i=k$ implies that $X_i \in k$. So if $L_1 = 1$, then $X_1$ belongs to $1$ cluster. How can I interpret these two equations? Also, can anyone suggest articles about Bayesian nonparametrics, Dirichlet process and Indian buffet process?? I am trying to understand the unsupervised clustering method using these processes. Thank you!
2026-03-29 16:24:36.1774801476
Question on understanding Dirichlet process
140 Views Asked by Bumbble Comm https://math.techqa.club/user/bumbble-comm/detail At
1
By this model, the data are assumed to be generated in the following way.
These are our assumptions, that is, somebody had distributions $\mathbb{P}\{L=k\}$ and $\mathbb{P}[X \in \cdot \mid L=k]$, and followed this procedure to create a dataset.
The usual task is to go backwards: to take a given dataset, and [assuming it was created in this manner], figure out what these two distributions are.