So I've been trying to make sense of the clustering algorithm on page 6 of this paper.
Are the "first" k eigenvalues they refer to the smallest eigenvalues?
What are the $y_i$ exactly? I don't see the motivation for using them.
If anyone could link other literature or explain how/why the first eigenvectors are being used, that would be great.
Yes the eigenvalues are the first $k$ smallest eigenvalues, starting from the eigenvalue zero. The vectors $y_i$ are the rows of the matrix $U$ where the columns of $U$ are the eigenvectors $u_1, u_2, \ldots$ of the Laplacian.
For the symmetric Laplacian $L$, $y_i$ and $u_i$ are the same. For the non-symmetric Laplacian $L_{rw}$ people often talk about left or right eigenvectors. The author probably tried to avoid defining left eigenvectors and used $y_i$ instead.
As far as I know, the reason why the eigenvectors of the Laplacian are so good at revealing clustering and partitioning properties of complex systems is not very well understood. The intuition behind using the first eigenvectors is best understood using an analogy from physics. The eigenvectors of the Laplacian can be though of as the vibration modes of a drum skin. The eigenvector $u_1$ of the to the lowest eigenvalue corresponds to the ground state. The first eigenvector $u_1$ corresponds to the first harmonic, dividing the drum skin into half and so on.