Lower division linear algebra course at my university taught the simple computation steps to finding a trace of a matrix, but not the intuition nor the purpose for it. What information is gained from summing the diagonal entries?
Why is the trace of a matrix important?
The trace has useful for multiple reasons. Besides the fact that it is an invariant like the determinant, it allows use to generalize several interesting operations to more general cases.
to the matrix case as
$$\langle X,Y\rangle_{\ell_2}=\mathrm{trace}\left(\sum_{i=0}^\infty X_i^*Y_i\right)\quad \mathrm{and}\quad \langle X,Y\rangle_{L_2}=\mathrm{trace}\left(\int_0^\infty Y(s)^*X(s)ds\right),$$
and this propagates all the results and concepts to the matrix case with no effort whatsoever. This also has applications in the analysis of matrix-valued dynamical systems.