For a given $n \times n$-matrix $A$, and $J\subseteq\{1,...,n\}$ let us denote by $A[J]$ its principal minor formed by the columns and rows with indices from $J$.
If the characteristic polynomial of $A$ is $x^n+a_{n-1}x^{n-1}+\cdots+a_1x+a_0$, then why $$a_k=(-1)^{n-k}\sum_{|J|=n-k}A[J],$$ that is, why is each coefficient the sum of the appropriately sized principal minors of $A$?
One way to see it: $A:V\to V$ induces the (again linear) maps $\wedge^k A:\wedge^k V\to \wedge^k V$. Your formula (restated in an invariant way, i.e. independently of basis) says that $$\det(xI-A)=x^n-x^{n-1}\operatorname{Tr}(A)+ x^{n-2}\operatorname{Tr}(\wedge^2 A)-\cdots(*)$$ We can conjugate $A$ so that it becomes upper-triangular with diagonal elements $\lambda_i$ ($\lambda_i$'s are the roots of the char. polynomial). Now for upper triangular matrices the formula $(*)$ says that $$(x-\lambda_1)\cdots(x-\lambda_n)=x^n-x^{n-1}(\sum\lambda_i)+x^{n-2}(\sum\lambda_i\lambda_j)-\cdots$$ which is certainly true, hence $(*)$ is true.