I am reading this paper on probabilistic algorithms for matrix decomposition. I don't understand Section 1.2. Given a matrix $A \in \mathbb{R}^{m \times n}$, we want an "approximate basis for the range of $A$", $Q \in \mathbb{R}^{r \times n}$ such that:
$$ Q \text{ has orthonormal columns and } A \approx QQ^* A $$
I understand that the range of $A$ is the column space of $A$, but I don't understand how finding a matrix $Q$ s.t. $A \approx QQ^* A$ means that $Q$ is a good approximation of that column space.
I was confused because I wondered why we cared that $Q$ was invertible because
$$ A \approx QQ^*A \implies QQ^* \approx I $$
But I don't think that's the right intuition. For me, the key was to write the expression like this:
$$ A \approx Q(Q^*A) $$
What we want is a matrix $Q$ with as few orthogonal columns as possible such that, if we project $A$ into this $k$-dimensional subspace and then reconstruct $A$, our reconstruction of $A$ is within some error tolerance.