I was asking in the mathematics chatroom about the difference between $\equiv$ and $=$, and I fell into an interesting vein of discussion that I'd like to ask about. Modular arithmetic is apparently not defined as an operation, but a "similarity" (for lack of a better term), which is why $\equiv$ and not $=$ is used. However, I was also told that you can define an operation:
like $a\mod m$ is the smallest positive number equivalent to $a\mod m$, - Akiva Weinberger
but then you'd write it like $a~{\rm mod}~m=b~{\rm mod}~m$ instead of $a\equiv b\mod m$. - Akiva Weinberger
I was also told that
The more useful point of view is that $a$ and $b$ fall into the same "category" (more formally, equivalence class), than that they have the same image under that operation. - Akiva Weinberger
but why is it more useful?
My one thought here is that it might not be useful because when if say that $a\mod m = b \mod m$, that doesn't necessarily mean that $a=b$; however, with other operations, such as addition, if you have $a+m=b+m$ you know that $a=b$ (I don't know the name of this "property", sorry). So maybe an operation must fulfill certain "properties" and modular arithmetic doesn't fulfill this one? I'm not sure though.
The "more useful" claim comes because
It follows a pattern that we see often in other parts of math and
It saves you the trouble of having to pick out a "special" element of each equivalence class to use as the "name" of that class.
In your example, for $m = 3$, the class $\{0, \pm 3, \pm 6, \pm 9, \ldots \}$ is called $0$, and similarly for the things named $1$ and $2$, and that works out fine. In this case, the element "0" is the "special" one in its equivalence class.
But if you look at something like $S^3$, the set of pairs of complex numbers $(z, w)$ with $|z|^2 + |w|^2 = 1$, there's a lovely equivalence relation on these: we say that $(z, w) \sim (z', w')$ whenever $zw' = wz'$. It turns out that geometrically $S^3$ consists of all points in 4-dimensional space whose distance from the origin is 1, and each equivalence class turns out to be a circle within this "3-sphere". So we can write the 3-sphere as a union of these circles. But for each circle, there's no 'preferred' choice of $(z, w)$ to represent that circle, so a naming scheme like the one you've used isn't really available. But the decomposition is still very useful: it's called the Hopf Fibration, and is one of the fundamental examples in homotopy theory.
In computer science, it's very common to want to pick out special elements so as to deal with individual things rather than sets, so "mod" (as an operation on ints) is very often defined in computer languages in exactly the way you suggest (although what $n \bmod k$ means when $n$ or $k$ is negative turns out to be profoundly undecided among computer scientists -- almost every imaginable definition has been used, and no two seem to agree).
Let me make one more argument in favor of the "equivalence class" definition. If you think of $\Bbb Z$ and $\Bbb Z/ n \Bbb Z$, with the latter being "sets of equivalence classes, then there's a very natural function from the first to the second: $$ p: \Bbb Z \to \Bbb Z/ n \Bbb Z: k \mapsto \text{the class containing $k$} $$
By contrast, writing down the "rule" to make $k$ to its modular remainder is a (small) pain. The formula I've given for "projection to the quotient" is the same for any equivalence relation, and is so simple that we hardly need to remember it.
When it comes time to define operations on the quotient (like "modular addition"), the strucutre of the proof that the operation makes sense is (in the equivalence class case) always the same: you say
"Suppose that $a \sim b$ and $a' \sim b'$; we will show that $$ p(a+a') = p(b + b') $$ thus showing that addition on the original set ($\Bbb Z$) 'passes to the quotient.' Since $a \sim b$, there's an integer $u$ with $a = b + un$; since $a' \sim b'$, there's an integer $v$ with $a' = b' + vn$. But then $a+a' = (b + b') + (u+v)n$, showing that $a+a' \sim b + b'$."
For some equivalence relations, the details of that middle paragraph are far more complicated, but the structure remains constant, which can be a big help in letting you know what you must prove.