How does the following definition of isomorphism between vector space imply "structure preserving bijection"?

I am really confused by this concept of isomorphism, it seems to be a new name for something that is already well understood. Every time I look up the definition for isomorphism, the definition changes. In other fields, they are called homomorphism, which confuses with homeomorphism (let's not get into that). They are of course equivalent, but the argument is quite subtle.

Acccording the following reliable source (Naylor and Sell):

Given linear spaces $X, Y$ over $ \mathbb{F}$$, T: X \to Y$ is an isomorphism iff (i) $T$ is $1-1$, (ii) $T$ is onto, (iii) $T$ is linear, (iv) $T^{-1}$ is linear

Ok, nice and simple! Computationally tractable. Something you would use on your exams to check if $T$ is an isomorphism.

But when you look at the other discussions on Math.SE, there seems to be a multi-panel debate as to what an isomorphism even is.

For example: What's the difference between a bijection and an isomorphism?

First answer: "As another example, if the sets are vector spaces, then an isomorphism is a bijection that preserves vector addition and scalar multiplication."

"The answer is "vice versa." An isomorphism is a structure-preserving bijection. The specific meaning of "structure" will vary, depending on the context."

Where are people getting their definition from? At no point in definition given above was anything said about "preservation of vector addition and scalar multiplication". Why is that so?


What does it mean for $T$ to be linear?

It precisely means given $u$ and $v$ in $X$ and $\alpha \in \Bbb F$,

  1. $T(u + v) = T(u) + T(v)$ (i.e., $T$ preserves vector addiction).
  2. $T(\alpha u) = \alpha T(u)$ (i.e., $T$ preserves scalar multiplication).

So the definition you listed does state that $T$ must preserve vector addition and scalar multiplication, but it does so concisely by saying "$T$ is linear".


A vector space is an additive commutative group $V$ along with an action of a field $k$ on $V$: for each scalar $\lambda\in k$ there is a map $\mu_\lambda : V\mapsto V$ that sends $v\mapsto \lambda v$. Since $V$ is also a commutative group under addition, we have a binary operation $\sigma: V\times V\to V$ that sends a pair $(v,w)\mapsto v+w$. Note then that a linear transformation between a space $V$ and a space $W$ is a map that commutes with this collection of "structure" functions on $V$ and $W$. We want that, for each $\lambda\in k$, $$f(\lambda v)=f(\mu_\lambda(v))=\mu'_\lambda(f(v))=\lambda f(v)$$


where $\mu$ are the multiplication by scalars in $V$ and $\sigma$ the sum of $V$, and $\mu',\sigma'$ the corresponding operations in $W$. It is in this sense that we say $f$ preserves the structure of vector spaces.

One usually has a collection of objects, in your case vector spaces, and a collection of arrows from and to objects. An isomorphism is defined to be an arrow $f$ that admits an inverse that is also an arrow. In your case, one can check that a linear transformation admitting an inverse is the same as the linear transformation itself being a bijection as a map of sets, and hence the result. In general, there are collection of objects, such as topological spaces, where there are arrows with no inverse arrow but such that, as a function, this arrow is bijective. A classical example is the map that wraps the unit interval $[0,1)$ onto the unit circle $S^1$, by sending $t\mapsto e^{2i t\pi}$. This is continuous, injective and onto, but its inverse is not continuous, and hence is not a map in the category (I could avoid the word for so long) of topological spaces.


The proper and universal definition of an isomorphism (in any given category) is that it is a morphism between objects that admits an inverse morphism. (If such an inverse exists it will always be unique.) The category will tell you what are (homo)morphisms; in the category of $F$-vector spaces they are $F$-linear maps. That morphisms preserve structure is just an informal statement, which really means that we call "structure" that what is preserved by morphisms. For instance instance in vector spaces having a linear dependency between some vectors implies having the same relation between their image across a linear map, so such relations are part of the structure in vector spaces.

Now in many categories morphisms are maps, and maps having inverses must be bijections. And often the conditions of being a morphism and bijective implies that the inverse map is also a morphism; this is true in the category of $F$-vector spaces (which is a nice exercise to prove), and in many other algebraic categories (but not for instance in the category of topological spaces where morphisms are continuous maps). When it holds one will often find instead of the above "high definition" of isomorphism the "low definition" of an isomorphism as a bijective morphism. It can be of some advantage in practice to not have to check explicitly that the inverse map of a morphism is a morphism, but still I feel that this should be formulated as a proposition, not integrated into the definition of isomorphism. I suppose it is done this way because students have a strong tendency to use the definition instead of trying to apply a proposition.

Your book more or less follows high definition, but instead of just asking for the existence of an inverse linear map (which would imply (i) and (ii), and also obviously has to coincide with $T^{-1}$) prefers to spell out conditions that imply the existence of an inverse map $T^{-1}$ (this is the only reason that (i) and (ii) are needed) and then require that inverse map to be linear (iv).