In my Linear Algebra and Geometry textbook, it defines the image of a linear transformation $T$ as:
$$\operatorname{Im}\, (T) := \{\; w \in W : \; w=Tv \;\;\text{ for some } v \in V \} $$
As far as I can see, this is just the same as:
$$\operatorname{Im} \, (T) := \{ \;Tv \in W : \;v \in V\}$$
Is there any difference in these definitions?
If not, why is the first one used?
Each definition uses Set-Builder Notation: notation which allows us to describe any given set of elements in any number of ways, and/or from different perspectives or for different purposes. E.g.:
Let $E_2 = \{\;n \in \mathbb{Z} \mid n\equiv 0 \pmod{2}\};$
Let $E_3 = \{\;n \in \mathbb{Z} : 2\mid n\;\}$.
Each of $E_1$, $E_2$, and $E_3$ each define the same set of even integers. There is only one set being defined; which definition one chooses depends on context.
Back to your two definitions:
The first is often used to establish, e.g., surjectivity of a function $f: V\to W$. If $f$ is onto, then for every $w \in W$, there exists a $v\in V$ such that $f(v) = w.$ So it's not uncommon to define the image of a function as it is defined in the first case.