I'm having trouble understanding an example in Guillemin and Pollack. If $f:R^k\rightarrow R$ be defined by
$f(x) = |x|^2 = x_1^2+...+x_k^2$
The derivative $df_x$ at the point $a = (a_1,..,a_k)$ has matrix $(2a_1,..,2a_k)$. Thus
$df_a:R^k\rightarrow R$ is surjective unless $f(a)=0$, so every nonzero real number is a regular value of $f$. I can't understand how to check $df_a$ is surjective and why it is obvious that every nonzero real number is a regular value of $f$? A hint is appreciated. Thanks.
We know that the differential of $f(x^1,\dots,x^k) = (x^1)^2 + \cdots + (x^k)^2$ at $a=(a_1,\dots,a_k) \in \mathbb{R}^k$ is $df_a = \sum 2a_i \, \text{d}x^i $. It is clear that if $f(a)=a_1^2+\cdots+a_k^2 = 0$, then $a=0$ which implies $df_a = (0,\dots,0)$ is the zero map (obviously not surjective).
For $f(a)\neq 0$ happen exactly at $a\neq 0$, so at least there is one component of $a$ which is nonzero. W.l.o.g assume $a_1 \neq 0$. For any $\lambda \in \Bbb{R}$, $$ df_a\Big(\frac{\lambda}{2a_1} \frac{\partial}{\partial x^1}\Big) =\frac{\lambda}{2a_1}\, df_a\Big( \frac{\partial}{\partial x^1} \Big) = \frac{\lambda}{2a_1} \cdot 2a_1= \lambda. $$ So $df_a : T_a \mathbb{R}^k \to \mathbb{R}$ is onto.