So, I know for a function $f :\mathbb{R}^m\to\mathbb{R}^n$, $f$ is differentiable on $U\subset\mathbb{R}^m$ iff for every point $a\in U$ there exists a linear transformation, call it $Df(a)$ such that $$ \lim_{\vec{h}\to 0}\frac{\|f(a + \vec{h}) - f(a) - Df(a)\vec{h}\|}{\|\vec{h}\|} = 0 $$ You can determine from this that if $f$ is differentiable then all the partial derivatives of $f$ exist on $U$ and $Df(a)$ is the Jacobian of $f$. We write $f\in C^1(U)$ to say it is once-differentiable on $U$.
So, what exactly does it mean to say $f\in C^k(U)$ for $U\subset\mathbb{R}^m$, and $k>1$? My only thought is that it requires all the partial derivatives of order $k$ to exist, but this is not enough information for the $C^1$ case since not only must the partials exist, but the above limit with $Df(a)$ equal to the Jacobian must also be zero (which are not necessarily always simultaneously true). My second thought was that, at least for the $k = 2$, the Hessian of $f$ must exist, and some limit involving the Hessian must go to zero, but I can't figure out what it is.
There really is, as far as I can tell, no reasonable idea of the "second-derivative" of a vector function exists. The "derivative" of a vector function, as I understand it, is $Df(a)$, which is a function that maps a vector $a$ to a matrix. So if we were to take the derivative of this, the "second" derivative, what kind of object would it be? Would it be a tensor?
Well, that's pretty simple if you take the abstract point of view: as you mention it, the differential of a map $f\colon U\subset \mathbf R^m\longrightarrow \mathbf R^n$ at a point $a\in\mathbf R^m$ is a linear map map $Df(a)\in \mathcal L(\mathbf R^m,\mathbf R^n)$ which is tangent to $f$ at $a$.
Now, if $f$ is differentiable at every point of $U$, you define a (first order) differential map \begin{align} Df\colon U\subset\mathbf R^m & \longrightarrow \mathcal L(\mathbf R^m,\mathbf R^n),\\ a & \longmapsto Df(a). \end{align} This differential map $Df$ may in turn be differentiable at a point $a\in U$. You then obtain a second order differential $D^2f(a)$, which is a linear map in $\;\mathcal L\bigl(\mathbf R^m,\mathcal L(\mathbf R^m,\mathbf R^n)\bigr)$, tangent to $Df$ at $a$.
As we have a canonical isomorphism $\;\mathcal L\bigl(\mathbf R^m,\mathcal L(\mathbf R^m,\mathbf R^n)\bigr)\simeq\mathcal L^2(\mathbf R^m,\mathbf R^n)$ (the set of bilinear maps from $\mathbf R^m$ to $\mathbf R^n$, we identify $D^2f(a)$ with the corresponding bilinear map, which is represented by the Jacobian matrix once a basis has been chosen.