I'm mostly self taught in undergraduate physics and maths, so I haven't had much of an education in this stuff. Basically, I know how the theory of general relativity is formulated in terms of tensors and tensor equations. You equate components of the curvature tensor with components of the energy-momentum tensor, as in $R_{\mu \nu} -\frac{1}{2}Rg_{\mu \nu} = \frac{8\pi G}{c^4}T_{\mu \nu}$. But I wasn't really satisfied with this, because of how much it seemed to depend on the components of the tensor. Hence I started to research other definitions of tensors (after all, vectors can be defined completely independently of components).
What I found was that tensors are defined as multilinear maps: \begin{align} T: V^*\times \cdots \times V^*\times V\times \cdots \times V\rightarrow \mathbb{R}.\end{align}
Now I understand all of the terms in this definition (e.g. dual spaces, direct products), but I don't know how this relates to the definition I've seen before in physics, as objects that are invariant under coordinate transformations: \begin{align}(T')^{m_1 \cdots} _{n_1 \cdots} = \frac{\partial (x')^{m_1}}{\partial x^{p_1}}\cdots \frac{\partial x^{q_1}}{\partial (x')^{n_1}}\cdots T^{p_1 \cdots}_{q_1 \cdots}.\end{align} So my question is this: how do these definitions relate to each other? They seem completely different, and I'm not even sure how to think of a "physics tensor" as a multilinear map to $\mathbb{R}$. Also, if you're knowledgeable in physics, is there a form of the Einstein Field Equations that is completely independent of coordinates?
To simplify notation, let me restrict the discussion to $(1,1)$-tensors. Unfortunately, one uses the word "tensor" in mathematics in (at least) two different contexts:
How is this related to the transformation rule you wrote?
In physics, one usually doesn't start with the mathematical description of tensors and tensor fields I just gave you and instead they usually take the transformation rule as the "definition" of a tensor. More formally, physicists usually think of tensors as a rule which assigns for each coordinate system functions $\mathcal{T}^i_j$ such that for different coordinate systems, the components of the functions are related by the transformation rule above. This avoids all sorts of mathematical discussions about multi-linearity, tensor products, tensor bundles but can obscure what a "is really" a tensor.