A connection can be thought of as a map $\Gamma(TM) \times \Gamma(TM) \to \Gamma(TM)$, so $\nabla_X Y$ is section of the tangent bundle. But it can also be thought of as a map $\Gamma(TM) \to \Gamma(T^*M \otimes TM)$, but why is that? I don't see how.
I thought maybe one keeps $X$ fixed in $\nabla_X Y$ but this doesn't give me a section of $T^*M \otimes TM$. Can someone explain it, please?
A section of $TM \otimes T^\ast M$ is just a $TM$-valued $1$-form. In other words, a section $\sigma \in \Gamma(TM \otimes T^\ast M)$ can be considered as a linear map $$\tilde{\sigma}: \Gamma(TM) \longrightarrow \Gamma(TM),$$ $$\tilde{\sigma}(p, v) = \sigma_p(v).$$
For a concrete example, consider the constant section $(1,2) \otimes dx \in \Gamma(T\Bbb R^2 \otimes T^\ast \Bbb R^2)$. Then the associated linear map $\Gamma(T\Bbb R^2) \longrightarrow \Gamma(T\Bbb R^2)$ in this case is $$(p, v) \mapsto ((1,2) \otimes dx)_p(v) = (1,2) \cdot dx_p(v) = (p,(v_1,2v_1)),$$ where $v = (v_1, v_2)$.
If you want to view a connection as a map $\nabla: \Gamma(TM) \longrightarrow \Gamma(TM \otimes T^\ast M)$ as opposed to a map $\nabla_X : \Gamma(TM) \longrightarrow \Gamma(TM)$ for some vector field $X$, then you should imagine it as the map $$Y \mapsto (X \mapsto \nabla_X Y).$$ In other words, you instead think of $X$ as the argument of the associated linear map $$\nabla_\bullet Y : \Gamma(TM) \longrightarrow \Gamma(TM).$$ For each choice of $X$ you then get a section $\nabla_X Y \in \Gamma(TM)$, so you should be able to see that $\nabla_\bullet Y$ is a section of $TM \otimes T^\ast M$.