According to how the covariant derivative acts on smooth functions:
$ \nabla_V(T(w, Y)) = V^k \partial_k (T^i_j w_i Y^j) = V^k [ \partial_k T^i_j w_i Y^j + T^i_j \partial_k w_i Y^j + T^i_j w_i \partial_k Y^j ]. $
This is because we get the value of the fully evaluated tensor at a point by multiplying and summing the coefficient functions as above. I mean, the sum of products $T^i_j w_i Y^j$ gives us the smooth function. Now, the above formula holds because we are just multiplying smooth functions. Also the covariant derivative of a sum of two tensors is the sum of the covariant derivatives. (Functions are (0,0) tensors.)
Now according to Leibniz rule:
$\nabla_V (T(w,Y)) = (\nabla_V T)(w,Y) + T(\nabla_V w,Y) + T(w, \nabla_V Y)$,
Term by term:
$ (\nabla_V T)(w, Y) = [V^k \partial_k T^i_j + \Gamma^i_{mk} T^m_j V^k - \Gamma^n_{jk} T^i_n V^k] w_i Y^j, (1) $
$ T(\nabla_V w, Y) = T^i_j (V^k \partial_k w_i - \Gamma^l_{ik} w_l V^k) Y^j,(2) $
$ T(w, \nabla_V Y) = T^i_j w_i (V^k \partial_k Y^j + \Gamma^j_{lk} Y^l V^k).(3) $
Adding these together: $V^k [ \partial_k T^i_j w_i Y^j + T^i_j \partial_k w_i Y^j + T^i_j w_i \partial_k Y^j ]+ V^k([\Gamma^i_{mk} T^m_j - \Gamma^n_{jk} T^i_n ] w_i Y^j - T^i_j \Gamma^l_{ik} w_l Y^j+ T^i_j \Gamma^j_{lk} Y^l w_i )). $
The sum of the first terms on the right hand side of equations 1,2,3 is the same as the right hand side of the first equation, but in total the sum is not the same. What's wrong?
Nothing is wrong; you’re just confused by the tons of indices. You need to shuffle some of them around and relabel the dummy indices. Your “extra” term consists of 4 parts (I ungrouped your brackets): \begin{align} V^k\left(\Gamma^i_{mk} T^m_jw_iY^j - \Gamma^n_{jk} T^i_nw_iY^j - \Gamma^l_{ik} T^i_j w_l Y^j+ \Gamma^j_{lk} T^i_j Y^l w_i \right). \end{align} The first and third term cancel, and the second and fourth cancel. To see this,