Is it true, that if the integrand in the $$\int_{0}^{T}X\left(t\right)dY\left(t\right)$$ integral is deterministic (so it is the same “function” for each $\omega\in\Omega$) and continuous, then it doesn't matter what we choose as $s_{i}$ in the $$\sum_{i}X\left(s_{i}\right)\left(Y_{t_{i}}-Y_{t_{i-1}}\right),\;\;\;s_{i}\in\left[t_{i-1},t_{i}\right]$$ integral approximating sum: the sum will eventually converge to the same random variable $\int_{0}^{T}X\left(t\right)dY\left(t\right)$ in $L^{2}\left(\Omega\right)$? (I will refer this integral approximating sum as “the sum” below.)
I mean this is very surprising for me, because in the very famous example of $\int_{0}^{T}W\left(t\right)dW\left(t\right)$, where $W$ is a Wiener-process it does matter what $s_{i}$s are, i.e. the sum will converge to different random variables depending on what $s_{i}$s are. I thought the only reason of converging to different limits is the fact, that the $W$ integrator hasn't got finite variation. However, I still think this is the most importan reason, but considering the statement above this isn't the only one.
E.g.: I haven't thought so far that there is “reasonable” difference between a Wiener-process and a Weierstrass function choosing as integrand, in terms of the $\int_{0}^{T}X\left(t\right)W\left(t\right)$ integral. I've thought that in both cases the $\sum_{i}X\left(s_{i}\right)\left(W_{t_{i}}-W_{t_{i-1}}\right)$ will converge to different values depending on the $s_{i}$ points. According to the statement above, this is not true. Choosing $X$ as a Weierstrass function, it doesn't matter what $s_{i}$s are, the sum will converge to the same random variable. The only fair difference I see between the Weierstrass function or a Wiener process is that one of them is deterministic and the other one is a stochastic process.
So am I right to point out that in several cases there are more reasons that the sum will converge to different values as $s_{i}$s are different:
1, $Y$ hasn't got finite variation,
2, $X$ is a stochastic process (so it is $\omega$ dependent)?
If it is indeed true, then what is the “intuition” behind this? Why is it way different when the integrand is deterministic and not stochastic? Other words, why can I choose whatever $s_{i}$ in case of the (deterministic) Weierstrass function as an integrand and the sum will converge to the same variable, while on the other hand choosing $s_{i}$ is important when the integrand is a (stochastic) Wiener-process?
Yes. Following Revuz-Yor or Le-Gall section on Stochastic integrals, they build stochastic integrals using elementary processes $H(s)=\sum H_{i} 1_{(t_{i},t_{i+1}]}$ for $H_{i}\in \mathcal{F}_{t_{i}}$ to define for martingale $M$
$$\int H(s)dM_{s}:=\sum H_{i}(M_{t_{i+1}}-M_{t_{i}})$$
and then extend to more general L^2 processes (Theorem 5.4 in Le Gall). But if $H_{i}$ is deterministic, then the filtration constraint is not relevant i.e. we immediately get that this sum is a martingale and so the same proof goes through.
The key issue is the need for martingale and the various martingale convergence theorems. They use that to obtain the L^2-limit. If it is deterministic, there is no effect. If it is random, we have to be careful eg. for $s_{i}=\frac{t_{i}+t_{i+1}}{2}$, we get the Stratonovich integral (Definitions of the Stratonovich integral and why the "average" definition is arguably correct).
To be clear this is only for the L2 convergence. In the almost-sure convergence, it fails even for deterministic. For example, if we interpret $\int f(t)dB_{t}$ as a Riemann-Stieljes integral i.e.
$$\int f(t)dB_{t}\approx \sum f(s_{i})(B_{t_{i+1}}-B_{t_{i}}),$$
for $s_i\in [t_{i},t_{i+1}]$, then this is not allowed to converge due to the Banach-Steinhaus Theorem because it implies that $B$ has finite variation
If ,however, $f$ is of bounded variation, then we can define
$$\int^T f(t)dB_{t}:=f(T)B_{T}-f(0)B_{0}-\int^T B_{s}df_{s}.$$
For more posts see
What is the explicit obstruction to almost sure convergence in stochastic integrals?
and
Understand better stochastic integral through a.s. convergence
Also, see Rough paths for the more recent developments in extending stochastic integration to more degenerate fields. This topic explores the issue of the choice of partitions/divisions and how different choices/lifts give different types of stochastic integrals.