Consider the following ODE: $$ \frac{d^2}{dx^2}f(x)+\frac{3}{2x}f(x)+x^{7/2}\sin f(x) =0\,. $$ This seems to have a non-continuous dependence on its initial conditions. For example, choosing \begin{align} f(0.1) &= \lambda \,, \\ f'(0.1) &= \mu \quad \,, \end{align} and letting $\lambda \le 2.21918, \mu = 1$, produces the following solution
and letting $\lambda \ge 2.21919, \mu = 1$, completely changes the form of the solution.
The critical point (for example, $\lambda = 2.21918$) depends on the choice of $\mu$ (in this example, $\mu = 1$).
I would like to know what is responsible for this behaviour, and in generic situations, how we can predict whether a given ODE has smooth dependence on its initial conditions or not.


The continuous dependence of the solution on the initial value is proven using the
For this problem use $u=(y,y')$, $\|u(x)\|=\max(|y(x)|,|y'(x)|)$ so that then on $x>0$ $$ \|\Delta u'(x)\|\le\max(|Δy'(x)|,\frac3{2x}|Δy(x)|+x^{7/2}|Δy(x)|)\le\max(1,\frac3{2x}+x^{7/2})\,\|Δu(x)\|. $$
So essentially, there are 2 segments for the Lipschitz constant,
As one can see, on the second segment the factor relating the error at $x_0$ to the error at $x$ deteriorates rapidly, for instance to get $\|Δu(5)\|<1$ one would need at $x_0=0.1$ $$ \|Δu(0.1)\|\le \exp\left(-\frac29(5^{9/2}-1.1^{9/2}\right)\left(\frac{0.1}{1.1}\right)^{3/2}\le 5.12205\cdot 10^{-137} $$ which is impossible to realize in standard floating point values. Even allowing that this estimate is somewhat pessimistic, the practical result will be a jump in behavior for minimal changes in the initial conditions.
To make an observable prediction, let the initial condition have a variance in the machine epsilon range, $\|Δu(x_0)\|\sim 10^{-15}$ and calculate the interval where the solutions are visually close, that is $\|Δu(x)\|<10^{-2}$. There is no problem on the first segment, so on the second segment we have to solve $$ \exp\left(\frac29(x^{9/2}-1.1^{9/2}\right)\left(\frac{1.1}{0.1}\right)^{3/2} \le 10^{13}$$ leading to $x\le2.9$. This seems consistent with the plots in the question as the graphs start to be different at about $x=3.5$.
As one can see, the actual behavior is several orders of magnitude better than predicted, the non-linearity appears to also work towards stabilizing the solution behavior towards shrinking oscillations around multiples of $2\pi$, so that one does not see the worst case estimate in every interval of initial values.