Here’s a transcript of the original exercise. (There‘s even a hint given by the authors in the textbook as you can see. But precisely this hint confuses me).
2.2.4 Consider the binary until operator $U$. In a model $\mathfrak{M} = (W,R,V)$ its truth definition reads:
$$ \begin{align} &\mathfrak{M},t \Vdash U(\phi, \psi) &\text{iff} & &\text{there is a $v$ such that $Rtv$ and $v \Vdash \phi$, and} \\ & & & &\text{for all $u$ such that $Rtu$ and $Ruv$: $u \Vdash \psi$.} \end{align} $$
Prove that $U$ is not definable in the basic modal language. Hint: think about the following two models, but with arrows added to make sure the relations are transitive:
I’m unsure how to prove this. I would probably attempt something like this:
In order to show that the operator $U$ is not definable you’d need to show that there’s a bisimulation between these two models (which is easy) and then you’d need to argue that only in one of them $U(\phi, \psi)$ holds (Which I don’t see. Doesn’t it hold in both models?). Then it follows that ‘$U$’ is not definable, because bisimilar models are invariant i.e. they make exactly the same transitions possible and couldn’t be distinguished by someone who operates on them.
So my main issue is seeing how these models are bisimilar yet only in one of them $U$ is the case.

Welcome to mse!
You have exactly the right idea, there's a bisimulation between these two models where we relate
It seems like you're comfortable with this, so I won't belabor the point.
Now, I claim that $s_0 \Vdash U(q,p)$ while $s' \not \Vdash U(q,p)$. As you've pointed out, this means that $U(q,p)$ is not definable in the basic modal language.
Now that you know what the correct worlds/formulas are, you might want to try to check this yourself. If you get stuck, I'll leave a solution below the fold:
I hope this helps ^_^