One of the generic ways to obtain chaos in phase space is when the system causes trajectories to stretch and fold.
I understand that the stretching will cause neighboring initial conditions to diverge, which is one of the conditions of chaos, but why is there need for folding?
I’ve also been told that folding into a finite domain is necessary for chaos. Why? I honestly cannot picture this in my head. Why can’t we fold and stretch into an expanding domain and still have chaos?
Stretching alone isn't enough to call a system "chaotic", as you note. Well, neither is folding. It's possible to have a system that does a very regular kind of folding; think of squaring in the complex plane. One of the most studied examples of a chaotic map is $z \mapsto z^2 + c$; if $c=0$ we don't get chaos because it's just folding. But you do see points getting 'mixed up' in a way, since you can find two points in the preimage for most points in the range. You might say that stretching makes points that start close get more distant, and folding makes points that are distant get closer. It's the interaction between the two that makes things fun.