Collatz stopping times

2.4k Views Asked by At

The maximum stopping times for the Collatz $3x+1$ function have been computed up to about $x = 10^{18}$, given at $3x+1$ delay records. Plotting those results gives this:

enter image description here

A more interesting presentation is given on a semi-log plot:

enter image description here

It can be seen that the stopping time tends to a line as $x$ increases, and is confined to relatively narrow bounds. If anything, it seems to be tending to slightly narrower bounds as $x$ increases. Using a best fit of the upper part of the curve gives this equation:

$S=147.8\log_{10}x-366.9$

where $S$ is the maximum stopping time.

My question is, is there a (perhaps statistical) argument for why the maximum stopping time would tend to follow this line?

A further question is what is the likelihood that the relation could hold at higher powers of 10. For example, $x_0=10^{200}$ gives a predicted maximum stopping time for $x<x_0$ of about 29,194.

1

There are 1 best solutions below

7
On

I can't really explain it exactly but I can give a heuristic to see that this is basically the simplest scaling that makes any sense.

Suppose $x$ is a number whose Collatz iteration reaches the familiar cycle $4 \to 2 \to 1 \to 4 \to \dots$ Then there is a number $S_x$, the number of iterates before reaching $1$, and a number $p_x$, which is the fraction of its iterates that are even (again counting only until the first time that the sequence reaches 1). For example, $S_5=5,p_5=4/5$. An even Collatz iterate gives a division by 2 and an odd Collatz iterate roughly gives a multiplication by 3, so the Collatz sequence should be just a little bit bigger than $x_n=x 2^{-np_x} 3^{n(1-p_x)}=x 3^n 6^{-np_x}=x (3 \cdot 6^{-p_x})^n$ (bigger because I've dropped the $+1$s). If $p_x>\log(3)/\log(6)$ which is between $0.61$ and $0.62$ then this goes exponentially to zero.

The statement $S_x \sim k \log(x)$ for a constant $k$ amounts to saying that $p_x$ is asymptotically independent of $x$. If this holds then $S$ has the same asymptotic. Of course $p_x$ is actually a rather erratic function of $x$ in reality, but this is the main idea. Most likely this heuristic as stated is false but one could hope that something similar (for example, a relation involving aggregate statistics like your $S$) might hold.