Suppose we have some finite set $S$. Suppose we generate an infinitely long sequence $s_n$ from the elements of this set. Is there guaranteed to exist a ''phrase'' from the set $S$ that repeats an arbitrary (but finitely many) times?
More explicitly: is $s_n$ guaranteed to contain a subsequence of length $mk$ where some phrase of length $m$ occurs $k$ times in a row, for arbitrary but finite $m$ and $k$?
For example if $S = \{0,1\}$, for $m=2$, $k=3$, can we be sure that any sequence generated contains at least one of:
- (0,1) repeated: $0, 1, 0, 1, 0, 1$
- (1,0) repeated: $1, 0, 1, 0, 1, 0$
- (0,0) repeated: $0, 0, 0, 0, 0, 0$
- (1,1) repeated: $1, 1, 1, 1, 1, 1$
It seems to be correct intuitively but can't find an obvious proof (or disproof).
No. Consider the Thue-Morse sequence, whose $n^{th}$ term is $0$ if $n-1$ has an even number of ones in its binary representation, and is $1$ if there are an odd number of ones. This starts like $$ 0,1,1,0,1,0,0,\boxed{1},1,0,0,1,0,1,1,0,\dots $$ For example, the eighth term (boxed above) is a one because $8-1=7$ in binary is $111$, which has an odd number of ones.
It can be shown that the Thue-Morse sequence has no pattern which repeats itself three times consecutively. Therefore, this is a counter-example for any $k\ge 3$.