The Van Eck sequence is defined here: https://oeis.org/A181391
By far the most frequent integer at the beginning of the sequence (up to first 10 million terms) is zero.
Intuitively, one would expect smaller integers to be more frequent than larger integers, and inspection of finite initial slices of the sequence shows a fairly smooth decline in frequency of integers as they get larger.
However, there appear to be a number of pronounced local minima and maxima in the frequency distribution before this pattern of smooth-ish decline begins. For example, in the first 5000 terms, there are 1703 zeros, 345 1s, 105 2s, and 72 3s. There is then a rise back up to 693 5s, then decline.
In the first million terms, there is a minimum at 3:
0 appears 136529 times.
1 appears 24000 times.
2 appears 7643 times.
3 appears 748 times.
4 appears 1644 times.
5 appears 17685 times.
6 appears 48609 times.
7 appears 43041 times.
8 appears 31182 times.
9 appears 19384 times.
10 appears 9447 times.
In the first 10 million terms, the minimum has shifted slightly to 4, but is even more pronounced:
0 appears 1250523 times.
1 appears 215715 times.
2 appears 69497 times.
3 appears 6102 times.
4 appears 2410 times.
5 appears 39936 times.
6 appears 260931 times.
7 appears 453416 times.
8 appears 359203 times.
9 appears 249831 times.
10 appears 142729 times.
(there appear to be several further maxima and minima)
Why does this happen? Is 4 rare in the (full) infinite sequence also?
A 0 in the sequence means that the previous term has not occurred previously.
...a0... => a has not previously occurred.
A 1 in the sequence means that the two prior terms are identical.
...xa..(a terms)..xaa1...
A 2 in the sequence...
...ypx..(a terms)..yaxa2...
A 3 in the sequence...
...xpqz..(a terms)..xayza3...
A 4 in the sequence...
...wpqrz..(a terms)..waxyza4...
And so on... Why should 4 be especially rare???
My best guess is that it wasn't fairly common as the length between zeroes. Consider the way the sequence is structured. It is a series of subsequences delimited by zeroes; where the first number is the length of the previous sequence, the last is a number that has not appeared yet, and the middle is a random series of numbers. After it's first instance, a number can reappear as the first number or somewhere in the middle. The average length of the sequence seems to increase from 3 early on and up, Some numbers appear often because they're close to the sequence length average. Some shorter numbers can appear as well when certain numbers appear multiple times in one subsequent. I'm guessing that 4 didn't appear very often as a sequence length and it also was too long to appear in the middle of sequences very often.