Imagine a number base where instead of writing the number of products for each power of your base (e.g. in decimal, $152 = 1 \times 10^2 + 5 \times 10^1 + 2 \times 10^0$), you had one where you write the exponents for the prime factorization for that number (in little endian). So $2$ would be $1$ (because it's the first prime), $5$ would be $100$ (because its the third prime), $15$ would be $110$ because its $3 \times 5$, and so on. Somewhat bizarrely, $1$ would be $0$ because it can be written as $2^0$ but there is no way to write zero.
Since every number (except $0$) can be uniquely factorized, then we can write any positive integer in this notation. If we include $-1$ as one of the primes, we can include negative numbers. If we allow the exponents to be negative, then we can write any rational number, $\frac{p}{q} = p(q^{-1})$. If we allow the exponents to be fractions, then we can write any surd, as well as complex numbers (having included $-1$, we can take its square root).
What I'm wondering is whether there is anything cool or surprising about writing numbers in this way. I've tried thinking of some interesting features and here's a list of pros and cons so far:
Cons:
- No way to represent zero.
- Can't do addition (probably for same reason as above because $1$ is the identity so it's a multiplicative group)
- Inefficient to take a number from decimal to this base (because factoring is hard)
- There's probably no sensible way to count. The additive order (1,2,3,4,5,6,7,8) gives a sequence 0,1,10,2,100,11,3 for which I'm struggling to find a pattern (it's tempting to translate into binary or something except the exponents grow arbitrarily large). Finding a natural way to multiplicatively order them is weird too. You start at $0$, then $1$, then you have two choices: $2$ or $10$. At each point, you have to decide whether to increment an exponent or include a new prime. This means you've got two decisions to make at each new point, each of which continues infinitely. Way out of my depth but perhaps this gives some intuition for the uncountability of irrationals?
Pros:
- Multiplication is much easier (just addition of exponents)
- Exponentiation is much easier (multiplication of exponents)
- Easy to convert back to an additive number base (just iteratively multiply your primes)
- It grows very fast: $111 = 30, 1111 = 210, 11111 = 2310$ (and that's with limiting exponents to at most $1$). I feel like this might be a pro because it gives a compact way of writing some very large numbers (in particular, n-smooth numbers from cryptography) and because it would quickly overwhelm someone who was reading these numbers in decimal. Perhaps there are some cryptographic applications here? Also, maybe some cool stuff happens when we reduce these numbers $\pmod y$, where $y$ is something fancy?
So anyways, thanks for reading this far. I'm putting this on here because I'm curious to hear what more knowledgeable people think about writing numbers in this way. Has this been done before? Are there any surprising patterns here? Is this just an inconvenient way of writing numbers which we already know plenty about or might it offer a different perspective? Are there any applications which could benefit from greatly speeding up multiplication at the expense of forgoing addition?
My final pitch: imagine a bunch of aliens landed on Earth. Wouldn't it be easier to describe this system to them (since primes are presumably universally studied) than decimal numbers which are based off the number of fingers we happen to have on our hands? So isn't this a more natural number base? Let me know!
Saw this recently and was curious about how these numbers grow, even with exponents reduced $\pmod{2}$.
Turns out plotting the sequence $\Pi_2 = \{1, 2, 3, 6, 5, 10, ...\}$ against the binary index $n_2 = \{0, 1, 10, 11, 100, 101, ...\}$ shows that although it doesn't grow monotonically, it has quite a simple self-similar structure. See above
When plotting up to larger values
and with different bases
, the pattern becomes clearer: in base $d$, we have $d$ peaks, each recursively made of up $d$ peaks.
Not only does it look cool but peaks made out of peaks makes sense. Imagine inducting on the exponents base $2$. Then if the first $n$ strings are $0x_1, 0x_2, ..., 0x_n$, the next $n$ strings will simply be $1x_1, 1x_2, ..., 1x_n$. This will look identical to the plot of the first $n$ numbers, except shifted up by some additive constant.
So in base $d$ we get $d$ peaks because we have $0x_1, 0x_2, ... 0x_n$, then $1x_1, 1x_2, ..., 1x_n$, all the way up to $(d-1)x_1, (d-1)x_2, ..., (d-1)x_n$. Once again, each level is just shifted up by $p, p^2, ..., p^{d-1}$ for some prime $p$. So the self-similarity of the graphs just reflect the self-similarity of counting.
Presumably there could be a more formal description of what's going on here. For anyone interested, a useful starting point might be that reducing the exponents mod $d$ (call it $\Pi_d$) gives a subgroup of $(\mathbb{Z}, \times)$. Moreover, $\Pi_{d_1}$ is a subgroup of $\Pi_{d_2}$, if $d_1 \leq d_2$.