Some mathematical truth statements/questions can not be answered with certainty. This could be because
- The computation required to answer them is finite but requires a universe-sized computer to work on it for longer than the age of the universe.
- The question has an answer, but the computation required to answer them is infinite, so that they can never be answered. (I'd like to know an example of this).
- Godel's theorem??? (despite reading a lot on this, I still don't fully understand what it means).
But my question is: Are there examples of problems that are not solvable in an exact way, but where we nevertheless have some exact algorithm that gives us a meaningful probability distribution over answers to the problem?
For example, we could have a function $f:X \to \{\text{yes,no} \}$, and a function $g:X\to [0,1]$, where we know that $f$ is fully defined for every $x\in X$, but we cannot compute it (or it takes a long time to compute so that we can only compute it for say 100 values in a year with a super computer), and where if we calculate $g(x)$ for various $x$, it will on average make meaningful probability statements about what the value of $f(x)$ is. It could be that while $f$ is computationally intractable, $g$ is very efficient.
Is there an example vaguely like this? (it doesn't have to fit my exact formulation).
To be clear, I'm not asking for "approximations" to $f$, but specifically for probability distributions over answers of $f$. I am also not asking for an approximate probability distribution to another probability distribution. I'm asking for a probability distribution over mathematical truth statements that while they may not be computable have an actual true answer.
Edit: The Prime Number Theorem is a very good example. I am also wondering if there are any good examples of theorems that are unproven, but where there is nevertheless some sort of probability estimate possible regarding whether it's true or not. (I've read something regarding the Goldbach conjecture here, but the argument seems obviously wrong).
One very general answer to this is the "logical induction" algorithm described in this paper. Also, the introduction to this paper has some discussion of what a "meaningful" function $g$ might look like.
I should say ahead of time that, like many results in the vein of "this function is computable", the algorithm in question is not going to be remotely efficient: it will take a very long time (an impractical length of time) for it to make nontrivial probability estimates.
In any case, the idea is that the logical inductor will produce consistent probability estimates for all mathematical statements you give it; the more time you give the logical inductor to work, the more reasonable the probability estimates. (By contrast, there is no similar algorithm that produces truth values for all mathematical statements.) Essentially, the longer the logical inductor runs, the more facts it learns, and it uses those facts to guess at truth values it hasn't learned.
(It is easy to come up with an algorithmic prover that, inefficiently, finds more and more proofs of mathematical statements over time. Just iterate over all finite strings, and for each string, check if it's a proof of something. Such a prover feeds into the logical inductor, and gives it a basis for making its guesses. Of course, there are unprovable statements that the prover will never say anything about, and we still want the logical inductor to assign them probabilities.)
You should imagine the algorithm working as follows. Suppose that the statement we're interested in is "The $3^{\text{rd}}$ digit of $\pi$ after the decimal is $1$" encoded as a statement in Peano arithmetic or something. Over time, the logical inductor's beliefs might change as follows:
(These are only intuitions, since of course I don't have a practical implementation to test this on; it might turn out that there are ways to bypass some of these stages.)
The same would happen (over a different scale of time) to something like the $1000000000^{\text{th}}$ digit of $\pi$, or to other statements we don't know the truth value of.