(the old post grew too long so I made a separate question)
With exponential probability, $\Pr(X>s+t | X>s) = \Pr(X>t)$
where X is a waiting time for some event.
Now you estimate $p_1 = \Pr(X>s)$, at time $t_0$ When you reach time s and the event not happened yet, so you update your estimate $p_1 = \Pr(X>s)$ since $X$ is memoryless.
Now your past self could have foreseen the reasoning of the future self's reasoning.
If I don't see $X$ happening for $s$ period, then I'd have to wait for $X$ for another s period with equal probablity $p_1$. $p_1 = \Pr(X>s+s|X>s)$
Oh, then, I realize I can apply this reasoning more times. $\Pr(X<s+t|X>s) = \Pr(X<t) $ which can be derived from $1-\Pr(X>s+t|X>s) = 1- \Pr(X>t)$
Since $X$ happening $0<X<\mathrm{d}t, \mathrm{d}t<X<2\mathrm{d}t, 2\mathrm{d}t<X<3\mathrm{d}t$ is disjoint, I can add the probabilities to get the probability of union $0<X<3\mathrm{d}t$
$p_2 = \Pr(X<\mathrm{d}t)+\Pr(X<2\mathrm{d}t|X>\mathrm{d}t)+\Pr(X<3\mathrm{d}t|X>2\mathrm{d}t) = 3\cdot \Pr(X<\mathrm{d}t) = \Pr(X<3\mathrm{d}t)=\Pr(X<s)$ where $3\mathrm{d}t=s$.
Since pdf is thicker in front, I guess $\Pr(X>s) =1-p_2 < p_1$
(This can't be.. The reasoning must be flawed somewhere...)
Where you have taken some time, you have fewer time left, but $\displaystyle\int_0^{\inf} $ assumes the other.
And it is stating your world time is freshly reborn with the same probabilitic structure as far as the event is concerned.. (ok this doesn't sound mathematical expression..)
I am trying to understand. $Pr(a<X<2a)$ from current time is different from $Pr(0<X<a)$ from time a, although X is refering to the same time frame. So difference seems to spring from where you are standing, not what you are looking..
I can compute future self's $Pr(X<a)$ as current self $Pr(a<X<2a|X>a)$ because it's memoryless.
But somehow asserting $Pr(X<a) = Pr(a<X<2a)$ (this would mean, Pr(a< X<2a) doesn't depend on where X happend in < a, because it's memoryless) is false and not what memoryless means.
$Pr(a<X<2a)$ must mean your weight of belief X happening for the first time in between a<X<2a.
So, we are describing memoryless property with non-memoryless thinking (pdf).
The way we express the idea is subtle. so confusing..
I must feel I can construct some contradiction out of it.
Although I can see the practicality of reasoning memorylessly, and it could also be a non contradictory assumption, I guess I feel at least there must be an alternative perspective on the assumption or interpretation on what assumptions we are making, I want to know if there's other perspective indeed, or my thinking can be proven falsy.
There are some concepts being blurred together here.
Let's start with this point about estimation. Without getting into Bayesian thinking, there is no way to use the information that a single sample from an exponential distribution exceeds $t$ in order to get some kind of useful estimate for the rate or mean. Either way the probability that $X>t$ increases as the rate of events occurring decreases, so your best estimate for the rate is just zero, regardless of how long you have been waiting. (With Bayesian thinking you can start from a guess that the rate is something and gradually adjust your guess as you wait.)
Second, you have this observation that $P(X<t)=P(X<(k+1)t \mid X>kt)$ for all $k$. This is true. The place you have an error is $3P(X<dt)=P(X<3dt)$. Everything you wrote in that line before that is true, but that is not; in fact $3P(X<dt)$ needn't even be less than $1$. You can't convert from a sum of probabilities of disjoint events to a probability of the union in this situation because the conditions aren't the same between the different events.
The last thing is about what's going on with the conditioning. What is going on intuitively is that if you have already been waiting for an amount of time $t$ (mathematically represented by the condition $X>t$ in a conditional probability), you have the same prediction for how long you'll continue to wait. Mathematically this new prediction is $P(X>t+s \mid X>t)$ and the old prediction that it matches up with is $P(X>s)$. However: