What is the differential probability notation used in this paper by Einstein?

Question

What is the differential probability notation used in this paper by Einstein?

1.6k Views Asked by Bumbble Comm At 28 Apr 2026 - 8:29

Earlier today, I saw a discussion on Einstein's 1917 paper On The Quantum Theory of Radiation. Section 2 of the paper introduces three equations that I'm neither sure how they are derived nor what they're supposed to represent.

For instance, section 2 defines the following equation (the other two are similar, but introduce $\rho$, a radiation density):

$$dW = A_m^ndt$$

$A_m^n$ is a constant, and $dt$ is a "very small" unit of time. On the other hand, $dW$ is said to be a probability that emission of radiation occurs during $dt$.

My first confusion is that the probability is represented using a differential $dW$. Earlier in the paper the same variable $W$ is also used to represent a density function. However, setting aside that confusion for a moment, since $A_m^n$ is a constant, and $t$ is presumably unbounded:

$$\int_{-\infty}^{\infty}A_m^ndt = \infty$$

The above should equal 1 for $dW$ to represent a distribution.

This makes me think I'm either missing some implicit assumptions, and/or Einstein is not calculating a traditional integral using Riemann sums.

According to this question, taking the differential of a probability density function occurs in stochastic calculus. Considering we are dealing with molecules in a gas, this makes sense.

So is the above equation an example of describing a stochastic process? If so, is there a less-terse way in writing the equation that would also show more clearly that this is not an integral calculated using Riemann sums? If so, I do not have the background to understand this paper.

Original Q&A

There are 1 best solutions below

**Bumbble Comm** · Accepted Answer

Radiation from a material (neglecting the overall decrease in radioactive atoms) is a stochastic process in which in a given time interval every atom has a certain chance of decaying. If the time interval $\Delta t$ is short enough then there is a very small probability of having 1 decay and an extremely small of having two or more decays. Einsteins $dW$ denotes this probability of having 1 decay (and thus neglects the possibility of two or more). From temporal invariance it must be proportional to the length of the time interval: $dW = a \; dt$. This only makes sense when $dW$ is small compared to 1.

Another way to think of it is as the average number of decays in the interval $dt$. Chopping a (not so small) time interval $[0,T]$ into $n$ tiny intervals where Einsteins argument works allow you to show that e.g. $X$, the total number of decays is a random variable following a Poisson distribution: If $\Delta t=T/n$ is small enough so that we can neglect multiple decays in each interval, then $p=a \Delta t=aT/n$ is the (approx) probability of one decay in each interval, and these decays may be assumed independent. Having $k$ decays in total is (approx) a binomial distribution (let $n\rightarrow \infty$ for fixed $k$): $$ P(X=k)=\left( \begin{matrix} n\\k \end{matrix} \right) \left(\frac{aT}{n}\right)^k \left(1- \frac{aT}{n}\right)^{n-k} \rightarrow \frac{1}{k!} (aT)^k e^{-aT}$$

You may also derive an ODE for the probability of having no decays up to a certain time. Your integral (over some fixed 'long' period of time) represents the average total number of decays in that period. It may not be interpreted as a probability anymore.

What is the differential probability notation used in this paper by Einstein?

There are 1 best solutions below

Related Questions in PROBABILITY-DISTRIBUTIONS

Related Questions in NOTATION

Trending Questions

Popular # Hahtags

Popular Questions