What is the intuition behind gamma distribution?
For instance, I understand how to "construct" Gaussian distribution. This is my intuition:
- Bernoulli distribution - which is simple concept
- A sequence of Bernoulli trials is a Binomial distribution. I understand how binomial coefficient is constructed
- Using Stirling approximation we can deduce Gaussian distribution
Hence I understand that shape of a Gaussian distribution is determined by the binomial coefficients and so on.
How can Gamma distribution be derived step by step using relatively simple concepts?
The short answer is "prove the pdf is non-negative & integrates to 1", since that guarantees we're dealing with a probability distribution. But if you're looking for a motivation for why anything would be Gamma-distributed, here's the idea.
The $\alpha =1$ special case, with pdf $e^{-\beta x}$ for $x>0$, is the so-called exponential distribution, which concerns the lifetime of something that has the same risk of failure in a given time period regardless of its history (we call this memorylessness), e.g. a radioactive atom. (The mean lifetime is $\beta^{-1}$.) If $\alpha$ is a positive integer, you have the distribution obtained by summing $\alpha$ exponential iids. In general $\alpha$ can be any positive real, but that's just a continuous generalisation.