Definition: what is the difference between generating a random variable versus generating a random number?

Question

Definition: what is the difference between generating a random variable versus generating a random number?

892 Views Asked by Bumbble Comm At 27 Mar 2026 - 5:59

Can someone clarify to me the difference between:

generating a random variable (according to a certain distribution) versus
generating a random number (according to a certain distribution)?

I am thoroughly confused by this concept, because my goal is simply to generate a set of data points according to a certain distribution, and every single reference out there is about generating random variables according to a certain distribution.

Mathematically speaking, A random variable is a function. A random number is a scalar. Totally different thing.

For example, this pdf (http://opim.wharton.upenn.edu/~sok/papers/s/rv.pdf) is titled "Generating a random variable" and starts an example with "the most widely used method of generating pseudo-random numbers are the congruential generator". But the notation for generating the pseudo-random number follows the convention of random variables.

In this reference, it talks about generating random variables with the rejection-sampling method. http://www.columbia.edu/~ks20/4703-Sigman/4703-07-Notes-ARM.pdf But I thought this method was used for generating random numbers (which is the end-goal for everyone)?

What is the distinction between these two concepts?

Original Q&A

There are 2 best solutions below

**Bumbble Comm** · Answer 1 · 2019-01-30 01:15:23

A random variable $f$ is a typically defined as a real valued measurable function. There are various characterisations, a typical one is that the set $f^{-1}((-\infty,\alpha])$ is a member of some specified $\sigma$-algebra. There is not necessarily a single distribution associated with the function. The term random is a misnomer, there is nothing random in the colloquial sense.

The constant function $f(\omega) = 1$ is a perfectly well behaved random variable with nothing random about it whatsoever.

A random variable is not generated as such, it is defined.

A random number generator is understood loosely as some process that produces a sequence of numbers whose statistics approach some specified ideal.

Aside:

Here is a nice collection of random variables (functions) stolen from Kac's wonderful monograph "Statistical Independence in Probability, Analysis and Number Theory" for your amusement.

Let $r(x) = (-1)^{\lfloor x \rfloor}$, and let $r_n(x) = r (2^n x)$ for $x \in [0,1]$ and $n \in \mathbb{N}$.

The $r_n$ are known as the the Rademacher functions. If you plot a few you will see that they are square waves of different frequencies.

Clearly there is nothing random about the $r_n$, however we can prove many results that have a probabilistic flavour, such as $\lim_{n \to \infty} {r_1(x)+\cdots + r_n(x) \over n} = 0$ for almost all $x$, or the stronger (& more difficult to prove) $\limsup_{n \to \infty} {|r_1(x)+\cdots + r_n(x) \over \sqrt{n \log(\log n)}}| = \sqrt{2}$ for almost all $x$ (cf. law of the iterated logarithm).

**Bumbble Comm** · Answer 2 · 2019-02-01 21:06:52

I think a large part of the problem here is that there is not a general agreement on a convenient and accurate language to describe the use of pseudorandom numbers in computing. Instead, people tend to borrow words and notations from mathematical probability and abuse them mightily.

We have functions called “random number generators” (RNGs), but all they do is generate pseudorandom numbers with uniform distribution over a large but finite set of rational values. For various purposes we would like to generate random numbers according to a chosen distribution, but that’s an inconvenient phrase to have to repeat. We could abbreviate it to just “generate random numbers”, but then someone might think we mean an RNG with a uniform distribution. So sometimes someone will say “generate a random variable” because it is concise and get the point across (in context) even though it is not really correct.

There is also a long history of the use of pseudorandom numbers going back to the early years of computing, much of it done without mathematical rigor and often not well at all. It may be that some of the language we use nowadays for this purpose originated from practitioners who did not really understand how a random variable is defined mathematically, and the language “stuck” even among those who should know better due to the force of long usage.

Definition: what is the difference between generating a random variable versus generating a random number?

There are 2 best solutions below

Related Questions in PROBABILITY

Related Questions in STATISTICS

Related Questions in DEFINITION

Related Questions in STATISTICAL-INFERENCE

Related Questions in MATHEMATICAL-MODELING

Trending Questions

Popular # Hahtags

Popular Questions