Basic probability: Is event space in probability poorly defined?

313 Views Asked by At

I am in a probability class and we are just getting into random variables.

A random variable to me is a function that maps event to some real number. But the concept of event is difficult to swallow.

For example, let f(x) be a function that maps the set of real numbers x to set of real numbers y, this is the notion of a function or mapping that I am most familiar with.

Let instead f be X, and x $\in$ {head, tail}, then X(head) = 0, and X(tail) = 1 is one of the mapping. But head and tail themselves are ill defined, how can we ever operate on strings? I would be more comfortable if they are numbers instead, since we cannot add heads, divide heads, multiply heads or subtract heads, but we can do it with actual numbers. So something is off here, either X is not a function as I would like to imagine it to be, or the event space itself is ill defined.

How do I wrap my head around this?

3

There are 3 best solutions below

0
On BEST ANSWER

I'm afraid it is your idea of what a function is that is wrong rather then anything in probability theory. You also don't quite have the right probability spaces and functions but that's easier to fix.

Let's first go back to what a function is. In mathematics to have a function you first need at least one set. Preferably we will have two sets $A$ and $B$, ($A$ might be equal to $B$ though). A function from $A$ to $B$ is a set of ordered pairs $f=\{(a,b)|a\in A, b\in B\}$ such that any $a$ only ever appears as the first coordinate of at most one pair. This is often written as $ ((a,b)\in f) \& ((a,c)\in f)\implies b=c$. This condition also allows us to write $f(a)=b$ as we are used to since there is no confusion about which $b$ we should be getting (the unique $b$ such that $(a,b)\in f$.

Putting the above paragraph in even less mathematical terms we can think of a function as a map from $A$ to $B$ such that each element of $A$ maps to exactly one element of $B$.

Notice that the definition of a function $f$ needs absolutely nothing from the sets. As far as we care the set $A$ may contain fruit and the set $B$ may be the fruit's colour (assuming each fruit only has one colour). We certainly can't normally add fruit (and it doesn't really matter if we are adding apples to oranges or apples to apples, we don't have an addition operation defined on either. The operation is defined on the cardinalities of sets of fruit.)

For an example of a function that's closer to your probability example and still uses fruit you can think of each fruit being assigned it's weight. That gives you a function $w$ from fruit to $\mathbb{R}^+$. We still can't add fruit (the domain set) but we can easily add it's weights (the range set). This is very similar to how probability works, you can't usually add "heads" and "tails" but you can add the probabilities that either will happen.

Now to the slight misunderstanding you had about probability. The probability function (usualy) $P$ is not from the set $\Omega=\{"heads","tails"\}$ rather it's (i'll simplify a bit here) from $\mathcal{P}(\Omega)$. That is it's from the power set of $\Omega$. This is important for a number of reasons one of which is you want to have the event that either "heads" or "tails" will happen.

Another useful thing about having $P$ be from $\mathcal{P}(\Omega)$ is that you do actually get "addition" in some sense on the base set in terms of union. This is also part of the definition of the probability function $P(A\sqcup B)=P(A)+P(B)$ where $\sqcup$ denotes disjoint union. (Again I'm simplifying a bit here the definition of probability space requires countable additivity).

0
On

In your first example you say that "heads" $(H)$ and "tails" $(T)$ are ill-defined, but that is only because you neglected to properly define the probability space. If we define our probability space $\Omega$ as all the possible events from flipping a coin, then $\Omega=\{ H, T\}$ and we are good to go.

You are already familiar with a set function that takes these events to the reals, $P:\Omega\to[0,1]$ as $P(H)=1/2$ and $P(H)=1/2$. This is the probability function.

You defined a different function called the tail indicator random variable function, $X:\Omega\to\mathbb{R}$ as $X(H)=0$ and $X(T)=1$.

The random variable type functions $X(\omega)$ are useful when we are trying to describe some quality of $\omega$.

If we consider the probability space generated by flipping a coin three times, then, for example, we can define a random variable function $X:\Omega\to\mathbb{R}$ that tells us the number of heads in any given $\omega\in\Omega$. So $X(HHH)=3$ while $X(HTT)=X(THT)=1$ etc...

Hopefully this helps a bit.

0
On

Your issue is indeed an issue with what a function is.

You can define a function any time you have two sets, $A$ and $B$. Then a function is just a rule that associates with each $a \in A$ some unique $b \in B$, and we write $f(a) = b$ (or other things, but that's not important).

So, we can have functions that don't look anything like what you're used to. For example, we have a "day of the week" function. In my timezone, it's February 2nd, 2015 (that'll be our $a \in A$). Now, our day of the week function maps this date to Wednesday (that's our $b \in B$). So, this 'day of the week' function is a map from dates to days, and it's a perfectly good map, mathematically.

So, what does all this have to do with probability?

Event spaces, like you pointed out, can contain lots of various things that don't look like traditional mathematics (Heads and Tails being a good example). But that's OK, because our probability measure can still map these events to numbers, like probabilities.