I've been reading this very interesting blog post entitled "A review of probability theory" from Terence Tao. Here are a few quotes from the blog post:
Elements of the sample space $\Omega$ will be denoted $\omega$. However, for reasons that will be explained shortly, we will try to avoid actually referring to such elements unless absolutely required to.
...
In order to have the freedom to perform extensions every time we need to introduce a new source of randomness, we will try to adhere to the following important dogma: probability theory is only “allowed” to study concepts and perform operations which are preserved with respect to extension of the underlying sample space. (This is analogous to how differential geometry is only “allowed” to study concepts and perform operations that are preserved with respect to coordinate change, or how graph theory is only “allowed” to study concepts and perform operations that are preserved with respect to relabeling of the vertices, etc..) As long as one is adhering strictly to this dogma, one can insert as many new sources of randomness (or reorganise existing sources of randomness) as one pleases; but if one deviates from this dogma and uses specific properties of a single sample space, then one has left the category of probability theory and must now take care when doing any subsequent operation that could alter that sample space. This dogma is an important aspect of the probabilistic way of thinking, much as the insistence on studying concepts and performing operations that are invariant with respect to coordinate changes or other symmetries is an important aspect of the modern geometric way of thinking. With this probabilistic viewpoint, we shall soon see the sample space essentially disappear from view altogether, after a few foundational issues are dispensed with.
Let’s give some simple examples of what is and what is not a probabilistic concept or operation. The probability $P(E)$ of an event is a probabilistic concept; it is preserved under extensions. Similarly, boolean operations on events such as union, intersection, and complement are also preserved under extensions and are thus also probabilistic operations. The emptiness or non-emptiness of an event $E$ is also probabilistic, as is the equality or non-equality of two events $E,F$ (note how it was important here that we demanded the map $\pi$ to be surjective in the definition of an extension). On the other hand, the cardinality of an event is not a probabilistic concept; for instance, the event that the roll of a given die gives $4$ has cardinality one in the sample space $\{1,\ldots,6\}$, but has cardinality six in the sample space $\{1,\ldots,6\} \times \{1,\ldots,6\}$ when the values of an additional die are used to extend the sample space. Thus, in the probabilistic way of thinking, one should avoid thinking about events as having cardinality, except to the extent that they are either empty or non-empty.
[The bold is mine.]
This seems to be a very insightful viewpoint. But, in introductory probability classes, it is very common to compute the probability of an event by counting the number of outcomes in the event, and dividing by the total number of outcomes in the sample space (assuming that all outcomes in the sample space are equally likely). Is this bad form? In such cases, would it be preferable to compute the probabilities using an approach that does not involve counting the number of elements of an event?
Perhaps an anology is that, in linear algebra, we often prefer proofs that don't use coordinates.
In the 17th century when Pascal and Fermat were working on the Problem of points they used fundamental elementary ideas of Enumerative combinatorics which are important and still valid. In elementary schools we still teach addition and multiplication of counting numbers. Just as the Combinatorial principles of Rule of sum and Rule of product are introduced in probability classes. They are still important and valid. What Terence Tao seems to be referring to is some abstract and theoretical framework very far from what is used in the practice of Statistics where the simple ideas introduced in probability classes can be used to solve concrete problems which come up in real life. I am sure that there is some value in Tao's approach when properly understood but there are obstacles to achieving that proper understanding without first having understood the more naive approach and requires some mathematical maturity.