(1) If $X$ and $Y$ are two sets, we define the Cartesian product $X \times Y$ as the set of ordered pairs $(x,y)$, such that $x \in X$ and $y \in Y$.
(2) On the other hand [Folland, Real Analysis, page 4], if $\{X_\alpha\}_{\alpha \in A}$ is infinite indexed family of sets, their Cartesian product $$ \prod_{\alpha \in A}X_\alpha $$ is defined as the set of maps $f: A \to \bigcup\limits_{\alpha \in A} X_\alpha$ such that $f(\alpha) \in X_\alpha$ for every $\alpha \in A$.
After saying this, Folland remarks:
it should be noted, and promptly forgotten, that when $A = \{1,2\}$, the previous definition of $X_1 \times X_2$ [that's (1) above] is set-theoretically different from the present definition of $\prod_1^2 X_j$ [that's (2) above]. Indeed, the latter concept depends on the mappings, which are defined in terms of the former one.
I am not grasping this remark. Specifically, here are my questions.
Question 1: How is (2) set-theoretically different from (1)? A simple illustrative example?
Question 2: If (1) is extended to infinite families, which definition would be stronger? A simple illustrative example?
Question 3: Why should this be "promptly forgotten"?
I'll probably have more questions depending on the type of answers I'll get to these.
Thanks!
Very briefly. Question 1 has been amply answered; the two definitions produce different sets, although there is a canonical bijection between them.
Question 2. There is no such thing as extending (1) to infinite families, it is a definition for pairs of sets only. Even if you want something for three sets, there are at least two ways to combine two Cartesian products, which give different (though isomorphic) results, neither of which involves ordered triples. But one can never get infinite products in any of these ways. One can interpret (2) as an alternative way to "generalize" this, in a manner that also caters for infinite products.
Question 3: We can forget the distinction between (1) and (2) in the case of a Cartesian product of two sets because of the canonical bijection between the sets produced by the two definitions, which means we can always and consistently translate back and forth between them when necessary. And we should forget the distinction because keeping track of which of the two is officially applied in which situation is a totally unproductive effort. The whole importance of giving these definitions is to have a precise model of a Cartesian product, so that its properties can be deduced from the axioms of set theory, but having more than one equivalent model does not add anything useful.
Absent question: if (2) can do all that (1) can (slightly differently but equivalently) why should we care about (1) in the first place? Because without (1) we would be in a chicken-and-egg situation when trying to formulate (2), not only because (2) produces sets of mappings, which require a Cartesian product (of two sets), but also because the notion of an indexed family of sets itself is defined in terms of mappings.