Why do empty operations give the identity?

739 Views Asked by At

Why is it that empty operations give the identity of that operation?

Wikipedia says that the empty product is taken to be 1 "by convention", but this convention is consistent with ideas that seem like an empty product (like $x^{0}$).

If I talk about the intersection of no sets (taking $U$ to be the "universal set") then logic demands the empty intersection be $U$ (because any element is vacuously in every set in an empty family, and is therefore in the empty intersection). $U$ indeed functions as the identity of intersections; for any $A \subseteq U$, $A \cap U = A$.

Same story for the union of sets. Logic demands that the union of no sets be the empty set, and this is indeed the identity of the union operation.

There seems to be much more than convention going on here, and it is consistent among binary operations of all types. So why is it that the operation on no operands tends to result in the identity of that operation?

3

There are 3 best solutions below

3
On

Pardon, in advance, all the kooky notation I've made up in order to answer this question.

Everything you mention is an instance of a family of operators $\bigodot_{\alpha} : X^{\alpha} \to X$ on some set (or class) $X$ and some set (or class) of ordinals $\alpha$, such that:

  • $\bigodot_{i<1} x_i = x_1$ for all $x_1 \in X$, that is, the $1$-ary $\odot$ is the identity function; and
  • For all $\alpha \ge 1$, $\bigodot_{i<\alpha+1} x_i = \left( \bigodot_{i < \alpha} x_i \right) \odot x_{\alpha+1}$ for all $( x_i : i < \alpha ) \in X^{\alpha}$.

Here I've written $\bigodot_{i<\alpha} x_i$ as shorthand for $\bigodot_{\alpha} ( x_i : i < \alpha )$, and $x \odot y$ as shorthand for $\bigodot_2 ( x,y )$.

In order for the second property above to hold for all $\alpha$, not just all $\alpha \ge 1$, you need to have, for all $x \in X$, $$\left(\bigodot_0 ( ) \right) \odot x = \bigodot_1 \{ x \} = x$$ and likewise $x \odot \left( \bigodot_0 () \right) = x$.

Thus, if it exists, $\bigodot_0()$ (i.e. the 'empty $\odot$') must be the identity for $\odot$.


This can be justified at a more general level. The operators you mention are all instances of products and coproducts in particular categories. The empty product is the terminal object of the category (if it exists), and the empty coproduct is the initial object (if it exists). Indeed:

  • Multiplication of numbers. The natural numbers (thought of as von Neumann ordinals) form a category $\mathsf{FinOrd}$, whose morphisms are functions. In this category, the product coincides with multiplication of natural numbers as we know it, and $1$ is the terminal object, hence the empty product. The coproduct coincides with addition of natural numbers, and $0$ is the initial object, hence the empty sum.
  • Intersection of sets. The class of all sets forms a category, where a unique morphism $X \to Y$ exists if and only if $X \subseteq Y$. In this category, coproducts are unions and the empty set is the initial object, hence the empty union. Products are intersections, but there is no terminal object (since that would imply the existence of a universal set), so there is no empty intersection. However, in the subcategory consisting of subsets of some fixed set $U$, the terminal object is $U$ itself, hence $U$ is the empty intersection in this subcategory.
4
On

The reason for the phenomenon is that we want our operation to be associative. We want the sum of two sums to be one big sum, for instance, and we want this to hold regardless of how many terms are in each sum.

$$\prod_{i\in \emptyset} f(i)\times \prod_{i\in A}f(i) = \prod_{i\in (\emptyset\cup A)} f(i)=\prod_{i\in A}f(i)$$

$$\sum_{i\in \emptyset} f(i)+ \sum_{i\in A}f(i) = \sum_{i\in (\emptyset\cup A)} f(i)=\sum_{i\in A}f(i)$$

$$\bigcup_{i\in \emptyset} f(i)\cup \bigcup_{i\in A}f(i) = \bigcup_{i\in (\emptyset\cup A)} f(i)=\bigcup_{i\in A}f(i) $$

$$\bigcap_{i\in \emptyset} f(i)\cap \bigcap_{i\in A}f(i) = \bigcap_{i\in (\emptyset\cup A)} f(i)=\bigcap_{i\in A}f(i)$$

In all cases, the desire to have the above hold uniquely determines what the empty operation must be.

Note: as filipos points out, we want $\prod_{i\in A}f(i)\times \prod_{i\in B}f(i)=\prod_{i\in (A\cup B)} f(i)$ for any disjoint sets $A,B$.

4
On

It's a very useful convention, at very least; in some cases, though, it really is the correct value given the definitions involved.

As you note, the intersection of an empty collection of sets it's not empty; rather, it's everything. This can be seen by expanding the definition. Suppose $\mathcal{A}$ is empty. Then $$\begin{align} x \in \bigcap \mathcal{A} &\iff \forall A\,(A\in\mathcal{A} \to x\in A) \\ &\iff \forall A\,(A\in\emptyset \to x\in A) \\ &\iff \forall A\,(\mathsf{False} \to x\in A) \\ &\iff \forall A\,(\mathsf{True}) \\ &\iff \mathsf{True} \end{align}$$ So every $x$ is, in theory, a member of the intersection. This isn't a set, unless implicitly all $A\in \mathcal{A}$ are subsets of some set $X$, as is typical. *When that's the case, the empty intersection is $X$, because then implicitly $x$ is constrained to be in $X$.

A similar situation arises more generally in complete lattices — partially ordered sets $(X,\preceq)$ such that every $Y\subseteq X$ has a $\sup$ (least upper bound) and an $\inf$ (greatest lower bound) with respect to the ordering $\preceq$. Work through the definitions and you'll see that in a complete lattice, $$\begin{align} \sup \emptyset &= \mathbf{0} \quad\text{the 'bottom', least element, and} \\ \inf \emptyset &= \mathbf{1} \quad\text{the 'top', greatest element.} \end{align}$$ In fact, because the definition of a complete lattice implies that $\sup$ and $\inf$ exist for $\emptyset$, the definitions of $\sup$ and $\inf$ imply that for every element $x$ of the lattice, $\sup\emptyset \preceq x \preceq \inf \emptyset$, and this establishes that a complete lattice actually has least and greatest elements ($\mathbf{0}, \mathbf{1}$ respectively).

This is consistent with what happens with $\bigcup$ and $\bigcap$: these are $\sup$ and $\inf$ operations in powersets $\mathcal{P}(X)$, which are complete lattices with respect to the partial ordering $\subseteq$.