When an algebraic structure is defined, it is often defined as a set $S$ "along with"/"together with"/"having" operations $\circ_1, \circ_2, \ldots, \circ_n$, and "denoted" by $(S, \circ_1, \circ_2, \ldots, \circ_n)$.
This is true of metric spaces as well. We often say that a metric space is a set $X$ with some distance function $f: X \times X \to \mathbb{R}$.
But none of this seems very formal to me. Are we -- the reader -- supposed to infer that it in each of these cases, we're really talking about an $n$-tuple (in ZFC)?
So for instance, when a theorem says something like this:
Thm Let $X$ be a metric space. If $X$ is a foo, then $X$ is a noo.
It is formally:
Thm Let $(X, d)$ be a metric space. If (the underlying set) $X$ is a foo, then $X$ is a noo.
I guess I'm just confused as to why tuples are glossed over and replaced with "together with" so often, when it detracts from the "axiomaticity" of the material.
It's often okay to define a mathematical object without an explicit set theoretic model. If we need a model we can always take a step back and make sure there is one.
Take for example the following definition:
We can use this definition of a map to define surjectivity, injectivity and any other property of maps and work with those without ever thinking about how we would model this in set theory.
Or do you always think of a map $f\colon A\to B$ as a tuple $(F, A, B)$ where $F\subseteq A\times B$ is a right-unique relation, and the $3$-tuple $(F, A, B)$ is really build from Kuratowski pairs $(F, (A, B))$ with $(A, B) = \{\{A\}, \{A, B\}\}$?