In the Cartesian coordinate system we identify a point in space by the three coordinates $x$, $y$, and $z$.
In the spherical coordinate system, we identify a point in space by the three coordinates $r$, $\theta$, and $\phi$.
First of all, I am guessing that these coordinate systems must be defined relative to an orthogonal set of basis vectors (that form a set of orthogonal axes). Is this true? That is, is an orthogonal set of basis vectors a necessary requirement to define a coordinate system?
Intuitively, it is pretty clear what a coordinate system is. But formally, what constitutes a coordinate system, ie what are the exact components of it?
In the case of a spherical coordinate system, I am guessing the system is a set of orthogonal basis vectors, and a set of coordinates of form $(r, \theta, \phi)$.
Using the same set of orthogonal basis vectors, we can define a Cartesian coordinate system by adding a set of coordinates of form $(x,y,z)$.
When we draw a graph of a function using spherical coordinates, is it fundamentally correct to label the axes $x$, $y$, and $z$? By that I mean, if you do label them as such, do those labels have anything to do with the Cartesian coordinates $x$, $y$, and $z$? It seems like the labeling is just a set of coordinates on the basis vectors. The coordinates in Cartesian coordinates just happen to coincide with them.
When we have spherical coordinates $(r,\theta,\phi)$, they are associated with a particular point $(c_1,c_2,c_3)$ in space. What do we call $(c_1,c_2,c_3)$ in this association with spherical coordinates? It seems like $(c_1, c_2, c_3)$ are the coordinates in space (also because we draw the graph with $x$, $y$, and $z$ on the axes), but then again, $(r,\theta,\phi)$ are the actual coordinates in the spherical system.
The precise notions you need are that of a manifold (and the definition of charts and atlases (this terminology obviously derived from their usual meaning in everyday English) which go along with it). For that, I strongly suggest you Frederic Schuller's lectures.
Let me start of slightly informally. By coordinates, we really mean a mapping of some kind. We take a certain point of interest, and assign it some collection of numbers. So, you could also think of this as a "numericalization" process of some sort. Slightly more explicitly, start with a set $M$ of interest. By a (local) coordinate system on $M$, we mean an injective mapping $\psi:U\to\Bbb{R}^n$ for some $n$, where $U$ is some subset of $M$. Some remarks (a lot of it is terminology/vocabulary):
Notice how in this preliminary definition, I made no reference at all to any sort of linear algebra/vector space structure. I didn't talk about basis vectors (linear algebra), I didn't talk about orthogonality (geometry in linear algebra). None of that. Coordinates should really just be thought of as a process of assigning tuples of numbers to points in a certain set.
For the sake of completeness, let me mention what I did not discuss above. The definition above is incomplete because I didn't really discuss the role of the integer $n$. In order to avoid pathologies, we should impose certain conditions on the set $M$ (namely that it is a topological space of some kind) and that all the maps are homeomorphisms and so on (because it is a simple fact in set theory that $\Bbb{R}\cong \Bbb{R}^n$ are sets of the same cardinality, so there is a bijection between $\Bbb{R}^n$ and $\Bbb{R}^m$ for any $n$ and $m$). This is all done more systematically in the lecture series I linked to above. My purpose in this answer is to tell you roughly what coordinates are, and to really emphasize what they are not.
Finally, I should mention that we can have more than one coordinate system on the same set $M$, and we can also have more than one coordinate system with the same domain $U$, such as $(U,\psi), (U,\tilde{\psi})$, where $\psi,\tilde{\psi}:U\to\Bbb{R}^n$ are two completely different maps. So far I have talked a lot about the abstract definition, but how does it relate to some familiar examples?
Example 1: Vector spaces, bases, and associated coordinate system.
Let $V$ be an $n$-dimensional vector space over $\Bbb{R}$. Let $\beta=\{v_1,\dots, v_n\}$ be a basis for this vector space (linearly independent and spanning). As a result, for each vector $v\in V$, there exist unique scalars $a^1,\dots, a^n\in\Bbb{R}$ such that \begin{align} v&=\sum_{i=1}^na^iv_i. \end{align} We now define a global coordinate system $\psi:V\to\Bbb{R}^n$ as $\psi(v)= (a^1,\dots, a^n)$, where the $a^1,\dots, a^n$ are the unique numbers making the equation above true.
Perhaps a slightly easier way of describing this is that we first consider the mapping $\Bbb{R}^n\to V$ defined as $(a^1,\dots, a^n)\mapsto\sum_{i=1}^na^iv_i$. Now, because $\beta=\{v_1,\dots, v_n\}$ is a basis for the vector space, it follows the mapping is both injective (due to linear independence) and surjective (due to spanning). Hence, it is bijective, and thus has an inverse. This inverse mapping what I called $\psi:V\to\Bbb{R}^n$ above.
I should really write $\psi_{\beta}$ to indicate that this mapping (actually a linear isomorphism) has been constructed from the basis $\beta$. So, it is in this sense that a basis for a vector space gives rise to a coordinate system in the above sense. This is the "natural" way of getting a coordinate system using a basis because it fully incorporates the linear structure on the space $V$. Conversely, given any isomorphism $\psi:V\to \Bbb{R}^n$, we get an associated basis $\beta$ of $V$, namely $\beta=\{\psi^{-1}(e_1),\dots, \psi^{-1}(e_n)\}$, where $e_i=(0,\dots, 1,\dots 0)\in\Bbb{R}^n$ is the vector with a $1$ in $i^{th}$ spot and $0$ elsewhere.
So, in the setting of vector spaces, there is a one-to-one correspondence between linear isomorphisms $\psi:V\to \Bbb{R}^n$ (which qualify as a "coordinate system on $V$") and bases $\beta$ of $V$. So, this (and the fact that in introductory math you usually only deal with vector spaces like $\Bbb{R},\Bbb{R}^2,\Bbb{R}^3$) is why people may initially just define a coordinate system to be a basis. However, I would caution against such an identification; you should keep these notions separate: a coordinate system is different from a basis.
Example 2: $\Bbb{R}^n$
The set $M=\Bbb{R}^n$ is rather special because it appears in the very definition of a coordinate system by virtue of being the target space of $\psi$. So, a global coordinate system on $M=\Bbb{R}^n$ is really just an injective mapping $\psi:\Bbb{R}^n\to\Bbb{R}^n$. There is a very obvious example of such a mapping, namely the identity function: send each point $p\in\Bbb{R}^n$ (which by definition is an $n$-tuple of numbers) to the same $n$-tuple $p$, i.e $\psi(p):=p=(p^1,\dots, p^n)$. So, $(\Bbb{R}^n\text{id}_{\Bbb{R}^n})$ qualifies as a coordinate system on $\Bbb{R}^n$, and it is this special coordinate system that we refer to as "Cartesian coordinates". It is of course standard notation to denote the coordinate functions $\psi^i=(\text{id}_{\Bbb{R}^n})^i$ as $x^i$. So, Cartesian coordinates are $(x^1,\dots, x^n)$, and in lower dimensions we simply use the notation $x$ (if $n=1$), $(x,y)$ if $n=2$, and $(x,y,z)$ if $n=3$ to denote these coordinate functions.
Also, if you think of $M=V=\Bbb{R}^n$ as a vector space, and consider the "standard basis" $\beta=\{e_1,\dots, e_n\}$, then as described in the previous example, this gives rise to a coordinate system $\psi:\Bbb{R}^n\to\Bbb{R}^n$ (which turns out to be a linear isomorphism), and since we chose specifically the basis $\{e_1,\dots, e_n\}$, this mapping $\psi$ is equal to the identity map, i.e we recover "Cartesian coordinates". It also turns out that if you choose the standard inner product on $\Bbb{R}^n$, that this basis is orthonormal.
It is because of these many ways of viewing things that it is often so difficult (in the beginning) to separate the notions of coordinates, basis vectors, Cartesian coordinates, and the notion of orthonormality and so all these notions get jumbled up into one big idea. Hopefully this clarifies the distinction between these objects.
Different Coordinates on the Same Space: Polar Coordinates
As I mentioned previously, if you consider one and the same set $M$, you can have many many different coordinate systems. Let us now introduce polar coordinates in a precise manner. Let $M=\Bbb{R}^3$. Think of this $M$ as the abstract set on which we wish to introduce a polar coordinate system. Now, consider the set $A=(0,\infty)\times (0,\pi)\times (0,2\pi)\subset\Bbb{R}^3$. And now consider the mapping $\tilde{\psi}:A\to M$ defined as \begin{align} \tilde{\psi}(r,\theta,\phi)= (r\sin\theta\cos\phi,r\sin\theta\sin\phi,r\cos\theta). \end{align} Because of how the domain $A$ is defined, it is straight-forward to show that this mapping is injective. We call the image of this mapping $U:=\tilde{\psi}[A]$, and we let $\psi=(\tilde{\psi})^{-1}:U\to A\subset\Bbb{R}^3$. It is the pair $(U,\psi)$ that we call "spherical polar coordinates in $\Bbb{R}^3$" (though we should really say "spherical polar coordinates on the set $U\subset\Bbb{R}^3$" to emphasize that the coordinate system is only defined on a proper subset $U$ of $\Bbb{R}^3$).
Now, the key point is that polar coordinates are a coordinate system on the region $U\subset M=\Bbb{R}^3$. Now, recall the definition of Cartesian coordinates from the previous example. By definition, they make up the identity map $\text{id}_{\Bbb{R}^3}=(x,y,z):\Bbb{R}^3\to\Bbb{R}^3$. Now, by restricting the domain of these maps, we of course get an injective map $(x,y,z):U\to\Bbb{R}^3$, so this is still a coordinate system on $U$.
So, we now have two different coordinate systems on the same set $U\subset M$. One is Cartesian coordinates $(U,(x,y,z))$, and the other is polar coordinates $(U,(r,\theta,\phi))$. The relationship between these functions is that for each $p=(p^1,p^2,p^3)\in U$, we have \begin{align} \begin{cases} x(p)=p^1=r(p)\sin\theta(p)\cos\phi(p)\\ y(p)=p^2=r(p)\sin\theta(p)\sin\phi(p)\\ z(p)=p^3=r(p)\cos\theta(p) \end{cases} \end{align} Since these equalities hold for all $p\in U$, we have that \begin{align} \begin{cases} x&=r\cdot (\sin \circ \theta)\cdot (\cos\circ \phi)\\ y&=r\cdot (\sin \circ \theta)\cdot (\sin\circ \phi)\\ z&=r\cdot (\cos \circ \theta) \end{cases} \end{align} This is a proper way of expressing the relationship between the coordinate functions. However, mathematicians are obviously a little lazy, so we suppress the composition symbol and simply write that "spherical polar coordinates are defined (implicitly) by the relations" \begin{align} \begin{cases} x&=r\sin\theta\cos\phi\\ y&=r\sin\theta\sin\phi\\ z&=r\cos\theta \end{cases}\tag{$*$} \end{align}
Hopefully this example allows you to reconcile the formal/abstract definition of a coordinate system which I gave initially, with the usual language with which it is typically introduced. Also, hopefully this example illustrates why it is sometimes easier to define a parametrization first (i.e the inverse of the coordinate system).
Final Example: Parabolic Coordinates
One typically only encounters Cartesian, polar and cylindrical coordinates, so let me instead give you a different example to illustrate how the formal definition matches up with how things are done in practice. We shall consider Parabolic Coordinates in part of $\Bbb{R}^2$. If you see the first thing which Wikipedia says:
then it may not be clear (a-priori) how this corresponds to the above formal definition I gave for a coordinate system as an injective mapping from some set. WHat are the $M,U,\psi$ here? Well, as you may have hopefully already picked up from the previous examples, it is sometimes easier to provide an implicit definition for the coordinate system in terms of another already well-defined and well-understood coordinate system (i.e defining $\tilde{\psi}:A\to M$, the parametrization is sometimes simpler than defining its inverse $\psi=(\tilde{\psi})^{-1}:U=\tilde{\psi}[A]\to A$). Once we provide the definition of $\tilde{\psi}$, it is then much easier to use abstract tools like the inverse function theorem to prove that it is indeed locally injective/ can be locally inverted.
Anyway, here $M=\Bbb{R}^2$, it is a little annoying to work out what exactly $U,\psi$ are. Anyway, the point I wish to drive home is that these parabolic coordinates, like any other coordinate system, are just a device by which we can take a point $p\in U$ and obtain a tuple of numbers $(\sigma(p),\tau(p))$. The relationship between these coordinates and the usual Cartesian coordinates $x(p):=p^1, y(p)=p^2$ is then given by equation $(**)$ applied to each point $p\in U$.
What is going on with the Pictures
Typically, if we wish to visualize a coordinate system $(U,\psi)$, which has the coordinate functions $(x_{\psi}^1,\dots, x_{\psi}^n)$, what we do is we look at the set of points where each $x_{\psi}^i$ is constant. So, for each number $t\in\Bbb{R}$, we like to look at the set of points \begin{align} \{p\in U\,:\, x_{\psi}^i(p)=t\}\qquad (1\leq i \leq n) \end{align} So, for example, in $M=\Bbb{R}^2$, the set of points with constant $x$-coordinate are "vertical lines", while the set of points with constant $y$-coordinate are "horizontal lines", the set of points with constant $r$-coordinate are circles centered at the origin, and set of points of constant $\theta$-coordinate are half-rays. In the parabolic coordinate system, the set of points with constant $\sigma,\tau$ are parabolas of some kind and so on.
So, whenever one has a coordinate system $(U,\psi)$ in mind and wishes to draw pictures, what we do is we look at everything in the image of this mapping, i.e we draw things in $\psi[U]\subset \Bbb{R}^n$. Since the image is a subset of $\Bbb{R}^n$, it is perfectly fine to draw an $n$-dimensional grid as usual, and label to axes as $x_{\psi}^1,\dots x_{\psi}^n$. So, a tuple $(a^1,\dots, a^n)\in\psi[U]$ then obviously just corresponds to a point $p\in U$ with coordinates $x_{\psi}^1(p)=a^1,\dots, x_{\psi}^n(p)=a^n$.
Bottom line: coordinates are not the same thing as a basis, they are not the same thing as a orthogonal basis. None of that. Yes, given a basis for a vector space, we can define a global coordinate system, but don't let this simple fact lead you to mixing up the two notions. Coordinate systems are just a systematic way of assigning numerical quantities to certain points of interest (and this had better be injective so that two different points do not get assigned the same tuple of numbers). Finally, pictures that we draw are usually represented in the target space of the mapping $\psi$ (we do this because the target space is $\Bbb{R}^n$, and we have developed our intuition from $\Bbb{R}^n$).