I am an undergrad that has taken a few courses in real and complex analysis. I am trying to understand the Fourier transform better at a level of abstraction somewhere between "it moves from physical to frequency space" and a graduate level that talks about things like "Pointryagin dual spaces."
1) (Preserving information) My current intuition for integral transforms comes from measure-theoretic probability. My thinking from scratch: I know the probability distribution of a random variable is essentially the "measure assigned to all Borel sets," so it makes sense that knowing $E[f \circ X]$ for any measurable $f$ suffices to characterize the distribution of $X$. This means knowing $E[X^k]$ for all integers $k$ suffices by polynomial approximation, and so moments characterize a distribution. Then since the exponential function is defined as a power series, the object $E[\exp( tX)]$ tells us the moments, and thus the distribution, of the rv $X$, so a MGF characterizes a distribution. I later learned the MGF is sometimes called a "Laplace Transform," so it makes sense that the Laplace transform preserves information. I also know the Fourier transform is the "characteristic function" of a random variable, but I'm not sure what the analog of the above explanation is for the characteristic function, and in particular the need for complex numbers here and the intuition for what their resulting interpretation (in the above sense) puzzles me. Of course, if you think of the naïve idea that the Fourier transform goes from "physical to frequency space" then it's obvious that it preserves information. But I am looking for an explanation in the sense given above.
2) (Change of basis) I have read the Fourier transform is a "change of basis" and I have also heard the fact that the exponential function is an eigenfunction of the Laplace operator, but I'm not sure how these two operators are related. In particular, I know the Fourier transform sends $\mathcal{F}: L_2 \to L_2$, where that function space has a countable basis. So I would expect a "change of basis" to be something like $\hat{f} = \sum_i a_i \phi_i$ where $\phi_i$ are a basis set of functions for the space. But the Fourier transform involves an integral, not a sum, somehow implying an "uncountable infinite basis" if we view $\hat{f}(x) = \int f(t) \exp(-2\pi i x t) dt$ the values $f(t)$ as the co-efficients $a_i$ and the $\exp(-2\pi i x t)$ as the eigenfunctions. What's going on here? In what precise sense is it a change of basis?
Since you seem to understand the mathematical part, I'll give in for granted and just focus on the interpretation part. I'm not sure I understand your questions exactly, but I'll try to answer in a general sense.
I like to think of the Fourier transform as a sort of 'improper' change of basis. In fact, when we express a vector $\mathbf{v}$ given a set of basis vectors $\{ \mathbf{\phi}_i\}$ we write
\begin{equation} \mathbf{v} = \sum_j c_j \mathbf{\phi}_i \end{equation}
or, using coordinates:
\begin{equation} \label{eqn:v_e} v_i = \sum_k c_k \phi_{ik} \end{equation}
in which ${c_k}$ are the coefficients and $\phi_{ik}$ is a (really bad) notation for the $k$-th element of the $i$-th vector. This means that, given the basis, knowing the set of coefficients is the same as knowing the vector (which is in fact expressed in this basis by the coefficients themselves).
The same principle can hold for a function $f(x)$ that we can express as a combination of basis-function ${\phi_k(x)}$.
\begin{equation} f(x) = \sum_k c_k \phi_k(x) \end{equation}
and, just like before, for a given set of basis functions, knowing the coefficients gives us all the information we need to reconstruct the original function. This is the case, e.g., of the Fourier Series (a discrete version of the Fourier Transform).
Finally, if instead of using a discrete set of basis-function we use a continuous one we need to parametrize it using a continuous variable instead of discrete one ($\phi_k(x) \rightarrow \phi(k,x)$) . Moreover, the summation will turn into an integral ($\sum_k \rightarrow \int dk$).
\begin{equation} \label{eqn:f_e} f(x) = \int dk ~ c(k) \phi(k,x) \end{equation}
Note the similarity between this expression and the expression for $v$. In the Fourier transform, $\hat{p}$ are the coefficients, and the complex exponential functions are the basis on which we express $p$.
So, how does this relate to your questions?
Preserving information: Since by the knowledge of the Fourier coefficients we can can recover our original function, we have the same information about it. The F-transforms of a function is another representation of the function, but the function is still there. A good parallel might be representing a probability distribution using its PDF, using its CDF or is MGF: it is always the same distribution, just expressed in different terms.
Change of Basis Fourier series is a change of basis in the sense you are referring to. Fourier transform is an integral transform which can be seen as a sort of "change of basis" for the reasons I've explained above, but it is not as discussed for example in this reply.