I have proven to myself that if you assume some continuous function $f(x)$ to be an infinite sum of sinusoids, then the integral techniques of a Fourier series yield exactly the correct amplitude for a given frequency. With the Fourier transform, as the period is extended to the whole real line, the integration no longer, it seems to me, gives the exact amplitude for a given frequency. However, very intuitively, all the resources I have seen state that the peaks of the Fourier transform show which frequencies are present in the original function. But the graph of Fourier transform isn’t a Dirac-Delta-esque impulse spike; it is a smooth function (in the graphs I’ve seen), which suggests that the frequencies that differ ever so slightly from the “key” frequencies of the function are still present in some sense, are still shown to have some amplitude, and now this gets extremely hand wavy, which I dislike in maths. YouTube videos and Wikipedia visual demonstrations are one thing, but I appreciate rigour.
My (self intuited - any better phrasing or writing of anything I am saying would be appreciated) thoughts (on Fourier - I’m getting to Laplace):
I am working from a background of someone who can prove the validity of Fourier series, but has seen no rigour on the subject of Fourier transforms. I assume this is the rationale: take $f(x)=\sum_{n\in S}c_n\exp(2\pi in\cdot x)$, where $S$ are the frequencies that constitute $f$. Then one convention of the Fourier transform says, assuming $\xi$ is a frequency present in $f$: $$\begin{align}\hat f(\xi)&=\int_{\Bbb{R}}f(x)\exp(-2\pi ix\cdot\xi)\,dx\\&=\int_{\Bbb{R}}c_\xi\exp((2\pi i\xi-2\pi i\xi)x)\,dx+\sum_{(n\neq\xi)\in S}\int_{\Bbb{R}}c_n\exp(2\pi ix(n-\xi))\,dx\\&=\int_{\Bbb{R}}c_\xi\,dx+\sum_{(n\neq\xi)\in S}\int_{\Bbb{R}}c_n\exp(2\pi ix(n-\xi))\,dx\end{align}$$
And I’ve gathered that the latter sum of the integrals $(n\neq\xi)$ is supposed to fizzle out somehow, as the complex exponential has a non-zero argument and as such will ... rotate around and around and integrate to zero? This was hand-waved in a resource I viewed, and it struck me as very informal. A proof of this, or a better replacement statement, is one of my questions. As for the first integral, well this one won’t rotate around and fizzle out, since it has no exponential, but I fail to see how the integral doesn’t diverge, and merely just peaks instead when $\xi$ is a frequency of $f$; the amplitude associated with $\xi$, which I’ve denoted $c_\xi$, is being integrated over the whole real line and should diverge (and not finitely peak) if it is non-zero... I am very confident my error is in treating $f$ as a weighted sum, as we do in Fourier series, but I don’t see an alternative way! None of this explains either the reality that the graph of the transform still has height in the neighbourhood of some component frequency, suggesting the frequencies near to it are present... how does the transform capture this? And why is it the case? Indeed, is the transform equal to zero when evaluated at a non-component frequency, or is it just very close to zero? I’ve never been able to tell by eyeballing graphs.
I have similar issues with the Laplace transform. My entire argument as above is the same, leading to the same issues, just considering $f$ as a sum of exponentials, be they complex or real. However there is one key thing to add: a video on the Laplace transform explained that the poles of the Laplace transform show which complex $s$ are present in $f$, considered as a (I assume weighted) sum of the $\exp(st)$... again, my Fourier-series intuition of a discrete sum is creating issues that I am unsure how to resolve. But why poles! The Fourier transform (that I’ve seen) tends to just peak when it is evaluated at a component frequency - what is so different about Laplace that it has poles at the component frequencies? And what does it say about the frequencies very near to the component ones, as they will have very large height on the graph too?
Everything I’ve seen on either transform just talks of them as natural generalisations; in particular, I gather a student is expected to blindly swallow: “the Fourier transform is just like the Fourier series with infinite period” but that creates issues! There must be something different going on; plus, there are many different conventions for the transform, all of which would yield different amplitudes for a given frequency, which is incongruous with the idea of a Fourier transform being “just like” the Fourier series.
Many thanks to anyone who can provide the rigour that is lacking in all these surface level explanations of my own half-baked thoughts and reading. It would be so nice if Wikipedia provided this kind of information, instead of just statements of fact!
Lastly (but I don’t care so much about this particular question), this is also doing my head in:
The Poisson summation formula says that the sum of Fourier series coefficients is the same as the sum of the evaluations of the transform at the component frequencies - but if my hunch about there being a key difference between the idea of Fourier series and Fourier transform is correct, this makes no sense to me!


This may not fully answer your question but the followings are all I know about Fourier and Laplace transform, at least according to where I came from (mechanical engineering).
Background
Consider the following linear Ordinary Differential Equation (ODE):
$$ \sum_{k}{b_{k}\frac{d^{k}}{dx^{k}}y\left(x\right)}=f\left(x\right) $$
In my field I am often interested to find the particular solution of this ODE. If $f(x)$ is a linear combination of exponential function:
$$ \begin{align} \sum_{k}{b_{k}\frac{d^{k}}{dx^{k}}y\left(x\right)}&=\sum_{m}{C_{m}e^{s_{m}x}}\\ \\ y_{p}\left(x\right)&=\sum_{m}{H\left(s_{m}\right)C_{m}e^{s_{m}x}}\\ \\ H(s_{m})&=\frac{1}{\sum_{k}{b_{k}s_{m}^{k}}} \end{align} $$
Notice how easy and simple it is to find the particular solution. We just scale each exponential function by a function $H\left(s\right)$, this function is called transfer function. Therefore it is of interest to represent an arbitrary function $f(x)$ as combination of (maybe infinitely many) exponential functions.
Fourier Transform
Fourier claimed that many functions can be represented by infinitely many pure imaginary exponentials:
$$ f\left(x\right)=\frac{1}{2\pi}\int_{-\infty}^{\infty}F\left(\Omega\right)e^{i\Omega x}\phantom{x}d\Omega $$
In practice we want to know the function $F\left(\Omega\right)$ for a given $f(x)$. We do this like the following:
$$ \begin{align} \int_{-\infty}^{\infty}f(x)e^{-i\Omega x}\phantom{x}dx&=\frac{1}{2\pi}\int_{-\infty}^{\infty}\int_{-\infty}^{\infty}F\left(\omega\right)e^{i\omega x}\phantom{.}d\omega\phantom{.} e^{-i\Omega x}\phantom{.}dx\\ \\ &=\frac{1}{2\pi}\int_{-\infty}^{\infty}F\left(\omega\right)\int_{-\infty}^{\infty}e^{i\left(\omega - \Omega\right) x}\phantom{.}dx \phantom{.}d\omega\\ \\ &=\frac{1}{2\pi}\int_{-\infty}^{\infty}F\left(\omega\right)\phantom{.}2\pi\phantom{.}\delta\left(\omega - \Omega\right)\phantom{.}d\omega\\ \\ &=F\left(\Omega\right) \end{align} $$
As you know, these two equations are Inverse Fourier Transform and Fourier Transform respectively. Notice that we have never required $f(x)$ to be periodic so in a sense there are no period nor frequency. So when you see example of Fourier Transform that is smooth continuous curve and wonder why there are no single "peak", maybe the function is simply not periodic.
I think it's schools' fault to teach Fourier Series first before Fourier Transform.
Fourier Series
In a special case in which $f(x)$ is a linear combination of pure imaginary exponential functions with period $T$, the Fourier Transform becomes a linear combination of delta functions:
$$ \begin{align} f(x)&=\sum_{m\in\mathbb{Z}}{C_{m}e^{i\frac{2\pi m}{T}x}}\\ \\ F\left(\Omega\right)&=2\pi\sum_{m\in\mathbb{Z}}{C_{m}\delta\left(\Omega-\frac{2\pi m}{T}\right)} \end{align} $$
We are not too interested in this complete result and we usually only want the coefficient $C_{m}$. We do this by using the Fourier Series expansion:
$$ C_{m}=\frac{1}{T}\int_{0}^{T}f(x)e^{-i\frac{2\pi m}{T}x}\phantom{.}dx $$
This use the fact that integration of imaginary exponential over their period returns zero as you may already know.
Conclusion
The difference between Fourier Transform and Fourier Series lies in the fact that Fourier Series is for periodic function while Fourier Transform can be applied to non periodic function as well, which leads to smooth functions rather than singular peaks.
Later on the Fourier Series will be the basic for Discrete Time Fourier Transform (DTFT) and Fast Fourier Transform (FFT).
Bonus: Laplace Transform
Recall that many (not all) function can be represented by linear combination of pure imaginary exponential functions. This is because some function cause the integral to diverge. However, notice that while the Fourier Transform for $f(x)$ does not exist, the transform for $e^{-\sigma x}f(x)$ may exist. We call the Fourier Transform of $f(x)$ after multiplication by $e^{-\sigma x}$ the Laplace Transform.
$$ \begin{align} F(\sigma, i\Omega)&=\int_{-\infty}^{\infty}e^{-\sigma x}f(x)e^{-i\Omega x}\phantom{.}dx\\ \\ &=\int_{-\infty}^{\infty}f(x)e^{-\left(\sigma+i\Omega\right) x}\phantom{.}dx\\ \\ \\ e^{-\sigma x}f(x)&=\frac{1}{2\pi}\int_{-\infty}^{\infty}F(\sigma, i\Omega)e^{i\Omega x}\phantom{.}d\Omega\\ \\ f(x)&=\frac{e^{\sigma x}}{2\pi}\int_{-\infty}^{\infty}F(\sigma, i\Omega)e^{i\Omega x}\phantom{.}d\Omega\ \end{align} $$
The Laplace Transform only exist for certain value of $\sigma$, this is called the region of convergence.