Possible significant error in proof of the spectral theorem, Brian C Hall, Quantum Theory for Mathematicians

Question

Possible significant error in proof of the spectral theorem, Brian C Hall, Quantum Theory for Mathematicians

277 Views Asked by Bumbble Comm At 29 Mar 2026 - 6:30

Note/Edit: Read the below paragraphs for context. I think I found a counterexample, though I don't have the energy to work through it now. Suppose $\mathcal{H} = L^2([0,1],m) \oplus L^2([0,1],m)$ where $m$ denotes Lebesgue measure. Consider the "position" operator $X$ given by $Xf(x) = xf(x)$ and consider the operator $A$ given by $Af = 1_{[1/2,1]}f$, each operator is a bounded self-adjoint operator on $L^2([0,1],m)$. Then define the bounded self-adjoint operator $X \oplus A$ on $\mathcal{H}$ in the obvious way: $X \oplus A (f,g) = (Xf,Ag)$. You can look at Hall's construction of the $(W_j, \psi_j)$, but you can convince yourself that we could take $W_1 = L^2([0,1],m) \times \{0\}$ and have $\psi_1 = (1,0)$ (standard techniques using density of polynomials will then convince you that the closed span of the $(X \oplus A)^n \psi_1$ is $W_1$). Then note that $1 \in \sigma(X \oplus A|_{W_1})$, $1_{\{1\}}(X \oplus A) \neq 0$ (as $(0, 1_{[1/2,1]} g)$ is an eigenvector of $X \oplus A$ with eigenvalue $1$ for all $g \in L^2([1/2,1],m)$), hence $\mu(\{1\}) \neq 0$ but $\mu_{\psi_1}(\{1\}) = 0$, which contradicts surjectivity as I noted in the 3rd to last paragraph. Let me know if you think this works or doesn't. If if does then I guess there's a serious problem with the proof in the book.

So I am in the middle of reading through B.C. Hall's proof of the spectral theorem for bounded self-adjoint operators on separable Hilbert spaces, so this question is a little hard to state given the amount of context.

Let $A$ a self-adjoint bounded operator on a separable Hilbert space $\mathcal{H}$. Suppose $\{W_j, \psi_j\}$ is a (possibly finite) sequence of pairwise orthogonal subspaces of $\mathcal{H}$ s.t. they are invariant under $A$, for fixed $j$ the span of $A^n \psi_j$ is dense in $W_j$, and $\mathcal{H} = \bigoplus_j W_j$ (the orthogonal direct sum). Let $A_j = A|_{W_j}$. Let $\mu^A$, $\mu^{A_j}$ denote the projection valued measure on the Borel $\sigma$-algebra on $\sigma(A), \sigma(A_j)$ resp. (where $\sigma(B)$ is the spectrum of $B$). Let $\mu_{\psi_j}$ be the positive measure given by $\mu_{\psi_j}(E) = (\psi_j, \mu^{A_j}(E) \psi_j)$. Let $\mu$ be a positive measure on $\mathcal{B}_{\sigma(A)}$ s.t. $\mu(E) =0 \iff \mu^A(E) = 0$. Then note that $\mu^{A_j}(E) = 1_E(A_j) = 1_E(A|_{W_j}) = 1_E(A)|_{W_j}$ (where we are referring to the functional calculus induced by the respective operators, and the last equality follows from a lemma). Thus $\mu(E) = 0$ implies $0=\mu^A(E)= 1_E(A)$, hence $\mu^{A_j}(E) = 1_E(A_j) = 1_E(A)|_{W_j} = 0$ which implies $\mu_{\psi_j}(E) = 0$. So $\mu_{\psi_j}$ is absolutely continuous w/r/t $\mu$ on $\mathcal{B}_{\sigma(A_j)}$. Let $\rho_j$ denote its Radon-Nikodym derivative.

Finally we get to the part I'm stuck at. The author now claims "one can easily see that" the map $f \mapsto \rho_j^{1/2}f$ is unitary from $L^2(\sigma(A_j), \mu_{\psi_j}) \to L^2(\sigma(A_j), \mu)$. It is easy to verify it is norm preserving but why is it surjective?

Note that we only showed that $\mu_{\psi_j}$ was absolutely continuous w/r/t $\mu$. I claim that we need the converse to hold. Suppose we had the converse. Then $\rho_j$ is a.e. (both $\mu$ and $\mu_{\psi_j}$ since these agree on null sets) to a function that is everywhere nonzero, so we can WLOG suppose that $\rho_j$ is everywhere nonzero. Then we note that $\int_{\sigma(A_j)} |f \rho_j^{-1/2}|^2 d\mu_{\psi_j} = \int_{\sigma(A_j)} |f|^2 \rho_j^{-1} \rho_j d\mu = \int_{\sigma(A_j)} |f|^2 d\mu$. So if $f \in L^2(\sigma(A_j), \mu)$, then $f\rho_j^{-1/2} \in L^2(\sigma(A_j), \mu_{\psi_j})$. So the map is onto. On the other hand if we have that the converse doesn't hold, by considering a set that is $\mu_{\psi_j}$ null but not $\mu$ null ($\mu$ can be taken to be a finite measure, we can WLOG suppose this set has finite measure), and considering the indicator function on this set, we can see that no function in $L^2(\sigma(A_j), \mu_{\psi_j})$ will map onto it.

So why does this converse hold (to me it doesn't seem like it actually does so it feels like there is a near fatal error in this proof)? I tried constructing examples, but they got confusing fast.

Sorry for the wall of text, the question is really deep into a long proof but I'm totally stuck and any help would be greatly appreciated.

Original Q&A

There are 1 best solutions below

**Bumbble Comm** · Accepted Answer

You don't need any converse, if $\rho\ne0$ a.e. $\mu_\psi$. Let me drop the $j$ to type less. If $f\in L^2(\sigma(A),\mu)$, all you need for surjectivity is that $\rho^{-1/2}f\in L^2(\sigma(A),\mu_\psi)$. And that works: you have that, for any measurable nonnegative $g$, $$ \int_E g\,d\mu_\psi=\int_E g\,\rho\,d\mu. $$ Thus $$ \int_{\sigma(A)}|\rho^{-1/2}\,f|^2\,d\mu_\psi=\int_{\sigma(A)}\rho^{-1}\,|f|^2\,\rho\,d\mu=\int_{\sigma(A)}|f|^2\,d\mu<\infty. $$ So $U$ is surjective: $f=U(\rho^{-1/2}f)$.

More details about Hall's proof: As written, the proof doesn't work. But it's more about the implementation than the idea. As written, the proof seems to work only when $H$ is separable, but that's an assumption over the whole book, so not an issue.

In Lemma 8.12, one should be more careful about the choice of $\mu$. Namely, instead of taking $\mu$ as described in its proof, one can take $$\tag1 \mu=\sum_k\tfrac1{k^2}\,\mu_{\psi_k}. $$ Here, for the nontrivial part of the proof, if $\mu_{\psi_k}(E)=0$ fr all $k$, we have $\mu^A(E)\psi_k=0$, and then for any $f$ $$0=f\mu^A(E)\psi_k=\mu^A(E)\,f\psi_k.$$As $\psi_k$ is cyclic, we get that $\mu^A(E)=0$ each $W_k$, and so $\mu^A(E)=0$.

The other important observation is this: a point $\lambda\in \sigma(A)$ is an eigenvalue for $A$ if and only if $\mu(\{\lambda\})>0$. That is, the eigenvalues of $A$ are precisely the atoms of $\mu$. Indeed, if $\lambda$ is an eigenvalue for $A$, then there exists $j$ such that $\lambda$ is an eigenvalue for $j$. As $A_j$ goes to the multiplication operator in $L^2(\sigma(A_j),\mu_{\psi_j})$, the eigenvalue equation says that there exists $g$ with $\|g\|_2=1$ $$ tg(t)=\lambda g(t),\ \ \ \ t\in \sigma(A_j)\ \text{ a.e.} $$ This forces $g(t)=0$ a.e. on $\sigma(A_j)\setminus\{\lambda\}$. Thus $$ 1=\int_{\{\lambda\}} |g(t)|^2\,d\mu_{\psi_k}=|g(\lambda)|^2\,\mu_{\psi_k}(\{\lambda\}) $$ and so $\{\lambda\}$ is an atom for $\mu_{\psi_k}$.

Now, whenever there is an eigenvalue $\lambda\in\sigma(A)$, we can fix an orthonormal basis $\{\phi_r\}$ of eigevectors for it, and we can decompose the eigenspace as $\bigoplus_r \mathbb C\,\phi_r$, each invariant for $A$ with cyclic vector. What this means is that in the original decomposition in subspaces $W_j$ we may have all eigenvalues accounted for. In other words, each $\mu_{\psi_j}$ will be either a Dirac delta or a diffuse measure. So, after renaming and changing coefficients, we may write $(1)$ as $$\tag2 \mu=\sum_k\tfrac1{k^2}\delta_{\lambda_k} + \sum_k\tfrac1{k^2}\,\mu_{\psi_k}, $$ where each of the sums could be finite, and the $\mu_{\psi_k}$ are all diffuse.

Now suppose that $A_j=\{\lambda_k\}$. Then the Radon-Nikodym derivative is $c\,1_{\lambda_k}$ for an appropriate $c>0$. Indeed, $$ \int_{\sigma(A_j)}f\,1_{\lambda_k}\,d\mu=f(\lambda_k)\,\mu(\{\lambda_k\})=f(\lambda_k)\,\sum_{j:\ \lambda_j=\lambda_k}\mu_{\psi_j}(\{\lambda_k\})=f(\lambda_k)\,\sum_{j:\ \lambda_j=\lambda_k}\tfrac1{j^2}. $$ In particular you get that $\mu_{\psi_k}$ and $\mu$ are mutually absolutely continuous on $A_j$.

When $A_j$ does not contain any eigenvalue, I know how to salvage the proof, but not with elementary arguments. I'll write them anyway. The result, stated in our context, is that if $\mu_{\psi_k}$ is diffuse, then $L^2(\sigma(A_j),\mu_{\psi_k})$ is isomorphic to $L^2([0,1],m)$ via a unitary that preserves the multiplication operator. Because both $\mu$ and $\mu_{\psi_k}$ are diffuse on $A_j$, we can map from one to the other via $L^2([0,1],m)$. The proof I know of this is about masas in von Neumann algebras. Namely, if $\mathcal A\subset B(H)$ is a diffuse masa, then there exists a unitary $U:H\to L^2[0,1]$ such that $U\mathcal A U^*=L^\infty[0,1]$ acting as multiplication operators (usually this result is quoted as "well-known" in von Neumann algebra literature, so written proofs do not abound; the one I know is Lemma 2.3.6 in Sinclair-Smith's Finite von Neumann Algebras and Masas). At this stage, I'm not sure if all the above can be done without the Spectral Theorem!

Possible significant error in proof of the spectral theorem, Brian C Hall, Quantum Theory for Mathematicians

There are 1 best solutions below

Related Questions in REAL-ANALYSIS

Related Questions in FUNCTIONAL-ANALYSIS

Related Questions in HILBERT-SPACES

Related Questions in SPECTRAL-THEORY

Related Questions in QUANTUM-MECHANICS

Trending Questions

Popular # Hahtags

Popular Questions