Groups, spectra and particles

This post is aimed at students of MAST20022 Group Theory and Linear Algebra.

Introduction

The two big topics of our course — advanced linear algebra and group theory — are unified in deep, elegant and surprising ways by quantum mechanics. In this first post, I’ll give a brief and non-rigorous precis of quantum mechanics, focusing on the spectral theorem. In the next, I’ll discuss how symmetries get represented in quantum systems, and finish by combining these ideas to get the basic building blocks of nature: particles.

Quantum mechanics

Like classical physics, the goal of quantum physics is to describe physical systems and understand how they evolve. In classical physics, the state of the system can generally be specified by a finite number of coordinates. For instance, a single particle moving around in $\mathbb{R}^3$ and subject to a force field $\mathbf{F}(\mathbf{x})$ can be described by a pair of $3$-vectors: position $\mathbf{x}(t)$ and momentum $\mathbf{p}(t)$. The evolution of the system obeys Newton’s second law of motion:

\[\mathbf{F}(\mathbf{x}) = \frac{d}{dt}\mathbf{p}(t).\]

The vectors $\mathbf{x}(t)$ and $\mathbf{p}(t)$ are called dynamical variables since they change with time. In quantum mechanics, the description of the state and its evolution is governed by five rules:

The state of a system at time $t$, aka the wavefunction $\Psi(t)$, is a unit vector in some complex inner product space $V$. There are other technical conditions on $V$ we will ignore for the time being.
Every physical property of the system corresponds to a Hermitian operator $\hat{A}: V \to V$. (To distinguish them from classical variables, these operators usually have hats.) Physical properties are also called observables, with typical examples being position $\hat{x}$, momentum $\hat{p}$, and energy $\hat{H}$.
The state of the system satisfies a differential equation (famously due to Schrödinger)\[i\hbar\frac{d}{dt}\Psi(t) = \hat{H}\Psi(t),\] where $\hbar$ is the reduced Planck constant. In SI units, $\hbar \sim 10^{-34} \text{ J s}$.
For an observable $\hat{A}$, the values you can obtain when measuring $\hat{A}$ are precisely the eigenvalues of $\hat{A}$.
If you measure $\hat{A}$ at time $t_0$, the system will subsequently be in a (normalised) $\lambda$-eigenstate $v_\lambda$ of $\hat{A}$ with probability \[\mathbb{P}(\Psi \to v_\lambda) = |\langle v_\lambda, \Psi(t_0)\rangle|^2.\]

I'll make a few comments about these rules, and leave further elaboration to a quantum mechanics course. First, recall that Hermitian operators have real eigenvalues. This means that we can only measure real values! This is physically sensible, and in fact, is the motivation for making observables Hermitian in the first place.

Secondly, rule 3 and rule 5 tell totally different stories about how the system evolves. Rule 5, often called the "collapse of the wavefunction", rudely interrupts the Schrödinger evolution and resets the system to an eigenvector of the observable. This seems very strange, and very ugly, and it is.

Observables and the spectral theorem

Let's talk more about the spectrum of values we can measure for an observable $\hat{A}$. From rule 4, these are just the eigenvalues of $\hat{A}$. (By the way, this is where the name of the spectral theorem comes from.) To begin with, suppose $V$ is finite-dimensional. We can use the second version of the spectral theorem to deduce that there exist self-adjoint Hermitian operators $P_i$ and scalars $\lambda_i$ for $i=1, \ldots, k$, such that:

$\lambda_i \neq \lambda_j$ if $i\neq j$;
$P_i^2 = P_i$;
$\sum_{i=1}^k P_i = 1_V$; and
$\sum_{i=1}^k \lambda_i P_i = 1$.

More concretely, the eigenvalues of $\hat{A}$ are the $\lambda_i$, and the operators $P_i$ project onto the $\lambda_i$-eigenspace. So we are guaranteed to have a nice set of measurable values! Suppose we measure $\lambda_i$, and in addition, the $\lambda_i$-eigenspace is $1$-dimensional, with a normalised eigenvector $v_i$. Then

\[P_i \Psi = \langle v_{i}, \Psi \rangle v_{i} \quad \Longrightarrow \quad \mathbb{P}(\Psi \to v_{i}) = ||P_i \Psi||^2.\]

What if the $\lambda_i$-eigenspace is $>1$-dimensional? This is called degeneracy, and a bit more work is needed, which again I’ll leave to a quantum mechanics course.

Now, in reality, $V$ is never finite-dimensional. But the extra condition I mentioned in rule 1 is that $V$ must be very well-behaved complex inner product space called a Hilbert space. You will learn much more about these in third year. Hilbert spaces have the marvellous property that, even in the infinite-dimensional case, a version of the spectral theorem applies. In fact, this is the motivation for the technical condition on $V$ in formulating quantum mechanics.

So, I’ll close the section by stating the general spectral theorem for Hilbert spaces. Suppose $V$ is a Hilbert space, and $\hat{A}: V \to V$ is a Hermitian operator. Then for each $\lambda \in \mathbb{R}$, there is an operator $E(\lambda)$ satisfying:

$E(\lambda)E(\lambda') = E(\min\{\lambda, \lambda'\})$;
$\lim_{\lambda\to-\infty} E(\lambda) = 0$ and $\lim_{\lambda\to \infty} E(\lambda) = 1$;
$\int_{-\infty}^\infty dE(\lambda) = 1_V$; and
$\int_{-\infty}^\infty \lambda\, dE(\lambda) = \hat{A}$.

In the finite-dimensional case, let's check this reduces to the familiar spectral theorem. Set
\[
E(\lambda) \equiv \sum_{\lambda_i < \lambda} P_i
\]and interpret the integrals in properties 3 and 4 using
\[
\int_{-\infty}^\infty f(\lambda)\, dE(\lambda) \equiv \sum_{i}f(\lambda_i) P_i.
\]I'll leave it as an exercise to check that this has properties 1-4. In the infinite-dimensional case, we would make the sum an integral, so heuristically we have something like
\[
E(\lambda) = \int_{-\infty}^\lambda P(\lambda') \, d\lambda'.
\]I won't tell you how to integrate an infinite-dimensional operator (this requires functional analysis), but hopefully it doesn't seem too far removed from the finite-dimensional case. The moral is that even in infinite dimensions, we can "diagonalise" a Hermitian operator (write it as a sum of projection operators) and interpret the projection operators in quantum mechanical terms.

Quantising classical systems

So far I haven't discussed the link between the classical and the quantum description of a system. It turns out you need such a link in order to figure out the operators $\hat{A}$ corresponding to observables; we can't deduce them from the rules alone. The process of going from a classical to a quantum description of the same system is called quantisation. Again, I'll leave most of the details to a quantum mechanics course. However, since the evolution of the system is governed by the energy operator $\hat{H}$ (also called the Hamiltonian) let's see how we calculate it, in principle.

In the classical setup, there is an energy operator $H$ which takes the state of the system and spits out its energy (a real number). For the single particle system described above, it's just the sum of kinetic energy (associated with motion) and potential energy (to do with moving around in the force field $\mathbf{F}(\mathbf{x})$). If the force is conservative, i.e. satisfies
\[
\mathbf{F}(\mathbf{x}) = - \nabla V(\mathbf{x})
\]for some scalar potential $V(\mathbf{x})$, and the particle has mass $m$, then the energy operator is
\[
H(\mathbf{x}, \mathbf{p}) = \frac{|\mathbf{p}|^2}{2m} + V(\mathbf{x}).
\]The energy operator in the quantum description is related very simply to this: it is just
\[
\hat{H} = \frac{\hat{\mathbf{p}}^2}{2m} + V(\hat{\mathbf{x}}),
\]where $\hat{\mathbf{x}}$ and $\hat{\mathbf{p}}$ are the operators associated to the position and momentum observables. In other words, just replace dynamical variables with their operators! So the usual form of Schrödinger's equation,
\[
i\hbar \frac{d}{dt}\Psi = -\frac{\hbar^2}{2m}\nabla^2\Psi + V\Psi
\]just comes from replacing $\mathbf{p}$ with its quantum counterpart $\hat{\mathbf{p}} = -i\hbar \nabla$. (It's not at all obvious that momentum should become a differential operator, but it turns out to be right. Consult your local quantum mechanics course for details on how to find $\hat{\mathbf{x}}$ and $\hat{\mathbf{p}}$.)

Classical symmetries

Finally, it's time for group theory to make an entrance. Recall that we can describe a classical particle in 3D using position $\mathbf{x}$ and momentum $\mathbf{p}$. We can smush them together into a vector $v \equiv (\mathbf{x}, \mathbf{p}) \in \mathbb{R}^3\times\mathbb{R}^3$. The combined position-momentum space $M \equiv \mathbb{R}^3\times\mathbb{R}^3$ is called phase space. A symmetry of the system will be an invertible transformation of phase space $T: M \to M$ that leaves it "invariant", that is, unchanged with respect to an equivalence relation of our choice. We choose "invariant" to mean "the energy of the system isn't changed by $T$, whatever state it's in".
Formally, for all $v \in M$,
\[
H(Tv) = H(v).
\]The collection of all the symmetries forms a group $\mathcal{G}$, as you can check. [Technical comment: The evolution of the system can be related to the derivatives of the energy function by Hamilton's equations. This guarantees that the physics will be the same under a symmetry transformation, given the way we've defined it.]

Some examples

Nature seems to favour groups which are simple enough for us to figure out, but mathematically non-trivial. A few physically important examples:

rotations of $\mathbb{R}^n$, also known as the orthogonal group $\mathrm{O}(n)$, which arise in systems with rotational invariance;
rotations of $\mathbb{C}^n$, also known as the unitary group $\mathrm{U}(n)$, which often come from conservation of probability in quantum mechanics;
translations, spatial rotations, and Newtonian "boosts" of $\mathbb{R}^4$, also called the Galilean group, which connect (Newtonian) physics in different inertial frames;
rotations and relativistic "boosts" of $\mathbb{R}^4$, also called the Lorentz group $\mathrm{SO}(1,3)$, which connect inertial frames in relativistic mechanics;
the Poincaré group, which just adds translations to the Lorentz group.

Note that "translations and rotations" includes pure translations, pure rotations, and combinations of the two; the same remark applies to boosts.

Quantum symmetries and representations

Now, suppose we want to quantise a classical system with symmetry group $G$. Symmetries are not always preserved when we quantise, but if they are, what should they look like? Well, before $G$ acted on the phase space $M$. Now it should act on the Hilbert space of the quantum theory, $G \hookrightarrow V$. Since $V$ is a vector space, and we want symmetries to preserve the vector space structure, they should act as linear transformations.

Thus, each group element $T \in G$ should be assigned a matrix, $\rho(T) \in \mathrm{GL}(V)$, where $\mathrm{GL}(V)$ denotes the invertible linear operators on $V$. To ensure that these matrices define a group action, we require the matrix assignment function $\rho: G \to \mathrm{GL}(V)$ to be a homomorphism:
\[
\rho(T\cdot S) = \rho(T) \cdot\rho(S).
\]Since "matrix assignment homomorphism" is a bit of a mouthful, $\rho$ is instead called a representation of $G$. In fact, we've already seen representations in the guise of matrix groups, e.g., the general linear group $\mathrm{GL}_n(\mathbb{F})$, the orthogonal group $\mathrm{O}(n)$, and the unitary group $\mathrm{U}(n)$. Finally, instead of leaving the energy invariant, the closest condition in quantum mechanics is that $\rho(G)$ not interfere with the measurement of energy. More formally, each $\rho(g)$ should commute with the energy operator $\hat{H}$: $$ [\hat{H}, \rho(g)] = \hat{H} \rho(g) - \rho(g)\hat{H} = 0. $$ Note that a group $G$ may act on $V$ without commting with $\hat{H}$, but it is not then a symmetry of the system.

Particles

If the matrices $\rho(G) \subset \mathrm{GL}(V)$ have a common invariant subspace, the representation $\rho$ is reducible. Just to remind you, an invariant subspace of a linear operator $L: V \to V$ is a nontrivial, proper subspace $W \subset V$ such that $L(W) \subset W$. A representation which is not reducible is irreducible. We can think of a reducible representation as a set of block diagonal matrices with compatible block structures. In other words, we can factor $\rho(g)$ into block diagonal matrices $A_1(g), A_2(g), \ldots, A_n(g)$ where $A_i$ is a representation on an $m_i$-dimensional subspace of $V$:
\[
\rho(g) = \left[
\begin{array}{cccc}
A_1(g)&&&\\
&A_2(g)&&\\
&&\ddots&\\
&&&A_n(g)
\end{array}
\right].
\]We can keep breaking a reducible representation down into blocks until we can't go any further. In many cases (for instance, a finite group $G$), this process will lead to a unique decomposition into irreducible blocks.

What is the physical significance of these blocks? Well, in the classical case, orbits of the symmetry group $G$ are things which look the same. I could be in one state, or any of the other states connected by symmetries, and the energy operator can't tell. Since they are physically indistinguishable, they are probably related. When we quantise, the irreducible subspaces are the equivalent of these orbits of indistinguishable states. They consist of vectors which are mixed together by the symmetries (acting as linear operators on $V$), and which cannot be split into smaller sets of vectors which mix amongst themselves. Generally, vectors correspond to degrees of freedom of our system. But a lump of inextricably linked degrees of freedom has a natural interpretation: a particle! So, a particle is irreducible representation $A$ of $G$ on a $k$-dimensional subspace
\[
A(g) = \overset{\text{mixed together under $G$}}{\overbrace{\left[
\begin{array}{ccc}
a_{11}(g)&\cdots&a_{1k}(g)\\
\vdots&\ddots&\vdots\\
a_{k1}(g)&\cdots&a_{kk}(g)
\end{array}
\right]}}.
\]But what group $G$ should we use? When we say that an electron is a particle, we mean it is a fundamental degree of freedom of the universe. The fundamental symmetry group of space (provided we can ignore gravity) is the Poincaré group of special relativity. Thus, a fundamental particle is usually defined as an irreducible representation of the Poincaré group.

Summary

We started by defining a classical symmetry as a transformation of phase space that always left the energy invariant. We then argued that in quantising, symmetries should act as linear transformations on the Hilbert space of the quantum theory with commute with the energy operator. Finally, we saw that irreducible subspaces (under the representation) are just directions in Hilbert space that inextricably mix under symmetries. We interpret them as the degrees of freedom of a particle. For physical reasons, we are often thinking specifically of the Poincaré group of special relativity. So, that completes our GTALA-motivated crash course in quantum mechanics and particle physics. Hope you learnt something!

References

Lie Algebras in Particle Physics (1982), Howard Georgi.
Mathematics of classical and quantum physics (1969), Byron and Fuller.
The Quantum Theory of Fields: Volume 1 (2005), Steven Weinberg.

Written on October 6, 2015