Perturbation Theory
When exact solutions to the SchrΓΆdinger equation are unavailable β as is the case for virtually every system beyond the hydrogen atom β perturbation theory provides a systematic, order-by-order framework for constructing approximate solutions. Starting from a solvable reference Hamiltonian, perturbation theory builds corrections to both energies and wavefunctions, yielding some of the most important quantitative predictions in quantum chemistry and physics.
1. Introduction: When Exact Solutions Don't Exist
The SchrΓΆdinger equation can be solved exactly for only a handful of systems: the particle in a box, the harmonic oscillator, the rigid rotor, and the hydrogen atom. For any system with two or more interacting electrons, or for a one-electron system in an external field, we must resort to approximation methods.
Perturbation theory addresses the following scenario: suppose we can write the full Hamiltonian as
$\hat{H} = \hat{H}^{(0)} + \lambda \hat{H}'$
where $\hat{H}^{(0)}$ is a "zeroth-order" Hamiltonian whose eigenstates and eigenvalues are known exactly, $\hat{H}'$ is a "perturbation" representing the additional interaction, and $\lambda$ is a dimensionless parameter that tracks the order of the correction. The key assumption is that $\hat{H}'$ is "small" compared to $\hat{H}^{(0)}$, so that the exact eigenstates and eigenvalues can be expanded as power series in $\lambda$.
The power of perturbation theory lies in its generality: it applies whenever the problem can be decomposed into a solvable part plus a small correction. This includes atoms in external fields (Stark and Zeeman effects), electron-electron repulsion in helium, anharmonic corrections to molecular vibrations, spin-orbit coupling, and the intermolecular interactions that give rise to van der Waals forces.
Central idea: If the perturbation is small, the first few terms of the expansion provide an excellent approximation, and systematic improvement is possible by going to higher order.
2. Non-Degenerate Rayleigh-SchrΓΆdinger Perturbation Theory
2.1 Setting Up the Expansion
We assume that the unperturbed Hamiltonian $\hat{H}^{(0)}$ has a complete set of orthonormal eigenstates $|\psi_n^{(0)}\rangle$ with eigenvalues $E_n^{(0)}$:
$\hat{H}^{(0)} |\psi_n^{(0)}\rangle = E_n^{(0)} |\psi_n^{(0)}\rangle$
and that the state of interest $|\psi_k^{(0)}\rangle$ is non-degenerate, meaning$E_k^{(0)} \neq E_n^{(0)}$ for all $n \neq k$. We expand both the exact eigenvalue and eigenstate as power series in $\lambda$:
$E_k = E_k^{(0)} + \lambda E_k^{(1)} + \lambda^2 E_k^{(2)} + \lambda^3 E_k^{(3)} + \cdots$
$|\psi_k\rangle = |\psi_k^{(0)}\rangle + \lambda |\psi_k^{(1)}\rangle + \lambda^2 |\psi_k^{(2)}\rangle + \cdots$
Here $E_k^{(1)}$ is the first-order energy correction, $E_k^{(2)}$ is the second-order correction, and so forth. We adopt the intermediate normalization convention:$\langle \psi_k^{(0)} | \psi_k \rangle = 1$, which implies$\langle \psi_k^{(0)} | \psi_k^{(j)} \rangle = 0$ for all $j \geq 1$.
2.2 Collecting Terms by Powers of Ξ»
Substituting the expansions into the full SchrΓΆdinger equation$(\hat{H}^{(0)} + \lambda \hat{H}')|\psi_k\rangle = E_k |\psi_k\rangle$ and collecting terms order by order in $\lambda$:
Order $\lambda^0$:Β $\hat{H}^{(0)} |\psi_k^{(0)}\rangle = E_k^{(0)} |\psi_k^{(0)}\rangle$β(the unperturbed equation)
Order $\lambda^1$:Β $\hat{H}^{(0)} |\psi_k^{(1)}\rangle + \hat{H}' |\psi_k^{(0)}\rangle = E_k^{(0)} |\psi_k^{(1)}\rangle + E_k^{(1)} |\psi_k^{(0)}\rangle$
Order $\lambda^2$:Β $\hat{H}^{(0)} |\psi_k^{(2)}\rangle + \hat{H}' |\psi_k^{(1)}\rangle = E_k^{(0)} |\psi_k^{(2)}\rangle + E_k^{(1)} |\psi_k^{(1)}\rangle + E_k^{(2)} |\psi_k^{(0)}\rangle$
2.3 First-Order Energy Correction
From the first-order equation, project onto $\langle \psi_k^{(0)} |$ from the left:
$\langle \psi_k^{(0)} | \hat{H}^{(0)} | \psi_k^{(1)} \rangle + \langle \psi_k^{(0)} | \hat{H}' | \psi_k^{(0)} \rangle = E_k^{(0)} \langle \psi_k^{(0)} | \psi_k^{(1)} \rangle + E_k^{(1)}$
Since $\hat{H}^{(0)}$ is Hermitian,$\langle \psi_k^{(0)} | \hat{H}^{(0)} | \psi_k^{(1)} \rangle = E_k^{(0)} \langle \psi_k^{(0)} | \psi_k^{(1)} \rangle$. The terms involving $\langle \psi_k^{(0)} | \psi_k^{(1)} \rangle$ cancel on both sides (by intermediate normalization, this inner product is zero). We obtain the celebrated result:
$\boxed{E_k^{(1)} = \langle \psi_k^{(0)} | \hat{H}' | \psi_k^{(0)} \rangle}$
The first-order energy correction is the expectation value of the perturbation in the unperturbed state.
2.4 First-Order Wavefunction Correction
To find $|\psi_k^{(1)}\rangle$, we project the first-order equation onto$\langle \psi_n^{(0)} |$ with $n \neq k$. Since$\langle \psi_n^{(0)} | \hat{H}^{(0)} = E_n^{(0)} \langle \psi_n^{(0)} |$:
$E_n^{(0)} \langle \psi_n^{(0)} | \psi_k^{(1)} \rangle + \langle \psi_n^{(0)} | \hat{H}' | \psi_k^{(0)} \rangle = E_k^{(0)} \langle \psi_n^{(0)} | \psi_k^{(1)} \rangle$
Solving for the expansion coefficient $\langle \psi_n^{(0)} | \psi_k^{(1)} \rangle$:
$\langle \psi_n^{(0)} | \psi_k^{(1)} \rangle = \frac{\langle \psi_n^{(0)} | \hat{H}' | \psi_k^{(0)} \rangle}{E_k^{(0)} - E_n^{(0)}}$
Since $|\psi_k^{(1)}\rangle$ can be expanded in the complete basis of unperturbed states, and using intermediate normalization ($\langle \psi_k^{(0)} | \psi_k^{(1)} \rangle = 0$):
$\boxed{|\psi_k^{(1)}\rangle = \sum_{n \neq k} \frac{\langle \psi_n^{(0)} | \hat{H}' | \psi_k^{(0)} \rangle}{E_k^{(0)} - E_n^{(0)}} |\psi_n^{(0)}\rangle}$
Note: this requires $E_k^{(0)} \neq E_n^{(0)}$ β the non-degeneracy condition.
2.5 Second-Order Energy Correction
From the second-order equation, projecting onto $\langle \psi_k^{(0)} |$:
$\langle \psi_k^{(0)} | \hat{H}' | \psi_k^{(1)} \rangle = E_k^{(2)}$
Substituting the expression for $|\psi_k^{(1)}\rangle$ derived above:
$\boxed{E_k^{(2)} = \sum_{n \neq k} \frac{|\langle \psi_n^{(0)} | \hat{H}' | \psi_k^{(0)} \rangle|^2}{E_k^{(0)} - E_n^{(0)}}}$
The second-order energy requires knowledge of all unperturbed states, not just the state of interest.
Important properties of $E_k^{(2)}$:
- β’ For the ground state ($k = 0$), every denominator$E_0^{(0)} - E_n^{(0)} < 0$ (since $E_0^{(0)}$ is the lowest energy), so $E_0^{(2)} \leq 0$. The second-order correction always lowers the ground-state energy.
- β’ States that are close in energy to state $k$ contribute most strongly (small denominator).
- β’ States not connected by the perturbation ($\langle \psi_n^{(0)} | \hat{H}' | \psi_k^{(0)} \rangle = 0$) do not contribute, regardless of their energy.
3. Degenerate Perturbation Theory
3.1 The Problem with Degeneracy
When two or more unperturbed states share the same energy, $E_k^{(0)} = E_l^{(0)}$, the denominator $E_k^{(0)} - E_l^{(0)}$ in the first-order wavefunction vanishes, and non-degenerate perturbation theory breaks down. The physical reason is clear: when states are degenerate, any linear combination of them is also an eigenstate of $\hat{H}^{(0)}$, and the perturbation must "choose" the correct linear combinations.
3.2 The Secular Equation
Suppose states $|\psi_1^{(0)}\rangle, |\psi_2^{(0)}\rangle, \ldots, |\psi_d^{(0)}\rangle$ are$d$-fold degenerate with energy $E^{(0)}$. We seek the correct zeroth-order states as linear combinations:
$|\phi\rangle = \sum_{i=1}^{d} c_i |\psi_i^{(0)}\rangle$
Substituting into the first-order equation and projecting onto each $\langle \psi_j^{(0)} |$within the degenerate subspace yields the secular equation:
$\sum_{i=1}^{d} \left( H'_{ji} - E^{(1)} \delta_{ji} \right) c_i = 0$
where $H'_{ji} = \langle \psi_j^{(0)} | \hat{H}' | \psi_i^{(0)} \rangle$. This is an eigenvalue problem: diagonalize the perturbation matrix within the degenerate subspace. The eigenvalues give the first-order corrections $E^{(1)}$, and the eigenvectors give the correct zeroth-order states.
$\boxed{\det \left( \mathbf{H}' - E^{(1)} \mathbf{I} \right) = 0}$
The secular determinant: a $d \times d$ eigenvalue problem within the degenerate subspace.
3.3 Procedure
- Identify the degenerate subspace: all states with $E_i^{(0)} = E^{(0)}$.
- Construct the $d \times d$ matrix of the perturbation $H'_{ij}$ within this subspace.
- Diagonalize $\mathbf{H}'$ to find eigenvalues $E_\alpha^{(1)}$ and eigenvectors $|\phi_\alpha\rangle$.
- The correct zeroth-order states are the eigenvectors $|\phi_\alpha\rangle$.
- The first-order energies are $E_\alpha = E^{(0)} + \lambda E_\alpha^{(1)}$.
- If the degeneracy is fully lifted, higher-order corrections follow from non-degenerate PT using the new basis.
4. The Stark Effect in Hydrogen (n = 2)
4.1 Setup
The Stark effect β the splitting of spectral lines in an external electric field β is the classic application of degenerate perturbation theory. Consider the $n = 2$ level of hydrogen, which is 4-fold degenerate (ignoring spin):
$|2,0,0\rangle, \quad |2,1,0\rangle, \quad |2,1,1\rangle, \quad |2,1,-1\rangle$
All four states have energy $E_2^{(0)} = -13.6/4 = -3.4$ eV.
The perturbation due to a uniform electric field $\mathcal{E}$ along the $z$-axis is:
$\hat{H}' = e\mathcal{E}z = e\mathcal{E}r\cos\theta$
4.2 Selection Rules and Matrix Elements
Since $z = r\cos\theta$ is odd under parity, the perturbation connects only states of opposite parity. Furthermore, $\cos\theta \propto Y_1^0$, so the selection rules are:
- β’ $\Delta l = \pm 1$ (parity selection rule)
- β’ $\Delta m = 0$ (since $\hat{H}'$ has no $\phi$-dependence)
The only nonzero matrix element within the $n = 2$ subspace is:
$\langle 2,0,0 | \hat{H}' | 2,1,0 \rangle = e\mathcal{E} \langle 2,0,0 | r\cos\theta | 2,1,0 \rangle = -3e\mathcal{E}a_0$
This matrix element is computed by evaluating the radial integral$\int_0^\infty R_{20}(r) \, r \, R_{21}(r) \, r^2 \, dr = -3\sqrt{6} \, a_0$ (using the explicit hydrogen radial wavefunctions) and the angular integral$\int Y_0^0 \cos\theta \, Y_1^0 \, d\Omega = 1/\sqrt{3}$.
4.3 The Perturbation Matrix and Diagonalization
In the basis $\{|2,0,0\rangle, |2,1,0\rangle, |2,1,1\rangle, |2,1,-1\rangle\}$, the perturbation matrix is:
$\mathbf{H}' = \begin{pmatrix} 0 & -3e\mathcal{E}a_0 & 0 & 0 \\ -3e\mathcal{E}a_0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \end{pmatrix}$
The secular equation $\det(\mathbf{H}' - E^{(1)}\mathbf{I}) = 0$ yields:
$(E^{(1)})^2 \left[ (E^{(1)})^2 - 9e^2\mathcal{E}^2 a_0^2 \right] = 0$
The four eigenvalues are:
$\boxed{E^{(1)} = +3e\mathcal{E}a_0, \quad 0, \quad 0, \quad -3e\mathcal{E}a_0}$
The linear Stark effect: splitting proportional to the electric field strength.
The correct zeroth-order states corresponding to the nonzero eigenvalues are:
$|\phi_{\pm}\rangle = \frac{1}{\sqrt{2}} \left( |2,0,0\rangle \mp |2,1,0\rangle \right)$
The states $|2,1,\pm1\rangle$ remain degenerate and unshifted to first order. The 4-fold degenerate level splits into 3 levels, with the middle level remaining doubly degenerate. This is the linear Stark effect, characteristic of hydrogen where states of different$l$ but the same $n$ are degenerate.
5. Ground State Polarizability
5.1 Second-Order Energy in an Electric Field
For the non-degenerate ground state of an atom in a uniform electric field, the first-order energy correction vanishes by parity ($\langle 0 | z | 0 \rangle = 0$ for states of definite parity). The leading effect comes from the second-order correction:
$E^{(2)} = e^2 \mathcal{E}^2 \sum_{n \neq 0} \frac{|\langle n | z | 0 \rangle|^2}{E_0 - E_n}$
This energy shift is proportional to $\mathcal{E}^2$, which defines the static electric polarizability $\alpha$ through the relation$E^{(2)} = -\frac{1}{2}\alpha \mathcal{E}^2$:
$\boxed{\alpha = 2e^2 \sum_{n \neq 0} \frac{|\langle n | z | 0 \rangle|^2}{E_n - E_0} = 2 \sum_{n \neq 0} \frac{|\langle n | \mu_z | 0 \rangle|^2}{E_n - E_0}}$
where $\mu_z = ez$ is the $z$-component of the dipole moment operator.
5.2 Closure Approximation for Hydrogen
For the hydrogen atom ground state, the exact sum over all excited states (including the continuum) is difficult to evaluate directly. The closure approximation replaces all energy denominators by an average excitation energy $\Delta E$:
$\alpha \approx \frac{2e^2}{\Delta E} \sum_{n \neq 0} |\langle n | z | 0 \rangle|^2 = \frac{2e^2}{\Delta E} \left( \langle 0 | z^2 | 0 \rangle - |\langle 0 | z | 0 \rangle|^2 \right)$
where we used completeness. For the hydrogen 1s state, $\langle 0 | z | 0 \rangle = 0$ by parity and$\langle 0 | z^2 | 0 \rangle = \langle 0 | r^2 \cos^2\theta | 0 \rangle = \frac{1}{3}\langle r^2 \rangle_{1s} = a_0^2$. Using $\Delta E \approx E_2 - E_1 = 10.2$ eV as a lower bound for the average excitation energy:
$\alpha \approx \frac{2e^2 a_0^2}{\Delta E} \approx 4 a_0^3 \cdot \frac{4\pi\epsilon_0}{e^2/(2a_0)} \cdot \frac{2 \cdot 13.6}{10.2} \approx 5.3 \, a_0^3 \cdot 4\pi\epsilon_0$
The exact result, obtained by Epstein (1926) using parabolic coordinates, is:
$\boxed{\alpha = \frac{9}{2} a_0^3 \cdot 4\pi\epsilon_0 = 4.5 \, a_0^3 \cdot 4\pi\epsilon_0}$
Exact polarizability of the hydrogen atom ground state.
The closure approximation overestimates the polarizability because $\Delta E$ was taken as the minimum excitation energy rather than a proper average. Nevertheless, it provides a useful estimate and correctly captures the $a_0^3$ scaling.
6. Applications
Molecular Polarizabilities
Second-order perturbation theory predicts how molecules respond to external electric fields. The polarizability tensor $\alpha_{ij}$ governs the induced dipole moment$\mu_i^{\text{ind}} = \sum_j \alpha_{ij} \mathcal{E}_j$ and determines refractive indices, dielectric constants, and Raman scattering intensities. Larger molecules with low-lying excited states tend to have larger polarizabilities.
Van der Waals Interactions
London dispersion forces arise from second-order perturbation theory applied to the instantaneous dipole-dipole interaction between two atoms or molecules. The resulting attractive potential scales as $V(R) \sim -C_6/R^6$, where $C_6$ depends on the polarizabilities and ionization energies of both species. These ubiquitous forces are responsible for the condensation of noble gases and much of the cohesion in molecular crystals.
Zeeman Effect
The interaction of an atom with an external magnetic field, $\hat{H}' = -\hat{\boldsymbol{\mu}} \cdot \mathbf{B}$, splits degenerate $m_l$ and $m_s$ sublevels. In the weak-field (anomalous) Zeeman effect, degenerate perturbation theory within the $J$ manifold yields energies $E_m = g_J \mu_B B m_J$, where $g_J$ is the LandΓ© g-factor.
Spin-Orbit Coupling
The relativistic interaction between an electron's spin and its orbital angular momentum,$\hat{H}_{\text{SO}} = \xi(r)\hat{\mathbf{L}} \cdot \hat{\mathbf{S}}$, is treated as a perturbation to the non-relativistic Hamiltonian. This lifts the degeneracy of states with the same $L$ and $S$ but different $J$, producing the fine structure of atomic spectra. The splitting follows the LandΓ© interval rule.
MΓΈller-Plesset Perturbation Theory in Quantum Chemistry
In modern computational chemistry, MΓΈller-Plesset (MP) perturbation theory uses the Hartree-Fock Hamiltonian as $\hat{H}^{(0)}$ and the difference between the exact electron-electron repulsion and the mean-field HF potential as $\hat{H}'$. The second-order correction (MP2) captures a large fraction of the electron correlation energy at a cost scaling as $\mathcal{O}(N^5)$ with system size, making it one of the most widely used post-Hartree-Fock methods for computing molecular energies, geometries, and properties.
7. Historical Context
Lord Rayleigh develops perturbation methods for vibrating systems in classical mechanics, establishing the mathematical framework that would later be adapted for quantum theory. His approach of expanding solutions in power series of a small parameter laid the foundation for what would become Rayleigh-SchrΓΆdinger perturbation theory.
Johannes Stark observes the splitting of hydrogen spectral lines in an electric field, providing the experimental phenomenon that would become one of perturbation theory's greatest triumphs when explained quantum mechanically.
Erwin SchrΓΆdinger formulates the quantum mechanical version of perturbation theory in his foundational papers on wave mechanics. He derives the formulas for energy and wavefunction corrections and applies them to the Stark effect, reproducing Epstein's earlier old-quantum-theory results within the new framework.
Brillouin and Wigner independently develop an alternative formulation of perturbation theory in which the exact energy appears in the denominators rather than the unperturbed energy. Brillouin-Wigner perturbation theory is not size-extensive but is conceptually simpler and converges differently from the Rayleigh-SchrΓΆdinger form.
Christian MΓΈller and Milton Plesset apply Rayleigh-SchrΓΆdinger perturbation theory to many-electron atoms and molecules using the Hartree-Fock solution as the reference state. Their formulation (MΓΈller-Plesset perturbation theory, or MP theory) remains one of the most important methods in computational quantum chemistry, with the MP2 approximation becoming a standard tool for including electron correlation.
8. Python Simulation: Anharmonic Oscillator
The following simulation computes energy corrections for an anharmonic oscillator with potential$V(x) = \frac{1}{2}x^2 + \lambda x^4$ using perturbation theory (first and second order) and compares them against exact diagonalization in a truncated harmonic oscillator basis. The matrix elements of $x^4$ are computed exactly using the creation and annihilation operator representation. Observe how perturbation theory becomes progressively less accurate as $\lambda$ increases.
Anharmonic Oscillator: Perturbation Theory vs Exact Diagonalization
PythonCompares first-order and second-order perturbation theory energy corrections with exact eigenvalues for the anharmonic oscillator V = x^2/2 + lambda*x^4.
Click Run to execute the Python code
Code will be executed with Python 3 on the server
9. Fortran Simulation: Stark Effect
This Fortran program computes the energy splittings of the hydrogen $n = 2$ states as a function of the external electric field strength. The 4Γ4 perturbation matrix is constructed using the analytically known matrix element $\langle 2,0,0 | eEz | 2,1,0 \rangle = -3ea_0\mathcal{E}$and diagonalized using Jacobi rotations. The output confirms the linear Stark splitting into three distinct energy levels.
Stark Effect: Hydrogen n=2 Energy Splittings
FortranComputes energy levels of the hydrogen n=2 states as a function of electric field strength, demonstrating the linear Stark effect via degenerate perturbation theory.
Click Run to execute the Fortran code
Code will be compiled with gfortran and executed on the server
Key Equations Summary
| Quantity | Expression |
|---|---|
| First-order energy | $E_k^{(1)} = \langle \psi_k^{(0)} | \hat{H}' | \psi_k^{(0)} \rangle$ |
| First-order wavefunction | $|\psi_k^{(1)}\rangle = \sum_{n \neq k} \frac{\langle \psi_n^{(0)} | \hat{H}' | \psi_k^{(0)} \rangle}{E_k^{(0)} - E_n^{(0)}} |\psi_n^{(0)}\rangle$ |
| Second-order energy | $E_k^{(2)} = \sum_{n \neq k} \frac{|\langle \psi_n^{(0)} | \hat{H}' | \psi_k^{(0)} \rangle|^2}{E_k^{(0)} - E_n^{(0)}}$ |
| Degenerate PT secular eq. | $\det(\mathbf{H}' - E^{(1)} \mathbf{I}) = 0$ |
| Stark splitting (n=2) | $E^{(1)} = \pm 3e\mathcal{E}a_0, \; 0$ |
| Polarizability | $\alpha = 2 \sum_{n \neq 0} \frac{|\langle n | \mu_z | 0 \rangle|^2}{E_n - E_0}$ |