Part III: Experimental Methods | Chapter 4

Cryo-Electron Microscopy

Single-particle reconstruction, electron scattering physics, contrast transfer function, Fourier slice theorem, and the resolution revolution in structural biology

1. Introduction: The Resolution Revolution

2017 Nobel Prize in Chemistry

The 2017 Nobel Prize in Chemistry was awarded jointly to Jacques Dubochet, Joachim Frank, and Richard Henderson "for developing cryo-electron microscopy for the high-resolution structure determination of biomolecules in solution." This prize recognized decades of work that transformed cryo-EM from a niche technique producing blurry blobs into a method rivaling X-ray crystallography in resolution, without requiring crystals.

The term "resolution revolution" was coined by Werner Kuhlbrandt in 2014 to describe the dramatic improvement in cryo-EM resolution enabled by direct electron detectors and improved computational methods. Before 2013, sub-4 Γ… resolution structures by cryo-EM were exceedingly rare. By 2020, structures at 1.2 Γ… resolution had been achieved, and thousands of near-atomic resolution structures were being deposited annually.

Unlike X-ray crystallography, cryo-EM does not require crystals. Biological specimens are flash-frozen in vitreous ice, preserving their native hydrated state. Individual particle images are recorded, classified, and computationally combined to reconstruct three-dimensional density maps. This approach is particularly powerful for large, flexible, and heterogeneous macromolecular complexes.

Workflow diagram of cryo-electron microscopy showing sample preparation, imaging, and 3D reconstruction steps
Cryo-EM workflow: from sample vitrification through electron imaging to computational 3D reconstruction β€” Source: Wikimedia Commons

The fundamental challenge of cryo-EM is that biological molecules are composed of light atoms (C, N, O, H) that scatter electrons weakly. Individual particles produce images with extremely low signal-to-noise ratios. The power of single-particle analysis lies in averaging: by combining tens of thousands to millions of noisy images of identical particles in random orientations, a high-resolution 3D reconstruction can be obtained.

In this chapter, we derive the key physics underlying cryo-EM: the electron-specimen interaction, the contrast transfer function, the Fourier slice theorem that enables 3D reconstruction, and the statistical principles governing resolution. We then explore landmark applications and the historical development of the technique.

2. Derivation 1: Electron Scattering Physics

Electrons in a transmission electron microscope (TEM) are accelerated through a potential difference $V$ (typically 200-300 kV for cryo-EM). At these energies, relativistic effects are significant and must be included in computing the de Broglie wavelength.

De Broglie Wavelength with Relativistic Correction

The kinetic energy of an electron accelerated through potential $V$ is$E_k = eV$. The relativistic energy-momentum relation gives:

$$E^2 = (pc)^2 + (m_0 c^2)^2$$

where the total energy is $E = E_k + m_0 c^2 = eV + m_0 c^2$. Solving for momentum:

$$p = \frac{1}{c}\sqrt{E^2 - (m_0 c^2)^2} = \frac{1}{c}\sqrt{(eV + m_0 c^2)^2 - (m_0 c^2)^2}$$

Expanding the square:

$$p = \frac{1}{c}\sqrt{e^2V^2 + 2eVm_0c^2} = \frac{1}{c}\sqrt{2m_0 c^2 eV\left(1 + \frac{eV}{2m_0c^2}\right)}$$

The de Broglie wavelength $\lambda = h/p$ becomes:

$$\boxed{\lambda = \frac{h}{\sqrt{2m_0 eV\left(1 + \frac{eV}{2m_0c^2}\right)}}}$$

At 300 kV, this gives $\lambda \approx 0.0197$ Γ… β€” far smaller than interatomic distances. The resolution in cryo-EM is therefore not limited by diffraction but by radiation damage, specimen movement, and the contrast transfer function.

The Contrast Transfer Function (CTF)

In the weak phase object approximation (WPOA), the specimen acts as a thin phase plate. The image formed by the objective lens is modulated by the contrast transfer function, which arises from the combination of defocus and spherical aberration:

The electron wave after the objective lens acquires a phase shift $\chi(k)$ that depends on the spatial frequency $k$:

$$\chi(k) = \pi \lambda \Delta f\, k^2 - \frac{\pi}{2} C_s \lambda^3 k^4$$

where $\Delta f$ is the defocus (positive for underfocus),$C_s$ is the spherical aberration coefficient, and$k = |\mathbf{k}|$ is the spatial frequency. The CTF for a weak phase object is:

$$\boxed{\text{CTF}(k) = \sin\!\left(\pi \lambda \Delta f\, k^2 - \frac{\pi}{2} C_s \lambda^3 k^4\right)}$$

The CTF oscillates between +1 and -1 as a function of spatial frequency, passing through zero at specific frequencies. At these zeros, information is lost entirely. The first zero occurs at:

$$k_1 = \frac{1}{\sqrt{\lambda \Delta f}}$$

(neglecting the $C_s$ term for moderate spatial frequencies). This is why cryo-EM images are typically collected at a series of defocus values: information lost at the CTF zeros of one defocus value can be recovered from images at other defocus values.

The oscillatory nature of the CTF produces characteristic Thon rings in the power spectrum of cryo-EM micrographs. These concentric rings of alternating contrast and contrast reversal are diagnostic of the defocus and astigmatism, and their visibility to high spatial frequencies indicates the quality of the data.

3. Derivation 2: Single Particle Analysis & 3D Reconstruction

The mathematical foundation of single-particle cryo-EM reconstruction rests on the Fourier slice theorem (also called the projection-slice theorem or central section theorem), which establishes a direct connection between 2D projection images and the 3D Fourier transform of the object.

The Fourier Slice Theorem

Let $\rho(\mathbf{r})$ be the 3D electron density of the specimen, and let$P_\theta(x)$ be a 1D projection along direction $\theta$ (for the 2D case, which generalizes straightforwardly to 3D):

$$P_\theta(x) = \int_{-\infty}^{\infty} \rho(x\cos\theta - y\sin\theta,\; x\sin\theta + y\cos\theta)\, dy$$

Taking the 1D Fourier transform of the projection:

$$\hat{P}_\theta(k) = \int_{-\infty}^{\infty} P_\theta(x)\, e^{-2\pi i k x}\, dx$$

Substituting the projection integral and changing variables, one can show that:

$$\boxed{\hat{P}_\theta(k) = \hat{\rho}(k\cos\theta,\; k\sin\theta)}$$

Interpretation: The 1D Fourier transform of a projection at angle $\theta$ gives a central slice through the 2D Fourier transform of the object, oriented at the same angle $\theta$. In 3D, each 2D projection image provides a central plane through the 3D Fourier transform. By collecting projections at many orientations (which cryo-EM achieves through random particle orientations on the grid), one can fill in the 3D Fourier space and reconstruct $\rho(\mathbf{r})$ by inverse Fourier transform.

Back-Projection Reconstruction

The simplest reconstruction method is direct back-projection. Given a set of projections $\{P_{\theta_i}\}$, the back-projection estimate of the density is:

$$\rho_{\text{BP}}(\mathbf{r}) = \sum_{i=1}^{N} P_{\theta_i}(\mathbf{r} \cdot \hat{n}_i)$$

where $\hat{n}_i$ is the projection direction. This simple sum produces a blurred reconstruction because low spatial frequencies are overrepresented (each projection contributes a central slice, and slices are denser near the origin). The correction is weighted back-projection:

$$\rho(\mathbf{r}) = \mathcal{F}^{-1}\!\left[\frac{\sum_i \hat{P}_{\theta_i}(k) \cdot W_i(k)}{\sum_i W_i(k)}\right]$$

where $W_i(k)$ are appropriate weighting functions. In practice, modern cryo-EM software (RELION, cryoSPARC) uses iterative maximum-likelihood refinement to simultaneously determine particle orientations and reconstruct the 3D map.

Fourier Shell Correlation and the 0.143 Criterion

The resolution of a cryo-EM reconstruction is assessed using the Fourier Shell Correlation (FSC). The dataset is randomly split into two halves, each independently reconstructed, and the correlation between the two maps is computed in shells of spatial frequency:

$$\text{FSC}(k) = \frac{\sum_{k_i \in \text{shell}} \hat{\rho}_1(k_i) \cdot \hat{\rho}_2^*(k_i)}{\sqrt{\sum_{k_i} |\hat{\rho}_1(k_i)|^2 \cdot \sum_{k_i} |\hat{\rho}_2(k_i)|^2}}$$

The FSC ranges from 1 (perfect correlation at low frequencies) to 0 (pure noise at high frequencies). The resolution is defined as the spatial frequency where FSC drops below a threshold.

The widely adopted FSC = 0.143 criterion (also called the half-bit criterion) was proposed by Rosenthal and Henderson (2003). The rationale is that at FSC = 0.143 for the half-map correlation, the FSC of the full dataset (with all particles) corresponds to FSC $\approx 0.5$, which is the point where the signal equals the noise. Using the relation:

$$\text{FSC}_{\text{full}} = \frac{2 \cdot \text{FSC}_{\text{half}}}{1 + \text{FSC}_{\text{half}}}$$

Setting $\text{FSC}_{\text{half}} = 0.143$ gives$\text{FSC}_{\text{full}} = 2(0.143)/(1+0.143) = 0.250$. This threshold corresponds to approximately the half-bit information criterion per voxel, providing a statistically principled cutoff for resolution estimation.

4. Derivation 3: Signal-to-Noise in Low-Dose Imaging

Biological specimens in the electron microscope are exquisitely sensitive to radiation damage. A typical protein molecule can tolerate approximately 20 electrons/Γ…Β² before significant structural damage occurs at near-atomic resolution. This fundamental limitation forces cryo-EM to operate in a low-dose regimewhere individual images are dominated by noise.

Shot Noise and the SNR of Individual Images

For a pixel receiving $n$ electrons on average, electron counting follows Poisson statistics. The signal is $n$ and the noise (standard deviation) is$\sqrt{n}$, giving a per-pixel SNR:

$$\text{SNR}_{\text{pixel}} = \frac{n}{\sqrt{n}} = \sqrt{n}$$

For a typical cryo-EM exposure of 20 e$^-$/Γ…Β² on a detector with 1 Γ… pixel size, $n \approx 20$, giving SNR $\approx 4.5$per pixel. However, the contrast of unstained biological specimens is only a few percent, so the structural SNR is much lower:

$$\text{SNR}_{\text{structural}} \approx \frac{\Delta n}{\sqrt{n}} \ll 1$$

where $\Delta n$ is the difference in electron counts due to the specimen contrast. For a typical protein, $\Delta n / n \sim 0.02$, making individual particle images essentially invisible by eye.

SNR Improvement by Averaging

The key to cryo-EM is that averaging $N$ independent, aligned images of identical particles improves the SNR. If each image has signal $s$ and independent noise with variance $\sigma^2$:

The average signal is unchanged: $\bar{s} = s$. The noise variance of the average decreases as:

$$\text{Var}(\bar{x}) = \frac{\sigma^2}{N}$$

Therefore the SNR after averaging $N$ particles is:

$$\boxed{\text{SNR}_N = \sqrt{N} \cdot \text{SNR}_1}$$

To achieve SNR $\sim 10$ from images with SNR $\sim 0.1$, one needs$N \sim 10{,}000$ particles. For higher resolution (smaller features, lower contrast), the required number grows. Modern high-resolution reconstructions routinely use$10^5$ to $10^6$ particles.

This analysis assumes perfect alignment. In practice, alignment errors introduce an additional noise source that limits resolution. The required number of particles to achieve resolution$d$ scales approximately as $N \propto 1/d^2$ (more particles are needed for higher resolution, since the signal at high spatial frequencies is weaker).

5. Derivation 4: CTF Correction with the Wiener Filter

Since the CTF modulates the image in Fourier space, raw cryo-EM images contain contrast reversals and missing information at CTF zeros. Accurate CTF correction is essential for high-resolution reconstruction.

Image Formation Model

In the weak phase object approximation, the observed image in Fourier space is:

$$\hat{I}(k) = \text{CTF}(k) \cdot \hat{\rho}_{\text{proj}}(k) + \hat{N}(k)$$

where $\hat{\rho}_{\text{proj}}$ is the Fourier transform of the projected density and $\hat{N}$ is additive noise. The goal is to recover$\hat{\rho}_{\text{proj}}$ from the observed $\hat{I}$.

Naive division by the CTF diverges at zeros. The Wiener filter provides the optimal linear estimate (in the minimum mean-square error sense):

$$\boxed{H_{\text{opt}}(k) = \frac{\text{CTF}(k)}{\text{CTF}^2(k) + \frac{1}{\text{SNR}(k)}}}$$

The corrected estimate is $\hat{\rho}_{\text{est}} = H_{\text{opt}} \cdot \hat{I}$.

Derivation of the Wiener Filter

We seek the filter $H(k)$ that minimizes the expected squared error:

$$\mathcal{E} = \left\langle \left| H(k)\hat{I}(k) - \hat{\rho}(k) \right|^2 \right\rangle$$

Substituting $\hat{I} = \text{CTF}\cdot\hat{\rho} + \hat{N}$ and expanding:

$$\mathcal{E} = |H|^2 \left(\text{CTF}^2 S_\rho + S_N\right) - 2\,\text{Re}\!\left[H \cdot \text{CTF} \cdot S_\rho\right] + S_\rho$$

where $S_\rho = \langle|\hat{\rho}|^2\rangle$ is the signal power spectrum and$S_N = \langle|\hat{N}|^2\rangle$ is the noise power spectrum. Setting$\partial \mathcal{E}/\partial H = 0$:

$$H_{\text{opt}} = \frac{\text{CTF} \cdot S_\rho}{\text{CTF}^2 S_\rho + S_N} = \frac{\text{CTF}}{\text{CTF}^2 + S_N/S_\rho}$$

Identifying $\text{SNR}(k) = S_\rho(k)/S_N(k)$ recovers the boxed result above. At CTF zeros, $H_{\text{opt}} \to 0$, gracefully suppressing the noisy frequencies rather than amplifying them. At spatial frequencies where $\text{SNR} \gg 1$,$H_{\text{opt}} \approx 1/\text{CTF}$, performing full phase correction.

In practice, CTF correction in single-particle cryo-EM is performed during 3D reconstruction. Each particle image has its own CTF (determined by its position on the micrograph and the local defocus), and the reconstruction algorithm applies per-particle CTF correction. Combining images from many defocus values fills in the CTF zeros, since zeros at one defocus are non-zeros at another.

6. Applications of Cryo-EM

Diagram of ribosome translating mRNA into a polypeptide chain, showing large and small subunits
The ribosome translating mRNA: cryo-EM has revealed the structural dynamics of this molecular machine at near-atomic resolution β€” Source: Wikimedia Commons

The Ribosome

The ribosome was one of the first large macromolecular complexes to be studied by cryo-EM. Joachim Frank's pioneering single-particle analysis work in the 1980s and 1990s used the ribosome as a primary test system. The ribosome is ideal for cryo-EM: it is large (~2.5 MDa for the bacterial 70S ribosome), abundant, and functionally important. Early reconstructions at 25 Γ… resolution revealed the overall architecture, while modern cryo-EM achieves sub-2.5 Γ… resolution, enabling visualization of individual amino acids, drug binding sites, and conformational states during translation.

Crucially, cryo-EM can capture the ribosome in multiple functional states simultaneously. Computational classification of heterogeneous datasets reveals ribosomes at different stages of translation, bound to different tRNAs and elongation factors, providing a molecular movie of protein synthesis.

SARS-CoV-2 Spike Protein

Cryo-EM played a critical role in the rapid response to the COVID-19 pandemic. Within weeks of the publication of the SARS-CoV-2 genome in January 2020, Jason McLellan's group at the University of Texas determined the structure of the prefusion spike glycoprotein at 3.5 Γ… resolution using cryo-EM (Wrapp et al., Science, 2020). This structure revealed the receptor binding domain (RBD) in its "up" conformation, poised to engage the human ACE2 receptor.

The spike structure was immediately shared with vaccine developers and became the basis for the mRNA vaccines produced by Pfizer-BioNTech and Moderna. Subsequent cryo-EM structures mapped the epitopes of neutralizing antibodies, tracked mutations in variants of concern, and guided the design of broadly protective antigens.

Drug Discovery

Cryo-EM has become an essential tool in structure-based drug design. Unlike X-ray crystallography, which requires well-diffracting crystals, cryo-EM can determine structures of membrane proteins, flexible complexes, and heterogeneous assemblies that are intractable to crystallization. Pharmaceutical companies now routinely use cryo-EM for lead optimization, visualizing how candidate drugs bind to their targets at near-atomic resolution.

Notable examples include structures of TRPV1 (the capsaicin receptor), GPCRs in complex with G proteins, and the proteasome bound to inhibitors. The ability to resolve multiple conformational states in a single dataset provides insight into the mechanism of drug action and selectivity.

Time-Resolved Cryo-EM

An emerging frontier is time-resolved cryo-EM, which captures structural snapshots at defined time points after initiating a reaction. Rapid mixing and spray-plunging devices can achieve time resolution of milliseconds. By vitrifying samples at different time points, one obtains a series of cryo-EM datasets that together describe the structural trajectory of a biological process.

Applications include watching ribosome assembly intermediates, following enzyme catalytic cycles, and observing viral maturation. Combined with computational sorting of heterogeneous particles, time-resolved cryo-EM provides unprecedented insight into the dynamics of macromolecular machines.

7. Historical Development

The development of cryo-EM spans more than five decades, from the first electron microscope images of biological specimens to the resolution revolution of the 2010s.

DeRosier & Klug (1968): 3D Reconstruction from EM

David DeRosier and Aaron Klug published the first 3D reconstruction from electron micrographs in 1968, using images of the T4 bacteriophage tail. They demonstrated that a series of 2D projection images, taken at different tilt angles, could be computationally combined to reconstruct the 3D structure using the Fourier slice theorem. This seminal work established the mathematical framework that underpins all of modern cryo-EM reconstruction. Klug received the Nobel Prize in Chemistry in 1982 for this and related work on crystallographic electron microscopy.

Henderson (1975): First Atomic-Resolution EM Structure

Richard Henderson and Nigel Unwin determined the first structure of a membrane protein at near-atomic resolution using electron microscopy of 2D crystals of bacteriorhodopsin (1975). By combining images and electron diffraction from purple membrane patches, they obtained a 7 Γ… resolution map showing seven transmembrane alpha-helices. Henderson later improved the resolution to 3.5 Γ… (1990), and crucially demonstrated that radiation damage was the fundamental limiting factor, not the physics of electron optics.

Henderson's key insight was that the electron wavelength is short enough for atomic resolution, and that the resolution limit in biological EM is set entirely by the tolerable dose. He calculated that approximately 10,000 images of identical molecules should suffice for atomic-resolution reconstruction even at minimal dose, laying the theoretical groundwork for single-particle cryo-EM.

Joachim Frank: Single-Particle Analysis

Joachim Frank developed the computational methods for single-particle analysis beginning in the 1970s. His group created algorithms for aligning, classifying, and averaging heterogeneous sets of particle images without requiring crystals or regular arrays. The key innovations included multivariate statistical analysis for classification, correlation-based alignment, and iterative refinement of 3D reconstructions from 2D projections.

Frank's software package SPIDER became the standard tool for single-particle EM. His work transformed EM from a qualitative imaging technique into a quantitative structural method. The ribosome served as his primary test specimen, and his group produced increasingly detailed reconstructions through the 1990s and 2000s.

Jacques Dubochet: Vitrification

The critical breakthrough that made cryo-EM practical was Jacques Dubochet's development of vitrification in the early 1980s at the EMBL in Heidelberg. Previous EM methods required staining or dehydration, which distorted or destroyed biological structures. Dubochet discovered that by rapidly plunging a thin aqueous film into liquid ethane (cooled by liquid nitrogen), water could be frozen so quickly that it formed vitreous (amorphous) ice rather than crystalline ice.

Vitreous ice preserves the native hydrated structure of biological specimens, avoids staining artifacts, and provides a uniform background. Dubochet's method (published 1984) made it possible to image unstained, fully hydrated macromolecules in the electron microscope. Combined with Frank's single-particle methods and modern direct electron detectors with movie-mode data collection, vitrification enabled the resolution revolution.

Timeline of Key Milestones

  • 1968 β€” DeRosier & Klug: first 3D reconstruction from EM images
  • 1975 β€” Henderson & Unwin: bacteriorhodopsin structure at 7 Γ…
  • 1981 β€” Dubochet: vitrification of water for cryo-EM
  • 1984 β€” Dubochet & Adrian: first cryo-EM images of viruses in vitreous ice
  • 1987 β€” Frank: SPIDER software for single-particle analysis
  • 1990 β€” Henderson: bacteriorhodopsin at 3.5 Γ… from electron crystallography
  • 1995 β€” BΓΆttcher et al.: hepatitis B capsid at sub-nanometer resolution
  • 2008 β€” Yu et al.: GroEL at 4.2 Γ… (first sub-5 Γ… single-particle structure)
  • 2012 β€” Direct electron detectors (Gatan K2, FEI Falcon) become available
  • 2013 β€” Liao et al.: TRPV1 at 3.4 Γ… β€” start of the resolution revolution
  • 2014 β€” Kuhlbrandt coins "resolution revolution"
  • 2017 β€” Nobel Prize to Dubochet, Frank & Henderson
  • 2020 β€” Yip et al.: apoferritin at 1.2 Γ… (highest cryo-EM resolution)
  • 2020 β€” SARS-CoV-2 spike structure enables vaccine design

8. Python Simulation: CTF & Reconstruction

The following simulation demonstrates three fundamental aspects of cryo-EM: (1) the contrast transfer function for multiple defocus values, showing how CTF zeros shift with defocus, (2) simulated Thon rings as seen in a power spectrum, and (3) a simple back-projection reconstruction from projections at different angles.

Cryo-EM: CTF & Reconstruction

Python
script.py291 lines

Click Run to execute the Python code

Code will be executed with Python 3 on the server