Module 5
Protein Structure & AlphaFold2
Protein structure prediction was the “second half-century problem” of biology until AlphaFold2 essentially solved it in 2020. This module reviews the pre-AF2 methods (homology, threading, ab initio), the AlphaFold2 architecture (Evoformer + Structure Module), and the AlphaFold-DB era of practical use.
1. Pre-AF2 Methods
Homology modelling (Modeller, SWISS-MODEL) copies coordinates from a related experimentally-solved structure. Threading (I-TASSER, RaptorX) aligns to fold libraries even without close homology. Ab initio (Rosetta, QUARK) samples conformations against a force-field + statistical potential. Before 2020, CASP competitions showed slow but steady progress; accuracy on novel folds plateaued around GDT-TS 40–60.
2. AlphaFold2 Architecture
Jumper 2021 (Nature) described AF2. Core components:
- MSA + template search: recover co-evolutionary and structural context from UniProt / BFD / PDB.
- Evoformer: 48 blocks of attention that iteratively refine MSA embeddings and pair representations.
- Invariant Point Attention (IPA) Structure Module: 8 blocks that produce 3-D coordinates + atomic-level refinement.
- Recycling: the whole network is run 3× with iterated inputs for refinement.
Simulation: Accuracy & Confidence
Click Run to execute the Python code
Code will be executed with Python 3 on the server
3. AlphaFold-DB & Practical Use
AlphaFold Database (Varadi 2022) hosts predicted structures for ~200 million proteins — essentially all of UniProt. pLDDT (per-residue confidence), PAE (predicted aligned error) between residue pairs, and global pTM are used for quality control. Downstream uses: structure-based drug discovery, docking (HADDOCK, AutoDock Vina), variant-effect prediction, metagenome annotation. AlphaFold3 (Abramson 2024) extended to protein-protein, protein-DNA, protein-ligand with unified attention over multi-modal inputs.
Key References
• Jumper, J. et al. (2021). “Highly accurate protein structure prediction with AlphaFold.” Nature, 596, 583–589.
• Varadi, M. et al. (2022). “AlphaFold Protein Structure Database.” Nucleic Acids Res., 50, D439–D444.
• Abramson, J. et al. (2024). “Accurate structure prediction of biomolecular interactions with AlphaFold 3.” Nature, 630, 493–500.
• Lin, Z. et al. (2023). “Evolutionary-scale prediction of atomic-level protein structure.” Science, 379, 1123–1130.