Part II
The Genetic Basis of Cancer
Oncogenes, tumour-suppressor genes, the two-hit hypothesis, drivers vs. passengers, mutational signatures, hereditary syndromes, and the genome-scale landscape that modern oncology reads.
1. Cancer is a Genetic Disease
Every cancer arises through the somatic accumulation of mutations in genes that regulate cell proliferation, survival, and genome maintenance. The somatic-mutation theory of cancer — proposed by Boveri in 1914 and now canonical — holds that the malignant phenotype is a cell-autonomous, heritable property of clonal descendants of a single mutated cell.
The genes whose mutation drives cancer fall into two complementary classes:
Oncogenes
Activated by gain-of-function mutations. A single allele is sufficient (genetically dominant at the cellular level). Encode growth-promoting proteins: receptor tyrosine kinases, cytoplasmic kinases, GTPases, transcription factors, anti-apoptotic factors.
Tumour suppressor genes
Inactivated by loss-of-function mutations. Both alleles must be lost (recessive at the cellular level) — the two-hit hypothesis. Encode brakes on proliferation, DNA-damage responders, apoptosis effectors, repair enzymes.
2. Oncogenes — Gain of Function
An oncogene begins life as a normal cellular gene — its proto-oncogene — whose function is to promote proliferation, survival, or metabolic activation when appropriately regulated. Mutations that lock the proto-oncogene in an active state, or that drive its over-expression, convert it into an oncogene.
Mechanisms of activation
- Point mutation — KRAS G12, BRAF V600E, EGFR L858R; locks the kinase or GTPase in a constitutively active conformation.
- Gene amplification — HER2 (ERBB2) in breast cancer, MYCN in neuroblastoma; produces 10–100× normal protein levels.
- Chromosomal translocation — BCR-ABL in CML (Philadelphia chromosome, t(9;22)), MYC-IgH in Burkitt lymphoma, EWS-FLI1 in Ewing sarcoma.
- Insertional mutagenesis — viral promoter inserted near c-myc by ALV (avian leukosis virus); the mechanism that originally revealed v-onc/c-onc.
- Promoter mutations — TERT C228T/C250T create new ETS sites, the most common non-coding mutation across cancers.
- Gene-fusion drivers — ALK, ROS1, RET fusions in NSCLC; NTRK fusions are now pan-tissue actionable (larotrectinib).
ABL kinase domain bound to imatinib (Gleevec)
Schindler et al., Science 2000. The first targeted cancer drug — designed against the BCR-ABL fusion product of the Philadelphia chromosome in CML. Imatinib (orange sticks) occupies the ATP-binding pocket of the inactive kinase conformation. Median survival in CML went from 4 years to a normal lifespan.
3. Tumour Suppressors — Loss of Function
Tumour-suppressor genes normally restrain cell proliferation or maintain genome integrity. Loss-of-function inactivation, requiring (in the canonical model) the elimination of both alleles, releases the brake.
Mechanisms of inactivation
- Truncating mutations — nonsense, frameshift, splice-site — produce non-functional protein.
- Loss of heterozygosity (LOH) — chromosomal deletion, gene conversion, mitotic recombination, or whole-chromosome loss eliminates the second wild-type allele.
- Promoter hypermethylation — epigenetic silencing of e.g. MLH1 (sporadic MSI cancers), CDKN2A, BRCA1.
- Dominant-negative mutations — p53 missense alleles disrupt tetramers containing wild-type protein, producing a partial dominant phenotype before LOH.
- Viral inactivation — HPV E6 targets p53 for degradation; HPV E7 sequesters Rb. A virus achieving in trans what a mutation does in cis.
4. Knudson’s Two-Hit Hypothesis
In 1971, Alfred Knudson analysed the age-of-onset distribution of retinoblastoma in children and made a beautiful inference. Sporadic (unilateral, unifocal) cases peaked later and followed second-order kinetics; familial (bilateral, multifocal) cases peaked earlier and followed first-order kinetics. He proposed that retinoblastoma required two independent rate-limiting events:
\[ P(\text{tumour by age } t) \;=\; 1 - e^{-(\mu t)^k}, \quad k = \begin{cases} 1 & \text{familial (one hit needed)} \\ 2 & \text{sporadic (two hits needed)} \end{cases} \]
Children with hereditary retinoblastoma carry a germline mutation in one RB1 allele in every cell; only one further somatic hit (LOH, point mutation, or methylation of the second allele) is needed for tumour formation — hence linear age dependence. Sporadic cases require two independent somatic hits in the same cell, giving the second-order curve. The hypothesis was experimentally confirmed in 1986 by Friend, Dryja and Weinberg, who cloned RB1.
5. Classical Oncogenes — Atlas
| Gene | Class | Activation | Cancers | Drug |
|---|---|---|---|---|
| KRAS | small GTPase | G12/G13/Q61 missense | pancreatic 90%, CRC 40%, NSCLC 25% | sotorasib (G12C) |
| BRAF | Ser/Thr kinase | V600E missense | melanoma 50%, thyroid, hairy-cell leukaemia | vemurafenib, dabrafenib |
| EGFR | RTK | L858R, exon-19 del | NSCLC adeno (Asian female non-smoker) | erlotinib, gefitinib, osimertinib |
| HER2 / ERBB2 | RTK | amplification | breast 20%, gastric | trastuzumab, pertuzumab, T-DXd |
| PIK3CA | PI3K p110α | E545K, H1047R hotspots | breast, endometrial, CRC | alpelisib (HR+ breast) |
| MYC | bHLH-Zip TF | amplification, t(8;14) IgH | Burkitt, neuroblastoma (MYCN) | (BET inhibitors — investigational) |
| BCR-ABL | fusion kinase | t(9;22) Philadelphia | CML, Ph+ ALL | imatinib, dasatinib, ponatinib |
| ALK | RTK | EML4-ALK fusion | NSCLC ~5%, ALCL | crizotinib, alectinib, lorlatinib |
| JAK2 | tyrosine kinase | V617F | polycythaemia vera, ET, MF | ruxolitinib |
| FLT3 | RTK | internal-tandem dup | AML 30% | midostaurin, gilteritinib |
| KIT | RTK | exon-11 deletions | GIST, mast-cell disease | imatinib, sunitinib |
| CTNNB1 | β-catenin TF | S37, S33, T41 (no GSK3 phos) | HCC, CRC, desmoid tumour | (no approved direct inhibitor) |
MYC–MAX heterodimer bound to the E-box DNA element
Nair & Burley, Cell 2003. MYC is the master oncogenic transcription factor — amplified, translocated, or stabilised in roughly 70% of all human cancers. It dimerises with MAX through its bHLH-Zip domain to bind CACGTG E-boxes and activate ribosome biogenesis, glycolysis and proliferation. MYC has resisted direct drugging for 40 years.
6. Classical Tumour Suppressors — Atlas
| Gene | Function | Hereditary syndrome | Sporadic cancers |
|---|---|---|---|
| TP53 | DNA-damage TF, “guardian of genome” | Li-Fraumeni | >50% of all human cancers |
| RB1 | cell-cycle G1/S restraint | familial retinoblastoma + osteosarcoma | SCLC, retinoblastoma |
| APC | β-catenin destruction complex | familial adenomatous polyposis (FAP) | colorectal cancer (gatekeeper) |
| BRCA1 / BRCA2 | HR DNA-repair | HBOC (hereditary breast/ovarian) | breast, ovarian, prostate, pancreatic |
| MLH1, MSH2, MSH6, PMS2 | mismatch repair | Lynch syndrome (HNPCC) | colorectal, endometrial |
| VHL | HIF-α ubiquitin ligase | von Hippel-Lindau | clear-cell RCC, haemangioblastoma |
| NF1 | RasGAP (Ras-off) | neurofibromatosis-1 | plexiform neurofibroma, MPNST, glioma |
| NF2 | cytoskeleton (merlin), Hippo | neurofibromatosis-2 | schwannoma, meningioma |
| PTEN | PIP3 phosphatase, anti-PI3K | Cowden syndrome | endometrial, prostate, glioma |
| CDKN2A | p16INK4a, p14ARF | familial melanoma | melanoma, pancreatic, NSCLC |
| SMAD4 | TGF-β signaling | juvenile polyposis | pancreatic, CRC |
| STK11 / LKB1 | AMPK kinase, polarity | Peutz-Jeghers | NSCLC, cervical adeno |
Classical distinction: gatekeepers (APC, RB1, TP53) directly regulate proliferation/death; caretakers(BRCA1/2, MMR genes) maintain genome integrity. Loss of a caretaker accelerates the acquisition of further driver mutations — the “mutator phenotype.”
7. Driver vs Passenger Mutations
A typical adult solid tumour carries thousands of somatic mutations — melanoma and lung cancer (mutagen-exposed) often exceed 10⁵, while paediatric tumours and leukaemias may have only dozens. The vast majority are passengers — mutations that arose during the clonal expansion but contribute nothing to fitness. A small minority, typically 2–10 per tumour, are drivers — mutations that conferred a positive selective advantage on the cell or its ancestors.
How drivers are inferred
- Recurrence above background — mutations seen at the same residue across many tumours (KRAS-G12, IDH1-R132, BRAF-V600).
- dN/dS > 1 selection signal — non-synonymous > synonymous mutations relative to a neutral expectation (Martincorena et al. 2017).
- Functional clustering — mutations concentrated in protein domains essential for activity/inhibition.
- Mutual exclusivity / co-occurrence — pathway-level driver patterns (e.g. mutations in PTEN and PIK3CA are mutually exclusive).
Tumour mutational burden (TMB) — the total non-synonymous mutation count per Mb — is itself prognostic and predictive, especially for response to immune-checkpoint blockade: high-TMB tumours present more neoantigens.
8. Mutational Signatures — A Mutagen Fingerprint
Different mutagens leave characteristic patterns of single-base substitutions across the 96 trinucleotide contexts (each of 6 substitution classes × 16 flanking dinucleotides). Alexandrov, Stratton and colleagues (Nature 2013, 2020) decomposed the substitution profiles of tens of thousands of cancer genomes by NMF into single-base-substitution (SBS) signatures. The COSMIC catalogue now lists 80+ such signatures, many with assigned aetiologies:
| Signature | Aetiology | Cancers |
|---|---|---|
| SBS1 | spontaneous 5mC deamination (clock) | all (correlates with patient age) |
| SBS4 | tobacco smoke (PAHs → bulky adducts) | NSCLC, head & neck, oesophageal |
| SBS7a/b | UV-light pyrimidine dimers | melanoma, cutaneous SCC |
| SBS6, 15, 26 | defective MMR | Lynch / MSI-high tumours |
| SBS3 | HR deficiency (BRCA1/2) | breast, ovarian, pancreatic |
| SBS2 + SBS13 | APOBEC cytidine deaminase | breast, bladder, cervical, head & neck |
| SBS22 | aristolochic acid | upper-tract urothelial in endemic regions |
| SBS24 | aflatoxin B1 | HCC in regions with mouldy maize/peanuts |
| SBS10a/b | POLE proofreading defect (ultra-hypermutator) | endometrial, CRC |
Beyond aetiologic insight, signatures have therapeutic relevance: SBS3 (HR deficiency) correlates with PARP-inhibitor sensitivity even in BRCA-wild-type tumours; APOBEC activity flags potential checkpoint-blockade responsiveness. Tools: SigProfiler, MutationalPatterns, deconstructSigs.
9. Hereditary Cancer Syndromes
Roughly 5–10% of cancers occur in the context of a high-penetrance germline predisposition syndrome. Most are mendelian consequences of one inherited tumour-suppressor allele — the first hit is in every cell of the body.
Hereditary Breast/Ovarian (HBOC)
BRCA1, BRCA2
breast 60–80%, ovarian 20–60% by age 80
Lynch syndrome (HNPCC)
MLH1, MSH2, MSH6, PMS2
colorectal 40–80%, endometrial 25–60%
Familial adenomatous polyposis (FAP)
APC
CRC ~100% by age 40 if untreated
Li-Fraumeni
TP53
multi-cancer (sarcoma, breast, brain, ACC) early-onset
Retinoblastoma
RB1
bilateral retinoblastoma + 2° osteosarcoma
von Hippel-Lindau
VHL
clear-cell RCC, haemangioblastoma, phaeochromocytoma
Neurofibromatosis-1
NF1
plexiform NF, optic glioma, MPNST
MEN1 / MEN2
MEN1 / RET
parathyroid + pituitary + pancreas / medullary thyroid + phaeo
Cowden
PTEN
breast, thyroid, endometrial, hamartomas
Familial melanoma
CDKN2A
melanoma + pancreatic adenocarcinoma
Recognising these syndromes informs surveillance (early colonoscopy in Lynch, MRI in HBOC), prophylactic surgery (mastectomy/oophorectomy in BRCA carriers), targeted therapy (PARP inhibitors in BRCA-mutated), and cascade testing of relatives.
10. The Cancer Genome Atlas Era
Between 2008 and 2018 the Cancer Genome Atlas (TCGA), ICGC, and the Pan-Cancer Analysis of Whole Genomes (PCAWG)consortium sequenced ~25,000 tumours across 38 cancer types, producing the first genome-scale atlases of human cancer. Key generalisations that emerged:
- Mutation rates span 5 orders of magnitude — pilocytic astrocytoma (~10 mutations) to UV-driven melanoma and POLE-mutant endometrial (>10⁵).
- Few drivers per tumour — median 4–6 driver events; the rest are passengers.
- Recurrent driver landscape is small — ~300 genes (the “Cancer Gene Census”) account for most drivers across cancers; 30 genes carry the bulk.
- Pathway convergence — different tumours converge on common dysregulated pathways (RTK-Ras-PI3K, p53, cell cycle, TGF-β, Wnt, Notch, chromatin).
- Non-coding drivers exist — TERT promoter, splice-site, regulatory-element mutations.
- Subclonal architecture — tumours are heterogeneous; deep sequencing reveals branching evolution and treatment-resistant minor clones.
11. Therapeutic Implications
The genetic basis of cancer maps directly onto modern targeted therapy. Three complementary strategies:
A. Direct oncogene targeting
Imatinib (BCR-ABL), erlotinib (EGFR), vemurafenib (BRAF V600E), sotorasib (KRAS G12C), larotrectinib (NTRK fusions). Patient selection by molecular profile.
B. Synthetic lethality
Exploits TSG loss: PARP inhibition is selectively lethal in BRCA1/2-deficient tumours (HR-deficient cells cannot repair PARP-trapped collapsed forks). Wnt-pathway losses synthetic-lethal with tankyrase inhibition (under investigation).
C. Genotype-tied immunotherapy
MMR-deficient (Lynch) and high-TMB cancers are exquisitely sensitive to anti-PD-1 — pembrolizumab was the first tissue-agnostic approval (2017) on the basis of microsatellite instability alone.
Resistance. Targeted therapy almost always fails through secondary mutations (T790M in EGFR after erlotinib; T315I in BCR-ABL), bypass-pathway activation, or selection of pre-existing minor clones. Evolutionary dynamics is now the central problem of clinical oncology.