Part IV: Synthesis & Strategy | Chapter 16

Retrosynthetic Analysis

Working backwards from target molecule to available starting materials

1. Introduction: Corey's Retrosynthetic Analysis

Retrosynthetic analysis is a technique for planning organic syntheses by working backwards from the target molecule (TM) to progressively simpler precursors, ultimately arriving at commercially available starting materials. The approach was formalized by E. J. Corey at Harvard University in the 1960s, and earned him the 1990 Nobel Prize in Chemistry.

The fundamental idea is deceptively simple: instead of asking "What can I make from these starting materials?" (the synthetic direction), we ask "What precursors could give me my target molecule?" (the retrosynthetic direction). This reversal of logic transforms synthesis planning from a combinatorial nightmare into a structured, logical process.

The Retrosynthetic Arrow

The open-headed double arrow $\Longrightarrow$ (the "retrosynthetic arrow" or "transform arrow") indicates a retrosynthetic step. It means: "can be made from."

$\text{Target Molecule (TM)} \Longrightarrow \text{Precursor(s)}$

This arrow is never interchangeable with the forward reaction arrow ($\rightarrow$). The retrosynthetic arrow signifies a mental operation — atransform — not a chemical reaction.

A complete retrosynthetic analysis produces a retrosynthetic tree: a branching diagram in which the target sits at the root and the leaves are available starting materials. Each node in the tree is connected to its children by retrosynthetic arrows. The tree may be linear (a single chain of transforms) or convergent (branches that merge), with convergent strategies generally preferred because they maximize overall yield.

Linear vs Convergent Synthesis

For a linear synthesis of $n$ steps, each with yield $y$, the overall yield is:

$\text{Overall yield (linear)} = y^n$

For a convergent synthesis that joins two branches of $n/2$ steps each:

$\text{Overall yield (convergent)} = y^{(n/2)} \times y^{(n/2)} \times y = y^{n/2 + 1}$

For $n = 10$ steps at $y = 80\%$ each: linear gives $0.8^{10} = 10.7\%$, while convergent gives $0.8^6 = 26.2\%$ — nearly 2.5 times better.

2. Key Concepts: Disconnections and Synthons

The retrosynthetic approach relies on a precise vocabulary for describing transforms and the fragments they produce. Mastering these terms is essential before tackling complex synthetic problems.

Core Terminology

Transform ($\Longrightarrow$): A retrosynthetic operation that converts a target molecule into potential precursor structures. It is theexact reverse of a synthetic reaction, applied as a mental operation.
Disconnection: The imaginary cleavage of a bond in the target molecule to reveal potential precursor fragments. Disconnections focus on C–C bonds (for building the carbon skeleton) and C–X bonds (for introducing functional groups).
Synthon: An idealized ionic fragment resulting from a disconnection. Synthons represent the reactivity pattern needed, not actual reagents. For example, disconnecting acetone at the C–C bond gives a $ {}^{-}\text{CH}_2\text{COCH}_3$ carbanion synthon and an $\text{R}^+$ cation synthon.
Synthetic Equivalent: A real, commercially available reagent that provides the reactivity of a given synthon. For the carbanion synthon$ {}^{-}\text{CH}_2\text{COCH}_3$, the synthetic equivalent is the enolate of acetone (generated with LDA or NaH).

Donor and Acceptor Synthons

Every heterolytic disconnection produces two synthons with complementary polarity:

Nucleophilic (Donor, $d^n$)

  • $d^0$: C=C, C=O (as nucleophile)
  • $d^1$: $\text{R}^-$ (Grignard, organolithium)
  • $d^2$: Enolates, enamines
  • $d^3$: Homoenolate equivalents

Electrophilic (Acceptor, $a^n$)

  • $a^0$: Epoxides, aziridines
  • $a^1$: $\text{R}^+$ (alkyl halides, carbonyls)
  • $a^2$: $\alpha,\beta$-unsaturated carbonyls
  • $a^3$: $\alpha$-halo carbonyls

Functional Group Interconversion (FGI)

FGI is a retrosynthetic operation that replaces one functional group with another to enable a disconnection that was not previously possible. No C–C bonds are broken or formed; only the functional group changes. Common FGIs include:

  • Alcohol $\Longrightarrow$ Alkene (dehydration/hydration)
  • Alkene $\Longrightarrow$ Alkyne (partial reduction)
  • Ketone $\Longrightarrow$ Alcohol (oxidation/reduction)
  • Amine $\Longrightarrow$ Nitro compound (reduction)
  • Carboxylic acid $\Longrightarrow$ Ester (hydrolysis)
  • 1,2-Diol $\Longrightarrow$ Alkene (dihydroxylation)

FGI is particularly useful when the target's functional groups do not suggest an obvious disconnection. By converting one group to another, new retrosynthetic pathways open up.

3. C–C Bond Forming Reactions

The formation of carbon–carbon bonds is the central challenge of organic synthesis. Each C–C bond in the target molecule represents a potential disconnection site. The following catalog presents the most important C–C bond forming reactions, showing both the retrosynthetic disconnection and the forward synthetic reaction.

3.1 Grignard Reaction

The Grignard reaction adds an organomagnesium halide ($\text{RMgX}$) to a carbonyl compound, forming a new C–C bond. It is one of the most versatile tools for building carbon skeletons.

Retrosynthetic disconnection:

$\text{R}_1\text{R}_2\text{CHOH} \Longrightarrow \text{R}_1\text{MgBr} + \text{R}_2\text{CHO}$

Forward reaction:

$\text{R}_1\text{MgBr} + \text{R}_2\text{CHO} \xrightarrow{1.\,\text{Et}_2\text{O}} \xrightarrow{2.\,\text{H}_3\text{O}^+} \text{R}_1\text{R}_2\text{CHOH}$

Scope: With formaldehyde $\rightarrow$ primary alcohol; with aldehyde$\rightarrow$ secondary alcohol; with ketone $\rightarrow$ tertiary alcohol; with CO$_2$ $\rightarrow$ carboxylic acid; with ester$\rightarrow$ tertiary alcohol (2 equiv).

3.2 Wittig Reaction

The Wittig reaction converts a carbonyl compound to an alkene using a phosphorus ylide. It is the method of choice for placing a C=C double bond at a specific position.

Retrosynthetic disconnection:

$\text{R}_1\text{CH}=\text{CHR}_2 \Longrightarrow \text{R}_1\text{CHO} + \text{Ph}_3\text{P}=\text{CHR}_2$

Forward reaction:

$\text{R}_1\text{CHO} + \text{Ph}_3\text{P}=\text{CHR}_2 \rightarrow \text{R}_1\text{CH}=\text{CHR}_2 + \text{Ph}_3\text{P}=\text{O}$

Stereochemistry: Unstabilized ylides give predominantly Z-alkenes; stabilized ylides (with EWG) give E-alkenes. The Horner–Wadsworth–Emmons (HWE) modification using phosphonate esters gives E-selectivity with easier purification.

3.3 Aldol Condensation

The aldol reaction forms a $\beta$-hydroxy carbonyl by coupling an enolizable carbonyl with an aldehyde or ketone electrophile. It is the premier method for constructing 1,3-difunctionalized systems.

Retrosynthetic disconnection:

$\text{R}_1\text{COCH}_2\text{CH(OH)R}_2 \Longrightarrow \text{R}_1\text{COCH}_3 + \text{R}_2\text{CHO}$

Forward reaction:

$\text{R}_1\text{COCH}_3 \xrightarrow{\text{LDA, } -78°\text{C}} \text{enolate} \xrightarrow{\text{R}_2\text{CHO}} \beta\text{-hydroxy ketone}$

Key feature: The directed aldol (using LDA to preform a single enolate) avoids the mixed-product problem of classical aldol conditions. The Evans, Zimmerman–Traxler, and Mukaiyama variants provide stereochemical control.

3.4 Claisen Condensation

The Claisen condensation couples two esters (or an ester with another carbonyl) to produce a$\beta$-keto ester. It is the ester analog of the aldol reaction.

Retrosynthetic disconnection:

$\text{R}_1\text{COCH}_2\text{COOR}_2 \Longrightarrow \text{R}_1\text{COOR} + \text{CH}_3\text{COOR}_2$

Forward reaction:

$2\,\text{RCH}_2\text{COOR}' \xrightarrow{\text{NaOR}'} \text{RCH(COOR')COCHR}_2 + \text{R'OH}$

3.5 Diels–Alder Reaction

The Diels–Alder [4+2] cycloaddition constructs a six-membered ring with up to four new stereocenters in a single step. It is perhaps the most powerful ring-forming reaction in organic chemistry.

Retrosynthetic disconnection:

$\text{cyclohexene} \Longrightarrow \text{1,3-diene} + \text{dienophile}$

Forward reaction:

$\text{diene} + \text{dienophile} \xrightarrow{\Delta \text{ or Lewis acid}} \text{cyclohexene}$

Rules: The reaction is syn with respect to both diene and dienophile (suprafacial on both components). Endo products are kinetically favored (Alder rule). Electron-rich dienes + electron-poor dienophiles (normal demand) or vice versa (inverse demand).

3.6 Transition-Metal Catalyzed Cross-Coupling

Palladium-catalyzed cross-coupling reactions have revolutionized C–C bond formation, particularly for $\text{sp}^2$–$\text{sp}^2$ and $\text{sp}^2$–$\text{sp}$ couplings. The 2010 Nobel Prize was awarded to Heck, Negishi, and Suzuki for this work.

Suzuki Coupling

$\text{Ar--X} + \text{Ar'--B(OH)}_2 \xrightarrow{\text{Pd(0), base}} \text{Ar--Ar'}$

Retrosynthetically: $\text{Ar--Ar'} \Longrightarrow \text{ArX} + \text{Ar'B(OH)}_2$

Heck Reaction

$\text{Ar--X} + \text{CH}_2\text{=CHR} \xrightarrow{\text{Pd(0), base}} \text{Ar--CH=CHR}$

Retrosynthetically: $\text{ArCH=CHR} \Longrightarrow \text{ArX} + \text{CH}_2\text{=CHR}$

Sonogashira Coupling

$\text{Ar--X} + \text{HC} \equiv \text{CR} \xrightarrow{\text{Pd(0), CuI, amine}} \text{Ar--C} \equiv \text{CR}$

Retrosynthetically: $\text{Ar--C} \equiv \text{CR} \Longrightarrow \text{ArX} + \text{HC} \equiv \text{CR}$

General catalytic cycle: Oxidative addition of Ar–X to Pd(0) $\rightarrow$ transmetalation with organometallic partner $\rightarrow$ reductive elimination to form the new C–C bond and regenerate Pd(0).

4. Strategy: Two-Group Disconnections

When two functional groups (FGs) are present in the target molecule, their relative position determines which C–C bond forming reaction is most appropriate. This is the 1,n-difunctionalized strategy, and it is the most systematic approach to retrosynthetic planning. The key insight is that the spacing between the functional groups dictates the polarity pattern, which in turn specifies the reaction type.

4.1 1,2-Difunctionalized Compounds

Two functional groups on adjacent carbons. The polarity pattern requires$d^1 + a^1$ or an $a^0$ synthon (epoxide).

Typical target: 1,2-diol, $\beta$-amino alcohol, $\alpha$-hydroxy ketone

Key disconnection:

$\text{R}_1\text{CH(OH)--CH(OH)R}_2 \Longrightarrow \text{R}_1\text{CH(OH)CHO} + \text{R}_2^-$

Synthetic equivalents: Epoxide opening by nucleophiles (Grignard, organocuprate, amine, azide). The $\text{S}_\text{N}2$ opening of an epoxide with a Grignard reagent gives a 1,2-difunctionalized product with anti stereochemistry.

$\text{R}_1\text{MgBr} + \overset{O}{\overbrace{\text{R}_2\text{CH---CH}_2}} \xrightarrow{\text{CuI cat.}} \text{R}_1\text{CH}_2\text{CH(OH)R}_2$

4.2 1,3-Difunctionalized Compounds

Two functional groups separated by one carbon. The polarity pattern is$d^2 + a^1$, which is the natural reactivity of carbonyl compounds. This is the most common pattern in retrosynthesis.

Typical target: $\beta$-hydroxy carbonyl, $\beta$-keto ester, 1,3-diol

Key disconnection (aldol):

$\text{RCOCH}_2\text{CH(OH)R'} \Longrightarrow \text{RCOCH}_3 \;(d^2) + \text{R'CHO} \;(a^1)$

Reactions: Aldol reaction, Claisen condensation, Knoevenagel condensation, malonic ester synthesis. The 1,3-relationship is sometimes called the "aldol pattern" because the aldol reaction naturally produces it.

4.3 1,4-Difunctionalized Compounds

Two functional groups separated by two carbons. The polarity pattern requires$d^2 + a^2$ or $d^1 + a^3$. This is a "dissonant" pattern when both groups have the same polarity bias.

Typical target: 1,4-dicarbonyl, $\gamma$-butyrolactone

Key disconnection (conjugate addition):

$\text{RCOCH}_2\text{CH}_2\text{COR'} \Longrightarrow \text{RCOCH}_3 \;(d^2) + \text{R'COCH=CH}_2 \;(a^2)$

Reactions: Michael addition (conjugate addition of enolate to $\alpha,\beta$-unsaturated carbonyl), Stetter reaction (umpolung, using $d^1$ acyl anion equivalent + $a^2$). The 1,4-dicarbonyl pattern is the hallmark of conjugate addition.

4.4 1,5-Difunctionalized Compounds

Two functional groups separated by three carbons. The polarity pattern is$d^2 + a^3$, which matches the Michael/aldol combination.

Typical target: 1,5-dicarbonyl, cyclohexenone (Robinson annulation product)

Key disconnection (Robinson annulation):

$\text{cyclohexenone} \Longrightarrow \text{ketone} \;(d^2) + \text{methyl vinyl ketone} \;(a^2 + a^1)$

Reactions: The Robinson annulation is a sequential Michael addition + intramolecular aldol condensation that builds a fused six-membered ring. It combines a 1,4-addition (Michael) with a 1,3-relationship (aldol) to create the 1,5-pattern. This was famously used in steroid synthesis.

Summary Table: Two-Group Disconnection Logic

PatternSynthon PairingKey ReactionConsonance
1,2-difunctional$d^1 + a^0$ or $d^1 + a^1$Epoxide openingDissonant
1,3-difunctional$d^2 + a^1$Aldol / ClaisenConsonant
1,4-difunctional$d^2 + a^2$Michael additionDissonant
1,5-difunctional$d^2 + a^3$Michael + aldolConsonant

5. Worked Examples

Example A: 2-Phenyl-2-butanol via Grignard

Target: 2-phenyl-2-butanol, $\text{C}_6\text{H}_5\text{C(CH}_3\text{)(OH)CH}_2\text{CH}_3$

Analysis: The target is a tertiary alcohol. Any of the three C–C bonds to the carbinol carbon can be disconnected, giving three possible Grignard disconnections:

Disconnection 1: break C–Ph bond

$\text{PhC(CH}_3\text{)(OH)Et} \Longrightarrow \text{PhMgBr} + \text{CH}_3\text{COEt (methyl ethyl ketone)}$

Disconnection 2: break C–Me bond

$\text{PhC(CH}_3\text{)(OH)Et} \Longrightarrow \text{CH}_3\text{MgBr} + \text{PhCOEt (propiophenone)}$

Disconnection 3: break C–Et bond

$\text{PhC(CH}_3\text{)(OH)Et} \Longrightarrow \text{EtMgBr} + \text{PhCOCH}_3 \text{ (acetophenone)}$

Best choice: Disconnection 3, because acetophenone and ethylmagnesium bromide are both cheap and commercially available. This exemplifies the principle: choose the disconnection that leads to the simplest, most readily available starting materials.

Example B: 4-Methylcyclohex-2-enone via Diels–Alder

Target: 4-methylcyclohex-2-en-1-one

Step 1: Identify the Diels–Alder retron

A six-membered ring containing a double bond is the hallmark of a Diels–Alder product. We disconnect across the double bond and the bond para to it.

Step 2: FGI if necessary

The enone must first be converted retrosynthetically to a cyclohexene bearing the correct substituents. FGI: ketone $\Longrightarrow$ alcohol $\Longrightarrow$ Diels–Alder adduct.

Step 3: Disconnection

$\text{cyclohexene adduct} \Longrightarrow \text{(E)-penta-1,3-diene} + \text{acrolein (CH}_2\text{=CHCHO)}$

Forward synthesis: (E)-Penta-1,3-diene + acrolein $\xrightarrow{\Delta}$ Diels–Alder adduct $\xrightarrow{\text{PCC}}$ cyclohexanone $\xrightarrow{\text{selenylation, oxidation, elimination}}$ enone target. The endo rule predicts the correct stereochemistry.

Example C: 4-Bromonitrobenzene via EAS Sequence

Target: para-bromonitrobenzene

Retrosynthetic analysis:

The key question: which substituent is introduced first? The order of EAS steps matters because existing substituents direct incoming groups.

$\text{4-BrC}_6\text{H}_4\text{NO}_2 \Longrightarrow \text{bromobenzene} + \text{HNO}_3/\text{H}_2\text{SO}_4$

or

$\text{4-BrC}_6\text{H}_4\text{NO}_2 \Longrightarrow \text{nitrobenzene} + \text{Br}_2/\text{FeBr}_3$

Analysis: Bromine is an ortho/para director (lone pairs donate to the ring despite its electronegativity). Nitro is a meta director (strong EWG). To get the pararelationship, we must introduce Br first, then nitrate the bromobenzene:

$\text{C}_6\text{H}_6 \xrightarrow{\text{Br}_2,\,\text{FeBr}_3} \text{C}_6\text{H}_5\text{Br} \xrightarrow{\text{HNO}_3,\,\text{H}_2\text{SO}_4} \text{4-BrC}_6\text{H}_4\text{NO}_2$

If we nitrated first, subsequent bromination of nitrobenzene would give the meta-bromo product, not the desired para isomer. This illustrates the critical role of directing effects in aromatic retrosynthesis.

Example D: A Polyketide Fragment via Convergent Strategy

Target: A $\beta$-hydroxy-$\delta$-keto ester fragment, common in polyketide natural products like erythromycin:

$\text{R}_1\text{CH(OH)CH}_2\text{COCH}_2\text{COOR}_2$

Step 1: Identify functional group relationships

The OH and ketone are in a 1,3-relationship ($\beta$-hydroxy ketone = aldol pattern). The ketone and ester are also in a 1,3-relationship (= Claisen pattern). This suggests a two-stage disconnection.

Disconnection 1: Aldol (1,3-OH/C=O)

$\text{target} \Longrightarrow \text{R}_1\text{CHO} + \text{CH}_3\text{COCH}_2\text{COOR}_2$

Disconnection 2: Claisen (1,3-C=O/C=O)

$\text{CH}_3\text{COCH}_2\text{COOR}_2 \Longrightarrow \text{CH}_3\text{COOR} + \text{CH}_3\text{COOR}_2$

Forward synthesis: (1) Claisen condensation of two esters to form the $\beta$-keto ester. (2) Directed aldol of the $\beta$-keto ester enolate with aldehyde $\text{R}_1\text{CHO}$ to install the $\beta$-hydroxy group with Evans auxiliary for stereochemical control. This convergent strategy assembles a complex fragment in just two C–C bond-forming steps.

6. Protecting Groups

In complex molecules with multiple functional groups, a reagent may react with the wrong group. A protecting group is temporarily installed to mask a reactive functional group, allowing selective transformation elsewhere. The ideal protecting group is introduced in high yield, stable to subsequent reaction conditions, and removed cleanly at the end.

When to Protect

  • A nucleophilic group (OH, NH$_2$) would interfere with an electrophilic reaction
  • An acidic proton (OH, NH) would quench a strong base or organometallic reagent
  • A reducible group (C=O) would be reduced by LiAlH$_4$ or NaBH$_4$
  • An oxidizable group (OH, aldehyde) would be destroyed by an oxidant

Common Protecting Groups by Functional Group

Alcohol Protection (–OH)

Protecting GroupInstallRemoveStable to
TBS (tert-butyldimethylsilyl)TBSCl, imidazoleTBAF or HFBase, mild acid, Grignard
THP (tetrahydropyranyl)DHP, PPTSMild acid (PPTS, MeOH)Base, Grignard, LiAlH$_4$
Benzyl (Bn)BnBr, NaHH$_2$/Pd-CAcid, base, oxidation
Acetyl (Ac)Ac$_2$O, pyridineK$_2$CO$_3$/MeOHMild conditions

Amine Protection (–NH$_2$)

Protecting GroupInstallRemove
Boc (tert-butoxycarbonyl)Boc$_2$O, baseTFA or HCl/dioxane
Cbz (benzyloxycarbonyl)CbzCl, baseH$_2$/Pd-C
Fmoc (fluorenylmethyloxycarbonyl)FmocCl, basePiperidine (base)

Carbonyl Protection (C=O)

Acetals and ketals: Treatment with ethylene glycol and acid catalyst converts a ketone or aldehyde to a 1,3-dioxolane. Removal uses aqueous acid. Acetals are stable to base, nucleophiles, reducing agents (LiAlH$_4$), and Grignard reagents — making them one of the most useful protecting groups in synthesis.

$\text{RCOR'} + \text{HOCH}_2\text{CH}_2\text{OH} \rightleftharpoons[\text{H}^+]{} \text{cyclic acetal} + \text{H}_2\text{O}$

Carboxylic Acid Protection (–COOH)

Methyl/ethyl esters: Formed by Fischer esterification or diazomethane. Removed by saponification (NaOH/H$_2$O). tert-Butyl esters: Formed with isobutylene/acid. Removed with TFA (same conditions as Boc).

Orthogonal Protection Strategy

When a molecule contains multiple identical functional groups (e.g., two OH groups), we need protecting groups that can be removed independently under different conditions. This is called orthogonal protection.

Classic example: Protecting two hydroxyl groups with TBS and Bn:

  • TBS is removed by fluoride (TBAF) — Bn survives
  • Bn is removed by hydrogenolysis (H$_2$/Pd) — TBS survives

This allows selective unmasking of either OH group at any point in the synthesis.

In peptide synthesis, the Boc/Bn and Fmoc/tBu strategies exploit orthogonality: Boc is acid-labile while Bn is hydrogenolysis-labile; Fmoc is base-labile while tBu is acid-labile. These complementary pairs allow iterative deprotection/coupling cycles.

7. Applications

Total Synthesis of Natural Products

Retrosynthetic analysis is indispensable in planning the total synthesis of complex natural products. Landmark examples include:

  • Strychnine (Woodward, 1954): A 28-step linear synthesis of this notoriously complex alkaloid, later improved to 12 steps by Vanderwal (2011) using retrosynthetic strategy.
  • Vitamin B$_{12}$ (Woodward & Eschenmoser, 1973): A 100+ researcher collaboration requiring nearly 100 steps, demonstrating both the power and limitations of linear synthesis.
  • Taxol (paclitaxel) (Holton, 1994; Nicolaou, 1994; Danheiser, 1996): Multiple groups achieved total syntheses of this anticancer agent, each using different retrosynthetic strategies highlighting the non-uniqueness of retrosynthetic solutions.
  • Palytoxin (Kishi, 1994): One of the most complex molecules ever synthesized, with 64 stereocenters and 115 steps, the convergent strategy was essential.

Pharmaceutical Process Chemistry

In the pharmaceutical industry, retrosynthetic analysis is used to design practical, scalable routes to drug candidates. Process chemists prioritize:

  • Atom economy: Maximizing incorporation of reactant atoms into the product (Trost, 1991). Defined as $\text{AE} = \frac{M_{\text{product}}}{\sum M_{\text{reactants}}} \times 100\%$
  • Step economy: Minimizing the total number of synthetic steps (Wender, 2006)
  • Scalability: Avoiding reactions that are hazardous, expensive, or difficult to scale (cryogenic conditions, toxic metals, chromatographic purification)
  • Stereoselectivity: Using catalytic asymmetric methods rather than resolution to set stereocenters

Green Chemistry and Sustainability

Modern retrosynthetic planning increasingly incorporates the 12 Principles of Green Chemistry(Anastas & Warner, 1998). Key considerations:

  • Catalytic reactions over stoichiometric reagents (e.g., Pd-catalyzed coupling vs. stoichiometric organocuprate)
  • Biocatalysis: Enzyme-catalyzed transformations (lipases, transaminases, ketoreductases) offer exquisite selectivity under mild conditions
  • Cascade/tandem reactions: Performing multiple bond-forming events in one pot reduces waste, solvent use, and purification steps
  • Renewable feedstocks: Designing syntheses from biomass-derived starting materials rather than petrochemicals
  • Electrochemistry and photochemistry: Electron or photon as the "reagent," minimizing chemical waste

8. Historical Context

E. J. Corey and the Logic of Chemical Synthesis

Elias James Corey (b. 1928) received the 1990 Nobel Prize in Chemistry "for his development of the theory and methodology of organic synthesis." His key contributions include:

  • Formalization of retrosynthetic analysis (1960s): Corey introduced the systematic vocabulary of transforms, disconnections, synthons, and retrons that we use today. His 1967 paper and 1989 book The Logic of Chemical Synthesis are foundational texts.
  • LHASA (Logic and Heuristics Applied to Synthetic Analysis): One of the first computer programs for retrosynthetic planning, developed at Harvard starting in 1969. It anticipated modern AI-driven synthesis planning by decades.
  • Reagent development: Corey developed numerous reagents and reactions, including the Corey–Bakshi–Shibata (CBS) reduction, Corey–Chaykovsky reaction, Corey–Winter olefin synthesis, and Corey–Fuchs reaction.

R. B. Woodward: The Art Before the Science

Robert Burns Woodward (1917–1979) won the 1965 Nobel Prize for his achievements in the art of organic synthesis. Before Corey's formalization, Woodward relied on deep chemical intuition and pattern recognition. His total syntheses include:

  • Quinine (1944), cholesterol (1951), cortisone (1951), strychnine (1954)
  • Reserpine (1956), chlorophyll (1960), vitamin B$_{12}$ (1973, with Eschenmoser)

Woodward's work demonstrated that any molecule, no matter how complex, could in principle be synthesized. His approach was more artistic than algorithmic, relying on visual pattern matching and an encyclopedic knowledge of reactions.

Convergent vs Linear Synthesis: Historical Evolution

Early total syntheses (1940s–1960s) were predominantly linear: each intermediate was prepared sequentially. This approach is conceptually simple but suffers from exponential yield loss. The shift toward convergent strategies — in which fragments are built independently and joined late in the synthesis — was driven by:

  • Recognition that convergent routes dramatically improve overall yield
  • The availability of powerful C–C bond forming reactions (cross-coupling, olefin metathesis)
  • Computer-aided retrosynthetic analysis that could explore convergent disconnections systematically
  • Economic pressures in pharmaceutical manufacturing that demand efficiency

Modern total syntheses, like those from the groups of Baran, MacMillan, and Shenvi, emphasize step economy and ideality (minimizing non-strategic steps like protecting group manipulations and oxidation state changes).

Related Video Lectures

Retrosynthetic Analysis

Protecting Groups

9. Interactive Retrosynthetic Tree Builder

The following Python simulation builds a retrosynthetic analysis tree for a target molecule described by its functional groups. Given a set of functional groups and their positions, the program identifies possible disconnections, ranks them by efficiency criteria (convergence, availability of starting materials, number of steps), and displays the full retrosynthetic tree with scoring.

The algorithm encodes the two-group disconnection logic from Section 4 and the C–C bond forming reactions from Section 3. It uses numpy for numerical scoring and ranking.

Retrosynthetic Analysis: Disconnection Scoring & Yield Comparison

Python
script.py434 lines

Click Run to execute the Python code

Code will be executed with Python 3 on the server