1.2 Chemical Foundations

Chemical Bonds in Biomolecules

Covalent Bonds

Strong bonds formed by sharing electron pairs. Bond energy ranges 200--800 kJ/mol.

  • C-C single bond: ~348 kJ/mol (0.154 nm)
  • C=C double bond: ~614 kJ/mol (0.134 nm)
  • C-O bond: ~360 kJ/mol
  • C-N bond: ~305 kJ/mol
  • C-H bond: ~413 kJ/mol
  • O-H bond: ~463 kJ/mol
  • P-O bond: ~335 kJ/mol (phosphodiester backbone)
  • Peptide bond (C-N): ~370 kJ/mol (partial double-bond character)

Covalent bonds define the primary structure of biomolecules. Breaking them requires enzymes (hydrolases, nucleases, proteases) or extreme conditions (strong acid/base, high temperature).

Non-Covalent Interactions

Individually weak but collectively strong. Determine 3D structure.

  • Hydrogen bonds: 10--40 kJ/mol. Donor (N-H, O-H) to acceptor (O, N). Optimal distance: 2.7--3.1 Angstrom. Directional -- strongest when donor-H-acceptor angle is 180 degrees.
  • Ionic (electrostatic): ~20 kJ/mol in water (greatly reduced from vacuum by dielectric screening). Follows Coulomb's law.
  • Van der Waals: 0.4--4 kJ/mol. Arise from transient dipole-induced dipole interactions. Attractive at optimal distance, strongly repulsive at shorter distances.
  • Hydrophobic effect: Not a true bond. Nonpolar groups are excluded from water, driven primarily by entropy of surrounding water molecules.

Coulomb's Law for Ionic Interactions

$$F = \frac{q_1 q_2}{4\pi\varepsilon_0 \varepsilon_r r^2}$$

where q₁ and q₂ are the charges, r is the distance, $\varepsilon_0$ is the permittivity of free space, and $\varepsilon_r$ is the relative permittivity (dielectric constant). Water has $\varepsilon_r \approx 80$, which dramatically weakens ionic interactions compared to vacuum ($\varepsilon_r = 1$). This is why salt dissolves in water but ionic crystals are very stable in dry conditions.

Van der Waals Interactions: Lennard-Jones Potential

$$V(r) = 4\varepsilon\left[\left(\frac{\sigma}{r}\right)^{12} - \left(\frac{\sigma}{r}\right)^{6}\right]$$

The $r^{-12}$ term models short-range repulsion (Pauli exclusion), while the$r^{-6}$ term models London dispersion attraction. $\varepsilon$ is the depth of the potential well and $\sigma$ is the distance at which V = 0. The equilibrium distance is at $r_{\min} = 2^{1/6}\sigma \approx 1.12\sigma$.

Hydrogen Bonding in DNA

A-T Base Pair

2 hydrogen bonds

~7 kJ/mol total

G-C Base Pair

3 hydrogen bonds

~11 kJ/mol total

Note: Base stacking interactions (van der Waals + hydrophobic) contribute more to DNA duplex stability than hydrogen bonding. Each stacking interaction contributes ~20--40 kJ/mol per base pair step. The nearest-neighbor model accounts for this sequence-dependent stability.

Water Chemistry in Biology

Properties of Water

  • Polarity: The O-H bond is polar (electronegativity difference ~1.2). Bent geometry (104.5 degrees) gives a net dipole moment of 1.85 D. Enables dissolution of ionic and polar compounds.
  • Hydrogen bonding: Each water molecule can form up to 4 H-bonds. In liquid water at 25 C, ~3.4 H-bonds per molecule on average. This gives water unusually high boiling point (100 C vs -161 C for methane), heat capacity (4.18 J/(g K)), and heat of vaporization (40.7 kJ/mol).
  • Dielectric constant: $\varepsilon_r \approx 80$ at 25 C, shielding charges and weakening electrostatic interactions.
  • Amphoteric nature: Water can act as both acid and base (autoionization).

$$\text{H}_2\text{O} \rightleftharpoons \text{H}^+ + \text{OH}^-$$

$$K_w = [\text{H}^+][\text{OH}^-] = 1.0 \times 10^{-14} \text{ at 25°C}$$

pH, pKa, and the Henderson-Hasselbalch Equation

$$\text{pH} = -\log[\text{H}^+]$$

$$\text{p}K_a = -\log K_a$$

The Henderson-Hasselbalch equation relates pH to the ratio of conjugate base [A$^-$] to weak acid [HA]:

$$\text{pH} = \text{p}K_a + \log\frac{[\text{A}^-]}{[\text{HA}]}$$

Derivation: Henderson-Hasselbalch Equation

Starting from the acid dissociation equilibrium for a weak acid HA dissociating into H$^+$ and A$^-$:

Step 1: Write the acid dissociation constant

$$K_a = \frac{[\text{H}^+][\text{A}^-]}{[\text{HA}]}$$

Step 2: Solve for [H$^+$]

$$[\text{H}^+] = K_a \cdot \frac{[\text{HA}]}{[\text{A}^-]}$$

Step 3: Take the negative logarithm of both sides

$$-\log[\text{H}^+] = -\log K_a - \log\frac{[\text{HA}]}{[\text{A}^-]}$$

Step 4: Substitute pH and pKa definitions

Since $\text{pH} = -\log[\text{H}^+]$ and $\text{p}K_a = -\log K_a$:

$$\text{pH} = \text{p}K_a - \log\frac{[\text{HA}]}{[\text{A}^-]}$$

Step 5: Invert the logarithm argument

Using the identity $-\log(x/y) = \log(y/x)$:

$$\boxed{\text{pH} = \text{p}K_a + \log\frac{[\text{A}^-]}{[\text{HA}]}}$$

Step 6: Key insight -- at pH = pKa

When pH = pKa, $\log\frac{[\text{A}^-]}{[\text{HA}]} = 0$, so $[\text{A}^-] = [\text{HA}]$. Exactly half the acid is deprotonated. This is the point of maximum buffering capacity, because equal concentrations of acid and conjugate base are available to absorb added H$^+$ or OH$^-$.

Key pH Values in Biology

  • Blood plasma: 7.35--7.45
  • Cytoplasm: ~7.2
  • Mitochondrial matrix: ~7.8
  • Lysosomes: 4.5--5.0
  • Stomach (gastric acid): 1.5--3.5
  • Small intestine: 7.5--8.0

Biological Buffer Systems

  • Bicarbonate: H₂CO₃/HCO₃$^-$ (pKa = 6.1, blood)
  • Phosphate: H₂PO₄$^-$/HPO₄$^{2-}$ (pKa = 6.86, intracellular)
  • Histidine: imidazole side chain (pKa = 6.0, proteins)
  • Tris: pKa = 8.1 (common lab buffer)

A buffer works best within +/- 1 pH unit of its pKa. At pH = pKa, [HA] = [A$^-$] and the buffering capacity is maximal. The buffer capacity $\beta$ is defined as:

$$\beta = \frac{d[\text{base added}]}{d(\text{pH})} = 2.303 \cdot C \cdot \frac{K_a[\text{H}^+]}{(K_a + [\text{H}^+])^2}$$

where C is the total buffer concentration

Functional Groups in Biomolecules

Organic chemistry of life is built from a limited set of functional groups, each with characteristic reactivity:

Hydroxyl (-OH)

Polar, H-bond donor/acceptor. Found in sugars, serine, threonine. pKa ~16 for alcohols.

Amino (-NH₂)

Basic, H-bond donor. Found in amino acids, nucleotide bases. pKa ~9-10 for primary amines.

Carboxyl (-COOH)

Acidic, ionized at physiological pH. Found in amino acids, fatty acids. pKa ~2-5.

Phosphate (-OPO₃²⁻)

Strongly acidic (pKa1 ~2.1, pKa2 ~6.9). DNA/RNA backbone, ATP. Carries negative charge at pH 7.

Carbonyl (C=O)

Polar, H-bond acceptor. Aldehydes in sugars, ketones, peptide bond. Electrophilic carbon.

Sulfhydryl (-SH)

Weak acid (pKa ~8.3 for cysteine). Forms disulfide bonds (-S-S-). Important in protein folding.

Methyl (-CH₃)

Nonpolar, hydrophobic. DNA methylation (5-methylcytosine) regulates gene expression.

Ester (C-O-C=O)

Links glycerol to fatty acids in lipids. Phosphodiester bonds link nucleotides in DNA/RNA.

Thermodynamics of Biomolecules

Gibbs Free Energy

$$\Delta G = \Delta H - T\Delta S$$

Gibbs Free Energy Equation

$\Delta H$ (Enthalpy)

Heat absorbed/released. Negative = exothermic (bond formation). Positive = endothermic (bond breaking).

$T\Delta S$ (Entropy term)

T in Kelvin. Positive $\Delta S$ = increased disorder (favorable). Hydrophobic effect is entropy-driven.

$\Delta G$ (Free Energy)

Determines spontaneity. $\Delta G < 0$: spontaneous. $\Delta G = 0$: equilibrium. $\Delta G > 0$: non-spontaneous.

Relationship to Equilibrium

$$\Delta G° = -RT\ln K_{eq}$$

where R = 8.314 J/(mol K) and T is temperature in Kelvin. A large $K_{eq}$ means a very negative $\Delta G°$ (reaction strongly favors products).

Derivation: Gibbs Free Energy and Equilibrium Constant

Starting from the fundamental thermodynamic definitions, we derive the relationship between free energy and the equilibrium constant.

Step 1: Define Gibbs free energy from enthalpy and entropy

The Gibbs free energy G is defined as a state function combining enthalpy H and entropy S at constant temperature and pressure:

$$G = H - TS$$

Step 2: For a process at constant T and P, take the change

$$\Delta G = \Delta H - T\Delta S$$

This is the master equation: a reaction is spontaneous when $\Delta G < 0$, which can occur if $\Delta H < 0$ (exothermic) or $T\Delta S > 0$ (entropy increase), or both.

Step 3: Chemical potential and reaction quotient

For a reaction with concentrations differing from standard state, the free energy depends on the reaction quotient Q:

$$\Delta G = \Delta G° + RT\ln Q$$

where $Q = \frac{[\text{Products}]}{[\text{Reactants}]}$ and R = 8.314 J/(mol K).

Step 4: At equilibrium, $\Delta G = 0$ and $Q = K_{eq}$

$$0 = \Delta G° + RT\ln K_{eq}$$

Step 5: Solve for the standard free energy

$$\boxed{\Delta G° = -RT\ln K_{eq}}$$

This is the bridge between thermodynamics and equilibrium. A large $K_{eq}$ (products favored) corresponds to a large negative $\Delta G°$. For example, ATP hydrolysis: $K_{eq} \approx 2 \times 10^5$ gives $\Delta G°' = -30.5$ kJ/mol at 37 C.

Step 6: Numerical example at body temperature (T = 310 K)

$$\Delta G° = -(8.314 \times 10^{-3})(310)\ln(2 \times 10^5) = -2.578 \times \ln(2 \times 10^5) \approx -31.4 \text{ kJ/mol}$$

Derivation: Boltzmann Distribution for Molecular States

The Boltzmann distribution describes the probability of a molecule occupying a state with energy E at thermal equilibrium. This is fundamental to understanding molecular interactions in biology.

Step 1: The fundamental postulate of statistical mechanics

For an isolated system with total energy $E_{total}$, all accessible microstates are equally probable. Consider a small subsystem (a single molecule) in contact with a large heat bath (the rest of the cell).

Step 2: Probability is proportional to the number of reservoir states

When the molecule is in state i with energy $E_i$, the reservoir has energy $E_{total} - E_i$. The number of reservoir microstates is $\Omega(E_{total} - E_i)$, so:

$$P(E_i) \propto \Omega(E_{total} - E_i)$$

Step 3: Use the entropy-microstate relation and Taylor expand

Since $S = k_B \ln \Omega$, we get $\Omega = e^{S/k_B}$. Taylor-expanding the reservoir entropy around $E_{total}$:

$$S(E_{total} - E_i) \approx S(E_{total}) - E_i \frac{\partial S}{\partial E} = S(E_{total}) - \frac{E_i}{T}$$

where we used $\partial S/\partial E = 1/T$ (the thermodynamic definition of temperature).

Step 4: Exponentiate to get the Boltzmann factor

$$P(E_i) \propto e^{S(E_{total})/k_B} \cdot e^{-E_i/k_BT} \propto e^{-E_i/k_BT}$$

The first factor is a constant (same for all states) and is absorbed into normalization.

Step 5: Define the partition function Z for normalization

Since probabilities must sum to 1:

$$Z = \sum_i e^{-E_i/k_BT}$$

$$\boxed{P(E_i) = \frac{e^{-E_i/k_BT}}{Z}}$$

Step 6: Connection to free energy and biological significance

The Helmholtz free energy is $F = -k_BT \ln Z$. At room temperature ($T = 300$ K), $k_BT \approx 2.5$ kJ/mol $\approx 0.6$ kcal/mol. This sets the energy scale for non-covalent interactions: a hydrogen bond (~10 kJ/mol $\approx 4k_BT$) significantly shifts the population, while thermal fluctuations can break individual weak bonds, enabling dynamic molecular recognition.

Derivation: Morse Potential for Bond Energy and Bond Length

The Morse potential provides a more realistic model for the potential energy of a diatomic bond than the harmonic approximation, correctly capturing bond dissociation and anharmonicity.

Step 1: Start with the harmonic approximation near equilibrium

Near the equilibrium bond length $r_0$, Hooke's law gives:

$$V_{\text{harmonic}}(r) = \frac{1}{2}k(r - r_0)^2$$

This parabola goes to infinity at large r, which is unphysical -- real bonds dissociate at finite energy.

Step 2: Require correct asymptotic behavior

A realistic potential must satisfy: $V(r_0) = 0$ (minimum), $V(r \to \infty) = D_e$ (dissociation energy), and $V'(r_0) = 0$ (equilibrium). The Morse potential achieves this with an exponential form:

$$\boxed{E(r) = D_e\left(1 - e^{-a(r - r_0)}\right)^2}$$

Step 3: Verify the boundary conditions

At $r = r_0$: $E = D_e(1 - 1)^2 = 0$ (minimum). As $r \to \infty$: $e^{-a(r-r_0)} \to 0$, so $E \to D_e$ (dissociation plateau). As $r \to 0$: $E$ rises steeply (repulsion).

Step 4: Relate the width parameter a to the force constant k

Expanding around $r_0$ using $e^{-x} \approx 1 - x + x^2/2$:

$$E(r) \approx D_e \cdot a^2(r - r_0)^2 + \cdots$$

Comparing with the harmonic potential: $k = 2D_e a^2$, therefore:

$$a = \sqrt{\frac{k}{2D_e}}$$

Step 5: The force and its relationship to bond stretching

$$F(r) = -\frac{dE}{dr} = 2aD_e\left(1 - e^{-a(r-r_0)}\right)e^{-a(r-r_0)}$$

The maximum restoring force occurs at the inflection point. Beyond this, the bond weakens and eventually breaks.

Step 6: Biological application -- C-C bond parameters

For a C-C single bond: $D_e \approx 348$ kJ/mol, $r_0 = 0.154$ nm, $a \approx 19.0$ nm$^{-1}$. For C=C double bond: $D_e \approx 614$ kJ/mol, $r_0 = 0.134$ nm. The higher $D_e$ and shorter $r_0$ for double bonds reflects the general trend: stronger bonds are shorter, a consequence of greater orbital overlap.

Actual vs. Standard Free Energy

$$\Delta G = \Delta G°' + RT\ln\frac{[\text{Products}]}{[\text{Reactants}]}$$

$\Delta G°'$ is the biochemical standard free energy (pH 7, 25 C, 1 M concentrations). The actual $\Delta G$ depends on cellular concentrations, which can shift reactions that are unfavorable under standard conditions to become favorable in vivo.

ATP: The Energy Currency

$$\text{ATP} + \text{H}_2\text{O} \rightarrow \text{ADP} + \text{P}_i \quad \Delta G°' = -30.5 \text{ kJ/mol}$$

ATP hydrolysis releases energy due to: (1) electrostatic repulsion between the closely spaced negative charges on phosphate groups, (2) better resonance stabilization of products (ADP + Pi), and (3) greater hydration of products. Under cellular conditions ($[\text{ATP}] \approx 5 \text{ mM}$,$[\text{ADP}] \approx 0.5 \text{ mM}$, $[\text{P}_i] \approx 5 \text{ mM}$), the actual $\Delta G$ is approximately -50 to -54 kJ/mol.

Coupled Reactions

Endergonic reactions can be driven by coupling to ATP hydrolysis. For example, glutamine synthesis:

  • Glutamate + NH₃ → Glutamine + H₂O   ($\Delta G°' = +14.2$ kJ/mol) -- unfavorable alone
  • ATP + H₂O → ADP + Pi   ($\Delta G°' = -30.5$ kJ/mol)
  • Coupled: Glutamate + NH₃ + ATP → Glutamine + ADP + Pi   ($\Delta G°' = -16.3$ kJ/mol) -- favorable!

This principle of energetic coupling underpins virtually all biosynthetic reactions. The cell maintains a high [ATP]/[ADP] ratio (typically ~500:1) far from equilibrium, making ATP hydrolysis even more exergonic than the standard value suggests.

Other High-Energy Compounds

  • Phosphoenolpyruvate (PEP): $\Delta G°' = -61.9$ kJ/mol
  • 1,3-Bisphosphoglycerate: $\Delta G°' = -49.4$ kJ/mol
  • Creatine phosphate: $\Delta G°' = -43.1$ kJ/mol
  • ATP (to ADP): $\Delta G°' = -30.5$ kJ/mol
  • Glucose-6-phosphate: $\Delta G°' = -13.8$ kJ/mol
  • Glycerol-3-phosphate: $\Delta G°' = -9.2$ kJ/mol

Compounds above ATP in the thermodynamic scale can phosphorylate ADP to regenerate ATP (substrate-level phosphorylation). Compounds below ATP require ATP hydrolysis to be phosphorylated.

The Four Macromolecules

Nucleic Acids

DNA and RNA -- store and transmit genetic information

Monomers: Nucleotides (base + sugar + phosphate)

Linkage: 3'-5' phosphodiester bonds. DNA: 2-deoxyribose, A/T/G/C. RNA: ribose, A/U/G/C.

Proteins

Enzymes, structural support, signaling, transport, immunity

Monomers: 20 standard amino acids

Linkage: Peptide bonds (condensation). 4 levels of structure: primary, secondary ($\alpha$-helix, $\beta$-sheet), tertiary, quaternary.

Carbohydrates

Energy storage, structural support, cell recognition

Monomers: Monosaccharides (glucose, fructose, galactose)

Linkage: Glycosidic bonds. Examples: starch, glycogen, cellulose, chitin. General formula: C$_n$(H₂O)$_n$.

Lipids

Membranes, energy storage, signaling, insulation

Not true polymers -- diverse hydrophobic structures

Types: fatty acids, phospholipids (bilayer membranes), sterols (cholesterol), sphingolipids, glycolipids. Energy yield: ~37 kJ/g (vs ~17 kJ/g for carbs).

Python: Buffer pH and Thermodynamics Calculator

Buffer pH and Thermodynamics Calculator

Python

Henderson-Hasselbalch equation, Gibbs free energy, and ATP coupling

script.py92 lines

Click Run to execute the Python code

Code will be executed with Python 3 on the server

Cellular Organization

Prokaryotes

  • No membrane-bound nucleus
  • Circular chromosome (+ plasmids)
  • 70S ribosomes (50S + 30S subunits)
  • Cell wall (peptidoglycan in bacteria)
  • Genome: 0.5--10 Mb, gene-dense (~85% coding)
  • Coupled transcription-translation
  • Examples: Bacteria, Archaea

Eukaryotes

  • Membrane-bound nucleus with nuclear pores
  • Linear chromosomes with telomeres/centromeres
  • 80S ribosomes (60S + 40S subunits)
  • Endomembrane system (ER, Golgi, lysosomes)
  • Genome: 10 Mb--100 Gb, much non-coding (~1.5% coding in human)
  • RNA processing (5' cap, splicing, 3' poly-A) before export
  • Mitochondria (own 16.6 kb circular genome in humans)