1.2 Chemical Foundations
Chemical Bonds in Biomolecules
Covalent Bonds
Strong bonds formed by sharing electron pairs. Bond energy ranges 200--800 kJ/mol.
- C-C single bond: ~348 kJ/mol (0.154 nm)
- C=C double bond: ~614 kJ/mol (0.134 nm)
- C-O bond: ~360 kJ/mol
- C-N bond: ~305 kJ/mol
- C-H bond: ~413 kJ/mol
- O-H bond: ~463 kJ/mol
- P-O bond: ~335 kJ/mol (phosphodiester backbone)
- Peptide bond (C-N): ~370 kJ/mol (partial double-bond character)
Covalent bonds define the primary structure of biomolecules. Breaking them requires enzymes (hydrolases, nucleases, proteases) or extreme conditions (strong acid/base, high temperature).
Non-Covalent Interactions
Individually weak but collectively strong. Determine 3D structure.
- Hydrogen bonds: 10--40 kJ/mol. Donor (N-H, O-H) to acceptor (O, N). Optimal distance: 2.7--3.1 Angstrom. Directional -- strongest when donor-H-acceptor angle is 180 degrees.
- Ionic (electrostatic): ~20 kJ/mol in water (greatly reduced from vacuum by dielectric screening). Follows Coulomb's law.
- Van der Waals: 0.4--4 kJ/mol. Arise from transient dipole-induced dipole interactions. Attractive at optimal distance, strongly repulsive at shorter distances.
- Hydrophobic effect: Not a true bond. Nonpolar groups are excluded from water, driven primarily by entropy of surrounding water molecules.
Coulomb's Law for Ionic Interactions
$$F = \frac{q_1 q_2}{4\pi\varepsilon_0 \varepsilon_r r^2}$$
where q₁ and q₂ are the charges, r is the distance, $\varepsilon_0$ is the permittivity of free space, and $\varepsilon_r$ is the relative permittivity (dielectric constant). Water has $\varepsilon_r \approx 80$, which dramatically weakens ionic interactions compared to vacuum ($\varepsilon_r = 1$). This is why salt dissolves in water but ionic crystals are very stable in dry conditions.
Van der Waals Interactions: Lennard-Jones Potential
$$V(r) = 4\varepsilon\left[\left(\frac{\sigma}{r}\right)^{12} - \left(\frac{\sigma}{r}\right)^{6}\right]$$
The $r^{-12}$ term models short-range repulsion (Pauli exclusion), while the$r^{-6}$ term models London dispersion attraction. $\varepsilon$ is the depth of the potential well and $\sigma$ is the distance at which V = 0. The equilibrium distance is at $r_{\min} = 2^{1/6}\sigma \approx 1.12\sigma$.
Hydrogen Bonding in DNA
A-T Base Pair
2 hydrogen bonds
~7 kJ/mol total
G-C Base Pair
3 hydrogen bonds
~11 kJ/mol total
Note: Base stacking interactions (van der Waals + hydrophobic) contribute more to DNA duplex stability than hydrogen bonding. Each stacking interaction contributes ~20--40 kJ/mol per base pair step. The nearest-neighbor model accounts for this sequence-dependent stability.
Water Chemistry in Biology
Properties of Water
- Polarity: The O-H bond is polar (electronegativity difference ~1.2). Bent geometry (104.5 degrees) gives a net dipole moment of 1.85 D. Enables dissolution of ionic and polar compounds.
- Hydrogen bonding: Each water molecule can form up to 4 H-bonds. In liquid water at 25 C, ~3.4 H-bonds per molecule on average. This gives water unusually high boiling point (100 C vs -161 C for methane), heat capacity (4.18 J/(g K)), and heat of vaporization (40.7 kJ/mol).
- Dielectric constant: $\varepsilon_r \approx 80$ at 25 C, shielding charges and weakening electrostatic interactions.
- Amphoteric nature: Water can act as both acid and base (autoionization).
$$\text{H}_2\text{O} \rightleftharpoons \text{H}^+ + \text{OH}^-$$
$$K_w = [\text{H}^+][\text{OH}^-] = 1.0 \times 10^{-14} \text{ at 25°C}$$
pH, pKa, and the Henderson-Hasselbalch Equation
$$\text{pH} = -\log[\text{H}^+]$$
$$\text{p}K_a = -\log K_a$$
The Henderson-Hasselbalch equation relates pH to the ratio of conjugate base [A$^-$] to weak acid [HA]:
$$\text{pH} = \text{p}K_a + \log\frac{[\text{A}^-]}{[\text{HA}]}$$
Derivation: Henderson-Hasselbalch Equation
Starting from the acid dissociation equilibrium for a weak acid HA dissociating into H$^+$ and A$^-$:
Step 1: Write the acid dissociation constant
$$K_a = \frac{[\text{H}^+][\text{A}^-]}{[\text{HA}]}$$
Step 2: Solve for [H$^+$]
$$[\text{H}^+] = K_a \cdot \frac{[\text{HA}]}{[\text{A}^-]}$$
Step 3: Take the negative logarithm of both sides
$$-\log[\text{H}^+] = -\log K_a - \log\frac{[\text{HA}]}{[\text{A}^-]}$$
Step 4: Substitute pH and pKa definitions
Since $\text{pH} = -\log[\text{H}^+]$ and $\text{p}K_a = -\log K_a$:
$$\text{pH} = \text{p}K_a - \log\frac{[\text{HA}]}{[\text{A}^-]}$$
Step 5: Invert the logarithm argument
Using the identity $-\log(x/y) = \log(y/x)$:
$$\boxed{\text{pH} = \text{p}K_a + \log\frac{[\text{A}^-]}{[\text{HA}]}}$$
Step 6: Key insight -- at pH = pKa
When pH = pKa, $\log\frac{[\text{A}^-]}{[\text{HA}]} = 0$, so $[\text{A}^-] = [\text{HA}]$. Exactly half the acid is deprotonated. This is the point of maximum buffering capacity, because equal concentrations of acid and conjugate base are available to absorb added H$^+$ or OH$^-$.
Key pH Values in Biology
- Blood plasma: 7.35--7.45
- Cytoplasm: ~7.2
- Mitochondrial matrix: ~7.8
- Lysosomes: 4.5--5.0
- Stomach (gastric acid): 1.5--3.5
- Small intestine: 7.5--8.0
Biological Buffer Systems
- Bicarbonate: H₂CO₃/HCO₃$^-$ (pKa = 6.1, blood)
- Phosphate: H₂PO₄$^-$/HPO₄$^{2-}$ (pKa = 6.86, intracellular)
- Histidine: imidazole side chain (pKa = 6.0, proteins)
- Tris: pKa = 8.1 (common lab buffer)
A buffer works best within +/- 1 pH unit of its pKa. At pH = pKa, [HA] = [A$^-$] and the buffering capacity is maximal. The buffer capacity $\beta$ is defined as:
$$\beta = \frac{d[\text{base added}]}{d(\text{pH})} = 2.303 \cdot C \cdot \frac{K_a[\text{H}^+]}{(K_a + [\text{H}^+])^2}$$
where C is the total buffer concentration
Functional Groups in Biomolecules
Organic chemistry of life is built from a limited set of functional groups, each with characteristic reactivity:
Hydroxyl (-OH)
Polar, H-bond donor/acceptor. Found in sugars, serine, threonine. pKa ~16 for alcohols.
Amino (-NH₂)
Basic, H-bond donor. Found in amino acids, nucleotide bases. pKa ~9-10 for primary amines.
Carboxyl (-COOH)
Acidic, ionized at physiological pH. Found in amino acids, fatty acids. pKa ~2-5.
Phosphate (-OPO₃²⁻)
Strongly acidic (pKa1 ~2.1, pKa2 ~6.9). DNA/RNA backbone, ATP. Carries negative charge at pH 7.
Carbonyl (C=O)
Polar, H-bond acceptor. Aldehydes in sugars, ketones, peptide bond. Electrophilic carbon.
Sulfhydryl (-SH)
Weak acid (pKa ~8.3 for cysteine). Forms disulfide bonds (-S-S-). Important in protein folding.
Methyl (-CH₃)
Nonpolar, hydrophobic. DNA methylation (5-methylcytosine) regulates gene expression.
Ester (C-O-C=O)
Links glycerol to fatty acids in lipids. Phosphodiester bonds link nucleotides in DNA/RNA.
Thermodynamics of Biomolecules
Gibbs Free Energy
$$\Delta G = \Delta H - T\Delta S$$
Gibbs Free Energy Equation
$\Delta H$ (Enthalpy)
Heat absorbed/released. Negative = exothermic (bond formation). Positive = endothermic (bond breaking).
$T\Delta S$ (Entropy term)
T in Kelvin. Positive $\Delta S$ = increased disorder (favorable). Hydrophobic effect is entropy-driven.
$\Delta G$ (Free Energy)
Determines spontaneity. $\Delta G < 0$: spontaneous. $\Delta G = 0$: equilibrium. $\Delta G > 0$: non-spontaneous.
Relationship to Equilibrium
$$\Delta G° = -RT\ln K_{eq}$$
where R = 8.314 J/(mol K) and T is temperature in Kelvin. A large $K_{eq}$ means a very negative $\Delta G°$ (reaction strongly favors products).
Derivation: Gibbs Free Energy and Equilibrium Constant
Starting from the fundamental thermodynamic definitions, we derive the relationship between free energy and the equilibrium constant.
Step 1: Define Gibbs free energy from enthalpy and entropy
The Gibbs free energy G is defined as a state function combining enthalpy H and entropy S at constant temperature and pressure:
$$G = H - TS$$
Step 2: For a process at constant T and P, take the change
$$\Delta G = \Delta H - T\Delta S$$
This is the master equation: a reaction is spontaneous when $\Delta G < 0$, which can occur if $\Delta H < 0$ (exothermic) or $T\Delta S > 0$ (entropy increase), or both.
Step 3: Chemical potential and reaction quotient
For a reaction with concentrations differing from standard state, the free energy depends on the reaction quotient Q:
$$\Delta G = \Delta G° + RT\ln Q$$
where $Q = \frac{[\text{Products}]}{[\text{Reactants}]}$ and R = 8.314 J/(mol K).
Step 4: At equilibrium, $\Delta G = 0$ and $Q = K_{eq}$
$$0 = \Delta G° + RT\ln K_{eq}$$
Step 5: Solve for the standard free energy
$$\boxed{\Delta G° = -RT\ln K_{eq}}$$
This is the bridge between thermodynamics and equilibrium. A large $K_{eq}$ (products favored) corresponds to a large negative $\Delta G°$. For example, ATP hydrolysis: $K_{eq} \approx 2 \times 10^5$ gives $\Delta G°' = -30.5$ kJ/mol at 37 C.
Step 6: Numerical example at body temperature (T = 310 K)
$$\Delta G° = -(8.314 \times 10^{-3})(310)\ln(2 \times 10^5) = -2.578 \times \ln(2 \times 10^5) \approx -31.4 \text{ kJ/mol}$$
Derivation: Boltzmann Distribution for Molecular States
The Boltzmann distribution describes the probability of a molecule occupying a state with energy E at thermal equilibrium. This is fundamental to understanding molecular interactions in biology.
Step 1: The fundamental postulate of statistical mechanics
For an isolated system with total energy $E_{total}$, all accessible microstates are equally probable. Consider a small subsystem (a single molecule) in contact with a large heat bath (the rest of the cell).
Step 2: Probability is proportional to the number of reservoir states
When the molecule is in state i with energy $E_i$, the reservoir has energy $E_{total} - E_i$. The number of reservoir microstates is $\Omega(E_{total} - E_i)$, so:
$$P(E_i) \propto \Omega(E_{total} - E_i)$$
Step 3: Use the entropy-microstate relation and Taylor expand
Since $S = k_B \ln \Omega$, we get $\Omega = e^{S/k_B}$. Taylor-expanding the reservoir entropy around $E_{total}$:
$$S(E_{total} - E_i) \approx S(E_{total}) - E_i \frac{\partial S}{\partial E} = S(E_{total}) - \frac{E_i}{T}$$
where we used $\partial S/\partial E = 1/T$ (the thermodynamic definition of temperature).
Step 4: Exponentiate to get the Boltzmann factor
$$P(E_i) \propto e^{S(E_{total})/k_B} \cdot e^{-E_i/k_BT} \propto e^{-E_i/k_BT}$$
The first factor is a constant (same for all states) and is absorbed into normalization.
Step 5: Define the partition function Z for normalization
Since probabilities must sum to 1:
$$Z = \sum_i e^{-E_i/k_BT}$$
$$\boxed{P(E_i) = \frac{e^{-E_i/k_BT}}{Z}}$$
Step 6: Connection to free energy and biological significance
The Helmholtz free energy is $F = -k_BT \ln Z$. At room temperature ($T = 300$ K), $k_BT \approx 2.5$ kJ/mol $\approx 0.6$ kcal/mol. This sets the energy scale for non-covalent interactions: a hydrogen bond (~10 kJ/mol $\approx 4k_BT$) significantly shifts the population, while thermal fluctuations can break individual weak bonds, enabling dynamic molecular recognition.
Derivation: Morse Potential for Bond Energy and Bond Length
The Morse potential provides a more realistic model for the potential energy of a diatomic bond than the harmonic approximation, correctly capturing bond dissociation and anharmonicity.
Step 1: Start with the harmonic approximation near equilibrium
Near the equilibrium bond length $r_0$, Hooke's law gives:
$$V_{\text{harmonic}}(r) = \frac{1}{2}k(r - r_0)^2$$
This parabola goes to infinity at large r, which is unphysical -- real bonds dissociate at finite energy.
Step 2: Require correct asymptotic behavior
A realistic potential must satisfy: $V(r_0) = 0$ (minimum), $V(r \to \infty) = D_e$ (dissociation energy), and $V'(r_0) = 0$ (equilibrium). The Morse potential achieves this with an exponential form:
$$\boxed{E(r) = D_e\left(1 - e^{-a(r - r_0)}\right)^2}$$
Step 3: Verify the boundary conditions
At $r = r_0$: $E = D_e(1 - 1)^2 = 0$ (minimum). As $r \to \infty$: $e^{-a(r-r_0)} \to 0$, so $E \to D_e$ (dissociation plateau). As $r \to 0$: $E$ rises steeply (repulsion).
Step 4: Relate the width parameter a to the force constant k
Expanding around $r_0$ using $e^{-x} \approx 1 - x + x^2/2$:
$$E(r) \approx D_e \cdot a^2(r - r_0)^2 + \cdots$$
Comparing with the harmonic potential: $k = 2D_e a^2$, therefore:
$$a = \sqrt{\frac{k}{2D_e}}$$
Step 5: The force and its relationship to bond stretching
$$F(r) = -\frac{dE}{dr} = 2aD_e\left(1 - e^{-a(r-r_0)}\right)e^{-a(r-r_0)}$$
The maximum restoring force occurs at the inflection point. Beyond this, the bond weakens and eventually breaks.
Step 6: Biological application -- C-C bond parameters
For a C-C single bond: $D_e \approx 348$ kJ/mol, $r_0 = 0.154$ nm, $a \approx 19.0$ nm$^{-1}$. For C=C double bond: $D_e \approx 614$ kJ/mol, $r_0 = 0.134$ nm. The higher $D_e$ and shorter $r_0$ for double bonds reflects the general trend: stronger bonds are shorter, a consequence of greater orbital overlap.
Actual vs. Standard Free Energy
$$\Delta G = \Delta G°' + RT\ln\frac{[\text{Products}]}{[\text{Reactants}]}$$
$\Delta G°'$ is the biochemical standard free energy (pH 7, 25 C, 1 M concentrations). The actual $\Delta G$ depends on cellular concentrations, which can shift reactions that are unfavorable under standard conditions to become favorable in vivo.
ATP: The Energy Currency
$$\text{ATP} + \text{H}_2\text{O} \rightarrow \text{ADP} + \text{P}_i \quad \Delta G°' = -30.5 \text{ kJ/mol}$$
ATP hydrolysis releases energy due to: (1) electrostatic repulsion between the closely spaced negative charges on phosphate groups, (2) better resonance stabilization of products (ADP + Pi), and (3) greater hydration of products. Under cellular conditions ($[\text{ATP}] \approx 5 \text{ mM}$,$[\text{ADP}] \approx 0.5 \text{ mM}$, $[\text{P}_i] \approx 5 \text{ mM}$), the actual $\Delta G$ is approximately -50 to -54 kJ/mol.
Coupled Reactions
Endergonic reactions can be driven by coupling to ATP hydrolysis. For example, glutamine synthesis:
- Glutamate + NH₃ → Glutamine + H₂O ($\Delta G°' = +14.2$ kJ/mol) -- unfavorable alone
- ATP + H₂O → ADP + Pi ($\Delta G°' = -30.5$ kJ/mol)
- Coupled: Glutamate + NH₃ + ATP → Glutamine + ADP + Pi ($\Delta G°' = -16.3$ kJ/mol) -- favorable!
This principle of energetic coupling underpins virtually all biosynthetic reactions. The cell maintains a high [ATP]/[ADP] ratio (typically ~500:1) far from equilibrium, making ATP hydrolysis even more exergonic than the standard value suggests.
Other High-Energy Compounds
- Phosphoenolpyruvate (PEP): $\Delta G°' = -61.9$ kJ/mol
- 1,3-Bisphosphoglycerate: $\Delta G°' = -49.4$ kJ/mol
- Creatine phosphate: $\Delta G°' = -43.1$ kJ/mol
- ATP (to ADP): $\Delta G°' = -30.5$ kJ/mol
- Glucose-6-phosphate: $\Delta G°' = -13.8$ kJ/mol
- Glycerol-3-phosphate: $\Delta G°' = -9.2$ kJ/mol
Compounds above ATP in the thermodynamic scale can phosphorylate ADP to regenerate ATP (substrate-level phosphorylation). Compounds below ATP require ATP hydrolysis to be phosphorylated.
The Four Macromolecules
Nucleic Acids
DNA and RNA -- store and transmit genetic information
Monomers: Nucleotides (base + sugar + phosphate)
Linkage: 3'-5' phosphodiester bonds. DNA: 2-deoxyribose, A/T/G/C. RNA: ribose, A/U/G/C.
Proteins
Enzymes, structural support, signaling, transport, immunity
Monomers: 20 standard amino acids
Linkage: Peptide bonds (condensation). 4 levels of structure: primary, secondary ($\alpha$-helix, $\beta$-sheet), tertiary, quaternary.
Carbohydrates
Energy storage, structural support, cell recognition
Monomers: Monosaccharides (glucose, fructose, galactose)
Linkage: Glycosidic bonds. Examples: starch, glycogen, cellulose, chitin. General formula: C$_n$(H₂O)$_n$.
Lipids
Membranes, energy storage, signaling, insulation
Not true polymers -- diverse hydrophobic structures
Types: fatty acids, phospholipids (bilayer membranes), sterols (cholesterol), sphingolipids, glycolipids. Energy yield: ~37 kJ/g (vs ~17 kJ/g for carbs).
Python: Buffer pH and Thermodynamics Calculator
Buffer pH and Thermodynamics Calculator
PythonHenderson-Hasselbalch equation, Gibbs free energy, and ATP coupling
Click Run to execute the Python code
Code will be executed with Python 3 on the server
Cellular Organization
Prokaryotes
- No membrane-bound nucleus
- Circular chromosome (+ plasmids)
- 70S ribosomes (50S + 30S subunits)
- Cell wall (peptidoglycan in bacteria)
- Genome: 0.5--10 Mb, gene-dense (~85% coding)
- Coupled transcription-translation
- Examples: Bacteria, Archaea
Eukaryotes
- Membrane-bound nucleus with nuclear pores
- Linear chromosomes with telomeres/centromeres
- 80S ribosomes (60S + 40S subunits)
- Endomembrane system (ER, Golgi, lysosomes)
- Genome: 10 Mb--100 Gb, much non-coding (~1.5% coding in human)
- RNA processing (5' cap, splicing, 3' poly-A) before export
- Mitochondria (own 16.6 kb circular genome in humans)