Amino Acid Structure & Properties
The building blocks of proteins: structure, classification, and acid-base chemistry
General Amino Acid Structure
All 20 standard amino acids share a common structural framework. Each contains a central carbon atom known as the $\alpha$-carbon ($C_\alpha$), which is bonded to four distinct groups:
- An amino group ($-\text{NH}_3^+$ at physiological pH)
- A carboxyl group ($-\text{COO}^-$ at physiological pH)
- A hydrogen atom ($-\text{H}$)
- A variable side chain ($-\text{R}$), which distinguishes each amino acid
General Formula
The generic structure of an $\alpha$-amino acid can be written as:
At neutral pH, amino acids exist predominantly as zwitterions with the amino group protonated ($-\text{NH}_3^+$) and the carboxyl group deprotonated ($-\text{COO}^-$).
Chirality: L and D Amino Acids
Because the $C_\alpha$ is bonded to four different groups (except in glycine, where $\text{R} = \text{H}$), it is a chiral center (asymmetric carbon). This gives rise to two non-superimposable mirror-image forms called enantiomers, designated L (levorotatory) and D (dextrorotatory) based on their relationship to L- and D-glyceraldehyde in Fischer projections.
In a Fischer projection, the L-amino acid has the amino group on the left side of the $C_\alpha$. Virtually all amino acids found in proteins are of the L-configuration. The D-amino acids occur in bacterial cell walls and certain antibiotics but are exceedingly rare in eukaryotic proteins.
Using the Cahn-Ingold-Prelog (R/S) system, most L-amino acids correspond to the S configuration, with the notable exception of L-cysteine, which is assigned R due to the higher priority of the sulfur-containing side chain.
Classification by R-Group Properties
The 20 standard amino acids are classified according to the chemical properties of their side chains (R-groups). These properties determine the amino acid's role in protein structure and function, its solubility, and its reactivity.
Nonpolar / Hydrophobic (9 amino acids)
These residues have aliphatic or aromatic R-groups that are largely insoluble in water. They tend to cluster in the interior of globular proteins, away from the aqueous environment, stabilizing the tertiary structure through hydrophobic interactions.
| Amino Acid | 3-Letter | 1-Letter | R-Group Character |
|---|---|---|---|
| Glycine | Gly | G | H (simplest; technically achiral) |
| Alanine | Ala | A | Methyl group |
| Valine | Val | V | Branched aliphatic (isopropyl) |
| Leucine | Leu | L | Branched aliphatic (isobutyl) |
| Isoleucine | Ile | I | Branched aliphatic (sec-butyl) |
| Proline | Pro | P | Cyclic (pyrrolidine ring) |
| Phenylalanine | Phe | F | Aromatic (benzyl) |
| Tryptophan | Trp | W | Aromatic (indole ring) |
| Methionine | Met | M | Thioether ($-\text{S}-\text{CH}_3$) |
Polar Uncharged (6 amino acids)
These amino acids have R-groups capable of hydrogen bonding but carry no formal charge at pH 7.4. They are often found on protein surfaces, mediating interactions with water and other polar molecules.
| Amino Acid | 3-Letter | 1-Letter | Functional Group |
|---|---|---|---|
| Serine | Ser | S | Hydroxyl ($-\text{OH}$) |
| Threonine | Thr | T | Hydroxyl ($-\text{OH}$), branched |
| Cysteine | Cys | C | Sulfhydryl ($-\text{SH}$) |
| Tyrosine | Tyr | Y | Phenolic hydroxyl |
| Asparagine | Asn | N | Amide ($-\text{CONH}_2$) |
| Glutamine | Gln | Q | Amide ($-\text{CONH}_2$) |
Positively Charged at pH 7.4 (3 amino acids)
These amino acids carry a net positive charge under physiological conditions.
| Amino Acid | Code | R-Group p$K_a$ | Ionizable Group |
|---|---|---|---|
| Lysine (Lys, K) | K | ~10.5 | $\varepsilon$-amino group |
| Arginine (Arg, R) | R | ~12.5 | Guanidinium group |
| Histidine (His, H) | H | ~6.0 | Imidazole ring |
Negatively Charged at pH 7.4 (2 amino acids)
These amino acids carry a net negative charge under physiological conditions due to deprotonated carboxyl side chains.
| Amino Acid | Code | R-Group p$K_a$ | Ionizable Group |
|---|---|---|---|
| Aspartate (Asp, D) | D | ~3.65 | $\beta$-carboxyl |
| Glutamate (Glu, E) | E | ~4.25 | $\gamma$-carboxyl |
Acid-Base Chemistry of Amino Acids
Amino acids are amphoteric molecules: they can act as both acids and bases. At physiological pH (~7.4), amino acids exist predominantly as zwitterions -- dipolar ions in which the amino group is protonated ($-\text{NH}_3^+$) and the carboxyl group is deprotonated ($-\text{COO}^-$). The net charge of the zwitterion is zero, despite the presence of two charged groups.
Ionization Equilibria
A simple amino acid (no ionizable R-group) undergoes two ionization steps:
The first ionization (p$K_{a1}$) corresponds to loss of a proton from the carboxyl group; the second (p$K_{a2}$) to loss of a proton from the ammonium group.
The Henderson-Hasselbalch Equation
The relationship between pH, p$K_a$, and the ratio of conjugate base to acid is given by:
Key implications:
- When $\text{pH} = \text{p}K_a$, then $[\text{A}^-] = [\text{HA}]$ and the group is 50% ionized
- When $\text{pH} = \text{p}K_a + 1$, the ratio $[\text{A}^-]/[\text{HA}] = 10$ (90.9% deprotonated)
- When $\text{pH} = \text{p}K_a - 1$, the ratio $[\text{A}^-]/[\text{HA}] = 0.1$ (9.1% deprotonated)
- Effective buffering capacity exists within $\pm 1$ pH unit of the p$K_a$
Example: Fractional charge calculation
What fraction of the $\alpha$-carboxyl group of alanine (p$K_{a1} = 2.34$) is deprotonated at pH 3.34?
Therefore, $\frac{[\text{COO}^-]}{[\text{COO}^-] + [\text{COOH}]} = \frac{10}{11} \approx 0.909$, so the carboxyl group is approximately 90.9% deprotonated.
Isoelectric Point (pI)
Definition
The isoelectric point (pI) is the pH at which the amino acid carries no net electrical charge. At the pI, the amino acid will not migrate in an electric field (electrophoresis) and has minimum solubility.
Calculating pI
The pI is calculated as the average of the two p$K_a$ values that flank the zwitterionic (net zero charge) species:
For amino acids with ionizable side chains, identify the two p$K_a$ values that bracket the species with net charge = 0, then average those two.
Example 1: Glycine (no ionizable R-group)
p$K_{a1}$ = 2.34 ($\alpha$-COOH), p$K_{a2}$ = 9.60 ($\alpha$-$\text{NH}_3^+$)
Example 2: Aspartate (acidic R-group)
p$K_{a1}$ = 2.09 ($\alpha$-COOH), p$K_{aR}$ = 3.86 ($\beta$-COOH), p$K_{a2}$ = 9.82 ($\alpha$-$\text{NH}_3^+$)
The zwitterion (net charge = 0) is flanked by p$K_{a1}$ and p$K_{aR}$ (both carboxyl groups):
Example 3: Lysine (basic R-group)
p$K_{a1}$ = 2.18 ($\alpha$-COOH), p$K_{a2}$ = 8.95 ($\alpha$-$\text{NH}_3^+$), p$K_{aR}$ = 10.53 ($\varepsilon$-$\text{NH}_3^+$)
The zwitterion with net charge = 0 is flanked by p$K_{a2}$ and p$K_{aR}$ (both amino groups):
Example 4: Histidine (physiological buffer)
p$K_{a1}$ = 1.82 ($\alpha$-COOH), p$K_{aR}$ = 6.00 (imidazole), p$K_{a2}$ = 9.17 ($\alpha$-$\text{NH}_3^+$)
The zwitterion (net charge = 0) is flanked by p$K_{aR}$ and p$K_{a2}$:
Titration Curves
Titration of an amino acid with a strong base (e.g., NaOH) reveals the p$K_a$ values as inflection points on the titration curve. At each p$K_a$, the amino acid acts as a buffer, and the curve shows a relatively flat region (buffering zone).
Titration of Alanine (2 ionizable groups)
Starting from fully protonated form ($\text{H}_3\text{N}^+ - \text{CHR} - \text{COOH}$, net charge = +1) and adding NaOH:
- 0 to 1 equivalent NaOH: Titration of $\alpha$-COOH (p$K_{a1} = 2.34$). Buffering region near pH 2.34.
- At 0.5 equivalents: pH = p$K_{a1} = 2.34$. Equal amounts of cation and zwitterion.
- At 1 equivalent: pH = pI = $\frac{2.34 + 9.69}{2} = 6.02$. Pure zwitterion form.
- 1 to 2 equivalents NaOH: Titration of $\alpha$-$\text{NH}_3^+$ (p$K_{a2} = 9.69$). Buffering region near pH 9.69.
- At 1.5 equivalents: pH = p$K_{a2} = 9.69$. Equal amounts of zwitterion and anion.
- At 2 equivalents: Fully deprotonated anion ($\text{H}_2\text{N} - \text{CHR} - \text{COO}^-$, net charge = -1).
Titration of Glutamic Acid (3 ionizable groups)
Glutamic acid has three p$K_a$ values corresponding to the $\alpha$-COOH (p$K_{a1} = 2.19$), $\gamma$-COOH (p$K_{aR} = 4.25$), and $\alpha$-$\text{NH}_3^+$ (p$K_{a2} = 9.67$).
The titration curve shows three buffering plateaus and requires 3 equivalents of NaOH for complete titration.
Note: for acidic amino acids, the pI falls between the two lowest p$K_a$ values (both carboxyl groups). For basic amino acids, the pI falls between the two highest p$K_a$ values (both amino groups or amino + imidazole).
Buffering Capacity
The buffering capacity $\beta$ is defined as the amount of strong acid or base required to change the pH by one unit. Mathematically:
where $C_T$ is the total buffer concentration. Buffering capacity is maximal when $\text{pH} = \text{p}K_a$ (i.e., when $[\text{H}^+] = K_a$).
Special Amino Acids
Several amino acids possess unique structural or chemical features that give them special roles in protein biochemistry.
Proline -- The Cyclic Imino Acid
Proline is unique among the 20 standard amino acids because its side chain cyclizes back onto the backbone nitrogen, forming a five-membered pyrrolidine ring. This makes proline technically an imino acid rather than an amino acid, as the nitrogen bears only one hydrogen in its free form.
- The cyclic structure restricts the backbone dihedral angle $\phi$ to approximately $-65°$
- Proline introduces kinks in $\alpha$-helices and is often found in turns
- The $\text{X-Pro}$ peptide bond has a higher propensity for the cis configuration (~10%) compared to other peptide bonds (<0.1%)
- Proline residues are important in collagen, where every third residue is Gly and Pro/Hyp are abundant
Cysteine -- Disulfide Bond Formation
Cysteine's thiol group ($-\text{SH}$, p$K_a \approx 8.3$) can form covalent disulfide bonds ($-\text{S}-\text{S}-$) with another cysteine via oxidation:
The resulting cystine residue is critical for stabilizing the tertiary and quaternary structure of many extracellular proteins (e.g., insulin, immunoglobulins). The standard reduction potential is:
Glycine -- The Simplest Amino Acid
Glycine ($\text{R} = \text{H}$) is the smallest amino acid and the only one that is achiral. Its small size gives it maximal conformational flexibility in proteins:
- Glycine occupies the widest region of the Ramachandran plot
- It is essential in tight turns and in proteins requiring compact packing (e.g., collagen: Gly-X-Y repeat)
- Glycine also serves as a neurotransmitter and a precursor to porphyrins, purines, and glutathione
Histidine -- The Physiological Buffer
Histidine's imidazole side chain has a p$K_a \approx 6.0$, making it the only standard amino acid whose R-group can ionize near physiological pH. This property makes histidine uniquely suited for several roles:
- Physiological buffering: effective buffer in the pH range 5.0-7.0
- Enzyme catalysis: the imidazole ring can act as both proton donor and acceptor (general acid-base catalysis)
- Metal coordination: the nitrogen atoms of the imidazole ring coordinate metal ions (e.g., $\text{Zn}^{2+}$ in carbonic anhydrase, $\text{Fe}^{2+}$ in hemoglobin)
The protonation equilibrium of the imidazole side chain:
At pH 7.4, approximately $\frac{1}{1 + 10^{7.4 - 6.0}} = \frac{1}{1 + 25.1} \approx 3.8\%$ of histidine residues are protonated. This partial protonation at physiological pH is precisely what makes histidine so versatile in enzyme active sites.
Key Concepts
- ā¶All 20 standard amino acids share a common backbone: $\alpha$-amino group, $\alpha$-carboxyl group, H atom, and R-group bonded to the $C_\alpha$.
- ā¶All amino acids in proteins are of the L-configuration. The $C_\alpha$ is a chiral center in all amino acids except glycine.
- ā¶R-groups are classified as nonpolar, polar uncharged, positively charged, or negatively charged at pH 7.4.
- ā¶At physiological pH, amino acids exist as zwitterions with the amino group protonated and the carboxyl group deprotonated.
- ā¶The Henderson-Hasselbalch equation ($\text{pH} = \text{p}K_a + \log[\text{A}^-]/[\text{HA}]$) relates pH, p$K_a$, and the ionization state of each group.
- ā¶The isoelectric point (pI) is the average of the two p$K_a$ values flanking the zwitterionic species.
- ā¶Histidine (p$K_a \approx 6.0$) is the only amino acid that buffers effectively near physiological pH and is critical in enzyme active sites.
- ā¶Cysteine forms disulfide bonds through oxidation of thiol groups, stabilizing protein structure.