← Back to Part I: Amino Acids & Proteins

Protein Folding Dynamics

From two-state folding kinetics and chevron analysis through phi-value mapping of transition states to the Zimm-Bragg helix-coil theory and chaperone mechanisms.

Derivation 1: Two-State Folding & the Chevron Plot

Many small single-domain proteins fold in a two-state manner, with only the native (N) and unfolded (U) states significantly populated:

$$\text{U} \;\underset{k_u}{\overset{k_f}{\rightleftharpoons}}\; \text{N}$$

Equilibrium Thermodynamics

The equilibrium constant and free energy of folding are:

$$K = \frac{[\text{U}]}{[\text{N}]} = \frac{k_u}{k_f} = \exp\!\left(\frac{\Delta G}{RT}\right)$$

where $\Delta G = G_U - G_N > 0$ for a stable protein. At the midpoint concentration of denaturant$C_m$, $\Delta G = 0$ and $K = 1$ ($k_f = k_u$).

The fraction unfolded at equilibrium is:

$$f_U = \frac{K}{1 + K} = \frac{1}{1 + \exp(-\Delta G/RT)}$$

Kinetics: The Chevron Plot

For a two-state protein, relaxation to equilibrium follows single-exponential kinetics with observed rate:

$$\boxed{k_{\text{obs}} = k_f + k_u}$$

Both rates depend on denaturant concentration [D] according to linear free energy relationships:

$$\ln k_f = \ln k_f^{\text{H}_2\text{O}} + \frac{m_f[\text{D}]}{RT}, \qquad \ln k_u = \ln k_u^{\text{H}_2\text{O}} + \frac{m_u[\text{D}]}{RT}$$

where $m_f < 0$ (folding slows with denaturant) and $m_u > 0$ (unfolding accelerates). A plot of $\ln k_{\text{obs}}$ vs [D] produces the characteristic V-shaped chevron plot: the left arm reflects folding and the right arm reflects unfolding.

Tanford $\beta$-Value

The Tanford $\beta$ value quantifies the position of the transition state on the folding reaction coordinate:

$$\boxed{\beta_T = \frac{m_f}{m_f - m_u} = \frac{m_f}{m_{\text{eq}}}}$$

where $m_{\text{eq}} = m_f - m_u$ is the equilibrium m-value. A $\beta_T \approx 0.7$ indicates that the transition state has ~70% of the solvent-accessible surface area buried relative to the native state — it is compact and native-like.

Linear Extrapolation Method (LEM)

The free energy of folding in water is obtained by linear extrapolation:

$$\Delta G([\text{D}]) = \Delta G^{\text{H}_2\text{O}} - m_{\text{eq}}[\text{D}]$$

At $C_m$, $\Delta G = 0$, so $\Delta G^{\text{H}_2\text{O}} = m_{\text{eq}} \cdot C_m$. Kinetic and equilibrium measurements must agree: $\Delta G^{\text{H}_2\text{O}} = -RT\ln(k_f^{\text{H}_2\text{O}}/k_u^{\text{H}_2\text{O}})$.

Derivation 2: $\Phi$-Value Analysis

$\Phi$-value analysis, developed by Alan Fersht, is the most powerful experimental method for mapping the structure of the transition state (TS) at residue-level resolution.

Definition

For a mutation that destabilizes the native state by $\Delta\Delta G°$ and raises the folding barrier by $\Delta\Delta G^\ddagger$:

$$\boxed{\Phi = \frac{\Delta\Delta G^\ddagger}{\Delta\Delta G°}}$$

where the individual terms are computed from kinetic data:

$$\Delta\Delta G^\ddagger = -RT\ln\!\left(\frac{k_f^{\text{mut}}}{k_f^{\text{wt}}}\right)$$

$$\Delta\Delta G° = -RT\ln\!\left(\frac{K_{\text{eq}}^{\text{mut}}}{K_{\text{eq}}^{\text{wt}}}\right) = -RT\ln\!\left(\frac{k_f^{\text{mut}}/k_u^{\text{mut}}}{k_f^{\text{wt}}/k_u^{\text{wt}}}\right)$$

Interpretation

$\Phi = 1$: The mutation affects the TS as much as it affects the native state. The residue is fully structured (native-like) in the TS. The mutation slows folding but does not affect unfolding.
$\Phi = 0$: The mutation does not affect the TS at all. The residue is fully unstructured in the TS. The mutation accelerates unfolding but does not affect folding.
$0 < \Phi < 1$: Partial structure formation at the TS. Could reflect fractional native contacts, or an average over parallel pathways.

Requirements for Reliable $\Phi$-Values

Valid $\Phi$-value analysis requires:

Conservative mutations (e.g., $\text{Val} \rightarrow \text{Ala}$, deletion of a methyl group) that remove non-covalent interactions without introducing new ones
Sufficiently large $\Delta\Delta G° > 2\;\text{kJ/mol}$ to avoid noise artifacts
The protein must remain two-state (no change in folding mechanism upon mutation)
Linear chevron plots without rollover (curvature indicates non-two-state behavior)

Key Findings from $\Phi$-Value Studies

Decades of $\Phi$-value analysis across many proteins have revealed that:

Transition states are heterogeneous but compact — generally 60–80% of native contacts are formed
The nucleus (high-$\Phi$ residues) is typically formed by residues from multiple secondary structure elements, confirming the nucleation-condensation mechanism
Proteins with similar topology tend to have similar $\Phi$-value patterns, supporting the idea that topology determines the folding mechanism

Derivation 3: Zimm-Bragg Helix-Coil Transition

The Zimm-Bragg model (1959) provides an exact statistical mechanical treatment of the helix-coil transition in polypeptides using a transfer matrix formalism.

The Two Parameters

$s$ (propagation parameter): The equilibrium constant for adding a helical residue to an existing helix:$\cdots\text{hh}\text{c} \rightleftharpoons \cdots\text{hh}\text{h}$. When $s > 1$, helix propagation is favorable.
$\sigma$ (nucleation parameter): The equilibrium constant penalty for initiating a new helix segment:$\cdots\text{cc}\text{c} \rightleftharpoons \cdots\text{cc}\text{h}$. The statistical weight for nucleation is $\sigma s$. For polypeptides, $\sigma \approx 10^{-3}$ to $10^{-4}$, reflecting the entropic cost of fixing three consecutive residues to form the first H-bond.

Transfer Matrix Formulation

Each residue is in state c (coil) or h (helix). The statistical weight of each pair transition defines the transfer matrix:

$$\mathbf{M} = \begin{pmatrix} 1 & \sigma s \\ 1 & s \end{pmatrix}$$

The rows index the state of residue $i$ (top: c, bottom: h) and the columns index the state of residue $i+1$. The partition function for a chain of $N$ residues is:

$$Z = \mathbf{e}_1^T \cdot \mathbf{M}^N \cdot \mathbf{e}_2$$

where $\mathbf{e}_1$ and $\mathbf{e}_2$ are appropriate boundary vectors.

Eigenvalue Solution

The eigenvalues of $\mathbf{M}$ are:

$$\lambda_{1,2} = \frac{(1+s) \pm \sqrt{(1-s)^2 + 4\sigma s}}{2}$$

For large $N$, $Z \approx \lambda_1^N$ (the larger eigenvalue dominates), and the helix fraction is:

$$\boxed{\theta = \frac{1}{N}\frac{\partial \ln Z}{\partial \ln s} \approx \frac{1}{2}\left(1 + \frac{s - 1}{\sqrt{(1-s)^2 + 4\sigma s}}\right)}$$

Key features of the Zimm-Bragg model:

The transition midpoint occurs at $s = 1$
The sharpness of the transition is governed by $\sigma$: smaller $\sigma$ gives a sharper (more cooperative) transition
The width of the transition scales as $\Delta s \sim \sqrt{\sigma}$
Longer chains show sharper transitions (finite-size effects)

Temperature Dependence

The propagation parameter $s$ depends on temperature through:

$$s(T) = \exp\!\left(-\frac{\Delta H_{\text{res}}}{R}\left(\frac{1}{T} - \frac{1}{T_m}\right)\right)$$

where $\Delta H_{\text{res}} \approx -4\;\text{kJ/mol}$ is the enthalpy change per residue for helix formation and $T_m$ is the melting temperature. At $T = T_m$, $s = 1$ and the helix fraction is 0.5.

Derivation 4: Chaperone Mechanisms

The GroEL/GroES System

GroEL is a tetradecameric (14-subunit) chaperonin arranged as two stacked heptameric rings, forming a barrel with a central cavity (~45 Å diameter). GroES is a heptameric co-chaperonin lid.

Mechanism (Iterative Annealing):

Capture: Unfolded/misfolded substrate binds to the hydrophobic inner surface of the open (trans) ring of GroEL
Encapsulation: ATP binding triggers a conformational change; GroES caps the ring, creating an enclosed, hydrophilic chamber (~65 Å diameter, ~175,000 Å$^3$ volume)
Folding: The substrate folds in the chamber for ~10 s (the time for ATP hydrolysis), protected from aggregation. The confined space may accelerate folding by limiting the conformational search
Release: ATP hydrolysis weakens GroES binding. ATP binding to the opposite (trans) ring triggers GroES and substrate release
Reiteration: If not yet native, the substrate can rebind for another round. This is the "iterative annealing" mechanism

The energetic cost is 7 ATP per folding cycle. Approximately 10–15% of E. coli proteins are GroEL substrates, primarily those with complex $\alpha/\beta$ topologies (TIM barrels, Rossmann folds).

The Hsp70 System

Hsp70 (DnaK in bacteria) is the most abundant cellular chaperone. It works with the co-chaperone Hsp40 (DnaJ) and nucleotide exchange factor (GrpE/BAG). The mechanism:

ATP-bound state: Substrate-binding domain is open (lid up), fast on/off kinetics, low affinity
ATP hydrolysis (stimulated by Hsp40): Lid closes, trapping the substrate. High affinity, slow off-rate
Nucleotide exchange (by GrpE/BAG): ADP is replaced by ATP, lid opens, substrate is released

Hsp70 recognizes short hydrophobic segments (~5 residues) that are exposed in unfolded or misfolded proteins. By repeatedly binding and releasing, Hsp70 prevents aggregation and gives the substrate multiple opportunities to fold correctly — another form of iterative annealing.

Kinetic Partitioning Model

The competition between productive folding and aggregation can be described by kinetic partitioning:

$$\text{Yield}_{\text{native}} = \frac{k_{\text{fold}}}{k_{\text{fold}} + k_{\text{agg}}[\text{U}]}$$

Chaperones act by reducing $k_{\text{agg}}$ (sequestering aggregation-prone intermediates) and effectively increasing the folding yield. This is particularly important during heat shock, where the concentration of unfolded proteins rises dramatically.

Applications: Protein Misfolding & Disease

Amyloid Diseases

Amyloidoses are characterized by the deposition of cross-$\beta$ fibrils — highly ordered, insoluble protein aggregates with a shared structural core: a stack of $\beta$-strands running perpendicular to the fibril axis with inter-strand H-bonds running parallel to the axis (the cross-$\beta$ motif).

Major Amyloid Diseases

Alzheimer's disease: $\text{A}\beta_{42}$ peptide and tau protein fibrils
Parkinson's disease: $\alpha$-synuclein Lewy body fibrils
Type 2 diabetes: IAPP (amylin) fibrils in pancreatic islets
Huntington's disease: polyglutamine (polyQ) expansion in huntingtin protein
ALS: SOD1, TDP-43, FUS aggregates
Systemic amyloidosis: immunoglobulin light chain (AL), transthyretin (ATTR)

The thermodynamic driving force for amyloid formation is that the cross-$\beta$ structure is often the global free energy minimum for polypeptide chains. Native protein structures are kinetically trapped metastable states separated from the amyloid state by large barriers.

Prion Diseases

Prions ($\text{PrP}^{\text{Sc}}$) are infectious misfolded forms of the prion protein ($\text{PrP}^{\text{C}}$). The protein-only hypothesis (Prusiner, Nobel 1997) states that $\text{PrP}^{\text{Sc}}$ propagates by templating the conversion of normal$\text{PrP}^{\text{C}}$ to the misfolded form. The conversion involves a dramatic structural rearrangement:$\text{PrP}^{\text{C}}$ is predominantly $\alpha$-helical, while $\text{PrP}^{\text{Sc}}$ is rich in $\beta$-sheet. The mechanism follows a nucleated polymerization model:

$$\text{Rate} = k_{\text{elong}}[\text{PrP}^{\text{C}}][\text{seeds}] + k_{\text{nuc}}[\text{PrP}^{\text{C}}]^n$$

where the first term describes elongation (fast) and the second describes de novo nucleation (extremely slow, accounting for long incubation periods).

Drug Design Targeting Misfolded Proteins

Kinetic stabilizers: Tafamidis stabilizes the native tetrameric form of transthyretin (TTR), preventing dissociation to monomers that form amyloid. FDA-approved for ATTR cardiomyopathy.
Anti-amyloid antibodies: Lecanemab and aducanumab target $\text{A}\beta$ aggregates. Lecanemab (FDA-approved 2023) shows modest but significant slowing of cognitive decline.
Chemical chaperones: Small molecules (e.g., 4-phenylbutyrate) that stabilize native protein conformations, used in cystic fibrosis (stabilizing $\Delta\text{F508}$ CFTR).
Aggregation inhibitors: Compounds that cap growing fibril ends or redirect aggregation pathways toward off-pathway, non-toxic species.

Python Simulation: Two-State Chevron Plot & Kinetics

This simulation generates the chevron plot showing how $\ln(k_{\text{obs}})$ varies with denaturant concentration, along with the equilibrium denaturation curve and the free energy dependence on [denaturant].

Two-State Folding: Chevron Plot, Free Energy, and Denaturation Curve

Python

chevron_plot.py106 lines

import numpy as np
import matplotlib
matplotlib.use('Agg')
import matplotlib.pyplot as plt

# Two-state protein folding: Chevron plot
# ln(k_obs) vs [denaturant] gives a V-shaped (chevron) plot

# Parameters for a typical two-state folder (e.g., CI2)
# Folding in water
kf_water = 50.0      # s^-1 (folding rate in water)
ku_water = 0.001     # s^-1 (unfolding rate in water)

# m-values (sensitivity to denaturant)
mf = -1.5   # kJ/(mol*M), folding rate decreases with denaturant
mu = 0.5    # kJ/(mol*M), unfolding rate increases with denaturant
RT = 2.479  # kJ/mol at 25 C

# ln(kf) = ln(kf_water) + mf*[D]/RT
# ln(ku) = ln(ku_water) + mu*[D]/RT
# k_obs = kf + ku

D = np.linspace(0, 8, 500)  # denaturant concentration (M)

ln_kf = np.log(kf_water) + mf * D / RT
ln_ku = np.log(ku_water) + mu * D / RT

kf = np.exp(ln_kf)
ku = np.exp(ln_ku)
k_obs = kf + ku

# Equilibrium: DeltaG = -RT * ln(kf/ku)
DeltaG = -RT * (ln_kf - ln_ku)
# K_eq = ku/kf = exp(DeltaG/RT)
K_eq = ku / kf
f_unfolded = K_eq / (1 + K_eq)

# Midpoint [D]_50% where kf = ku
D_mid = RT * np.log(kf_water / ku_water) / (mu - mf)
# Actually: ln(kf_water) + mf*D_mid/RT = ln(ku_water) + mu*D_mid/RT
# D_mid = RT*(ln(kf_water) - ln(ku_water)) / (mu - mf)

fig, axes = plt.subplots(1, 3, figsize=(18, 5.5))

# Panel 1: Chevron plot
ax = axes[0]
ax.plot(D, np.log(k_obs), color='#10b981', linewidth=3, label='ln(k_obs)')
ax.plot(D, ln_kf, color='#06b6d4', linewidth=2, linestyle='--', label='ln(k_f)')
ax.plot(D, ln_ku, color='#ef4444', linewidth=2, linestyle='--', label='ln(k_u)')
ax.axvline(D_mid, color='#f59e0b', linestyle=':', alpha=0.7, label=f'[D]_50% = {D_mid:.1f} M')
ax.set_xlabel('[Denaturant] (M)', fontsize=12, color='white')
ax.set_ylabel('ln(k) (s^-1)', fontsize=12, color='white')
ax.set_title('Chevron Plot', fontsize=13, color='#10b981', fontweight='bold')
ax.legend(fontsize=9, facecolor='#1e293b', edgecolor='#10b981', labelcolor='white')
ax.set_facecolor('#0f172a')
ax.tick_params(colors='white')
ax.grid(True, alpha=0.2, color='#10b981')

# Annotate arms
ax.annotate('Folding arm\n(slope = m_f/RT)', xy=(1, ln_kf[100]), fontsize=9, color='#06b6d4',
           bbox=dict(boxstyle='round', facecolor='#1e293b', edgecolor='#06b6d4'))
ax.annotate('Unfolding arm\n(slope = m_u/RT)', xy=(6, ln_ku[400]), fontsize=9, color='#ef4444',
           bbox=dict(boxstyle='round', facecolor='#1e293b', edgecolor='#ef4444'))

# Panel 2: Free energy vs denaturant
ax = axes[1]
ax.plot(D, DeltaG, color='#f472b6', linewidth=2.5)
ax.axhline(0, color='white', alpha=0.5, linewidth=1)
ax.fill_between(D, DeltaG, 0, where=(DeltaG < 0), alpha=0.2, color='#10b981')
ax.fill_between(D, DeltaG, 0, where=(DeltaG > 0), alpha=0.2, color='#ef4444')
ax.axvline(D_mid, color='#f59e0b', linestyle=':', alpha=0.7)
ax.annotate(f'C_m = {D_mid:.1f} M', xy=(D_mid, 2), fontsize=10, color='#f59e0b',
           bbox=dict(boxstyle='round', facecolor='#1e293b', edgecolor='#f59e0b'))
ax.set_xlabel('[Denaturant] (M)', fontsize=12, color='white')
ax.set_ylabel('DeltaG_fold (kJ/mol)', fontsize=12, color='white')
ax.set_title('Folding Free Energy vs [Denaturant]', fontsize=13, color='#f472b6', fontweight='bold')
ax.set_facecolor('#0f172a')
ax.tick_params(colors='white')
ax.grid(True, alpha=0.2, color='#f472b6')

# Panel 3: Fraction unfolded (equilibrium denaturation curve)
ax = axes[2]
ax.plot(D, f_unfolded, color='#a78bfa', linewidth=2.5)
ax.axhline(0.5, color='white', alpha=0.3, linestyle=':')
ax.axvline(D_mid, color='#f59e0b', linestyle=':', alpha=0.7)
ax.annotate(f'C_m = {D_mid:.1f} M', xy=(D_mid, 0.55), fontsize=10, color='#f59e0b',
           bbox=dict(boxstyle='round', facecolor='#1e293b', edgecolor='#f59e0b'))
ax.set_xlabel('[Denaturant] (M)', fontsize=12, color='white')
ax.set_ylabel('Fraction Unfolded', fontsize=12, color='white')
ax.set_title('Equilibrium Denaturation Curve', fontsize=13, color='#a78bfa', fontweight='bold')
ax.set_facecolor('#0f172a')
ax.tick_params(colors='white')
ax.grid(True, alpha=0.2, color='#a78bfa')

fig.patch.set_facecolor('#0f172a')
plt.tight_layout()
plt.savefig('output.png', dpi=150, bbox_inches='tight', facecolor='#0f172a')
plt.close()
print("Two-state chevron plot, free energy, and denaturation curve plotted.")
print(f"\nParameters:")
print(f"  k_f(water) = {kf_water} s^-1")
print(f"  k_u(water) = {ku_water} s^-1")
print(f"  DeltaG(water) = {-RT*np.log(ku_water/kf_water):.1f} kJ/mol")
print(f"  C_m = {D_mid:.1f} M")
print(f"  beta_T (Tanford beta) = m_f/(m_f - m_u) = {mf/(mf-mu):.2f}")

Click Run to execute the Python code

Code will be executed with Python 3 on the server

Python Simulation: Zimm-Bragg Helix-Coil Transition

Exploring the Zimm-Bragg model with the transfer matrix eigenvalue solution. The three panels show the effects of the nucleation parameter $\sigma$, chain length $N$, and temperature on the sharpness of the helix-coil transition.

Zimm-Bragg Helix-Coil Transition: Nucleation, Chain Length, and Temperature

Python

zimm_bragg.py133 lines

import numpy as np
import matplotlib
matplotlib.use('Agg')
import matplotlib.pyplot as plt

# Zimm-Bragg helix-coil transition theory
# Transfer matrix method for helix fraction as function of s (propagation) and sigma (nucleation)

def helix_fraction_zb(s_vals, sigma, N):
    """
    Compute helix fraction using Zimm-Bragg transfer matrix method.
    s = propagation parameter (equilibrium constant for adding a residue to existing helix)
    sigma = nucleation parameter (penalty for starting a new helix)
    N = chain length

Transfer matrix M = [[1, sigma*s], [1, s]]
    Helix fraction theta = (1/N) * d(ln Z)/d(ln s)
    """
    theta = np.zeros_like(s_vals)

for idx, s in enumerate(s_vals):
        # Eigenvalues of transfer matrix
        # lambda = [(1+s) +/- sqrt((1-s)^2 + 4*sigma*s)] / 2
        discriminant = (1 - s)**2 + 4 * sigma * s
        sqrt_disc = np.sqrt(max(discriminant, 0))

lambda1 = (1 + s + sqrt_disc) / 2
        lambda2 = (1 + s - sqrt_disc) / 2

# For large N, helix fraction approaches:
        # theta = (lambda1 - 1) / (2*lambda1 - 1 - s) approximately
        # More precisely using derivative of partition function
        if abs(lambda1 - lambda2) > 1e-10:
            # Exact for finite N using eigenvalue decomposition
            Z = lambda1**(N+1) - lambda2**(N+1)
            if abs(Z) > 1e-300:
                dZ_ds = (N+1) * lambda1**N * (0.5 + (s - 1 + 2*sigma) / (2*sqrt_disc)) -                         (N+1) * lambda2**N * (0.5 - (s - 1 + 2*sigma) / (2*sqrt_disc))
                theta[idx] = (s / (N * Z)) * dZ_ds
            else:
                theta[idx] = 0
        else:
            theta[idx] = 0.5

# Clip to valid range
    theta = np.clip(theta, 0, 1)
    return theta

s_range = np.linspace(0.5, 1.5, 1000)

fig, axes = plt.subplots(1, 3, figsize=(18, 5.5))

# Panel 1: Effect of nucleation parameter sigma
ax = axes[0]
N = 100
for sigma, col, label in [(1e-2, '#10b981', 'sigma = 0.01'),
                            (1e-3, '#06b6d4', 'sigma = 0.001'),
                            (1e-4, '#f472b6', 'sigma = 0.0001'),
                            (1e-5, '#f59e0b', 'sigma = 0.00001')]:
    theta = helix_fraction_zb(s_range, sigma, N)
    ax.plot(s_range, theta, color=col, linewidth=2.5, label=label)

ax.axvline(1.0, color='white', alpha=0.3, linestyle=':')
ax.axhline(0.5, color='white', alpha=0.3, linestyle=':')
ax.set_xlabel('s (propagation parameter)', fontsize=12, color='white')
ax.set_ylabel('Helix Fraction (theta)', fontsize=12, color='white')
ax.set_title('Zimm-Bragg: Effect of Nucleation', fontsize=13, color='#10b981', fontweight='bold')
ax.legend(fontsize=9, facecolor='#1e293b', edgecolor='#10b981', labelcolor='white')
ax.set_facecolor('#0f172a')
ax.tick_params(colors='white')
ax.grid(True, alpha=0.2, color='#10b981')
ax.annotate('Sharper transition with\nsmaller sigma (stronger\nnucleation penalty)',
           xy=(1.1, 0.3), fontsize=9, color='#f59e0b',
           bbox=dict(boxstyle='round', facecolor='#1e293b', edgecolor='#f59e0b'))

# Panel 2: Effect of chain length N
ax = axes[1]
sigma = 1e-3
for N_val, col, label in [(20, '#f59e0b', 'N = 20'),
                            (50, '#10b981', 'N = 50'),
                            (100, '#06b6d4', 'N = 100'),
                            (500, '#f472b6', 'N = 500')]:
    theta = helix_fraction_zb(s_range, sigma, N_val)
    ax.plot(s_range, theta, color=col, linewidth=2.5, label=label)

ax.axvline(1.0, color='white', alpha=0.3, linestyle=':')
ax.axhline(0.5, color='white', alpha=0.3, linestyle=':')
ax.set_xlabel('s (propagation parameter)', fontsize=12, color='white')
ax.set_ylabel('Helix Fraction (theta)', fontsize=12, color='white')
ax.set_title('Zimm-Bragg: Effect of Chain Length', fontsize=13, color='#06b6d4', fontweight='bold')
ax.legend(fontsize=9, facecolor='#1e293b', edgecolor='#06b6d4', labelcolor='white')
ax.set_facecolor('#0f172a')
ax.tick_params(colors='white')
ax.grid(True, alpha=0.2, color='#06b6d4')

# Panel 3: Temperature-induced helix-coil transition
ax = axes[2]
# s depends on temperature: s = exp(-DeltaH*(1/T - 1/Tm)/(R))
T_range = np.linspace(270, 380, 500)
T_m = 340.0  # melting temperature
DeltaH_res = -4.0  # kJ/mol per residue
R = 0.008314  # kJ/(mol*K)

s_of_T = np.exp(-DeltaH_res * (1/T_range - 1/T_m) / R)
sigma = 5e-4

for N_val, col, label in [(30, '#f59e0b', 'N = 30'),
                            (60, '#10b981', 'N = 60'),
                            (100, '#06b6d4', 'N = 100'),
                            (200, '#f472b6', 'N = 200')]:
    theta = helix_fraction_zb(s_of_T, sigma, N_val)
    ax.plot(T_range - 273.15, theta, color=col, linewidth=2.5, label=label)

ax.axvline(T_m - 273.15, color='white', alpha=0.3, linestyle=':', label=f'T_m = {T_m-273.15:.0f} C')
ax.axhline(0.5, color='white', alpha=0.3, linestyle=':')
ax.set_xlabel('Temperature (C)', fontsize=12, color='white')
ax.set_ylabel('Helix Fraction (theta)', fontsize=12, color='white')
ax.set_title('Thermal Helix-Coil Transition', fontsize=13, color='#f472b6', fontweight='bold')
ax.legend(fontsize=9, facecolor='#1e293b', edgecolor='#f472b6', labelcolor='white')
ax.set_facecolor('#0f172a')
ax.tick_params(colors='white')
ax.grid(True, alpha=0.2, color='#f472b6')

fig.patch.set_facecolor('#0f172a')
plt.tight_layout()
plt.savefig('output.png', dpi=150, bbox_inches='tight', facecolor='#0f172a')
plt.close()
print("Zimm-Bragg helix-coil transition plots generated.")
print("\nKey observations:")
print("  - Smaller sigma => sharper (more cooperative) transition")
print("  - Longer chains => sharper transition")
print("  - s = 1 is the midpoint of the transition")
print("  - sigma ~ 10^-3 to 10^-4 for real polypeptides")

Click Run to execute the Python code

Code will be executed with Python 3 on the server

Python Simulation: Folding Kinetics & $\Phi$-Value Analysis

This simulation shows: (1) two-state folding kinetics at different denaturant concentrations, (2) a simulated $\Phi$-value analysis scatter plot mapping transition state structure, and (3) free energy profiles comparing two-state, three-state, and downhill folding mechanisms.

Folding Kinetics, Phi-Value Analysis, and Free Energy Profiles

Python

folding_kinetics.py121 lines

import numpy as np
import matplotlib
matplotlib.use('Agg')
import matplotlib.pyplot as plt

# Protein folding kinetics: two-state and phi-value analysis

fig, axes = plt.subplots(1, 3, figsize=(18, 5.5))

# Panel 1: Two-state folding kinetics
ax = axes[0]
t = np.linspace(0, 0.5, 500)  # seconds

# Different conditions (varying denaturant)
conditions = [
    (100, 0.01, '0 M [D]', '#10b981'),
    (50, 0.05, '1 M [D]', '#06b6d4'),
    (20, 0.2, '2 M [D]', '#f59e0b'),
    (5, 1.0, '3 M [D]', '#f472b6'),
    (1, 5.0, '4 M [D]', '#ef4444'),
]

for kf, ku, label, col in conditions:
    k_obs = kf + ku
    f_eq = kf / k_obs
    # Starting from fully unfolded (f_N(0) = 0)
    f_N = f_eq * (1 - np.exp(-k_obs * t))
    ax.plot(t * 1000, f_N, color=col, linewidth=2, label=f'{label} (k_obs={k_obs:.1f})')

ax.set_xlabel('Time (ms)', fontsize=12, color='white')
ax.set_ylabel('Fraction Native', fontsize=12, color='white')
ax.set_title('Two-State Folding Kinetics', fontsize=13, color='#10b981', fontweight='bold')
ax.legend(fontsize=8, facecolor='#1e293b', edgecolor='#10b981', labelcolor='white')
ax.set_facecolor('#0f172a')
ax.tick_params(colors='white')
ax.grid(True, alpha=0.2, color='#10b981')

# Panel 2: Phi-value analysis
ax = axes[1]
# Phi = DDG_TS / DDG_eq
# Phi = 1 means mutation fully affects TS (native-like at TS)
# Phi = 0 means mutation doesn't affect TS (unfolded-like at TS)

# Simulate phi-values for a hypothetical protein
np.random.seed(42)
n_mutants = 25
DDG_eq = np.random.uniform(2, 15, n_mutants)  # kJ/mol
# Realistic phi distribution: mixture of values
phi_true = np.random.beta(2, 3, n_mutants)  # skewed toward lower values
DDG_TS = phi_true * DDG_eq
phi_calc = DDG_TS / DDG_eq

# Color by phi value
colors_phi = plt.cm.RdYlGn(phi_calc)

scatter = ax.scatter(DDG_eq, DDG_TS, c=phi_calc, cmap='RdYlGn', s=80,
                     edgecolors='white', linewidth=0.5, vmin=0, vmax=1)
cbar = plt.colorbar(scatter, ax=ax, shrink=0.8)
cbar.set_label('Phi value', color='white', fontsize=11)
cbar.ax.tick_params(colors='white')

# Reference lines
max_val = max(DDG_eq.max(), DDG_TS.max())
ax.plot([0, max_val], [0, max_val], '--', color='#10b981', alpha=0.7, label='Phi = 1.0')
ax.plot([0, max_val], [0, 0.5*max_val], '--', color='#f59e0b', alpha=0.7, label='Phi = 0.5')
ax.plot([0, max_val], [0, 0], '--', color='#ef4444', alpha=0.7, label='Phi = 0.0')

ax.set_xlabel('DDG_eq (kJ/mol)', fontsize=12, color='white')
ax.set_ylabel('DDG_TS (kJ/mol)', fontsize=12, color='white')
ax.set_title('Phi-Value Analysis', fontsize=13, color='#06b6d4', fontweight='bold')
ax.legend(fontsize=8, facecolor='#1e293b', edgecolor='#06b6d4', labelcolor='white')
ax.set_facecolor('#0f172a')
ax.tick_params(colors='white')
ax.grid(True, alpha=0.2, color='#06b6d4')

# Panel 3: Folding funnel with multiple pathways
ax = axes[2]
# Reaction coordinate (abstract) vs free energy with intermediates
Q = np.linspace(0, 1, 500)

# Two-state (smooth funnel)
G_2state = -20 * Q + 8 * np.exp(-((Q - 0.3)**2) / (2 * 0.05**2)) + 15 * (1 - Q)
G_2state = G_2state - G_2state[-1]

# Three-state (with intermediate)
G_3state = -20 * Q + 6 * np.exp(-((Q - 0.2)**2) / (2 * 0.04**2)) +            -5 * np.exp(-((Q - 0.5)**2) / (2 * 0.06**2)) +            8 * np.exp(-((Q - 0.7)**2) / (2 * 0.04**2)) + 15 * (1 - Q)
G_3state = G_3state - G_3state[-1]

# Downhill folder (barrierless)
G_downhill = 15 * (1 - Q)**1.5

ax.plot(Q, G_2state, color='#10b981', linewidth=2.5, label='Two-state')
ax.plot(Q, G_3state, color='#f472b6', linewidth=2.5, label='Three-state (intermediate)')
ax.plot(Q, G_downhill, color='#f59e0b', linewidth=2.5, label='Downhill folder')

ax.set_xlabel('Reaction Coordinate (Q)', fontsize=12, color='white')
ax.set_ylabel('Free Energy (kJ/mol)', fontsize=12, color='white')
ax.set_title('Folding Free Energy Profiles', fontsize=13, color='#f472b6', fontweight='bold')
ax.legend(fontsize=9, facecolor='#1e293b', edgecolor='#f472b6', labelcolor='white')
ax.set_facecolor('#0f172a')
ax.tick_params(colors='white')
ax.grid(True, alpha=0.2, color='#f472b6')

# Annotate key features
ax.annotate('U', xy=(0, G_2state[0]), fontsize=11, color='#ef4444', fontweight='bold',
           xytext=(-0.05, G_2state[0]+2))
ax.annotate('N', xy=(1, 0), fontsize=11, color='#06b6d4', fontweight='bold',
           xytext=(1.02, 1))
ax.annotate('I', xy=(0.5, G_3state[250]-1), fontsize=11, color='#f472b6', fontweight='bold')

fig.patch.set_facecolor('#0f172a')
plt.tight_layout()
plt.savefig('output.png', dpi=150, bbox_inches='tight', facecolor='#0f172a')
plt.close()
print("Folding kinetics, phi-value analysis, and energy profiles plotted.")
print("\nPhi-value interpretation:")
print("  Phi = 1: Residue is native-like in transition state")
print("  Phi = 0: Residue is unfolded-like in transition state")
print("  0 < Phi < 1: Partial structure formation at TS")
print(f"  Mean Phi = {np.mean(phi_calc):.2f} for this simulated dataset")

Click Run to execute the Python code

Code will be executed with Python 3 on the server

Folding Rate Predictors & Contact Order

Relative Contact Order (RCO)

Plaxco, Simons, and Baker (1998) discovered that the folding rates of two-state proteins correlate remarkably well with a simple topological property: the relative contact order.

$$\boxed{\text{RCO} = \frac{1}{L \cdot N_c}\sum_{i < j}^{N_c} |i - j|}$$

where $L$ is the total number of residues, $N_c$ is the number of native contacts, and$|i - j|$ is the sequence separation between contacting residues $i$ and $j$. The correlation with folding rate is:

$$\ln k_f \approx a - b \times \text{RCO}$$

with correlation coefficient $r \approx 0.8$. Proteins with predominantly local contacts ($\alpha$-helical, low RCO) fold faster than those with many non-local contacts ($\beta$-sheet rich, high RCO). This supports the idea that topology is the primary determinant of folding rate.

Chain Length Dependence

For two-state folders, the folding rate also depends on chain length:

$$\ln k_f \approx c - d \cdot L^{0.6}$$

The exponent $\sim 0.6$ is consistent with polymer theory: the conformational search time scales with the number of effective chain segments raised to a power related to the Flory exponent.

Nucleation-Condensation Mechanism

The dominant mechanism for two-state folding is nucleation-condensation(Fersht, 1995). In contrast to the earlier framework model (secondary structure forms first) and hydrophobic collapse model (collapse occurs first), nucleation-condensation proposes that:

A diffuse folding nucleus forms in the transition state, comprising elements of secondary and tertiary structure simultaneously
The nucleus is stabilized by a combination of local (secondary structure) and non-local (tertiary) interactions
Once the nucleus forms, the rest of the chain rapidly condenses around it
$\Phi$-values for nucleus residues are typically 0.3–0.7 (fractional, not fully native-like)

Diffusion-Collision Model

For larger proteins, the diffusion-collision model (Karplus and Weaver) describes folding as a hierarchical process: pre-formed microdomains (secondary structure elements) diffuse and collide to form the native tertiary structure. The rate depends on the diffusion rate of the microdomains and the probability that a collision is productive:

$$k_{\text{fold}} = k_{\text{diff}} \times P_{\text{productive}} \times P_{\text{correct}}$$

where $P_{\text{productive}}$ is the probability that colliding microdomains are correctly oriented and$P_{\text{correct}}$ accounts for the combinatorics of forming all necessary contacts.

Experimental Methods for Studying Folding

Stopped-Flow Kinetics

The workhorse for measuring folding/unfolding kinetics on the millisecond timescale. Two solutions (protein + denaturant at different concentrations) are rapidly mixed (dead time ~1 ms), and the signal (fluorescence, CD, absorbance) is monitored as a function of time. The observed rate constant is extracted by fitting to single or multi-exponential functions:

$$S(t) = S_\infty + \sum_i A_i \exp(-k_i t)$$

Temperature Jump (T-Jump)

Ultrafast heating (nanoseconds) using infrared laser pulses or electrical discharge perturbs the folding equilibrium, enabling the study of folding dynamics on the microsecond timescale. The temperature change is typically 5–15°C. Combined with fluorescence or IR spectroscopy, T-jump can reveal the earliest events in folding: helix formation, hydrophobic collapse, and the formation of the folding nucleus.

Hydrogen/Deuterium Exchange (HDX)

Backbone amide hydrogens exchange with solvent D$_2$O at rates that depend on their structural environment. In a native protein, amides involved in H-bonds or buried in the core exchange slowly (protection factors of $10^3$ to $10^8$). The exchange rate is:

$$k_{\text{ex}} = k_{\text{int}} \cdot \frac{k_{\text{op}}}{k_{\text{op}} + k_{\text{cl}}} \approx \frac{k_{\text{int}} \cdot k_{\text{op}}}{k_{\text{cl}}} = \frac{k_{\text{int}}}{P_f}$$

where $k_{\text{int}}$ is the intrinsic (unprotected) exchange rate, $k_{\text{op}}$ and$k_{\text{cl}}$ are the local opening and closing rates, and the protection factor $P_f = k_{\text{cl}}/k_{\text{op}}$. Under EX2 conditions (most physiological):

$$\Delta G_{\text{HDX}} = -RT\ln\!\left(\frac{k_{\text{ex}}}{k_{\text{int}}}\right) = RT \ln P_f$$

HDX monitored by NMR provides residue-level information; HDX-MS provides peptide-level resolution for larger proteins and complexes.

Single-Molecule FRET

Fluorescence resonance energy transfer between donor and acceptor dyes attached to specific sites on the protein reports on intramolecular distances in real time. The FRET efficiency is:

$$E = \frac{1}{1 + (r/R_0)^6}$$

where $r$ is the donor-acceptor distance and $R_0$ is the Forster radius (typically 40–60 Å). Single-molecule experiments reveal conformational heterogeneity and rare folding intermediates that are hidden in ensemble-averaged experiments.

Key Equations Summary

Two-State Equilibrium

$$K = \frac{[\text{U}]}{[\text{N}]} = \frac{k_u}{k_f} = \exp\!\left(\frac{\Delta G}{RT}\right)$$

$\Phi$-Value Definition

$$\Phi = \frac{\Delta\Delta G^\ddagger}{\Delta\Delta G°} = \frac{-RT\ln(k_f^{\text{mut}}/k_f^{\text{wt}})}{-RT\ln(K_{\text{eq}}^{\text{mut}}/K_{\text{eq}}^{\text{wt}})}$$

Zimm-Bragg Helix Fraction

$$\theta \approx \frac{1}{2}\left(1 + \frac{s - 1}{\sqrt{(1-s)^2 + 4\sigma s}}\right)$$

Tanford $\beta$-Value

$$\beta_T = \frac{m_f}{m_f - m_u}$$

FRET Efficiency

$$E = \frac{1}{1 + (r/R_0)^6}$$

Relative Contact Order

$$\text{RCO} = \frac{1}{L \cdot N_c}\sum_{i < j}^{N_c} |i - j|$$

← Protein Structure & Folding Part II: Enzymes →

Share:X Reddit LinkedIn