Chapter 5

Inflationary Models

A comprehensive survey of single-field slow-roll models — from chaotic inflation through Starobinsky, natural inflation, alpha-attractors, and Higgs inflation — with full derivations of observational predictions and confrontation with CMB data.

5.1 The Landscape of Inflationary Models

The inflationary paradigm posits a period of quasi-exponential expansion in the very early universe, driven by the potential energy of a scalar field — the inflaton. While the general framework of slow-roll inflation makes robust predictions (a nearly scale-invariant, nearly Gaussian spectrum of adiabatic perturbations), the specific predictions for the spectral index $n_s$ and the tensor-to-scalar ratio $r$ depend critically on the choice of inflaton potential $V(\phi)$.

The observational constraints from the Planck satellite (2018) and BICEP/Keck Array place tight bounds:

$$n_s = 0.9649 \pm 0.0042 \quad (68\%\;\text{CL}), \qquad r < 0.036 \quad (95\%\;\text{CL, BICEP/Keck 2021})$$

These constraints already rule out several simple models and strongly favor plateau-like potentials. The key observables are related to the slow-roll parameters via:

$$n_s = 1 - 6\epsilon_V + 2\eta_V, \qquad r = 16\epsilon_V$$

where the potential slow-roll parameters are:

$$\epsilon_V = \frac{M_P^2}{2}\left(\frac{V'}{V}\right)^2, \qquad \eta_V = M_P^2 \frac{V''}{V}$$

In this chapter, we derive the predictions of five major classes of inflationary models from first principles, compute their $n_s$ and $r$ values, and compare them against the data. We will see a remarkable convergence: many successful models share the universal prediction$n_s \approx 1 - 2/N$, differing only in $r$.

Key Notation

● $M_P = (8\pi G)^{-1/2} \approx 2.435 \times 10^{18}$ GeV — reduced Planck mass
● $N$ — number of e-folds before the end of inflation when the pivot scale exits the horizon (typically 50–60)
● $\phi_*$ — field value when the pivot scale $k_* = 0.05\;\text{Mpc}^{-1}$ exits the horizon
● $\phi_\text{end}$ — field value at the end of inflation, defined by $\epsilon_V(\phi_\text{end}) = 1$

5.2 Chaotic Inflation: $m^2\phi^2$

The simplest inflationary model is chaotic inflation with a massive free field, proposed by Andrei Linde in 1983. The potential is:

$$V(\phi) = \frac{1}{2}m^2\phi^2$$

Step 1: Compute the slow-roll parameters. We need the first and second derivatives of the potential:

$$V' = m^2\phi, \qquad V'' = m^2$$

Substituting into the slow-roll parameter definitions:

$$\epsilon_V = \frac{M_P^2}{2}\left(\frac{m^2\phi}{\frac{1}{2}m^2\phi^2}\right)^2 = \frac{M_P^2}{2}\cdot\frac{4}{\phi^2} = \frac{2M_P^2}{\phi^2}$$

$$\eta_V = M_P^2\frac{m^2}{\frac{1}{2}m^2\phi^2} = \frac{2M_P^2}{\phi^2}$$

Note the special feature of $m^2\phi^2$: we have $\epsilon_V = \eta_V$ exactly.

Step 2: Find the end of inflation. Inflation ends when$\epsilon_V = 1$:

$$\frac{2M_P^2}{\phi_\text{end}^2} = 1 \quad \Longrightarrow \quad \phi_\text{end} = \sqrt{2}\,M_P$$

Step 3: Compute the number of e-folds. The e-fold integral is:

$$N = \frac{1}{M_P^2}\int_{\phi_\text{end}}^{\phi_*}\frac{V}{V'}\,d\phi = \frac{1}{M_P^2}\int_{\phi_\text{end}}^{\phi_*}\frac{\frac{1}{2}m^2\phi^2}{m^2\phi}\,d\phi = \frac{1}{2M_P^2}\int_{\phi_\text{end}}^{\phi_*}\phi\,d\phi$$

$$N = \frac{1}{2M_P^2}\left[\frac{\phi^2}{2}\right]_{\phi_\text{end}}^{\phi_*} = \frac{\phi_*^2 - \phi_\text{end}^2}{4M_P^2} = \frac{\phi_*^2 - 2M_P^2}{4M_P^2}$$

Step 4: Invert for $\phi_*(N)$. Solving for $\phi_*$:

$$\phi_*^2 = 4NM_P^2 + 2M_P^2 = (4N + 2)M_P^2 \approx 4NM_P^2 \quad (\text{for } N \gg 1)$$

Step 5: Derive $n_s$ and $r$. Substituting$\phi_*^2 \approx 4NM_P^2$ into the slow-roll parameters:

$$\epsilon_V = \frac{2M_P^2}{4NM_P^2} = \frac{1}{2N}, \qquad \eta_V = \frac{1}{2N}$$

Therefore:

$$\boxed{n_s = 1 - 6\epsilon_V + 2\eta_V = 1 - \frac{6}{2N} + \frac{2}{2N} = 1 - \frac{2}{N}}$$

$$\boxed{r = 16\epsilon_V = \frac{8}{N}}$$

Numerical Values for $m^2\phi^2$

$N = 50$:

$n_s = 1 - 2/50 = 0.960$

$r = 8/50 = 0.160$

$N = 60$:

$n_s = 1 - 2/60 = 0.967$

$r = 8/60 = 0.133$

Both values of $r$ significantly exceed the BICEP/Keck bound $r < 0.036$. The$m^2\phi^2$ model is now ruled out at high confidence.

The $\phi^4$ Case (Also Ruled Out)

For completeness, consider $V(\phi) = \lambda\phi^4/4$. We have $V' = \lambda\phi^3$ and $V'' = 3\lambda\phi^2$. Then:

$$\epsilon_V = \frac{M_P^2}{2}\left(\frac{4}{\phi}\right)^2 = \frac{8M_P^2}{\phi^2}, \qquad \eta_V = M_P^2\frac{12}{\phi^2} = \frac{12M_P^2}{\phi^2}$$

The end of inflation: $\epsilon_V = 1$ gives $\phi_\text{end}^2 = 8M_P^2$. The e-fold integral yields $N = (\phi_*^2 - 8M_P^2)/(8M_P^2)$, so $\phi_*^2 \approx 8NM_P^2$ for large $N$. Thus:

$$\epsilon_V = \frac{1}{N}, \quad \eta_V = \frac{3}{2N} \quad \Longrightarrow \quad \boxed{n_s = 1 - \frac{3}{N}, \quad r = \frac{16}{N}}$$

For $N = 60$: $n_s = 0.950$, $r = 0.267$ — both $n_s$ and $r$ are far outside the Planck contours. The $\phi^4$ model was already under pressure from WMAP and is now decisively ruled out.

5.3 Starobinsky $R^2$ Inflation

The Starobinsky model (1980) was historically the first inflationary model, proposed even before Guth's 1981 paper. It modifies the Einstein-Hilbert action by adding a quadratic curvature term:

$$S = \frac{M_P^2}{2}\int d^4x\,\sqrt{-g}\,f(R), \qquad f(R) = R + \frac{R^2}{6M^2}$$

where $M$ is a mass scale to be fixed by the amplitude of perturbations ($M \sim 10^{13}$ GeV).

Step 1: Conformal Transformation to Einstein Frame

Any $f(R)$ theory can be recast as standard Einstein gravity plus a scalar field via a conformal transformation. Define:

$$F \equiv \frac{df}{dR} = 1 + \frac{R}{3M^2}$$

Perform the conformal (Weyl) rescaling of the metric:

$$\tilde{g}_{\mu\nu} = F\,g_{\mu\nu}$$

In the Einstein frame (with tildes), the action becomes:

$$\tilde{S} = \int d^4x\,\sqrt{-\tilde{g}}\left[\frac{M_P^2}{2}\tilde{R} - \frac{1}{2}\tilde{g}^{\mu\nu}\partial_\mu\phi\,\partial_\nu\phi - V(\phi)\right]$$

Step 2: Identify the Scalar Field

The canonical scalar field is related to $F$ by:

$$\phi = \sqrt{\frac{3}{2}}\,M_P\,\ln F = \sqrt{\frac{3}{2}}\,M_P\,\ln\!\left(1 + \frac{R}{3M^2}\right)$$

Inverting: $F = e^{\sqrt{2/3}\,\phi/M_P}$, so $R = 3M^2(F - 1) = 3M^2(e^{\sqrt{2/3}\,\phi/M_P} - 1)$.

Step 3: Derive the Plateau Potential

The Einstein-frame potential is given by the general $f(R)$ formula:

$$V(\phi) = \frac{M_P^2}{2}\,\frac{RF - f(R)}{F^2}$$

Substituting $f(R) = R + R^2/(6M^2)$:

$$RF - f = R\!\left(1 + \frac{R}{3M^2}\right) - R - \frac{R^2}{6M^2} = \frac{R^2}{3M^2} - \frac{R^2}{6M^2} = \frac{R^2}{6M^2}$$

Now express $R$ in terms of $F$: $R = 3M^2(F-1)$, so $R^2 = 9M^4(F-1)^2$:

$$V = \frac{M_P^2}{2}\cdot\frac{9M^4(F-1)^2}{6M^2 F^2} = \frac{3M^2 M_P^2}{4}\,\frac{(F-1)^2}{F^2}$$

Using $F = e^{\sqrt{2/3}\,\phi/M_P}$, we write $(F-1)/F = 1 - 1/F = 1 - e^{-\sqrt{2/3}\,\phi/M_P}$:

$$\boxed{V(\phi) = \frac{3M^2 M_P^2}{4}\left(1 - e^{-\sqrt{2/3}\,\phi/M_P}\right)^2}$$

This is the celebrated Starobinsky plateau potential. For large$\phi$, the exponential is negligible and $V \to 3M^2 M_P^2/4$ = const — a flat plateau that naturally gives slow-roll inflation.

Step 4: Slow-Roll Parameters on the Plateau

Define $x = e^{-\sqrt{2/3}\,\phi/M_P}$ for convenience (during inflation, $\phi$ is large and $x \ll 1$). Then:

$$V' = \frac{3M^2 M_P^2}{4}\cdot 2(1 - x)\cdot\sqrt{\frac{2}{3}}\,\frac{x}{M_P} = \frac{M^2 M_P}{2}\sqrt{\frac{3}{2}}\cdot\frac{2x(1-x)}{1}$$

More directly, we compute $V'/V$:

$$\frac{V'}{V} = \frac{2\sqrt{2/3}}{M_P}\cdot\frac{x}{1 - x} \approx \frac{2\sqrt{2/3}}{M_P}\,x \quad (x \ll 1)$$

$$\epsilon_V = \frac{M_P^2}{2}\left(\frac{V'}{V}\right)^2 = \frac{4}{3}\,\frac{x^2}{(1-x)^2} \approx \frac{4}{3}\,x^2$$

For $\eta_V$, we compute $V''/V$:

$$\eta_V = -\frac{4}{3}\,\frac{x(2-x)}{(1-x)^2} \approx -\frac{4}{3}\,x \cdot 2 = -\frac{4x}{3} \quad (x \ll 1)$$

Step 5: Relate to e-folds and Derive $n_s$, $r$

The number of e-folds (for $x \ll 1$) is:

$$N \approx \frac{3}{4}\,\frac{1}{x_*} \quad \Longrightarrow \quad x_* \approx \frac{3}{4N}$$

Substituting into the slow-roll parameters:

$$\epsilon_V \approx \frac{4}{3}\left(\frac{3}{4N}\right)^2 = \frac{3}{4N^2}$$

$$\eta_V \approx -\frac{4}{3}\cdot\frac{3}{4N} = -\frac{1}{N}$$

Therefore:

$$\boxed{n_s \approx 1 - 6\cdot\frac{3}{4N^2} + 2\cdot\left(-\frac{1}{N}\right) \approx 1 - \frac{2}{N}} \quad (\text{dominant term})$$

$$\boxed{r = 16\epsilon_V \approx \frac{12}{N^2}}$$

Numerical Values for Starobinsky

$N = 50$:

$n_s = 1 - 2/50 = 0.960$

$r = 12/2500 = 0.0048$

$N = 60$:

$n_s = 1 - 2/60 = 0.967$

$r = 12/3600 = 0.0033$

Excellent agreement with Planck data! The Starobinsky model sits right in the sweet spot of the$n_s$–$r$ plane.

5.4 Natural Inflation

Natural inflation (Freese, Frieman & Olinto, 1990) uses a pseudo-Nambu-Goldstone boson as the inflaton, providing a natural mechanism for the flatness of the potential. The potential arises from explicit breaking of a shift symmetry:

$$V(\phi) = \Lambda^4\left[1 + \cos\!\left(\frac{\phi}{f}\right)\right]$$

where $\Lambda$ is the energy scale of the potential and $f$ is the axion decay constant (the symmetry-breaking scale).

Step 1: Slow-Roll Parameters

Computing the derivatives:

$$V' = -\frac{\Lambda^4}{f}\sin\!\left(\frac{\phi}{f}\right), \qquad V'' = -\frac{\Lambda^4}{f^2}\cos\!\left(\frac{\phi}{f}\right)$$

The slow-roll parameters become:

$$\epsilon_V = \frac{M_P^2}{2f^2}\,\frac{\sin^2(\phi/f)}{[1 + \cos(\phi/f)]^2} = \frac{M_P^2}{2f^2}\,\frac{1 - \cos(\phi/f)}{1 + \cos(\phi/f)}$$

where we used $\sin^2\theta/(1+\cos\theta)^2 = (1-\cos\theta)/(1+\cos\theta)$ (the identity $\sin^2\theta = (1-\cos\theta)(1+\cos\theta)$).

$$\eta_V = -\frac{M_P^2}{f^2}\,\frac{\cos(\phi/f)}{1 + \cos(\phi/f)}$$

Step 2: Predictions as Functions of $N$ and $f$

The e-fold integral can be evaluated analytically:

$$N = \frac{f^2}{M_P^2}\ln\!\left[\frac{1 - \cos(\phi_*/f)}{1 - \cos(\phi_\text{end}/f)}\right]$$

In the limit $f \gg M_P$ (large decay constant), the potential near its minimum becomes approximately quadratic: $V \approx \Lambda^4\phi^2/(2f^2)$, recovering chaotic inflation. In the opposite limit$f \lesssim M_P$, the potential is too steep and inflation cannot produce enough e-folds.

The spectral index and tensor-to-scalar ratio are:

$$n_s = 1 - \frac{M_P^2}{f^2}\,\frac{2 - (M_P^2/f^2)\,e^{-NM_P^2/f^2}}{1 - e^{-NM_P^2/f^2} + M_P^2/(2f^2)}$$

In the large-$f$ limit, expanding to leading order in $M_P^2/f^2$:

$$n_s \xrightarrow{f \gg M_P} 1 - \frac{2}{N}, \qquad r \xrightarrow{f \gg M_P} \frac{8}{N}$$

recovering the $m^2\phi^2$ predictions, as expected.

The $f > M_P$ Requirement and Weak Gravity Conjecture

For natural inflation to produce $N \geq 50$ e-folds with $n_s$ consistent with Planck, one needs:

$$f \gtrsim 5\,M_P$$

This super-Planckian decay constant is problematic from the perspective of quantum gravity. The weak gravity conjecture (Arkani-Hamed, Motl, Nicolis & Vafa, 2007) suggests that in any consistent theory of quantum gravity, the axion decay constant should satisfy $f \lesssim M_P$. If correct, this would rule out natural inflation in its simplest form.

Various mechanisms have been proposed to circumvent this tension, including alignment of multiple axions (Kim, Nilles & Peloso, 2005) and monodromy (Silverstein & Westphal, 2008), though each introduces additional model-building complications. Current Planck+BICEP data place natural inflation under significant observational pressure even for large $f$.

5.5 $\alpha$-Attractors

The $\alpha$-attractor models (Kallosh & Linde, 2013; Kallosh, Linde & Roest, 2013) represent a broad universality class of inflationary potentials motivated by supergravity and the geometry of the Kähler manifold. They come in two families:

T-models:

$$V = V_0\tanh^{2n}\!\left(\frac{\phi}{\sqrt{6\alpha}\,M_P}\right)$$

E-models:

$$V = V_0\left(1 - e^{-\sqrt{2/(3\alpha)}\,\phi/M_P}\right)^{2n}$$

The parameter $\alpha$ controls the curvature of the Kähler manifold, $n$ determines the power, and $V_0$ sets the energy scale. For $n = 1$, the E-model with $\alpha = 1$ reproduces exactly the Starobinsky potential.

Universal Predictions in the Large-Field Limit

The remarkable property of $\alpha$-attractors is that for any value of $n$, the predictions for $n_s$ and $r$ converge to universal values in the limit of large field excursion (large $N$). To see this, consider the T-model with general $n$. Define$y = \tanh(\phi/(\sqrt{6\alpha}\,M_P))$. For large $\phi$, $y \to 1$ and we write$y = 1 - \delta$ with $\delta \ll 1$.

The slow-roll parameter $\epsilon_V$ is:

$$\epsilon_V = \frac{M_P^2}{2}\left(\frac{V'}{V}\right)^2 = \frac{4n^2}{3\alpha}\,\frac{(1-y^2)^2}{4y^2} \approx \frac{4n^2}{3\alpha}\,\delta^2 \quad (\delta \ll 1)$$

The e-fold calculation gives:

$$N \approx \frac{3\alpha}{4n}\,\frac{1}{\delta_*} \quad \Longrightarrow \quad \delta_* \approx \frac{3\alpha}{4nN}$$

Substituting back:

$$\epsilon_V \approx \frac{4n^2}{3\alpha}\left(\frac{3\alpha}{4nN}\right)^2 = \frac{3\alpha}{4N^2}$$

Note that $n$ has completely cancelled! Similarly, $\eta_V \approx -1/N$. Therefore:

$$\boxed{n_s \approx 1 - \frac{2}{N}, \qquad r \approx \frac{12\alpha}{N^2}}$$

This is the attractor behavior: the spectral index is universal ($n$-independent), while $r$ depends only on $\alpha$. The Starobinsky model corresponds to $\alpha = 1$ ($r = 12/N^2$).

Universality Explained

The universality arises because for large field values, all $\alpha$-attractor potentials approach a plateau. The approach to the plateau is controlled by a single parameter $\alpha$(the curvature of the Kähler manifold in the supergravity embedding), which fixes the shape of the potential near the plateau edge. The specific form of the potential away from the plateau (encoded in $n$) becomes irrelevant because the observable perturbations are generated while the field is near the plateau.

● $\alpha = 1$: Starobinsky / $R^2$ inflation ($r \approx 0.003$ for $N = 55$)
● $\alpha \ll 1$: Very small $r$, deep in the plateau regime
● $\alpha \gg 1$: Approaches monomial inflation, large $r$

5.6 Higgs Inflation

Higgs inflation (Bezrukov & Shaposhnikov, 2008) is the remarkable idea that the Standard Model Higgs boson itself can serve as the inflaton, provided it has a large non-minimal coupling to gravity. The Jordan-frame action is:

$$S_J = \int d^4x\,\sqrt{-g}\left[\frac{M_P^2 + \xi h^2}{2}\,R - \frac{1}{2}(\partial h)^2 - \frac{\lambda}{4}(h^2 - v^2)^2\right]$$

where $h$ is the Higgs field (in unitary gauge), $\xi$ is the non-minimal coupling ($\xi \sim 10^4$ for the correct amplitude of perturbations), $\lambda$ is the Higgs quartic coupling, and $v = 246$ GeV is the electroweak vacuum expectation value.

Step 1: Conformal Transformation

Define the conformal factor:

$$\Omega^2 = 1 + \frac{\xi h^2}{M_P^2}$$

Perform the Weyl rescaling $\tilde{g}_{\mu\nu} = \Omega^2 g_{\mu\nu}$ to go to the Einstein frame:

$$\tilde{g}_{\mu\nu} = \left(1 + \frac{\xi h^2}{M_P^2}\right)g_{\mu\nu}$$

Step 2: Canonical Scalar Field

After the conformal transformation, the kinetic term for $h$ is non-canonical. The canonically normalized field $\chi$ satisfies:

$$\frac{d\chi}{dh} = \sqrt{\frac{1}{\Omega^2} + \frac{6\xi^2 h^2}{M_P^2\,\Omega^4}} = \sqrt{\frac{1 + \xi h^2/M_P^2 + 6\xi^2 h^2/M_P^2}{(1 + \xi h^2/M_P^2)^2}}$$

In the inflationary regime where $h \gg M_P/\sqrt{\xi}$ (so $\Omega^2 \approx \xi h^2/M_P^2 \gg 1$and $6\xi \gg 1$):

$$\frac{d\chi}{dh} \approx \frac{\sqrt{6}\,\xi h/M_P}{\xi h^2/M_P^2} = \frac{\sqrt{6}\,M_P}{h}$$

Integrating: $\chi = \sqrt{6}\,M_P\,\ln(h\sqrt{\xi}/M_P)$, or equivalently:

$$h = \frac{M_P}{\sqrt{\xi}}\,e^{\chi/(\sqrt{6}\,M_P)}$$

Step 3: The Einstein-Frame Plateau Potential

The Einstein-frame potential is:

$$V_E(\chi) = \frac{1}{\Omega^4}\cdot\frac{\lambda}{4}h^4 = \frac{\lambda h^4/4}{(1 + \xi h^2/M_P^2)^2}$$

For $h \gg M_P/\sqrt{\xi}$:

$$V_E \approx \frac{\lambda h^4/4}{\xi^2 h^4/M_P^4} = \frac{\lambda M_P^4}{4\xi^2}$$

This is a constant — a plateau! More precisely, keeping the correction:

$$\boxed{V_E(\chi) = \frac{\lambda M_P^4}{4\xi^2}\left(1 - e^{-\sqrt{2/3}\,\chi/M_P}\right)^2}$$

This has exactly the same functional form as the Starobinsky potential! Therefore, the predictions are identical:

$$\boxed{n_s \approx 1 - \frac{2}{N}, \qquad r \approx \frac{12}{N^2}}$$

The mass scale is fixed by the amplitude of scalar perturbations: $\lambda/\xi^2 \sim 10^{-10}$, which for $\lambda \sim 0.01$ (the Higgs self-coupling at the inflationary scale) gives$\xi \sim 10^4$.

Unitarity Concerns

The main theoretical concern with Higgs inflation is perturbative unitarity. With $\xi \sim 10^4$, the effective cutoff scale of the Jordan-frame theory in the vacuum ($h \sim v$) is:

$$\Lambda_\text{cutoff} \sim \frac{M_P}{\xi} \sim 10^{14}\;\text{GeV}$$

This is below the inflationary Hubble scale $H_\text{inf} \sim 10^{13}$ GeV only marginally. The concern is that new physics below this scale could invalidate the inflationary calculation. However, Bezrukov and Shaposhnikov argued that the relevant cutoff during inflation is background-dependent and higher: $\Lambda \sim M_P/\sqrt{\xi} \sim 10^{16}$ GeV, which is safely above $H_\text{inf}$. The debate on this subtle point continues in the literature.

Despite these concerns, Higgs inflation remains attractive as a minimal model: it requires no new particles beyond the Standard Model, only a single new coupling constant $\xi$.

5.7 Model Comparison and Bayesian Selection

The $n_s$–$r$ Plane

The most informative way to compare models is in the $n_s$–$r$ plane. Each model traces a curve or region parameterized by the number of e-folds $N$ (and sometimes additional parameters like $f/M_P$ or $\alpha$). The Planck 2018 + BICEP/Keck 2021 data define confidence contours in this plane. Here is a summary:

Model	$n_s$ formula	$r$ formula	$n_s$ ($N\!=\!55$)	$r$ ($N\!=\!55$)	Status
$m^2\phi^2$	$1 - 2/N$	$8/N$	0.964	0.145	Ruled out
$\lambda\phi^4$	$1 - 3/N$	$16/N$	0.945	0.291	Ruled out
Starobinsky / $R^2$	$1 - 2/N$	$12/N^2$	0.964	0.004	Excellent fit
Natural ($f = 10\,M_P$)	—	—	0.962	0.098	Tension
$\alpha$-attractor ($\alpha$)	$1 - 2/N$	$12\alpha/N^2$	0.964	$0.004\alpha$	Excellent
Higgs (large $\xi$)	$1 - 2/N$	$12/N^2$	0.964	0.004	Excellent fit

Bayesian Model Selection with Planck Data

Bayesian model comparison computes the evidence (marginal likelihood) for each model:

$$\mathcal{Z}_i = \int d\boldsymbol{\theta}\,\mathcal{L}(\text{data}|\boldsymbol{\theta}, M_i)\,\pi(\boldsymbol{\theta}|M_i)$$

The Bayes factor between models $M_i$ and $M_j$ is $B_{ij} = \mathcal{Z}_i/\mathcal{Z}_j$. Planck 2018 analyses using this framework consistently find:

Strong preference for Starobinsky/$R^2$ and $\alpha$-attractor models with small $\alpha$
Decisive evidence against $\phi^4$ and monomial models with $p \geq 2$
Moderate tension for natural inflation even with $f = 10\,M_P$
Higgs inflation is statistically indistinguishable from Starobinsky (identical predictions for $n_s$, $r$)

The key discriminant between surviving models is the tensor-to-scalar ratio $r$. Future CMB experiments (CMB-S4, LiteBIRD) aim for sensitivity $\sigma(r) \sim 10^{-3}$, which would definitively test Starobinsky-class models.

5.8 Historical Context

1980 — Starobinsky

Alexei Starobinsky proposed the $R + R^2/(6M^2)$ modification of gravity as a model for the early universe, motivated by quantum corrections to gravity in a de Sitter background. This was the first model of cosmic inflation, predating Guth's work by a year, though it was not initially framed in the language of the “inflationary paradigm.” Starobinsky's model has proven remarkably prescient: four decades later, it remains the best fit to Planck data.

1983 — Linde's Chaotic Inflation

Andrei Linde proposed “chaotic inflation,” arguing that inflation does not require special initial conditions or a phase transition. With a simple $V = m^2\phi^2/2$potential, inflation occurs naturally for any initial field value $\phi \gtrsim \text{few} \times M_P$. This model served as the workhorse of inflationary cosmology for three decades before being ruled out by Planck + BICEP data.

1990 — Natural Inflation (Freese, Frieman & Olinto)

Katherine Freese, Joshua Frieman, and Angela Olinto proposed using a pseudo-Nambu-Goldstone boson as the inflaton, providing a natural explanation for the flatness of the potential via an approximate shift symmetry. The cosine potential $V = \Lambda^4[1 + \cos(\phi/f)]$introduced the axion decay constant $f$ as a key parameter, connecting inflation to particle physics.

2008 — Higgs Inflation (Bezrukov & Shaposhnikov)

Fedor Bezrukov and Mikhail Shaposhnikov showed that the Standard Model Higgs boson, with a large non-minimal coupling $\xi h^2 R$ to gravity ($\xi \sim 10^4$), can drive inflation. The Einstein-frame potential develops a plateau identical to Starobinsky's model. This minimalist approach requires no new particles, only a single new coupling constant.

2013 — $\alpha$-Attractors (Kallosh & Linde)

Renata Kallosh and Andrei Linde discovered that a broad class of supergravity-motivated inflationary models share universal predictions: $n_s \approx 1 - 2/N$ and$r \approx 12\alpha/N^2$, where $\alpha$ parameterizes the curvature of the Kähler manifold. The Starobinsky model is the $\alpha = 1$ member of this family. This unifying framework showed that the agreement of many models with Planck data is not coincidental but reflects a deep geometric attractor mechanism.

5.9 Python Simulation

The following simulation plots the $n_s$–$r$ plane with approximate Planck 2018 + BICEP/Keck confidence contours, overlaid with predictions from all five models discussed in this chapter. It also plots the potential $V(\phi)$ for each model.

Inflationary Models: n_s-r Plane and Potentials

Python

script.py201 lines

import numpy as np
import matplotlib
matplotlib.use('Agg')
import matplotlib.pyplot as plt

# ============================================================
# FIGURE 1: n_s - r plane with Planck/BICEP contours
# ============================================================
fig, (ax1, ax2) = plt.subplots(1, 2, figsize=(16, 7))

# --- Approximate Planck 2018 + BICEP/Keck contours (ellipses) ---
# Central values from Planck 2018: n_s = 0.9649, r ~ 0
ns_center = 0.9649
r_center = 0.0

# 68% and 95% CL approximate ellipses
theta = np.linspace(0, 2*np.pi, 300)

# 68% CL ellipse
sigma_ns_68 = 0.0042
sigma_r_68 = 0.015
ns_68 = ns_center + sigma_ns_68 * np.cos(theta)
r_68 = r_center + sigma_r_68 * np.sin(theta)
r_68 = np.clip(r_68, 0, None)

# 95% CL ellipse
sigma_ns_95 = 0.0084
sigma_r_95 = 0.036
ns_95 = ns_center + sigma_ns_95 * np.cos(theta)
r_95 = r_center + sigma_r_95 * np.sin(theta)
r_95 = np.clip(r_95, 0, None)

ax1.fill(ns_95, r_95, color='#4169E1', alpha=0.15, label='Planck+BICEP 95% CL')
ax1.fill(ns_68, r_68, color='#4169E1', alpha=0.3, label='Planck+BICEP 68% CL')
ax1.plot(ns_95, r_95, color='#4169E1', alpha=0.5, linewidth=1)
ax1.plot(ns_68, r_68, color='#4169E1', alpha=0.7, linewidth=1)

# --- Model 1: m^2 phi^2 (Chaotic) ---
N_arr = np.linspace(40, 70, 100)
ns_chaotic = 1 - 2.0/N_arr
r_chaotic = 8.0/N_arr
ax1.plot(ns_chaotic, r_chaotic, 'r-', linewidth=2.5, label=r'$m^2\phi^2$ (Chaotic)')
for N_mark in [50, 60]:
    ax1.plot(1 - 2.0/N_mark, 8.0/N_mark, 'ro', markersize=8)
    ax1.annotate(f'N={N_mark}', (1 - 2.0/N_mark, 8.0/N_mark),
                 textcoords='offset points', xytext=(8, 5), fontsize=8, color='red')

# --- Model 2: phi^4 ---
ns_phi4 = 1 - 3.0/N_arr
r_phi4 = 16.0/N_arr
ax1.plot(ns_phi4, r_phi4, 'r--', linewidth=2, alpha=0.7, label=r'$\lambda\phi^4$')
ax1.plot(1 - 3.0/60, 16.0/60, 'r^', markersize=8)

# --- Model 3: Starobinsky R^2 ---
ns_star = 1 - 2.0/N_arr
r_star = 12.0/N_arr**2
ax1.plot(ns_star, r_star, 'g-', linewidth=2.5, label=r'Starobinsky $R^2$')
for N_mark in [50, 60]:
    ax1.plot(1 - 2.0/N_mark, 12.0/N_mark**2, 'gs', markersize=8)
    ax1.annotate(f'N={N_mark}', (1 - 2.0/N_mark, 12.0/N_mark**2),
                 textcoords='offset points', xytext=(8, 5), fontsize=8, color='green')

# --- Model 4: Natural Inflation for various f/M_P ---
for f_over_MP in [5, 7, 10, 20]:
    ns_nat_list = []
    r_nat_list = []
    for N_val in N_arr:
        # Numerical evaluation for natural inflation
        # In the limit of large f, approaches m^2 phi^2
        # Use approximate formula
        mu2 = 1.0 / f_over_MP**2  # M_P^2 / f^2
        # Approximate: cos(phi_end/f) found from epsilon_V = 1
        # epsilon_V = (mu2/2) * (1 - cos(x))/(1 + cos(x))
        # For epsilon_V = 1: (1 - cos(x_end))/(1 + cos(x_end)) = 2*f^2/M_P^2
        ratio = 2.0 * f_over_MP**2
        if ratio > 1e6:
            cos_end = -1.0 + 2.0/ratio
        else:
            cos_end = (ratio - 1.0) / (ratio + 1.0)
            cos_end = max(-1.0, min(1.0, cos_end))
        # e-folds: N = (f^2/M_P^2) * ln[(1 - cos(phi_*)/(1 - cos(phi_end))]
        # so 1 - cos(phi_*) = (1 - cos_end) * exp(N * M_P^2/f^2)
        one_m_cos_end = 1.0 - cos_end
        one_m_cos_star = one_m_cos_end * np.exp(N_val * mu2)
        if one_m_cos_star > 2.0:
            one_m_cos_star = 2.0  # cap at valid range
        cos_star = 1.0 - one_m_cos_star
        if cos_star < -1.0:
            cos_star = -1.0
        # epsilon and eta
        eps = (mu2 / 2.0) * (1.0 - cos_star) / (1.0 + cos_star) if (1.0 + cos_star) > 1e-10 else 10.0
        eta = -mu2 * cos_star / (1.0 + cos_star) if (1.0 + cos_star) > 1e-10 else 10.0
        ns_val = 1.0 - 6.0*eps + 2.0*eta
        r_val = 16.0 * eps
        ns_nat_list.append(ns_val)
        r_nat_list.append(r_val)
    ns_nat = np.array(ns_nat_list)
    r_nat = np.array(r_nat_list)
    mask = (r_nat < 0.4) & (ns_nat > 0.92) & (r_nat > 0)
    if np.any(mask):
        ax1.plot(ns_nat[mask], r_nat[mask], '--', linewidth=1.5, color='orange',
                 alpha=0.5 + 0.15*(f_over_MP/20))
        # Label at N=55
        idx_55 = np.argmin(np.abs(N_arr - 55))
        if mask[idx_55]:
            ax1.annotate(f'f={f_over_MP}Mp', (ns_nat[idx_55], r_nat[idx_55]),
                         textcoords='offset points', xytext=(5, 3), fontsize=7, color='orange')

ax1.plot([], [], '--', color='orange', linewidth=1.5, label='Natural (various f)')

# --- Model 5: alpha-attractors for various alpha ---
for alpha_val in [0.1, 0.5, 1, 3, 7]:
    ns_alpha = 1 - 2.0/N_arr
    r_alpha = 12.0 * alpha_val / N_arr**2
    ax1.plot(ns_alpha, r_alpha, '-', linewidth=1.5, color='purple',
             alpha=0.4 + 0.12*alpha_val)
    idx_55 = np.argmin(np.abs(N_arr - 55))
    ax1.annotate(f'a={alpha_val}', (ns_alpha[idx_55], r_alpha[idx_55]),
                 textcoords='offset points', xytext=(5, 3), fontsize=7, color='purple')

ax1.plot([], [], '-', color='purple', linewidth=1.5, label=r'$\alpha$-attractors')

# --- Model 6: Higgs inflation (same as Starobinsky, just mark it) ---
ax1.plot(1 - 2.0/55, 12.0/55**2, 'b*', markersize=14, label='Higgs inflation', zorder=5)

ax1.set_xlabel(r'$n_s$', fontsize=14)
ax1.set_ylabel(r'$r$', fontsize=14)
ax1.set_title(r'Inflationary Models in the $n_s$-$r$ Plane', fontsize=14)
ax1.set_xlim(0.93, 0.985)
ax1.set_ylim(0, 0.30)
ax1.legend(loc='upper right', fontsize=8, framealpha=0.9)
ax1.grid(True, alpha=0.3)
ax1.tick_params(labelsize=11)

# ============================================================
# FIGURE 2: V(phi) for each model
# ============================================================
phi = np.linspace(0.01, 8, 500)  # in units of M_P

# m^2 phi^2 (normalized so V_0 = 1 at phi = 4 M_P)
V_chaotic = 0.5 * phi**2
V_chaotic = V_chaotic / V_chaotic[-1]

# Starobinsky
V_star = (1 - np.exp(-np.sqrt(2.0/3.0) * phi))**2
# Already normalized to approach 1

# Natural inflation (f = 5 M_P)
f_nat = 5.0
V_nat = 1 + np.cos(phi / f_nat)
V_nat = V_nat / 2.0  # Normalize to max = 1

# alpha-attractor T-model (alpha = 1, n = 1)
V_alpha = np.tanh(phi / np.sqrt(6.0))**2

# Higgs inflation (same shape as Starobinsky)
V_higgs = (1 - np.exp(-np.sqrt(2.0/3.0) * phi))**2

ax2.plot(phi, V_chaotic, 'r-', linewidth=2.5, label=r'$m^2\phi^2$ (Chaotic)')
ax2.plot(phi, V_star, 'g-', linewidth=2.5, label=r'Starobinsky $R^2$')
ax2.plot(phi, V_nat, '--', color='orange', linewidth=2, label=r'Natural ($f=5\,M_P$)')
ax2.plot(phi, V_alpha, '-', color='purple', linewidth=2, label=r'$\alpha$-attractor ($\alpha=1$)')
ax2.plot(phi, V_higgs, 'b--', linewidth=2, alpha=0.7, label='Higgs inflation')

# Shade the slow-roll region
ax2.axvspan(3, 8, alpha=0.05, color='yellow')
ax2.text(5.5, 0.15, 'Slow-roll\nregion', fontsize=10, color='yellow', alpha=0.6, ha='center')

ax2.set_xlabel(r'$\phi / M_P$', fontsize=14)
ax2.set_ylabel(r'$V(\phi) / V_0$', fontsize=14)
ax2.set_title('Inflationary Potentials (Normalized)', fontsize=14)
ax2.set_xlim(0, 8)
ax2.set_ylim(0, 1.15)
ax2.legend(loc='lower right', fontsize=9, framealpha=0.9)
ax2.grid(True, alpha=0.3)
ax2.tick_params(labelsize=11)

plt.tight_layout()
plt.savefig('output.png', dpi=150, bbox_inches='tight', facecolor='white')
plt.close()
print("Plot saved: n_s-r plane with Planck/BICEP contours and model predictions,")
print("alongside V(phi) for all five inflationary models.")
print()
print("KEY RESULTS:")
print("="*60)
print(f"{'Model':<25} {'n_s (N=55)':<15} {'r (N=55)':<15} {'Status'}")
print("-"*60)
models = [
    ("m^2 phi^2", 1-2/55, 8/55, "Ruled out"),
    ("lambda phi^4", 1-3/55, 16/55, "Ruled out"),
    ("Starobinsky R^2", 1-2/55, 12/55**2, "Excellent fit"),
    ("alpha-attr (a=1)", 1-2/55, 12*1/55**2, "Excellent fit"),
    ("Higgs (large xi)", 1-2/55, 12/55**2, "Excellent fit"),
]
for name, ns, r, status in models:
    print(f"{name:<25} {ns:<15.4f} {r:<15.5f} {status}")
print("="*60)
print(f"Planck 2018:  n_s = 0.9649 +/- 0.0042")
print(f"BICEP/Keck:   r < 0.036 (95% CL)")

Click Run to execute the Python code

Code will be executed with Python 3 on the server

Share:X Reddit LinkedIn

←4. Primordial Perturbations 6. Reheating→