Module 1 Β· Weeks 2β3 Β· Undergraduate β Graduate
Acoustics & Fourier Analysis
Sound is vibration; vibration is governed by differential equations; and differential equations are solved by decomposing functions into sinusoidal components. This is the Fourier programme, and it connects the physics of a vibrating string directly to the mathematics of Hilbert spaces.
1. The Vibrating String
Consider a string of length \(L\), fixed at both ends, with linear mass density \(\mu\) under tension \(T\). The transverse displacement \(u(x,t)\) satisfies the one-dimensional wave equation:
with boundary conditions \(u(0,t) = u(L,t) = 0\) (fixed ends) and initial conditions \(u(x,0) = f(x)\), \(\dot{u}(x,0) = g(x)\).
D'Alembert's Solution
On the infinite line, the general solution takes the elegant form:
This reveals that solutions are superpositions of left-moving and right-moving waves travelling at speed \(c\). On the finite string with fixed ends, reflections create standing waves β which leads us to the eigenvalue problem.
2. The Eigenvalue Problem
Separation of variables \(u(x,t) = X(x)T(t)\) yields the SturmβLiouville eigenvalue problem:
Normal Mode Frequencies
Each eigenfunction oscillates at its own natural frequency:
The frequencies form an arithmetic progression: \(f_1, 2f_1, 3f_1, \ldots\)This is precisely the harmonic series from Module 0. The eigenvalue problem thus provides the mathematical explanation for why vibrating strings produce harmonics.
Physical interpretation
The \(n\)-th mode has \(n-1\) nodes (stationary points) along the string. The fundamental (\(n=1\)) has no internal nodes; the second harmonic (\(n=2\)) has one node at the midpoint, etc. A real string vibrates as a superposition of all these modes, with amplitudes determined by the initial pluck shape.
3. Fourier Series
The normal modes \(\{\sin(n\pi x/L)\}\) form an orthogonal basis for \(L^2([0,L])\). Any square-integrable function on \([0,L]\)can be expanded:
More generally, Fourier series on \([-\pi, \pi]\) use both sines and cosines:
Parseval's Identity
Energy is conserved in the Fourier domain:
In acoustic terms: the total power of a sound equals the sum of the powers of its individual harmonics. This is why the amplitudes \(a_n, b_n\) (or equivalently, the power spectrum \(|c_n|^2\)) completely characterize the timbre.
Gibbs Phenomenon
At jump discontinuities, partial Fourier sums overshoot by approximately 9% of the jump β no matter how many terms are included. This Gibbs phenomenon is audible as a βringingβ artifact when synthesizing square waves with finitely many harmonics. The overshoot converges to \(\frac{1}{\pi}\int_0^{\pi}\frac{\sin t}{t}\,dt - \frac{1}{2} \approx 0.0895\), or about 8.95% of the jump height.
Simulation: Fourier Series Convergence & Gibbs Phenomenon
The simulation below demonstrates square-wave and sawtooth-wave partial-sum convergence, the Gibbs overshoot at discontinuities, Parseval energy convergence, and the power-spectrum envelope for a square wave.
Click Run to execute the Python code
Code will be executed with Python 3 on the server
4. Fourier Transform & the Spectrogram
For non-periodic signals, the Fourier series generalizes to the Fourier transform:
Short-Time Fourier Transform (STFT)
Music changes in time, so we need time-frequency analysis. The STFT applies the Fourier transform through a sliding window function \(w(t)\):
The squared modulus \(|\text{STFT}(\tau,\xi)|^2\) is the spectrogram β the standard visual representation of music.
HeisenbergβGabor Uncertainty
You cannot simultaneously localize a signal precisely in time and in frequency. A short window gives good time resolution but poor frequency resolution, and vice versa. This is the time-frequency uncertainty principle, the signal-processing analogue of Heisenberg's uncertainty in quantum mechanics. The Gaussian window (Gabor atom) achieves the theoretical minimum product.
5. Interactive Fourier Synthesizer
Build a timbre from its harmonic components. Each slider controls the amplitude of the\(n\)-th harmonic (\(n \cdot f_0\) Hz). The waveform is \(y(t) = \sum_{n=1}^{8} a_n \sin(2\pi n f_0 t)\). Try the presets to hear how different amplitude profiles create familiar timbres.
Interactive Fourier Synthesizer
Adjust harmonic amplitudes and click Play. The waveform updates in real time via the Web Audio API analyser node.
Why these presets?
- Square wave: only odd harmonics, amplitudes \(1/n\)
- Sawtooth: all harmonics, amplitudes \(1/n\)
- Triangle: odd harmonics only, amplitudes \(1/n^2\)
- Clarinet: predominantly odd harmonics (cylindrical bore resonance)
- Cello: rich spectrum with all harmonics, slowly decaying
6. Psychoacoustics
The ear is not a perfect Fourier analyser. The cochlea performs a mechanical frequency decomposition via the basilar membrane, but with finite resolution determined by critical bands.
Place Theory
Different frequencies excite different positions along the basilar membrane. High frequencies resonate near the base (stiff end); low frequencies near the apex (flexible end). Each position responds to a band of frequencies approximately 1/3 octave wide β these are the critical bands (Bark scale).
Helmholtz Consonance Theory
Hermann von Helmholtz (1863) proposed that dissonance arises when partials of two tones fall within the same critical band, producing audible beating. Consonant intervals have partials that either coincide or are well-separated on the basilar membrane.
The PlompβLevelt curve
Plomp and Levelt (1965) measured the perceived dissonance of two pure tones as a function of their frequency separation. Maximum dissonance occurs at about 25% of the critical bandwidth; negligible dissonance occurs beyond one critical bandwidth. For complex tones, total dissonance is the sum over all partial pairs β reproducing the traditional ranking of intervals.
Modern theories (Terhardt, Parncutt) add virtual pitch: the auditory system infers a missing fundamental from its harmonics, explaining why a phone speaker with no bass output still conveys the pitch of a low cello note. This pattern-matching mechanism is distinct from place coding and involves central neural processing.