Module 7: Song & Acoustic Biophysics
Birdsong is among the most complex acoustic signals produced by any animal. The syrinx โ unique to birds โ can generate two simultaneous, independently modulated voices. Understanding its biophysics requires fluid dynamics, wave mechanics, and neuroscience. This module derives the physics of sound production, propagation, and the neural circuits for vocal learning.
1. The Syrinx: Two-Voice Sound Source
1.1 Anatomy & Position
Unlike the mammalian larynx (at the top of the trachea), the avian syrinx is located at the tracheo-bronchial junction โ where the trachea bifurcates into two bronchi. Most songbirds have a tracheobronchial syrinx with intrinsic musculature in both bronchial branches, enabling independent bilateral control of the two sound sources.
Key structural elements (using oscine/songbird terminology):
- MTMMedial tympaniform membranes โ thin, compliant membranes on the medial wall of each bronchus. The primary vibrating element. Thickness โผ5โ20 ฮผm; area โผ1โ5 mm\(^2\).
- MLMedial labia โ paired fleshy lips adjacent to MTM in some species. Co-vibrate with MTM.
- LLLateral labia โ lateral valve controlling airflow. Tension affects both amplitude and frequency.
- RingsTracheal + bronchial rings โ cartilaginous support. The pessulus is a central cartilage at the tracheo-bronchial junction.
- MusclesSyringeal muscles (6โ9 pairs in oscines) โ nXIIts (hypoglossal nerve) innervation. Control membrane tension, glottal aperture, and labial position.
1.2 Sound Production: Bernoulli & Membrane Vibration
Air flow from the air sacs through the bronchi past the MTM/labia follows the Bernoulli principle: in the narrow channel between the membrane and the bronchial wall, flow velocity increases and pressure decreases (the Bernoulli effect), drawing the membranes inward. The restoring elastic force then pushes them back outward, establishing oscillation.
The fundamental frequency of the membrane oscillation is determined by its tension \(T\), area density \(\rho_s\) (mass per unit area), and effective length scale \(L\). For a thin membrane under tension, the resonant frequency:
This is analogous to a string under tension, but in 2D. \(T\) is controlled by syringeal muscle activity; \(\rho_s\) is set by membrane thickness and material properties (elastic modulus \(E \sim 10\text{--}100\) kPa for biological membranes).
Frequency modulation during song occurs on timescales of milliseconds. The rate of frequency change depends on muscle force: for a finch producing a frequency sweep from 3โ8 kHz over 50 ms:
The required rate of tension change is \(dT/dt \approx 2T f_0^{-1} \times 10^5\) โ achievable by fast-twitch syringeal muscles with contraction times โผ5โ10 ms.
1.3 Dual Voice: Two Independent Sources
The most striking capability of the oscine syrinx is the production of two simultaneous, harmonically unrelated frequencies. Wood thrushes (Hylocichla mustelina) produce complex chords by using each bronchial half of the syrinx independently. The two sides can operate:
- โ At different fundamental frequencies (independent pitch control)
- โ With different amplitude envelopes (independent volume)
- โ With independent spectral profiles (different harmonic content per side)
- โ With one side producing while the other is silent (unilateral phonation)
Nerve cutting experiments (severing the right or left n.XII branch) silence the corresponding side, confirming independent neural control. The left side tends to dominate in many species (left-hemisphere lateralization for song motor control โ analogous to language lateralization in humans).
Figure 1: Syrinx Cross-Section Diagram
2. Source-Filter Theory & Tracheal Resonance
2.1 Source: Syrinx Harmonic Series
The vibrating membrane generates a harmonic series: the fundamental frequency \(f_0\) plus overtones at \(2f_0, 3f_0, \ldots, nf_0\). For a purely sinusoidal (single-frequency) vibration, only \(f_0\) is produced. The nonlinearity of membrane vibration under Bernoulli forcing generates harmonic distortion, rich in higher partials. The spectrum of the source signal:
Amplitude envelope \(a_n\) typically falls off as \(a_n \propto n^{-\alpha}\)(spectral roll-off), with \(\alpha \approx 1\text{--}2\) for voiced sounds.
2.2 Filter: Tracheal Tube Resonances (Formants)
The trachea acts as a quarter-wave resonator: open at the beak and approximately closed at the syrinx. Resonance condition:
\(c_{\text{sound}} \approx 350\) m/s at 37ยฐC (inside trachea, warm humid air),\(L\) = tracheal length, \(n = 1, 2, 3, \ldots\) gives formants at odd multiples. For a zebra finch (\(L \approx 15\) mm):\(f_1 = 350/(4 \times 0.015) \approx 5833\) Hz โ in the middle of their song range (2โ9 kHz).
The beak acts as a radiation filter. A closed beak attenuates high frequencies (low-pass), while an open beak emphasizes higher harmonics. Dynamic beak opening during song modifies the effective tube length and hence formant positions:
Air sacs (connected to the syrinx) add parallel resonant cavities, further shaping the spectral output. The complete filter transfer function \(H(f)\) is the product of tracheal, beak, and air sac transfer functions. The output song spectrum:\(P_{\text{out}}(f) = |H(f)|^2 \cdot P_{\text{source}}(f)\).
Figure 2: Schematic Spectrogram of Bird Song
3. Vocal Learning: Neural Circuits & FOXP2
Vocal learning โ acquiring vocalizations through auditory experience and practice โ is taxonomically rare: among birds it is found only in three independent lineages (oscine passerines/songbirds, parrots, hummingbirds), plus cetaceans, bats, elephants, seals, and humans among mammals. This convergent evolution suggests strong selective pressure and shared neurobiological mechanisms.
3.1 Song System Neural Circuits
Motor Pathway (Song Production)
Anterior Forebrain Pathway (AFP, Learning)
During sensory phase (โผ10โ65 days post-hatch in zebra finch), the juvenile memorizes the tutor song into an auditory template in HVC and LMAN. During sensorimotor phase (โผ25โ90 dpn), the bird practices singing, comparing its output to the template via error-correcting plasticity in the AFP circuit (reinforcement learning). Dopaminergic projections from VTA to Area X signal reward and guide synaptic modification (BDNF/NTRK2-dependent LTP in RA).
3.2 FOXP2: The Language Gene
FOXP2 (Forkhead Box P2) is a transcription factor containing a forkhead DNA-binding domain. It is expressed in HVC and Area X in songbirds, and its expression level changes seasonally (higher during song learning periods). Key findings:
- โKnockdown of FOXP2 in Area X of zebra finch during sensorimotor learning produces abnormal, inaccurate song that fails to match the tutor template.
- โThe human FOXP2 sequence differs from mouse at only 3 amino acids, but two of these are specific to the human lineage and arose after the human-chimpanzee split (โผ6 Mya).
- โMutations in human FOXP2 cause a severe speech and language disorder (verbal dyspraxia), establishing it as critical for the fine oral-motor control underlying speech.
- โFOXP2 regulates downstream targets including CNTNAP2 (axon guidance), SLIT1 (neuronal migration), and MAP1B (synaptic plasticity) in both birds and humans.
- โHumanized mice (carrying human FOXP2 amino acid substitutions) show altered ultrasonic vocalizations and enhanced basal ganglia synaptic plasticity โ suggesting FOXP2 drove changes in motor learning circuitry.
4. Python: FM Synthesis, Spectrogram & Tracheal Filtering
Generating synthetic bird song using frequency modulation (FM) synthesis, computing a spectrogram, and demonstrating tracheal formant filtering on the harmonic source spectrum.
Click Run to execute the Python code
Code will be executed with Python 3 on the server
References
- Suthers, R. A. (2004). How birds sing and why it matters. In Nature's Music: The Science of Birdsong (ed. P. Marler & H. Slabbekoorn), pp. 272โ295. Academic Press.
- Elemans, C. P. H. et al. (2015). Universal mechanisms of sound production and control in birds and mammals. Nature Communications, 6, 8978.
- Goller, F. & Larsen, O. N. (1997). A new mechanism of sound generation in songbirds. Proceedings of the National Academy of Sciences, 94, 14787โ14791.
- Nowicki, S. & Searcy, W. A. (2014). The evolution of vocal learning. Current Opinion in Neurobiology, 28, 48โ53.
- Fee, M. S. & Scharff, C. (2010). The songbird as a model for the generation and learning of complex sequential behaviors. ILAR Journal, 51, 362โ377.
- Catchpole, C. K. & Slater, P. J. B. (2008). Bird Song: Biological Themes and Variations, 2nd ed. Cambridge University Press.
- Gill, F. B. (2007). Ornithology, 3rd ed. W. H. Freeman.