Module 3: Echolocation & Sonar
Only two major mammalian lineages evolved biological sonar: bats and toothed whales (odontocetes). In odontocetes the system is considerably more powerful, emitting clicks at up to 230 dB re 1 μPa (sperm whale) — the loudest biological sound ever recorded — and reaching effective ranges of hundreds of meters for fish-sized prey and kilometers for large schools. The peak frequencies of 100–200 kHz give wavelengths of 1 cm or less in seawater, enabling discrimination of targets differing by 1 mm at 5 m range. This module derives the acoustic physics of click production, the melon acoustic lens, the sonar equation, and the exquisite signal-processing capabilities revealed by behavioral experiments on bottlenose dolphins.
1. Click Production: Phonic Lips and the Museau de Singe
For most of the 20th century the laryngeal origin of dolphin clicks was assumed by analogy with human phonation. Careful anatomical and behavioral work from the 1990s onward (Cranford, Amundin, Norris) established that odontocete sound production occurs in the phonic lips (also called dorsal bursae or “monkey lips” — museau de singe, for the ape-face appearance of the structure in sperm whale dissections). These are paired elastic structures embedded in the upper nasal passages.
Driven air flow from the lungs causes the phonic lips to open and close at a rate set by their elastic properties. Each closure generates a pressure pulse: the “click” itself. For toothed whales other than sperm whales, clicks are broadband with peak frequency ~100–200 kHz and duration ~30–70 µs. Sperm whales produce long (~10 ms) multi-pulse clicks where the initial phonic-lip pulse reflects repeatedly off distal and frontal air sacs to form the famous “inter-pulse interval” signature used to estimate body length via the bent-horn model.
1.1 Source Level and Radiated Power
The source level (SL) is the on-axis sound pressure level at a nominal reference distance of 1 m. For a bottlenose dolphin, SL ≈ 220 dB re 1 μPa. For a sperm whale, SL reaches 230 dB re 1 μPa (broadside) or even 236 dB on-axis (Møhl et al. 2003). The radiated intensity is:
\[ I = \frac{p^2}{\rho c},\quad p = p_{ref}\cdot 10^{SL/20} \]
For SL = 230 dB, \(p = 10^{-6}\cdot 10^{11.5} = 3.16\times10^{5}\,\text{Pa}\)— over 3 atmospheres of peak acoustic pressure. Intensity:\(I = (3.16\times10^5)^2/(1025\cdot 1500) \approx 6.5\times10^4\,\text{W/m}^2\). Integrated over the ~5° beamwidth, the sperm whale radiates roughly 25 kW of peak acoustic power per click — more than a large pop-concert loudspeaker.
2. The Melon: Biological Acoustic Lens
The melon — encountered already in Module 0 — occupies the forehead of toothed whales and functions as an acoustic lens. Its graded lipid composition produces a smoothly varying sound speed \(c(\mathbf{x})\) from ~1310 m/s at the center to ~1500 m/s at the edges, setting up a refractive-index profile that focuses sound from the phonic lips into a narrow forward beam.
2.1 Eikonal Ray Equation
In the high-frequency (ray optics) limit, sound propagation is governed by the eikonal equation:
\[ |\nabla S|^2 = n^2(\mathbf{x}) = (c_0/c(\mathbf{x}))^2 \]
where \(S(\mathbf{x})\) is the phase (travel-time) surface and \(n(\mathbf{x})\) is the acoustic index of refraction.
The ray trajectory \(\mathbf{r}(s)\) satisfies:
\[ \frac{d}{ds}\left(n\,\frac{d\mathbf{r}}{ds}\right) = \nabla n \]
Rays bend toward regions of higher n (lower c), i.e. toward the melon center.
With the center of the melon having the lowest sound speed, rays emitted from the phonic lips are bent toward the optical axis. The melon acts as a positive (converging) lens. If the composition is tuned carefully, the lens is nearly aplanatic: all rays from the phonic lips emerge parallel (a collimated beam) rather than diverging.
2.2 Sperm Whale Bent-Horn Model
The sperm whale's spermaceti organ complicates the picture. Sound is generated at the museau de singe at the front of the head and directed backward through the spermaceti organ (the low-density oil-filled component), bouncing off a frontal air sac and then forward through the junk (denser, segmented organ below spermaceti) out the front of the head. The multiple internal reflections both shape the temporal structure of the click (the characteristic multi-pulse form) and amplify the on-axis intensity. The inter-pulse interval (IPI) between the first and second pulses of a click is proportional to twice the spermaceti organ length:
\[ \text{IPI} = \frac{2 L_{spermaceti}}{c_{oil}} \]
With \(c_{oil} \approx 1370\,\text{m/s}\), an IPI of 5 ms corresponds to \(L \approx 3.4\,\text{m}\).
Whaling records correlate spermaceti organ length with whole body length (~23%), so passively recording clicks allows acoustic “measurement” of sperm whales in the wild without capture.
3. The Sonar Equation
Echolocation performance is captured by the passive sonar equation, borrowed directly from ASW (anti-submarine warfare) acoustics:
\[ \text{EL} = \text{SL} - 2\,\text{TL} + \text{TS} \]
with detection requiring EL > NL + DT (noise level + detection threshold)
Terms: EL echo level at receiver (dB), SL source level (dB re 1 μPa @ 1 m), TL one-way transmission loss (dB), TStarget strength (dB). The factor of 2 on TL accounts for the round trip.
3.1 Transmission Loss
Transmission loss has two components: geometric spreading and absorption:
\[ \text{TL}(r) = 20 \log_{10}(r/r_0) + \alpha(f)\,r \]
Spherical spreading: 20 dB per decade. Absorption \(\alpha(f)\) rises sharply with frequency.
Seawater absorption is dominated by two relaxation processes (boric acid at ~1 kHz and magnesium sulfate at ~100 kHz) plus viscous bulk absorption at very high frequency. Typical values:
- \(f = 20\,\text{Hz}\) (blue whale): \(\alpha \approx 1\,\text{dB/1000 km}\)
- \(f = 2\,\text{kHz}\): \(\alpha \approx 0.1\,\text{dB/km}\)
- \(f = 20\,\text{kHz}\) (sperm whale): \(\alpha \approx 3\,\text{dB/km}\)
- \(f = 120\,\text{kHz}\) (bottlenose): \(\alpha \approx 40\,\text{dB/km}\)
Thus high-frequency dolphin clicks are limited to ~100 m effective range (the absorption alone accounts for 4 dB at that distance), while low-frequency whale calls can propagate for thousands of kilometers. This constraint shapes the acoustic behavior of each species: dolphin sonar is short-range & high resolution; large-whale vocalizations are long-range & low resolution.
3.2 Target Strength
The target strength quantifies how much of the incident sound a target reflects back toward the source:
\[ \text{TS} = 10 \log_{10}\!\left(\frac{\sigma_{bs}}{4\pi\,r_0^2}\right) \]
\(\sigma_{bs}\): backscattering cross section (m2).
For fish with gas-filled swimbladders, TS at the swimbladder resonance frequency can exceed −25 dB; at other frequencies TS is dominated by flesh impedance contrast (low). Cephalopods (squid, octopus) lack gas inclusions and have tissue impedance nearly matched to seawater — they are acoustically “cryptic,” with TS often below −55 dB. Despite this, sperm whales subsist almost entirely on deep-sea squid, demonstrating extraordinary acoustic discrimination.
4. Click Trains, ICI, and the Buzz Phase
A foraging odontocete does not emit a continuous acoustic stream but rather discrete clicks separated by an inter-click interval (ICI). The ICI is constrained by the round-trip time to the most distant echo:
\[ \text{ICI} \gtrsim \frac{2 d_{target}}{c} \]
ensuring each echo returns before the next click is emitted.
At \(d = 100\,\text{m}\), the round-trip time is \(2\cdot 100/1500 = 133\,\text{ms}\): the animal can emit ~7 clicks/s. As the whale approaches its prey, ICI shrinks proportionally. In the final stage of a capture, the toothed whale transitions from ordinary clicks to a very rapid sequence called the buzz phase, with ICIs of 2 ms or less (corresponding to distances of ~1.5 m). The buzz is an acoustic analog of the rapid fire-control burst a sonar would use on a closing target.
4.1 Discrimination Limits
Bottlenose dolphin psychophysical experiments (Au 1993, Branstetter et al. 2007) show remarkable discrimination:
- Distinguish solid vs hollow targets differing by 1-mm wall thickness at 5 m range
- Discriminate cylinders of the same size but differing materials (Al vs PVC vs steel) — apparently using material resonance signatures
- Classify targets through sediment, clay, or water-bottom clutter — target strength discrimination of ~1 dB
- Identify individual objects in dense fish schools
4.2 Signature Whistles
Beyond echolocation, dolphins produce tonal whistles in the 1–25 kHz band. The signature whistle, described by Caldwell & Caldwell (1965), is a frequency-modulated pattern unique to each individual — functionally an acoustic “name.” Mother dolphins appear to teach their calves signature whistles, and dolphins use each other's signature whistles to address specific individuals (King & Janik 2013). This is one of the clearest demonstrations of referential vocal learning outside humans.
5. Hearing: The Auditory Pathway
The cetacean auditory system bears little resemblance to that of a terrestrial mammal. There is no pinna (external ear); the external auditory meatus is reduced to a tiny canal filled with wax. Sound reaches the inner ear via a novel pathway: entering through the lower jaw, propagating through a fat-filled channel along the mandible, and coupling into the middle ear bones. This “acoustic fat” has impedance closely matched to seawater, serving as an efficient sound conductor from the exterior into the acoustically isolated ear bones.
5.1 Tympano-Periotic Complex
The middle and inner ears of cetaceans sit in a bony capsule (the tympano-periotic complex) that is isolated from the rest of the skull by air-filled sinuses and a ligamentous suspension. This isolation is crucial for directional hearing: without it, sound arriving through the water-filled skull would reach both ears almost simultaneously, destroying the interaural time difference cues that mammals normally use for sound localization. By suspending each ear capsule in its own acoustic enclosure, cetaceans preserve binaural directional hearing at frequencies up to 150 kHz.
5.2 Ultrasonic Hearing Range
Audiograms for bottlenose dolphins show a hearing range from ~75 Hz to ~150 kHz, with peak sensitivity at 40–80 kHz. Humans hear up to ~20 kHz; dogs and cats reach ~50–60 kHz; only bats (to ~200 kHz) and toothed whales approach these upper limits. Low-frequency hearing is less developed; this limits cetacean sensitivity to many anthropogenic noise sources in the 100–1000 Hz shipping band.
5.3 Interaural Time and Intensity Differences
With the two ears separated by ~20 cm on a bottlenose dolphin, interaural time differences (ITD) for sounds arriving from the side are:
\[ \Delta t_{ITD} = \frac{d_{ears}}{c} \approx \frac{0.2}{1500} \approx 130\,\mu\text{s} \]
At 120 kHz (echo frequency), the period is 8 µs; ITD resolution finer than this has been demonstrated behaviorally, indicating that dolphins exploit phase differences at very fine temporal resolution, much as humans do at lower frequencies. Interaural intensity differences (IID) from head-shadow effects are also used; combined, these provide angular resolution of ~2°.
6. Convergent Evolution with Bats
Bats (order Chiroptera) are the other major mammalian lineage that evolved biological sonar. Despite diverging from cetaceans roughly 85 Mya at the base of the Laurasiatheria, the two groups show extraordinary convergence in their echolocation systems:
- Both use high-frequency, short-duration broadband clicks or chirps
- Both exhibit “buzz phases” with shrinking inter-pulse intervals during prey capture
- Both have modified middle ear ossicles and isolated inner ear capsules
- Both possess specialized laryngeal (bats) or nasal (whales) structures for click production
- Both use target strength and spectral content to identify prey
Remarkably, molecular studies by Liu et al. (2010) demonstrated that the gene Prestin — encoding the motor protein driving outer hair cell motility that determines cochlear frequency selectivity — shows near-identical amino acid substitutions in bats and toothed whales. This is one of the clearest documented cases of molecular convergence in mammalian evolution: echolocation selected for the same biochemistry independently in two lineages separated by 85 Myr.
5. Echolocation System Diagram
6. Simulation: Sonar Physics and Melon Ray-Tracing
This simulation (i) computes echo level vs range from the sonar equation for bottlenose and sperm whales including absorption; (ii) plots inter-click interval vs target range and identifies the buzz-phase regime; (iii) lists target strengths of representative cetacean prey; and (iv) traces acoustic rays through a simple axisymmetric graded-index melon model, showing how the graded lipid profile focuses sound into a forward beam.
Click Run to execute the Python code
Code will be executed with Python 3 on the server
Key Observations
- Panel 1: At 120 kHz, absorption kills bottlenose signal beyond ~150 m; sperm whale's 20 kHz clicks retain signal past 1 km.
- Panel 2: ICI shrinks linearly with target distance. Buzz phase (ICI < 2 ms) corresponds to prey within ~1.5 m.
- Panel 3: Squid have very low TS (no swimbladder); sperm whales still detect them reliably via exquisite SNR.
- Panel 4: The graded-index melon bends diverging rays from the phonic lips into a collimated forward beam — a biological Luneburg lens.
Module Summary
Click Generation
Phonic lips (museau de singe) driven by pressurized air from lungs; not laryngeal
Peak Frequencies
Bottlenose 100–200 kHz; sperm whale broadband 5–25 kHz; λ ≈ 1–25 cm in seawater
Source Levels
Bottlenose 220 dB; sperm whale up to 236 dB (loudest biological sound on Earth)
Melon Lens
Graded lipid (1310–1500 m/s) focuses sound into forward beam — biological Luneburg lens
Sonar Equation
EL = SL – 2TL + TS; TL = 20 log(r) + α(f)·r; detection when EL > NL + DT
Absorption
Rises sharply with f: 120 kHz → 40 dB/km; 20 Hz → 1 dB/1000 km
ICI & Buzz
ICI = 2d/c; buzz phase at ICI ≤ 2 ms during final prey pursuit
Signature Whistles
Individual acoustic “names”; vocally learned — referential labeling outside humans
References
- Au, W.W.L. (1993). The Sonar of Dolphins. Springer.
- Au, W.W.L. & Hastings, M.C. (2008). Principles of Marine Bioacoustics. Springer.
- Cranford, T.W., Amundin, M. & Norris, K.S. (1996). Functional morphology and homology in the odontocete nasal complex. Journal of Morphology, 228(3), 223–285.
- Norris, K.S. & Harvey, G.W. (1972). A theory for the function of the spermaceti organ of the sperm whale. NASA Special Publication 262, 397–417.
- Møhl, B., Wahlberg, M., Madsen, P.T., Heerfordt, A. & Lund, A. (2003). The monopulsed nature of sperm whale clicks. JASA, 114(2), 1143–1154.
- Madsen, P.T. & Surlykke, A. (2013). Functional convergence in bat and toothed whale biosonars. Physiology, 28(5), 276–283.
- Branstetter, B.K. & Finneran, J.J. (2008). Comodulation masking release in bottlenose dolphins. JASA, 124(1), 625–633.
- Caldwell, M.C. & Caldwell, D.K. (1965). Individualized whistle contours in bottle-nosed dolphins. Nature, 207, 434–435.
- King, S.L. & Janik, V.M. (2013). Bottlenose dolphins can use learned vocal labels to address each other. PNAS, 110(32), 13216–13221.
- Madsen, P.T. et al. (2002). Sperm whale sound production studied with ultrasound time/depth-recording tags. JEB, 205, 1899–1906.