courseshub.world ยท Ecoacoustics ยท Module 06

Passive Acoustic Monitoring

The instrumentation, deployment design, data-management pipeline, and machine-learning species classifiers that make landscape-scale audio biodiversity surveys possible.

A grid of autonomous recorders sampling a landscape continuouslyโ†’ TB of audio per month โ†’ indices, embeddings, occupancy models

Passive acoustic monitoring (PAM) is the practice of deploying autonomous recorders to capture sound continuously over weeks or years โ€” turning biodiversity into a streamable time series. AudioMoth, SongMeter, and Wildlife Acoustics SM4 are the dominant hardware; BirdNET and Perch are the leading deep-learning classifiers; Arbimon, ARBIMON-PAM and Rainforest Connection provide cloud infrastructure. The discipline is moving rapidly from boutique fieldwork to global infrastructure.

Module 06

Passive Acoustic Monitoring

Passive Acoustic Monitoring (PAM) is the deployment of autonomous recording units (ARUs) to capture soundscapes continuously, without human presence.

DeviceFrequencyFeatureUse
Song Meter SM420 Hz โ€“ 192 kHzIndustry standardTerrestrial multi-taxa
AudioMoth8 Hz โ€“ 384 kHzOpen-source, ยฃ40Mass deployment, bats
Swift20 Hz โ€“ 48 kHzLong autonomyRemote sites, birds
AMAR-G41 Hz โ€“ 180 kHzMarine hydrophoneCetaceans, fish
HARP10 Hz โ€“ 100 kHzSeafloor deploymentDeep-sea, whales

Machine Learning for Soundscape Analysis

CNN architectures applied to mel-spectrograms: BirdNET (Cornell, 6000+ species), Perch (Google), AVES (self-supervised). Unsupervised methods include NMF:

Non-negative Matrix Factorization
\[ \mathbf{V} \approx \mathbf{W}\mathbf{H}, \quad \mathbf{W},\mathbf{H} \geq 0 \]

$\mathbf{V}$ spectrogram, $\mathbf{W}$ spectral basis (acoustic atoms), $\mathbf{H}$ temporal activations. Enables discovery of novel sound types.

The end-to-end pipeline

  1. Field deployment: choose number, spacing, schedule of recorders to balance area coverage against per-site density.
  2. Data offload: SD card swap, cellular telemetry, or LoRa-mesh. Bandwidth quickly becomes the bottleneck.
  3. Pre-processing: noise reduction, segmentation into vocalisation events, normalisation.
  4. Classification: deep-learning models identify species per snippet with class-conditional posteriors.
  5. Ecological inference: occupancy models (Royle-Nichols, MacKenzie et al.) translate detections into species presence under imperfect observation.
Share:XRedditLinkedIn