Data

Multi-observer passive acoustic recordings, analyst annotations, and automated detections for Antarctic blue and fin whale calls, Casey Station 2019

Australian Antarctic Division
Aulich, M., Balcazar, N., Collins, K. and Miller, B.S. ; AULICH, MEGHAN ; BALCAZAR, NAYSA ; REEVE, KYM ; MILLER, BRIAN SETH
Viewed: [[ro.stat.viewed]] Cited: [[ro.stat.cited]] Accessed: [[ro.stat.accessed]]
ctx_ver=Z39.88-2004&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&rfr_id=info%3Asid%2FANDS&rft_id=http://data.aad.gov.au/metadata/AAS_4636_Common_Ground_Annotated_Acoustic_Dataset&rft.title=Multi-observer passive acoustic recordings, analyst annotations, and automated detections for Antarctic blue and fin whale calls, Casey Station 2019&rft.identifier=http://data.aad.gov.au/metadata/AAS_4636_Common_Ground_Annotated_Acoustic_Dataset&rft.publisher=Australian Antarctic Data Centre&rft.description=This dataset supports a multi-observer capture-recapture analysis of passive acoustic detection of Antarctic blue whale (Balaenoptera musculus intermedia) ABZ-unit calls at Casey Station, East Antarctica, during 2019. ABZ-unit calls are low-frequency (~25–29 Hz) repeated tonal pulses forming part of the Antarctic blue whale's characteristic three-part song (A-, B-, and Z-units). It is associated with the publication: Miller, B.S. et al. (in press). Common ground: efficient, consistent, observer-independent bioacoustic call density estimation with adjudicated ground truth and capture-recapture detection functions. Methods in Ecology and Evolution. doi:[to be assigned].The dataset consists of three components:Acoustic recordings. Approximately 200 hours of underwater acoustic recordings collected at Casey Station (63°48'S, 111°47'E) between 23 December 2018 and 13 December 2019, downsampled to 1000 Hz. These recordings are a subset of the full Casey 2019 dataset available in the related archive: (Miller, B.S., Milnes, M. and Whiteside, S. 2025. Long-term underwater acoustic recordings in the Southern Ocean 2013–2024, Ver. 9, Australian Antarctic Data Centre. doi:10.26179/fhsv-ft93), selected following the AAD Blue and Fin Annotated Acoustic Library (AAD-BAFAAL) protocol: approximately 2% of total recording effort, distributed approximately evenly throughout the year to be broadly representative of annual recording conditions. Recordings were made using an AAD Moored Acoustic Recorder (MAR) deployed at approximately 2700 m depth, equipped with a hydrophone with sensitivity –165.9 dB re 1 V/µPa and an analogue front-end with frequency-dependent gain (see accompanying metaDataCasey2019.m for full calibration). Recordings are in 16-bit WAV format with timestamps encoded in the filename (yyyy-mm-dd_HH-MM-SS.wav).Raven Pro annotations. Time-frequency bounding box annotations made independently by three human analysts (Analysts 1–3, identities anonymised) using Raven Pro 1.5 (Cornell Lab of Ornithology, Ithaca NY). Annotations cover the following call types, consistent with the AAD-BAFAAL protocol:Bm-Ant-A, Bm-Ant-B, Bm-Ant-Z — Antarctic blue whale song units (~25–100 Hz)Bm-D — Antarctic blue whale D-calls (~50–90 Hz, short FM downsweeps)Bp-20, Bp-20Plus — Antarctic fin whale 20 Hz song pulsesBp-Downsweep — fin whale 40 Hz downsweep callsUnidentified — unidentified low-frequency biological soundsUnid-BlueOrFin — detections of low-frequency biological sounds identified as likely blue or fin whale but not distinguishable between the two species.Annotations are provided as Raven Pro Selection Tables (tab-delimited text format), one file per analyst per call type, directly importable into Raven Pro. Multi-observer detection capture history. A comma-separated values (CSV) file consolidating detections from five independent observers: the three human analysts described above, an Ishmael spectral correlation coefficient (SCC) automated detector (Observer 4), and a Koogu deep neural network (DNN) automated detector (Observer 5). Observers 4 and 5 were applied to 250 Hz downsampled versions of the same recordings; their detections were matched to analyst annotations by temporal overlap as described in Miller et al. (in press). Each row in the capture history represents a candidate detection event. Columns record: detection flags for each observer; time and frequency bounds of each observer's annotation; an adjudicated true/false positive verdict (assigned by BSM) for a subset of rows; and per-observer signal-to-noise ratio (SNR) estimates computed using the spectrogramSlices method with noise windows placed symmetrically before and after each annotation. Full column descriptions are provided in the supplementary material of Miller et al. (in press). Note: the capture history covers ABZ-unit calls only, as these are the focus of Miller et al. (in press); annotations for all other call types are provided as Raven Pro Selection Tables only.Progress Code: completedStatement: The acoustic recordings contain periods of elevated ambient noise from sea ice, wind, ship traffic, and Casey Station operations. These conditions affect detection probability and SNR estimates but are not removed from the dataset; the multi-observer capture-recapture design explicitly accounts for variable detection probability across observers and recording conditions. The 200-hour annotated subset was selected to be approximately representative of the full year's recording conditions but may not capture all noise conditions or call rate variability present in the complete dataset. Automated detectors (Observers 4 and 5) were applied to 250 Hz downsampled recordings; the temporal matching procedure used to align automated detections with analyst annotations is described in Miller et al. (in press). Analyst identities are anonymised with numeric IDs (1–3) consistent with the AAD-BAFAAL protocol. The Unid-BlueOrFin annotation category was inherited from the IWC-SORP Acoustic Trends Annotated Library protocol, where it was largely unused in practice. Usage of this category was improved in AAD-BAFAAL, though it should still be treated as an incomplete record; its absence in a given file does not reliably indicate absence of unidentifiable calls.&rft.creator=Aulich, M., Balcazar, N., Collins, K. and Miller, B.S. &rft.creator=AULICH, MEGHAN &rft.creator=BALCAZAR, NAYSA &rft.creator=REEVE, KYM &rft.creator=MILLER, BRIAN SETH &rft.date=2026&rft.coverage=westlimit=111.7833; southlimit=-63.8; eastlimit=111.7833; northlimit=-63.8&rft.coverage=westlimit=111.7833; southlimit=-63.8; eastlimit=111.7833; northlimit=-63.8&rft.coverage=uplimit=2700; downlimit=&rft.coverage=uplimit=2700; downlimit=&rft_rights=These data are publicly available for download from the provided URL. No formal restrictions beyond the CCBY Attribution License apply. These data are under active use for ongoing analyses; we welcome collaboration and ask that you contact the data originator before beginning work that may duplicate or closely relate to analyses already underway.&rft_rights=Attribution 4.0 International (CC BY 4.0) https://creativecommons.org/licenses/by/4.0/legalcode&rft_rights=This data set conforms to the CCBY Attribution License (http://creativecommons.org/licenses/by/4.0/). Please follow instructions listed in the citation reference provided at http://data.aad.gov.au/aadc/metadata/citation.cfm?entry_id=AAS_4636_Common_Ground_Annotated_Acoustic_Dataset when using these data.&rft_rights=This metadata record is publicly available.&rft_subject=oceans&rft_subject=biota&rft_subject=EARTH SCIENCE > BIOLOGICAL CLASSIFICATION > ANIMALS/VERTEBRATES > MAMMALS > CETACEANS > BALEEN WHALES&rft_subject=EARTH SCIENCE > OCEANS > OCEAN ACOUSTICS&rft_subject=EARTH SCIENCE > OCEANS > OCEAN ACOUSTICS > AMBIENT NOISE&rft_subject=ANIMAL DETECTION&rft_subject=PASSIVE ACOUSTIC MONITORING&rft_subject=BIOACOUSTICS&rft_subject=CAPTURE-RECAPTURE&rft_subject=DETECTION PROBABILITY&rft_subject=CALL DENSITY ESTIMATION&rft_subject=AUTOMATED DETECTION&rft_subject=ANNOTATED ACOUSTIC LIBRARY&rft_subject=ANTARCTIC BLUE WHALE&rft_subject=BLUE WHALE&rft_subject=LONG-TERM UNDERWATER ACOUSTIC RECORDING&rft_subject=FIN WHALE&rft_subject=DEEP LEARNING&rft_subject=HYDROPHONES&rft_subject=Passive Acoustic Recorder&rft_subject=MOORINGS&rft_subject=AMD/AU&rft_subject=CEOS&rft_subject=AMD&rft_subject=OCEAN > SOUTHERN OCEAN&rft_subject=OCEAN > SOUTHERN OCEAN > EAST ANTARCTIC CONTINENTAL SHELF&rft_subject=GEOGRAPHIC REGION > POLAR&rft.type=dataset&rft.language=English Access the data

Licence & Rights:

Open Licence view details
CC-BY

Attribution 4.0 International (CC BY 4.0)
https://creativecommons.org/licenses/by/4.0/legalcode

These data are publicly available for download from the provided URL.

No formal restrictions beyond the CCBY Attribution License apply. These data are under active use for ongoing analyses; we welcome collaboration and ask that you contact the data originator before beginning work that may duplicate or closely relate to analyses already underway.

This data set conforms to the CCBY Attribution License (http://creativecommons.org/licenses/by/4.0/).

Please follow instructions listed in the citation reference provided at http://data.aad.gov.au/aadc/metadata/citation.cfm?entry_id=AAS_4636_Common_Ground_Annotated_Acoustic_Dataset when using these data.

This metadata record is publicly available.

Access:

Other

Full description

This dataset supports a multi-observer capture-recapture analysis of passive acoustic detection of Antarctic blue whale (Balaenoptera musculus intermedia) ABZ-unit calls at Casey Station, East Antarctica, during 2019. ABZ-unit calls are low-frequency (~25–29 Hz) repeated tonal pulses forming part of the Antarctic blue whale's characteristic three-part song (A-, B-, and Z-units). It is associated with the publication: Miller, B.S. et al. (in press). Common ground: efficient, consistent, observer-independent bioacoustic call density estimation with adjudicated ground truth and capture-recapture detection functions. Methods in Ecology and Evolution. doi:[to be assigned].

The dataset consists of three components:
Acoustic recordings. Approximately 200 hours of underwater acoustic recordings collected at Casey Station (63°48'S, 111°47'E) between 23 December 2018 and 13 December 2019, downsampled to 1000 Hz. These recordings are a subset of the full Casey 2019 dataset available in the related archive: (Miller, B.S., Milnes, M. and Whiteside, S. 2025. Long-term underwater acoustic recordings in the Southern Ocean 2013–2024, Ver. 9, Australian Antarctic Data Centre. doi:10.26179/fhsv-ft93), selected following the AAD Blue and Fin Annotated Acoustic Library (AAD-BAFAAL) protocol: approximately 2% of total recording effort, distributed approximately evenly throughout the year to be broadly representative of annual recording conditions. Recordings were made using an AAD Moored Acoustic Recorder (MAR) deployed at approximately 2700 m depth, equipped with a hydrophone with sensitivity –165.9 dB re 1 V/µPa and an analogue front-end with frequency-dependent gain (see accompanying metaDataCasey2019.m for full calibration). Recordings are in 16-bit WAV format with timestamps encoded in the filename (yyyy-mm-dd_HH-MM-SS.wav).

Raven Pro annotations. Time-frequency bounding box annotations made independently by three human analysts (Analysts 1–3, identities anonymised) using Raven Pro 1.5 (Cornell Lab of Ornithology, Ithaca NY). Annotations cover the following call types, consistent with the AAD-BAFAAL protocol:

Bm-Ant-A, Bm-Ant-B, Bm-Ant-Z — Antarctic blue whale song units (~25–100 Hz)
Bm-D — Antarctic blue whale D-calls (~50–90 Hz, short FM downsweeps)
Bp-20, Bp-20Plus — Antarctic fin whale 20 Hz song pulses
Bp-Downsweep — fin whale 40 Hz downsweep calls
Unidentified — unidentified low-frequency biological sounds
Unid-BlueOrFin — detections of low-frequency biological sounds identified as likely blue or fin whale but not distinguishable between the two species.

Annotations are provided as Raven Pro Selection Tables (tab-delimited text format), one file per analyst per call type, directly importable into Raven Pro.

Multi-observer detection capture history.

A comma-separated values (CSV) file consolidating detections from five independent observers: the three human analysts described above, an Ishmael spectral correlation coefficient (SCC) automated detector (Observer 4), and a Koogu deep neural network (DNN) automated detector (Observer 5). Observers 4 and 5 were applied to 250 Hz downsampled versions of the same recordings; their detections were matched to analyst annotations by temporal overlap as described in Miller et al. (in press). Each row in the capture history represents a candidate detection event. Columns record: detection flags for each observer; time and frequency bounds of each observer's annotation; an adjudicated true/false positive verdict (assigned by BSM) for a subset of rows; and per-observer signal-to-noise ratio (SNR) estimates computed using the spectrogramSlices method with noise windows placed symmetrically before and after each annotation. Full column descriptions are provided in the supplementary material of Miller et al. (in press). Note: the capture history covers ABZ-unit calls only, as these are the focus of Miller et al. (in press); annotations for all other call types are provided as Raven Pro Selection Tables only.

Lineage

Progress Code: completed
Statement: The acoustic recordings contain periods of elevated ambient noise from sea ice, wind, ship traffic, and Casey Station operations. These conditions affect detection probability and SNR estimates but are not removed from the dataset; the multi-observer capture-recapture design explicitly accounts for variable detection probability across observers and recording conditions. The 200-hour annotated subset was selected to be approximately representative of the full year's recording conditions but may not capture all noise conditions or call rate variability present in the complete dataset. Automated detectors (Observers 4 and 5) were applied to 250 Hz downsampled recordings; the temporal matching procedure used to align automated detections with analyst annotations is described in Miller et al. (in press). Analyst identities are anonymised with numeric IDs (1–3) consistent with the AAD-BAFAAL protocol. The Unid-BlueOrFin annotation category was inherited from the IWC-SORP Acoustic Trends Annotated Library protocol, where it was largely unused in practice. Usage of this category was improved in AAD-BAFAAL, though it should still be treated as an incomplete record; its absence in a given file does not reliably indicate absence of unidentifiable calls.

Notes

Purpose
To support reproducible capture-recapture analysis of detection probability and call density for Antarctic blue and fin whales using passive acoustic monitoring. The multi-observer design allows simultaneous estimation of per-observer detection functions and call density without requiring complete enumeration of all calls — analogous to mark-recapture methods in wildlife ecology, here applied to acoustic detections. The dataset enables independent replication of the analysis in Miller et al. (in press) and reuse for detector performance evaluation, passive acoustic monitoring methodology, and studies of Antarctic blue whale acoustic behaviour and distribution. The broader annotation set (blue whale D-calls, fin whale calls) is included for completeness and consistency with the AAD-BAFAAL protocol, and to support future studies of Antarctic blue and fin whale co-occurrence, call rate estimation, and detector development across multiple call types at this site.

Data time period: 2018-12-23 to 2019-12-13

This dataset is part of a larger collection

Click to explore relationships graph

111.7833,-63.8

111.7833,-63.8

text: westlimit=111.7833; southlimit=-63.8; eastlimit=111.7833; northlimit=-63.8

text: uplimit=2700; downlimit=

Other Information
Download the dataset. (GET DATA > DIRECT DOWNLOAD)

url : https://data.aad.gov.au/eds/6184/download

Public information for AAS project AAS_4636 (PROJECT HOME PAGE)

url : https://projects.aad.gov.au/report_project_public.cfm?project_no=AAS_4636

Citation reference for this metadata record and dataset. (VIEW RELATED INFORMATION)

url : https://data.aad.gov.au/aadc/metadata/citation.cfm?entry_id=AAS_4636_Common_Ground_Annotated_Acoustic_Dataset

Identifiers
  • global : AAS_4636_Common_Ground_Annotated_Acoustic_Dataset
ACN 633 798 857