1
|
Gruden P, Jang J, Kügler A, Kropfreiter T, Tenorio-Hallé L, Lammers MO, Thode A, Meyer F. Automating multi-target tracking of singing humpback whales recorded with vector sensors. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2023; 154:2579-2593. [PMID: 37874222 DOI: 10.1121/10.0021972] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Accepted: 10/05/2023] [Indexed: 10/25/2023]
Abstract
Passive acoustic monitoring is widely used for detection and localization of marine mammals. Typically, pressure sensors are used, although several studies utilized acoustic vector sensors (AVSs), that measure acoustic pressure and particle velocity and can estimate azimuths to acoustic sources. The AVSs can localize sources using a reduced number of sensors and do not require precise time synchronization between sensors. However, when multiple animals are calling concurrently, automated tracking of individual sources still poses a challenge, and manual methods are typically employed to link together sequences of measurements from a given source. This paper extends the method previously reported by Tenorio-Hallé, Thode, Lammers, Conrad, and Kim [J. Acoust. Soc. Am. 151(1), 126-137 (2022)] by employing and comparing two fully-automated approaches for azimuthal tracking based on the AVS data. One approach is based on random finite set statistics and the other on message passing algorithms, but both approaches utilize the underlying Bayesian statistical framework. The proposed methods are tested on several days of AVS data obtained off the coast of Maui and results show that both approaches successfully and efficiently track multiple singing humpback whales. The proposed methods thus made it possible to develop a fully-automated AVS tracking approach applicable to all species of baleen whales.
Collapse
Affiliation(s)
- Pina Gruden
- Cooperative Institute for Marine and Atmospheric Research, Research Corporation of the University of Hawai'i, Honolulu, Hawaii 96822, USA
| | - Junsu Jang
- Marine Physical Laboratory, Scripps Institution of Oceanography, University of California San Diego, La Jolla, California 92093, USA
| | - Anke Kügler
- Marine Biology Graduate Program, University of Hawai'i at Mānoa, Honolulu, Hawaii 96822, USA
- Bioacoustics and Behavioral Ecology Lab, Syracuse University, Syracuse, New York 13244, USA
| | - Thomas Kropfreiter
- Marine Physical Laboratory, Scripps Institution of Oceanography, University of California San Diego, La Jolla, California 92093, USA
| | - Ludovic Tenorio-Hallé
- Marine Physical Laboratory, Scripps Institution of Oceanography, University of California San Diego, La Jolla, California 92093, USA
| | - Marc O Lammers
- Hawaiian Islands Humpback Whale National Marine Sanctuary, Kihei, Hawaii 96753, USA
| | - Aaron Thode
- Marine Physical Laboratory, Scripps Institution of Oceanography, University of California San Diego, La Jolla, California 92093, USA
| | - Florian Meyer
- Marine Physical Laboratory, Scripps Institution of Oceanography, University of California San Diego, La Jolla, California 92093, USA
| |
Collapse
|
2
|
Pu W, Liu S, Qing X, Qiao G, Mazhar S, Ma T. Automated extraction of baleen whale calls based on the pseudo-Wigner-Ville distribution. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2023; 153:1564. [PMID: 37002084 DOI: 10.1121/10.0017457] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/02/2022] [Accepted: 02/14/2023] [Indexed: 05/18/2023]
Abstract
Baleen whales produce a wide variety of frequency-modulated calls. Extraction of the time-frequency (TF) structures of these calls forms the basis for many applications, including abundance estimation and species recognition. Typical methods to extract the contours of whale calls from a spectrogram are based on the short-time Fourier transform and are, thus, restricted by a fixed TF resolution. Considering the low-frequency nature of baleen whale calls, this work represents the contours using a pseudo-Wigner-Ville distribution for a higher TF resolution at the cost of introducing cross terms. An adaptive threshold is proposed followed by a modified Gaussian mixture probability hypothesis density filter to extract the contours. Finally, the artificial contours, which are caused by the cross terms, can be removed in post-processing. Simulations were conducted to explore how the signal-to-noise ratio influences the performance of the proposed method. Then, in experiments based on real data, the contours of the calls of three kinds of baleen whales were extracted in a highly accurate manner (with mean deviations of 5.4 and 0.051 Hz from the ground-truth contours at sampling rates of 4000 and 100 Hz, respectively) with a recall of 75% and a precision of 78.5%.
Collapse
Affiliation(s)
- Wangyi Pu
- Acoustic Science and Technology Laboratory, Harbin Engineering University, Harbin 150001, China
| | - Songzuo Liu
- Acoustic Science and Technology Laboratory, Harbin Engineering University, Harbin 150001, China
| | - Xin Qing
- Acoustic Science and Technology Laboratory, Harbin Engineering University, Harbin 150001, China
| | - Gang Qiao
- Acoustic Science and Technology Laboratory, Harbin Engineering University, Harbin 150001, China
| | - Suleman Mazhar
- Acoustic Science and Technology Laboratory, Harbin Engineering University, Harbin 150001, China
| | - Tianlong Ma
- Acoustic Science and Technology Laboratory, Harbin Engineering University, Harbin 150001, China
| |
Collapse
|
3
|
Conant PC, Li P, Liu X, Klinck H, Fleishman E, Gillespie D, Nosal EM, Roch MA. Silbido profundo: An open source package for the use of deep learning to detect odontocete whistles. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2022; 152:3800. [PMID: 36586843 DOI: 10.1121/10.0016631] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/05/2022] [Accepted: 12/08/2022] [Indexed: 06/17/2023]
Abstract
This work presents an open-source matlab software package for exploiting recent advances in extracting tonal signals from large acoustic data sets. A whistle extraction algorithm published by Li, Liu, Palmer, Fleishman, Gillespie, Nosal, Shiu, Klinck, Cholewiak, Helble, and Roch [(2020). Proceedings of the International Joint Conference on Neural Networks, July 19-24, Glasgow, Scotland, p. 10] is incorporated into silbido, an established software package for extraction of cetacean tonal calls. The precision and recall of the new system were over 96% and nearly 80%, respectively, when applied to a whistle extraction task on a challenging two-species subset of a conference-benchmark data set. A second data set was examined to assess whether the algorithm generalized to data that were collected across different recording devices and locations. These data included 487 h of weakly labeled, towed array data collected in the Pacific Ocean on two National Oceanographic and Atmospheric Administration (NOAA) cruises. Labels for these data consisted of regions of toothed whale presence for at least 15 species that were based on visual and acoustic observations and not limited to whistles. Although the lack of per whistle-level annotations prevented measurement of precision and recall, there was strong concurrence of automatic detections and the NOAA annotations, suggesting that the algorithm generalizes well to new data.
Collapse
Affiliation(s)
- Peter C Conant
- Department of Computer Science, San Diego State University, San Diego, California 92182, USA
| | - Pu Li
- Department of Computer Science, San Diego State University, San Diego, California 92182, USA
| | - Xiaobai Liu
- Department of Computer Science, San Diego State University, San Diego, California 92182, USA
| | - Holger Klinck
- K. Lisa Yang Center for Conservation Bioacoustics, Cornell Lab of Ornithology, Cornell University, New York, New York 14850, USA
| | - Erica Fleishman
- College of Earth, Ocean, and Atmospheric Sciences, Oregon State University, Corvallis, Oregon 97331, USA
| | - Douglas Gillespie
- Sea Mammal Research Unit, Scottish Oceans Institute, University of St. Andrews, St. Andrews, KY16 9AJ, United Kingdom
| | - Eva-Marie Nosal
- Department of Ocean and Resources Engineering, University of Hawai'i at Mānoa, Honolulu, Hawaii 96822, USA
| | - Marie A Roch
- Department of Computer Science, San Diego State University, San Diego, California 92182, USA
| |
Collapse
|
4
|
Li L, Qiao G, Qing X, Zhang H, Liu X, Liu S. Robust unsupervised Tursiops aduncus whistle-event detection using gammatone multi-channel Savitzky-Golay based whistle enhancement. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2022; 151:3509. [PMID: 35649921 DOI: 10.1121/10.0011402] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/01/2022] [Accepted: 05/02/2022] [Indexed: 06/15/2023]
Abstract
Detecting whistle events is essential when studying the population density and behavior of cetaceans. After eight months of passive acoustic monitoring in Xiamen, we obtained long calls from two Tursiops aduncus individuals. In this paper, we propose an algorithm with an unbiased gammatone multi-channel Savitzky-Golay for smoothing dynamic continuous background noise and interference from long click trains. The algorithm uses the method of least squares to perform a local polynomial regression on the time-frequency representation of multi-frequency resolution call measurements, which can effectively retain the whistle profiles while filtering out noise and interference. We prove that it is better at separating out whistles and has lower computational complexity than other smoothing methods. In order to further extract whistle features in enhanced spectrograms, we also propose a set of multi-scale and multi-directional moving filter banks for various whistle durations and contour shapes. The final binary adaptive decisions at frame level for whistle events are obtained from the histograms of multi-scale and multi-directional spectrograms. Finally, we explore the entire data set and find that the proposed scheme achieves the highest frame-level F1-scores when detecting T. aduncus whistles than the baseline schemes, with an improvement of more than 6%.
Collapse
Affiliation(s)
- Lei Li
- Acoustic Science and Technology Laboratory, Harbin Engineering University, Harbin 150001, China
| | - Gang Qiao
- Acoustic Science and Technology Laboratory, Harbin Engineering University, Harbin 150001, China
| | - Xin Qing
- Acoustic Science and Technology Laboratory, Harbin Engineering University, Harbin 150001, China
| | - Huaying Zhang
- Acoustic Science and Technology Laboratory, Harbin Engineering University, Harbin 150001, China
| | - Xinyu Liu
- Acoustic Science and Technology Laboratory, Harbin Engineering University, Harbin 150001, China
| | - Songzuo Liu
- Acoustic Science and Technology Laboratory, Harbin Engineering University, Harbin 150001, China
| |
Collapse
|
5
|
Gruden P, Nosal EM, Oleson E. Tracking time differences of arrivals of multiple sound sources in the presence of clutter and missed detections. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021; 150:3399. [PMID: 34852628 DOI: 10.1121/10.0006780] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/27/2021] [Accepted: 08/19/2021] [Indexed: 06/13/2023]
Abstract
Acoustic line transect surveys are often used in combination with visual methods to estimate the abundance of marine mammal populations. These surveys typically use towed linear hydrophone arrays and estimate the time differences of arrival (TDOAs) of the signal of interest between the pairs of hydrophones. The signal source TDOAs or bearings are then tracked through time to estimate the animal position, often manually. The process of estimating TDOAs from data and tracking them through time can be especially challenging in the presence of multiple acoustically active sources, missed detections, and clutter (false TDOAs). This study proposes a multi-target tracking method to automate TDOA tracking. The problem formulation is based on the Gaussian mixture probability hypothesis density filter and includes multiple sources, source appearance and disappearance, missed detections, and false alarms. It is shown that by using an extended measurement model and combining measurements from broadband echolocation clicks and narrowband whistles, more information can be extracted from the acoustic encounters. The method is demonstrated on false killer whale (Pseudorca crassidens) recordings from Hawaiian waters.
Collapse
Affiliation(s)
- Pina Gruden
- Joint Institute for Marine and Atmospheric Research, Research Corporation of the University of Hawai'i, Honolulu, Hawaii 96822, USA
| | - Eva-Marie Nosal
- Ocean and Resources Engineering, University of Hawai'i at Mānoa, Honolulu, Hawaii 96822, USA
| | - Erin Oleson
- Pacific Islands Fisheries Science Center, National Oceanic and Atmospheric Administration (NOAA), Honolulu, Hawaii 96818, USA
| |
Collapse
|
6
|
Gruden P, White PR. Automated extraction of dolphin whistles-A sequential Monte Carlo probability hypothesis density approach. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2020; 148:3014. [PMID: 33261403 DOI: 10.1121/10.0002257] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/03/2020] [Accepted: 09/28/2020] [Indexed: 06/12/2023]
Abstract
The need for automated methods to detect and extract marine mammal vocalizations from acoustic data has increased in the last few decades due to the increased availability of long-term recording systems. Automated dolphin whistle extraction represents a challenging problem due to the time-varying number of overlapping whistles present in, potentially, noisy recordings. Typical methods utilize image processing techniques or single target tracking, but often result in fragmentation of whistle contours and/or partial whistle detection. This study casts the problem into a more general statistical multi-target tracking framework and uses the probability hypothesis density filter as a practical approximation to the optimal Bayesian multi-target filter. In particular, a particle version, referred to as a sequential Monte Carlo probability hypothesis density (SMC-PHD) filter, is adapted for frequency tracking and specific models are developed for this application. Based on these models, two versions of the SMC-PHD filter are proposed and the performance of these versions is investigated on an extensive real-world dataset of dolphin acoustic recordings. The proposed filters are shown to be efficient tools for automated extraction of whistles, suitable for real-time implementation.
Collapse
Affiliation(s)
- Pina Gruden
- Institute of Sound and Vibration Research, University of Southampton, Highfield, Hants, SO17 1BJ, United Kingdom
| | - Paul R White
- Institute of Sound and Vibration Research, University of Southampton, Highfield, Hants, SO17 1BJ, United Kingdom
| |
Collapse
|
7
|
Giard S, Simard Y, Roy N. Decadal passive acoustics time series of St. Lawrence estuary beluga. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2020; 147:1874. [PMID: 32237843 DOI: 10.1121/10.0000922] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/30/2019] [Accepted: 02/27/2020] [Indexed: 06/11/2023]
Abstract
Passive acoustics is used to monitor the threatened St. Lawrence estuary beluga between 2007 and 2017 from a site downstream of the beluga summer habitat. Acoustic metrics of presence and occurrence based on beluga acoustic band activity (BABA) are extracted by a dedicated algorithm adapted for the shipping noise from the St. Lawrence Seaway. A formal optimization process is used to set the algorithm parameters. Results evidence a year-round occurrence of belugas in the region, seasonal and diel patterns, and significant inter-annual variations. This study shows how passive acoustics methodology can be applied to monitor a loquacious species over multi-year periods in a shipping-noise-dominated environment, in order to understand its use of the habitat over the continuum of ecologically significant time scales.
Collapse
Affiliation(s)
- Samuel Giard
- Fisheries and Oceans Canada, Maurice Lamontagne Institute, Mont-Joli, Québec, Canada
| | - Yvan Simard
- Fisheries and Oceans Canada, Maurice Lamontagne Institute, Mont-Joli, Québec, Canada
| | - Nathalie Roy
- Fisheries and Oceans Canada, Maurice Lamontagne Institute, Mont-Joli, Québec, Canada
| |
Collapse
|