1
|
Conant PC, Li P, Liu X, Klinck H, Fleishman E, Gillespie D, Nosal EM, Roch MA. Silbido profundo: An open source package for the use of deep learning to detect odontocete whistles. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2022; 152:3800. [PMID: 36586843 DOI: 10.1121/10.0016631] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/05/2022] [Accepted: 12/08/2022] [Indexed: 06/17/2023]
Abstract
This work presents an open-source matlab software package for exploiting recent advances in extracting tonal signals from large acoustic data sets. A whistle extraction algorithm published by Li, Liu, Palmer, Fleishman, Gillespie, Nosal, Shiu, Klinck, Cholewiak, Helble, and Roch [(2020). Proceedings of the International Joint Conference on Neural Networks, July 19-24, Glasgow, Scotland, p. 10] is incorporated into silbido, an established software package for extraction of cetacean tonal calls. The precision and recall of the new system were over 96% and nearly 80%, respectively, when applied to a whistle extraction task on a challenging two-species subset of a conference-benchmark data set. A second data set was examined to assess whether the algorithm generalized to data that were collected across different recording devices and locations. These data included 487 h of weakly labeled, towed array data collected in the Pacific Ocean on two National Oceanographic and Atmospheric Administration (NOAA) cruises. Labels for these data consisted of regions of toothed whale presence for at least 15 species that were based on visual and acoustic observations and not limited to whistles. Although the lack of per whistle-level annotations prevented measurement of precision and recall, there was strong concurrence of automatic detections and the NOAA annotations, suggesting that the algorithm generalizes well to new data.
Collapse
Affiliation(s)
- Peter C Conant
- Department of Computer Science, San Diego State University, San Diego, California 92182, USA
| | - Pu Li
- Department of Computer Science, San Diego State University, San Diego, California 92182, USA
| | - Xiaobai Liu
- Department of Computer Science, San Diego State University, San Diego, California 92182, USA
| | - Holger Klinck
- K. Lisa Yang Center for Conservation Bioacoustics, Cornell Lab of Ornithology, Cornell University, New York, New York 14850, USA
| | - Erica Fleishman
- College of Earth, Ocean, and Atmospheric Sciences, Oregon State University, Corvallis, Oregon 97331, USA
| | - Douglas Gillespie
- Sea Mammal Research Unit, Scottish Oceans Institute, University of St. Andrews, St. Andrews, KY16 9AJ, United Kingdom
| | - Eva-Marie Nosal
- Department of Ocean and Resources Engineering, University of Hawai'i at Mānoa, Honolulu, Hawaii 96822, USA
| | - Marie A Roch
- Department of Computer Science, San Diego State University, San Diego, California 92182, USA
| |
Collapse
|
2
|
Wang Y, Ye J, Borchers DL. Automated call detection for acoustic surveys with structured calls of varying length. Methods Ecol Evol 2022. [DOI: 10.1111/2041-210x.13873] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Yuheng Wang
- Centre for Research into Ecological and Environmental Modelling School of Mathematics and Statistics University of St Andrews, The Observatory, St Andrews Fife Scotland
| | - Juan Ye
- School of Computer Science University of St Andrews, North Haugh, St Andrews Fife Scotland
| | - David L. Borchers
- Centre for Research into Ecological and Environmental Modelling School of Mathematics and Statistics University of St Andrews, The Observatory, St Andrews Fife Scotland
- Centre for Statistics in Ecology, the Environment, and Conservation University of Cape Town Cape Town South Africa
| |
Collapse
|
3
|
|
4
|
Biomimicking Covert Communication by Time-Frequency Shift Modulation for Increasing Mimicking and BER Performances. SENSORS 2021; 21:s21062184. [PMID: 33804747 PMCID: PMC8004036 DOI: 10.3390/s21062184] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/05/2021] [Revised: 03/10/2021] [Accepted: 03/17/2021] [Indexed: 11/17/2022]
Abstract
Underwater acoustic (UWA) biomimicking communications have been developed for covert communications. For the UWA covert communications, it is difficult to achieve the bit error rate (BER) and the degree of mimic (DoM) performances at the same time. This paper proposes a biomimicking covert communication method to increase both BER and DoM (degree of mimic) performances based on the Time Frequency Shift Keying (TFSK). To increase DoM and BER performances, the orthogonality requirements of the time- and frequency-shifting units of the TFSK are theoretically derived, and the whistles are multiplied by the sequence with a large correlation. Two-step DoM assessments are also developed for the long-term whistle signals. Computer simulations and practical lake and ocean experiments demonstrate that the proposed method increases the DoM by 35% and attains a zero BER at −6 dB of Signal to Noise Ratio (SNR).
Collapse
|
5
|
Lin TH, Akamatsu T, Tsao Y. Sensing ecosystem dynamics via audio source separation: A case study of marine soundscapes off northeastern Taiwan. PLoS Comput Biol 2021; 17:e1008698. [PMID: 33600436 PMCID: PMC7891715 DOI: 10.1371/journal.pcbi.1008698] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2020] [Accepted: 01/12/2021] [Indexed: 11/28/2022] Open
Abstract
Remote acquisition of information on ecosystem dynamics is essential for conservation management, especially for the deep ocean. Soundscape offers unique opportunities to study the behavior of soniferous marine animals and their interactions with various noise-generating activities at a fine temporal resolution. However, the retrieval of soundscape information remains challenging owing to limitations in audio analysis techniques that are effective in the face of highly variable interfering sources. This study investigated the application of a seafloor acoustic observatory as a long-term platform for observing marine ecosystem dynamics through audio source separation. A source separation model based on the assumption of source-specific periodicity was used to factorize time-frequency representations of long-duration underwater recordings. With minimal supervision, the model learned to discriminate source-specific spectral features and prove to be effective in the separation of sounds made by cetaceans, soniferous fish, and abiotic sources from the deep-water soundscapes off northeastern Taiwan. Results revealed phenological differences among the sound sources and identified diurnal and seasonal interactions between cetaceans and soniferous fish. The application of clustering to source separation results generated a database featuring the diversity of soundscapes and revealed a compositional shift in clusters of cetacean vocalizations and fish choruses during diurnal and seasonal cycles. The source separation model enables the transformation of single-channel audio into multiple channels encoding the dynamics of biophony, geophony, and anthropophony, which are essential for characterizing the community of soniferous animals, quality of acoustic habitat, and their interactions. Our results demonstrated the application of source separation could facilitate acoustic diversity assessment, which is a crucial task in soundscape-based ecosystem monitoring. Future implementation of soundscape information retrieval in long-term marine observation networks will lead to the use of soundscapes as a new tool for conservation management in an increasingly noisy ocean.
Collapse
Affiliation(s)
- Tzu-Hao Lin
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan (R.O.C)
| | - Tomonari Akamatsu
- The Ocean Policy Research Institute, The Sasakawa Peace Foundation, Tokyo, Japan
| | - Yu Tsao
- Research Center for Information Technology Innovation, Academia Sinica, Taipei, Taiwan (R.O.C)
| |
Collapse
|
6
|
|
7
|
Machine Learning Based Biomimetic Underwater Covert Acoustic Communication Method Using Dolphin Whistle Contours. SENSORS 2020; 20:s20216166. [PMID: 33138137 PMCID: PMC7662371 DOI: 10.3390/s20216166] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/05/2020] [Revised: 10/27/2020] [Accepted: 10/28/2020] [Indexed: 11/17/2022]
Abstract
For underwater acoustic covert communications, biomimetic covert communications have been developed using dolphin whistles. The conventional biomimetic covert communication methods transmit slightly different signal patterns from real dolphin whistles, which results in a low degree of mimic (DoM). In this paper, we propose a novel biomimetic communication method that preserves the large DoM with a low bit error rate (BER). For the transmission, the proposed method utilizes the various contours of real dolphin whistles with the link information among consecutive whistles, and the proposed receiver uses machine learning based whistle detectors with the aid of the link information. Computer simulations and practical ocean experiments were executed to demonstrate the better BER performance of the proposed method. Ocean experiments demonstrate that the BER of the proposed method was 0.002, while the BER of the conventional Deep Neural Network (DNN) based detector showed 0.36.
Collapse
|
8
|
Mooney TA, Di Iorio L, Lammers M, Lin TH, Nedelec SL, Parsons M, Radford C, Urban E, Stanley J. Listening forward: approaching marine biodiversity assessments using acoustic methods. ROYAL SOCIETY OPEN SCIENCE 2020; 7:201287. [PMID: 32968541 PMCID: PMC7481698 DOI: 10.1098/rsos.201287] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/20/2020] [Accepted: 08/05/2020] [Indexed: 05/08/2023]
Abstract
Ecosystems and the communities they support are changing at alarmingly rapid rates. Tracking species diversity is vital to managing these stressed habitats. Yet, quantifying and monitoring biodiversity is often challenging, especially in ocean habitats. Given that many animals make sounds, these cues travel efficiently under water, and emerging technologies are increasingly cost-effective, passive acoustics (a long-standing ocean observation method) is now a potential means of quantifying and monitoring marine biodiversity. Properly applying acoustics for biodiversity assessments is vital. Our goal here is to provide a timely consideration of emerging methods using passive acoustics to measure marine biodiversity. We provide a summary of the brief history of using passive acoustics to assess marine biodiversity and community structure, a critical assessment of the challenges faced, and outline recommended practices and considerations for acoustic biodiversity measurements. We focused on temperate and tropical seas, where much of the acoustic biodiversity work has been conducted. Overall, we suggest a cautious approach to applying current acoustic indices to assess marine biodiversity. Key needs are preliminary data and sampling sufficiently to capture the patterns and variability of a habitat. Yet with new analytical tools including source separation and supervised machine learning, there is substantial promise in marine acoustic diversity assessment methods.
Collapse
Affiliation(s)
- T. Aran Mooney
- Biology Department, Woods Hole Oceanographic Institution, 266 Woods Hole Road, Woods Hole, MA 02543, USA
- Author for correspondence: T. Aran Mooney e-mail:
| | - Lucia Di Iorio
- CHORUS Institute, Phelma Minatec, 3 parvis Louis Néel, 38000 Grenoble, France
| | - Marc Lammers
- Hawaiian Islands Humpback Whale National Marine Sanctuary, 726 South Kihei Road, Kihei, HI 96753, USA
| | - Tzu-Hao Lin
- Biodiversity Research Center, Academia Sinica, 128 Academia Road, Section 2, Nankang, Taipei 11529, Taiwan
| | - Sophie L. Nedelec
- Biosciences, College of Life and Environmental Sciences, Hatherly Laboratories, University of Exeter, Prince of Wales Road, Exeter EX4 4PS, UK
| | - Miles Parsons
- Australian Institute of Marine Science, Perth, Western Australia 6009, Australia
| | - Craig Radford
- Institute of Marine Science, Leigh Marine Laboratory, University of Auckland, PO Box 349, Warkworth 0941, New Zealand
| | - Ed Urban
- Scientific Committee on Oceanic Research, University of Delaware, Newark, DE 19716, USA
| | - Jenni Stanley
- Biology Department, Woods Hole Oceanographic Institution, 266 Woods Hole Road, Woods Hole, MA 02543, USA
| |
Collapse
|
9
|
Harakawa R, Ogawa T, Haseyama M, Akamatsu T. Automatic detection of fish sounds based on multi-stage classification including logistic regression via adaptive feature weighting. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2018; 144:2709. [PMID: 30522274 DOI: 10.1121/1.5067373] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/10/2017] [Accepted: 10/16/2018] [Indexed: 06/09/2023]
Abstract
This paper presents a method for automatic detection of fish sounds in an underwater environment. There exist two difficulties: (i) features and classifiers that provide good detection results differ depending on the underwater environment and (ii) there are cases where a large amount of training data that is necessary for supervised machine learning cannot be prepared. A method presented in this paper (the proposed hybrid method) overcomes these difficulties as follows. First, novel logistic regression (NLR) is derived via adaptive feature weighting by focusing on the accuracy of classification results by multiple classifiers, support vector machine (SVM), and k-nearest neighbors (k-NN). Although there are cases where SVM or k-NN cannot work well due to divergence of useful features, NLR can produce complementary results. Second, the proposed hybrid method performs multi-stage classification with consideration of the accuracy of SVM, k-NN, and NLR. The multi-stage acquisition of reliable results works adaptively according to the underwater environment to reduce performance degradation due to diversity of useful classifiers even if abundant training data cannot be prepared. Experiments on underwater recordings including sounds of Sciaenidae such as silver croakers (Pennahia argentata) and blue drums (Nibea mitsukurii) show the effectiveness of the proposed hybrid method.
Collapse
Affiliation(s)
- Ryosuke Harakawa
- Graduate School of Information Science and Technology, Hokkaido University, Sapporo, Hokkaido 060-0814, Japan
| | - Takahiro Ogawa
- Graduate School of Information Science and Technology, Hokkaido University, Sapporo, Hokkaido 060-0814, Japan
| | - Miki Haseyama
- Graduate School of Information Science and Technology, Hokkaido University, Sapporo, Hokkaido 060-0814, Japan
| | - Tomonari Akamatsu
- National Research Institute of Fisheries Science, Fisheries Research Agency, Yokohama, Kanagawa 236-8648, Japan
| |
Collapse
|
10
|
Lin TH, Chou LS. Automatic classification of delphinids based on the representative frequencies of whistles. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2015; 138:1003-1011. [PMID: 26328716 DOI: 10.1121/1.4927695] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Classification of odontocete species remains a challenging task for passive acoustic monitoring. Classifiers that have been developed use spectral features extracted from echolocation clicks and whistle contours. Most of these contour-based classifiers require complete contours to reduce measurement errors. Therefore, overlapping contours and partially detected contours in an automatic detection algorithm may increase the bias for contour-based classifiers. In this study, classification was conducted on each recording section without extracting individual contours. The local-max detector was used to extract representative frequencies of delphinid whistles and each section was divided into multiple non-overlapping fragments. Three acoustical parameters were measured from the distribution of representative frequencies in each fragment. By using the statistical features of the acoustical parameters and the percentage of overlapping whistles, correct classification rate of 70.3% was reached for the recordings of seven species (Tursiops truncatus, Delphinus delphis, Delphinus capensis, Peponocephala electra, Grampus griseus, Stenella longirostris longirostris, and Stenella attenuata) archived in MobySound.org. In addition, correct classification rate was not dramatically reduced in various simulated noise conditions. This algorithm can be employed in acoustic observatories to classify different delphinid species and facilitate future studies on the community ecology of odontocetes.
Collapse
Affiliation(s)
- Tzu-Hao Lin
- Institute of Ecology and Evolutionary Biology, National Taiwan University, Number 1, Section 4, Roosevelt Road, Taipei 10617, Taiwan
| | - Lien-Siang Chou
- Institute of Ecology and Evolutionary Biology, National Taiwan University, Number 1, Section 4, Roosevelt Road, Taipei 10617, Taiwan
| |
Collapse
|
11
|
Lin TH, Yu HY, Chen CF, Chou LS. Passive acoustic monitoring of the temporal variability of odontocete tonal sounds from a long-term marine observatory. PLoS One 2015; 10:e0123943. [PMID: 25923338 PMCID: PMC4414466 DOI: 10.1371/journal.pone.0123943] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2014] [Accepted: 03/09/2015] [Indexed: 11/23/2022] Open
Abstract
The developments of marine observatories and automatic sound detection algorithms have facilitated the long-term monitoring of multiple species of odontocetes. Although classification remains difficult, information on tonal sound in odontocetes (i.e., toothed whales, including dolphins and porpoises) can provide insights into the species composition and group behavior of these species. However, the approach to measure whistle contour parameters for detecting the variability of odontocete vocal behavior may be biased when the signal-to-noise ratio is low. Thus, methods for analyzing the whistle usage of an entire group are necessary. In this study, a local-max detector was used to detect burst pulses and representative frequencies of whistles within 4.5–48 kHz. Whistle contours were extracted and classified using an unsupervised method. Whistle characteristics and usage pattern were quantified based on the distribution of representative frequencies and the composition of whistle repertoires. Based on the one year recordings collected from the Marine Cable Hosted Observatory off northeastern Taiwan, odontocete burst pulses and whistles were primarily detected during the nighttime, especially after sunset. Whistle usage during the nighttime was more complex, and whistles with higher frequency were mainly detected during summer and fall. According to the multivariate analysis, the diurnal variation of whistle usage was primarily related to the change of mode frequency, diversity of representative frequency, and sequence complexity. The seasonal variation of whistle usage involved the previous three parameters, in addition to the diversity of whistle clusters. Our results indicated that the species and behavioral composition of the local odontocete community may vary among seasonal and diurnal cycles. The current monitoring platform facilitates the evaluation of whistle usage based on group behavior and provides feature vectors for species and behavioral classification in future studies.
Collapse
Affiliation(s)
- Tzu-Hao Lin
- Institute of Ecology and Evolutionary Biology, National Taiwan University, No. 1, Sec. 4, Roosevelt Road, Taipei 10617, Taiwan (R.O.C.)
| | - Hsin-Yi Yu
- Institute of Ecology and Evolutionary Biology, National Taiwan University, No. 1, Sec. 4, Roosevelt Road, Taipei 10617, Taiwan (R.O.C.)
| | - Chi-Fang Chen
- Department of Engineering Science and Ocean Engineering, National Taiwan University, No. 1, Sec. 4, Roosevelt Road, Taipei 10617, Taiwan (R.O.C.)
| | - Lien-Siang Chou
- Institute of Ecology and Evolutionary Biology, National Taiwan University, No. 1, Sec. 4, Roosevelt Road, Taipei 10617, Taiwan (R.O.C.)
- * E-mail:
| |
Collapse
|