A machine learning pipeline for classification of cetacean echolocation clicks in large underwater acoustic datasets

Machine learning algorithms, including recent advances in deep learning, are promising for tools for detection and classification of broadband high frequency signals in passive acoustic recordings. However, these methods are generally data-hungry and progress has been limited by challenges related t...

Full description

Bibliographic Details
Main Author: Frasier, K.E (Author)
Format: Article
Language:English
Published: Public Library of Science 2021
Subjects:
Online Access:View Fulltext in Publisher
LEADER 03595nam a2200577Ia 4500
001 10.1371-JOURNAL.PCBI.1009613
008 220427s2021 CNT 000 0 und d
020 |a 1553734X (ISSN) 
245 1 0 |a A machine learning pipeline for classification of cetacean echolocation clicks in large underwater acoustic datasets 
260 0 |b Public Library of Science  |c 2021 
856 |z View Fulltext in Publisher  |u https://doi.org/10.1371/JOURNAL.PCBI.1009613 
520 3 |a Machine learning algorithms, including recent advances in deep learning, are promising for tools for detection and classification of broadband high frequency signals in passive acoustic recordings. However, these methods are generally data-hungry and progress has been limited by challenges related to the lack of labeled datasets adequate for training and testing. Large quantities of known and as yet unidentified broadband signal types mingle in marine recordings, with variability introduced by acoustic propagation, source depths and orientations, and interacting signals. Manual classification of these datasets is unmanageable without an in-depth knowledge of the acoustic context of each recording location. A signal classification pipeline is presented which combines unsupervised and supervised learning phases with opportunities for expert oversight to label signals of interest. The method is illustrated with a case study using unsupervised clustering to identify five toothed whale echolocation click types and two anthropogenic signal categories. These categories are used to train a deep network to classify detected signals in either averaged time bins or as individual detections, in two independent datasets. Bin-level classification achieved higher overall precision (>99%) than click-level classification. However, click-level classification had the advantage of providing a label for every signal, and achieved higher overall recall, with overall precision from 92 to 94%. The results suggest that unsupervised learning is a viable solution for efficiently generating the large, representative training sets needed for applications of deep learning in passive acoustics. Copyright: © 2021 Kaitlin E. Frasier. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. 
650 0 4 |a acoustics 
650 0 4 |a acoustics 
650 0 4 |a Acoustics 
650 0 4 |a algorithm 
650 0 4 |a Algorithms 
650 0 4 |a animal 
650 0 4 |a Animals 
650 0 4 |a article 
650 0 4 |a biology 
650 0 4 |a California 
650 0 4 |a California 
650 0 4 |a Cetacea 
650 0 4 |a Cetacea 
650 0 4 |a cluster analysis 
650 0 4 |a Cluster Analysis 
650 0 4 |a Computational Biology 
650 0 4 |a Data Interpretation, Statistical 
650 0 4 |a Databases, Factual 
650 0 4 |a deep learning 
650 0 4 |a Deep Learning 
650 0 4 |a echolocation 
650 0 4 |a echolocation 
650 0 4 |a Echolocation 
650 0 4 |a factual database 
650 0 4 |a machine learning 
650 0 4 |a Machine Learning 
650 0 4 |a nonhuman 
650 0 4 |a physiology 
650 0 4 |a pipeline 
650 0 4 |a recall 
650 0 4 |a software design 
650 0 4 |a Software Design 
650 0 4 |a statistical analysis 
650 0 4 |a toothed whale 
650 0 4 |a unsupervised machine learning 
650 0 4 |a Unsupervised Machine Learning 
650 0 4 |a Whales 
700 1 |a Frasier, K.E.  |e author 
773 |t PLoS Computational Biology