Automated identification of avian vocalizations with deep convolutional neural networks

Abstract Passive acoustic monitoring is an emerging approach to wildlife monitoring that leverages recent improvements in automated recording units and other technologies. A central challenge of this approach is the task of locating and identifying target species vocalizations in large volumes of au...

Full description

Bibliographic Details
Main Authors: Zachary J. Ruff, Damon B. Lesmeister, Leila S. Duchac, Bharath K. Padmaraju, Christopher M. Sullivan
Format: Article
Language:English
Published: Wiley 2020-03-01
Series:Remote Sensing in Ecology and Conservation
Subjects:
Online Access:https://doi.org/10.1002/rse2.125
id doaj-a014e061e6074d228d5c23933f652c03
record_format Article
spelling doaj-a014e061e6074d228d5c23933f652c032020-11-25T02:04:50ZengWileyRemote Sensing in Ecology and Conservation2056-34852020-03-0161799210.1002/rse2.125Automated identification of avian vocalizations with deep convolutional neural networksZachary J. Ruff0Damon B. Lesmeister1Leila S. Duchac2Bharath K. Padmaraju3Christopher M. Sullivan4Pacific Northwest Research Station USDA Forest Service Corvallis OregonPacific Northwest Research Station USDA Forest Service Corvallis OregonPacific Northwest Research Station USDA Forest Service Corvallis OregonCenter for Genome Research and Biocomputing Oregon State University Corvallis OregonCenter for Genome Research and Biocomputing Oregon State University Corvallis OregonAbstract Passive acoustic monitoring is an emerging approach to wildlife monitoring that leverages recent improvements in automated recording units and other technologies. A central challenge of this approach is the task of locating and identifying target species vocalizations in large volumes of audio data. To address this issue, we developed an efficient data processing pipeline using a deep convolutional neural network (CNN) to automate the detection of owl vocalizations in spectrograms generated from unprocessed field recordings. While the project was initially focused on spotted and barred owls, we also trained the network to recognize northern saw‐whet owl, great horned owl, northern pygmy‐owl, and western screech‐owl. Although classification performance varies across species, initial results are promising. Recall, or the proportion of calls in the dataset that are detected and correctly identified, ranged from 63.1% for barred owl to 91.5% for spotted owl based on raw network output. Precision, the rate of true positives among apparent detections, ranged from 0.4% for spotted owl to 77.1% for northern saw‐whet owl based on raw output. In limited tests, the CNN performed as well as or better than human technicians at detecting owl calls. Our model output is suitable for developing species encounter histories for occupancy models and other analyses. We believe our approach is sufficiently general to support long‐term, large‐scale monitoring of a broad range of species beyond our target species list, including birds, mammals, and others.https://doi.org/10.1002/rse2.125Acoustic monitoringavian vocalizationBioacousticsmachine learningneural networksspotted owls
collection DOAJ
language English
format Article
sources DOAJ
author Zachary J. Ruff
Damon B. Lesmeister
Leila S. Duchac
Bharath K. Padmaraju
Christopher M. Sullivan
spellingShingle Zachary J. Ruff
Damon B. Lesmeister
Leila S. Duchac
Bharath K. Padmaraju
Christopher M. Sullivan
Automated identification of avian vocalizations with deep convolutional neural networks
Remote Sensing in Ecology and Conservation
Acoustic monitoring
avian vocalization
Bioacoustics
machine learning
neural networks
spotted owls
author_facet Zachary J. Ruff
Damon B. Lesmeister
Leila S. Duchac
Bharath K. Padmaraju
Christopher M. Sullivan
author_sort Zachary J. Ruff
title Automated identification of avian vocalizations with deep convolutional neural networks
title_short Automated identification of avian vocalizations with deep convolutional neural networks
title_full Automated identification of avian vocalizations with deep convolutional neural networks
title_fullStr Automated identification of avian vocalizations with deep convolutional neural networks
title_full_unstemmed Automated identification of avian vocalizations with deep convolutional neural networks
title_sort automated identification of avian vocalizations with deep convolutional neural networks
publisher Wiley
series Remote Sensing in Ecology and Conservation
issn 2056-3485
publishDate 2020-03-01
description Abstract Passive acoustic monitoring is an emerging approach to wildlife monitoring that leverages recent improvements in automated recording units and other technologies. A central challenge of this approach is the task of locating and identifying target species vocalizations in large volumes of audio data. To address this issue, we developed an efficient data processing pipeline using a deep convolutional neural network (CNN) to automate the detection of owl vocalizations in spectrograms generated from unprocessed field recordings. While the project was initially focused on spotted and barred owls, we also trained the network to recognize northern saw‐whet owl, great horned owl, northern pygmy‐owl, and western screech‐owl. Although classification performance varies across species, initial results are promising. Recall, or the proportion of calls in the dataset that are detected and correctly identified, ranged from 63.1% for barred owl to 91.5% for spotted owl based on raw network output. Precision, the rate of true positives among apparent detections, ranged from 0.4% for spotted owl to 77.1% for northern saw‐whet owl based on raw output. In limited tests, the CNN performed as well as or better than human technicians at detecting owl calls. Our model output is suitable for developing species encounter histories for occupancy models and other analyses. We believe our approach is sufficiently general to support long‐term, large‐scale monitoring of a broad range of species beyond our target species list, including birds, mammals, and others.
topic Acoustic monitoring
avian vocalization
Bioacoustics
machine learning
neural networks
spotted owls
url https://doi.org/10.1002/rse2.125
work_keys_str_mv AT zacharyjruff automatedidentificationofavianvocalizationswithdeepconvolutionalneuralnetworks
AT damonblesmeister automatedidentificationofavianvocalizationswithdeepconvolutionalneuralnetworks
AT leilasduchac automatedidentificationofavianvocalizationswithdeepconvolutionalneuralnetworks
AT bharathkpadmaraju automatedidentificationofavianvocalizationswithdeepconvolutionalneuralnetworks
AT christophermsullivan automatedidentificationofavianvocalizationswithdeepconvolutionalneuralnetworks
_version_ 1724940793412583424