Automated identification of avian vocalizations with deep convolutional neural networks
Abstract Passive acoustic monitoring is an emerging approach to wildlife monitoring that leverages recent improvements in automated recording units and other technologies. A central challenge of this approach is the task of locating and identifying target species vocalizations in large volumes of au...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Wiley
2020-03-01
|
Series: | Remote Sensing in Ecology and Conservation |
Subjects: | |
Online Access: | https://doi.org/10.1002/rse2.125 |
id |
doaj-a014e061e6074d228d5c23933f652c03 |
---|---|
record_format |
Article |
spelling |
doaj-a014e061e6074d228d5c23933f652c032020-11-25T02:04:50ZengWileyRemote Sensing in Ecology and Conservation2056-34852020-03-0161799210.1002/rse2.125Automated identification of avian vocalizations with deep convolutional neural networksZachary J. Ruff0Damon B. Lesmeister1Leila S. Duchac2Bharath K. Padmaraju3Christopher M. Sullivan4Pacific Northwest Research Station USDA Forest Service Corvallis OregonPacific Northwest Research Station USDA Forest Service Corvallis OregonPacific Northwest Research Station USDA Forest Service Corvallis OregonCenter for Genome Research and Biocomputing Oregon State University Corvallis OregonCenter for Genome Research and Biocomputing Oregon State University Corvallis OregonAbstract Passive acoustic monitoring is an emerging approach to wildlife monitoring that leverages recent improvements in automated recording units and other technologies. A central challenge of this approach is the task of locating and identifying target species vocalizations in large volumes of audio data. To address this issue, we developed an efficient data processing pipeline using a deep convolutional neural network (CNN) to automate the detection of owl vocalizations in spectrograms generated from unprocessed field recordings. While the project was initially focused on spotted and barred owls, we also trained the network to recognize northern saw‐whet owl, great horned owl, northern pygmy‐owl, and western screech‐owl. Although classification performance varies across species, initial results are promising. Recall, or the proportion of calls in the dataset that are detected and correctly identified, ranged from 63.1% for barred owl to 91.5% for spotted owl based on raw network output. Precision, the rate of true positives among apparent detections, ranged from 0.4% for spotted owl to 77.1% for northern saw‐whet owl based on raw output. In limited tests, the CNN performed as well as or better than human technicians at detecting owl calls. Our model output is suitable for developing species encounter histories for occupancy models and other analyses. We believe our approach is sufficiently general to support long‐term, large‐scale monitoring of a broad range of species beyond our target species list, including birds, mammals, and others.https://doi.org/10.1002/rse2.125Acoustic monitoringavian vocalizationBioacousticsmachine learningneural networksspotted owls |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Zachary J. Ruff Damon B. Lesmeister Leila S. Duchac Bharath K. Padmaraju Christopher M. Sullivan |
spellingShingle |
Zachary J. Ruff Damon B. Lesmeister Leila S. Duchac Bharath K. Padmaraju Christopher M. Sullivan Automated identification of avian vocalizations with deep convolutional neural networks Remote Sensing in Ecology and Conservation Acoustic monitoring avian vocalization Bioacoustics machine learning neural networks spotted owls |
author_facet |
Zachary J. Ruff Damon B. Lesmeister Leila S. Duchac Bharath K. Padmaraju Christopher M. Sullivan |
author_sort |
Zachary J. Ruff |
title |
Automated identification of avian vocalizations with deep convolutional neural networks |
title_short |
Automated identification of avian vocalizations with deep convolutional neural networks |
title_full |
Automated identification of avian vocalizations with deep convolutional neural networks |
title_fullStr |
Automated identification of avian vocalizations with deep convolutional neural networks |
title_full_unstemmed |
Automated identification of avian vocalizations with deep convolutional neural networks |
title_sort |
automated identification of avian vocalizations with deep convolutional neural networks |
publisher |
Wiley |
series |
Remote Sensing in Ecology and Conservation |
issn |
2056-3485 |
publishDate |
2020-03-01 |
description |
Abstract Passive acoustic monitoring is an emerging approach to wildlife monitoring that leverages recent improvements in automated recording units and other technologies. A central challenge of this approach is the task of locating and identifying target species vocalizations in large volumes of audio data. To address this issue, we developed an efficient data processing pipeline using a deep convolutional neural network (CNN) to automate the detection of owl vocalizations in spectrograms generated from unprocessed field recordings. While the project was initially focused on spotted and barred owls, we also trained the network to recognize northern saw‐whet owl, great horned owl, northern pygmy‐owl, and western screech‐owl. Although classification performance varies across species, initial results are promising. Recall, or the proportion of calls in the dataset that are detected and correctly identified, ranged from 63.1% for barred owl to 91.5% for spotted owl based on raw network output. Precision, the rate of true positives among apparent detections, ranged from 0.4% for spotted owl to 77.1% for northern saw‐whet owl based on raw output. In limited tests, the CNN performed as well as or better than human technicians at detecting owl calls. Our model output is suitable for developing species encounter histories for occupancy models and other analyses. We believe our approach is sufficiently general to support long‐term, large‐scale monitoring of a broad range of species beyond our target species list, including birds, mammals, and others. |
topic |
Acoustic monitoring avian vocalization Bioacoustics machine learning neural networks spotted owls |
url |
https://doi.org/10.1002/rse2.125 |
work_keys_str_mv |
AT zacharyjruff automatedidentificationofavianvocalizationswithdeepconvolutionalneuralnetworks AT damonblesmeister automatedidentificationofavianvocalizationswithdeepconvolutionalneuralnetworks AT leilasduchac automatedidentificationofavianvocalizationswithdeepconvolutionalneuralnetworks AT bharathkpadmaraju automatedidentificationofavianvocalizationswithdeepconvolutionalneuralnetworks AT christophermsullivan automatedidentificationofavianvocalizationswithdeepconvolutionalneuralnetworks |
_version_ |
1724940793412583424 |