Environment Sound Classification System Based on Hybrid Feature and Convolutional Neural Network

At present, the environment sound recognition system mainly identifies environment sounds with deep neural networks and a wide variety of auditory features. Therefore, it is necessary to analyze which auditory features are more suitable for deep neural networks based ESCR systems. In this paper, we...

Full description

Bibliographic Details
Format: Article
Language:zho
Published: The Northwestern Polytechnical University 2020-02-01
Series:Xibei Gongye Daxue Xuebao
Subjects:
Online Access:https://www.jnwpu.org/articles/jnwpu/full_html/2020/01/jnwpu2020381p162/jnwpu2020381p162.html
id doaj-e133f03b38f64bbb823ab09bf844509d
record_format Article
spelling doaj-e133f03b38f64bbb823ab09bf844509d2021-05-02T19:19:38ZzhoThe Northwestern Polytechnical UniversityXibei Gongye Daxue Xuebao1000-27582609-71252020-02-0138116216910.1051/jnwpu/20203810162jnwpu2020381p162Environment Sound Classification System Based on Hybrid Feature and Convolutional Neural NetworkAt present, the environment sound recognition system mainly identifies environment sounds with deep neural networks and a wide variety of auditory features. Therefore, it is necessary to analyze which auditory features are more suitable for deep neural networks based ESCR systems. In this paper, we chose three sound features which based on two widely used filters:the Mel and Gammatone filter banks. Subsequently, the hybrid feature MGCC is presented. Finally, a deep convolutional neural network is proposed to verify which features are more suitable for environment sound classification and recognition tasks. The experimental results show that the signal processing features are better than the spectrogram features in the deep neural network based environmental sound recognition system. Among all the acoustic features, the MGCC feature achieves the best performance than other features. Finally, the MGCC-CNN model proposed in this paper is compared with the state-of-the-art environmental sound classification models on the UrbanSound 8K dataset. The results show that the proposed model has the best classification accuracy.https://www.jnwpu.org/articles/jnwpu/full_html/2020/01/jnwpu2020381p162/jnwpu2020381p162.htmlenvironment soundhybrid featuresound classificationconvolutional neural networkfilter
collection DOAJ
language zho
format Article
sources DOAJ
title Environment Sound Classification System Based on Hybrid Feature and Convolutional Neural Network
spellingShingle Environment Sound Classification System Based on Hybrid Feature and Convolutional Neural Network
Xibei Gongye Daxue Xuebao
environment sound
hybrid feature
sound classification
convolutional neural network
filter
title_short Environment Sound Classification System Based on Hybrid Feature and Convolutional Neural Network
title_full Environment Sound Classification System Based on Hybrid Feature and Convolutional Neural Network
title_fullStr Environment Sound Classification System Based on Hybrid Feature and Convolutional Neural Network
title_full_unstemmed Environment Sound Classification System Based on Hybrid Feature and Convolutional Neural Network
title_sort environment sound classification system based on hybrid feature and convolutional neural network
publisher The Northwestern Polytechnical University
series Xibei Gongye Daxue Xuebao
issn 1000-2758
2609-7125
publishDate 2020-02-01
description At present, the environment sound recognition system mainly identifies environment sounds with deep neural networks and a wide variety of auditory features. Therefore, it is necessary to analyze which auditory features are more suitable for deep neural networks based ESCR systems. In this paper, we chose three sound features which based on two widely used filters:the Mel and Gammatone filter banks. Subsequently, the hybrid feature MGCC is presented. Finally, a deep convolutional neural network is proposed to verify which features are more suitable for environment sound classification and recognition tasks. The experimental results show that the signal processing features are better than the spectrogram features in the deep neural network based environmental sound recognition system. Among all the acoustic features, the MGCC feature achieves the best performance than other features. Finally, the MGCC-CNN model proposed in this paper is compared with the state-of-the-art environmental sound classification models on the UrbanSound 8K dataset. The results show that the proposed model has the best classification accuracy.
topic environment sound
hybrid feature
sound classification
convolutional neural network
filter
url https://www.jnwpu.org/articles/jnwpu/full_html/2020/01/jnwpu2020381p162/jnwpu2020381p162.html
_version_ 1721488582517981184