Anomaly Detection in Industrial Machine Sounds Using High-Frequency Features and Gate Recurrent Unit Networks

Detecting anomalies in industrial sound is critical for maintaining operational efficiency, preventing costly equipment failures, and ensuring workplace safety. However, it presents significant challenges due to the complexity and variability of industrial environments, including background noise an...

全面介紹

書目詳細資料
發表在:IEEE Access
Main Authors: Thi-Thu-Huong Le, Andro Aprila Adiputra, Jiwon Yun, Howon Kim
格式: Article
語言:英语
出版: IEEE 2025-01-01
主題:
在線閱讀:https://ieeexplore.ieee.org/document/10980326/
實物特徵
總結:Detecting anomalies in industrial sound is critical for maintaining operational efficiency, preventing costly equipment failures, and ensuring workplace safety. However, it presents significant challenges due to the complexity and variability of industrial environments, including background noise and fluctuating operating conditions. This paper proposes a comprehensive approach that leverages machine learning (ML) and deep learning (DL) techniques to address these challenges. Using three datasets, the Malfunctioning Industrial Machine Investigation and Inspection for Domain Generalization 2022 (MIMII DG 2022), the Detection and Classification of Acoustic Scenes and Events (DCASE) 2022, and DCASE 2024, we evaluate the performance of various ML and DL models under different experimental conditions. Our study focuses on feature extraction methods such as Mel-spectrograms and Mel-frequency Cepstral Coefficients (MFCCs) to capture critical acoustic characteristics of industrial machinery. Both supervised ML and DL techniques are employed to explore effective anomaly detection strategies. Extensive experimentation and evaluation using metrics such as confusion matrix, accuracy, precision, recall, and F1 score highlight the effectiveness of our approach in real-world industrial scenarios. Experimental results demonstrate that eXtreme Gradient Boosting (XGBoost) outperforms Support Vector Machine (SVM) and Decision Tree (DT) models in the ML approach across both feature sets. In the DL approach, Gated Recurrent Units (GRU) perform better on MFCC features than Recurrent Neural Networks (RNN) and Long Short-Term Memory (LSTM) networks. GRU emerges as the best-performing model on testing datasets, achieving a precision of 99.56% and an F1 score of 99.55% on the DCASE 2024 dataset, a precision of 94% and an F1 score of 93.95% on the DCASE 2022 dataset, and a precision of 92.2% and an F1 score of 92.06% on the MIMII DG 2022 dataset. These results underscore the potential of DL for real-time industrial sound analysis in predictive maintenance, offering significant improvements over traditional methods.
ISSN:2169-3536