A Novel Bearing Fault Diagnosis Method Using Spark-Based Parallel ACO-K-Means Clustering Algorithm
K-Means clustering algorithm is a typical unsupervised learning method, and it has been widely used in the field of fault diagnosis. However, the traditional serial K-Means clustering algorithm is difficult to efficiently and accurately perform clustering analysis on the massive running-state monito...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2021-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9353541/ |
id |
doaj-5e7cfef24bd142f8bed58ca86e9d7b49 |
---|---|
record_format |
Article |
spelling |
doaj-5e7cfef24bd142f8bed58ca86e9d7b492021-05-19T23:03:30ZengIEEEIEEE Access2169-35362021-01-019287532876810.1109/ACCESS.2021.30592219353541A Novel Bearing Fault Diagnosis Method Using Spark-Based Parallel ACO-K-Means Clustering AlgorithmLanjun Wan0https://orcid.org/0000-0001-7236-3589Gen Zhang1Hongyang Li2Changyun Li3School of Computer Science, Hunan University of Technology, Zhuzhou, ChinaSchool of Computer Science, Hunan University of Technology, Zhuzhou, ChinaSchool of Computer Science, Hunan University of Technology, Zhuzhou, ChinaHunan Key Laboratory of Intelligent Information Perception and Processing Technology, Hunan University of Technology, Zhuzhou, ChinaK-Means clustering algorithm is a typical unsupervised learning method, and it has been widely used in the field of fault diagnosis. However, the traditional serial K-Means clustering algorithm is difficult to efficiently and accurately perform clustering analysis on the massive running-state monitoring data of rolling bearing. Therefore, a novel fault diagnosis method of rolling bearing using Spark-based parallel ant colony optimization (ACO)-K-Means clustering algorithm is proposed. Firstly, a Spark-based three-layer wavelet packet decomposition approach is developed to efficiently preprocess the running-state monitoring data to obtain eigenvectors, which are stored in Hadoop Distributed File System (HDFS) and served as the input of ACO-K-Means clustering algorithm. Secondly, ACO-K-Means clustering algorithm suitable for rolling bearing fault diagnosis is proposed to improve the diagnosis accuracy. ACO algorithm is adopted to obtain the global optimal initial clustering centers of K-Means from all eigenvectors, and the K-Means clustering algorithm based on weighted Euclidean distance is used to perform clustering analysis on all eigenvectors to obtain a rolling bearing fault diagnosis model. Thirdly, the efficient parallelization of ACO-K-Means clustering algorithm is implemented on a Spark platform, which can make full use of the computing resources of a cluster to efficiently process large-scale rolling bearing datasets in parallel. Extensive experiments are conducted to verify the effectiveness of the proposed fault diagnosis method. Experimental results show that the proposed method can not only achieve good fault diagnosis accuracy but also provide high model training efficiency and fault diagnosis efficiency in a big data environment.https://ieeexplore.ieee.org/document/9353541/Ant colony optimizationfault diagnosisK-Means clusteringrolling bearingsparkwavelet packet decomposition |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Lanjun Wan Gen Zhang Hongyang Li Changyun Li |
spellingShingle |
Lanjun Wan Gen Zhang Hongyang Li Changyun Li A Novel Bearing Fault Diagnosis Method Using Spark-Based Parallel ACO-K-Means Clustering Algorithm IEEE Access Ant colony optimization fault diagnosis K-Means clustering rolling bearing spark wavelet packet decomposition |
author_facet |
Lanjun Wan Gen Zhang Hongyang Li Changyun Li |
author_sort |
Lanjun Wan |
title |
A Novel Bearing Fault Diagnosis Method Using Spark-Based Parallel ACO-K-Means Clustering Algorithm |
title_short |
A Novel Bearing Fault Diagnosis Method Using Spark-Based Parallel ACO-K-Means Clustering Algorithm |
title_full |
A Novel Bearing Fault Diagnosis Method Using Spark-Based Parallel ACO-K-Means Clustering Algorithm |
title_fullStr |
A Novel Bearing Fault Diagnosis Method Using Spark-Based Parallel ACO-K-Means Clustering Algorithm |
title_full_unstemmed |
A Novel Bearing Fault Diagnosis Method Using Spark-Based Parallel ACO-K-Means Clustering Algorithm |
title_sort |
novel bearing fault diagnosis method using spark-based parallel aco-k-means clustering algorithm |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2021-01-01 |
description |
K-Means clustering algorithm is a typical unsupervised learning method, and it has been widely used in the field of fault diagnosis. However, the traditional serial K-Means clustering algorithm is difficult to efficiently and accurately perform clustering analysis on the massive running-state monitoring data of rolling bearing. Therefore, a novel fault diagnosis method of rolling bearing using Spark-based parallel ant colony optimization (ACO)-K-Means clustering algorithm is proposed. Firstly, a Spark-based three-layer wavelet packet decomposition approach is developed to efficiently preprocess the running-state monitoring data to obtain eigenvectors, which are stored in Hadoop Distributed File System (HDFS) and served as the input of ACO-K-Means clustering algorithm. Secondly, ACO-K-Means clustering algorithm suitable for rolling bearing fault diagnosis is proposed to improve the diagnosis accuracy. ACO algorithm is adopted to obtain the global optimal initial clustering centers of K-Means from all eigenvectors, and the K-Means clustering algorithm based on weighted Euclidean distance is used to perform clustering analysis on all eigenvectors to obtain a rolling bearing fault diagnosis model. Thirdly, the efficient parallelization of ACO-K-Means clustering algorithm is implemented on a Spark platform, which can make full use of the computing resources of a cluster to efficiently process large-scale rolling bearing datasets in parallel. Extensive experiments are conducted to verify the effectiveness of the proposed fault diagnosis method. Experimental results show that the proposed method can not only achieve good fault diagnosis accuracy but also provide high model training efficiency and fault diagnosis efficiency in a big data environment. |
topic |
Ant colony optimization fault diagnosis K-Means clustering rolling bearing spark wavelet packet decomposition |
url |
https://ieeexplore.ieee.org/document/9353541/ |
work_keys_str_mv |
AT lanjunwan anovelbearingfaultdiagnosismethodusingsparkbasedparallelacokmeansclusteringalgorithm AT genzhang anovelbearingfaultdiagnosismethodusingsparkbasedparallelacokmeansclusteringalgorithm AT hongyangli anovelbearingfaultdiagnosismethodusingsparkbasedparallelacokmeansclusteringalgorithm AT changyunli anovelbearingfaultdiagnosismethodusingsparkbasedparallelacokmeansclusteringalgorithm AT lanjunwan novelbearingfaultdiagnosismethodusingsparkbasedparallelacokmeansclusteringalgorithm AT genzhang novelbearingfaultdiagnosismethodusingsparkbasedparallelacokmeansclusteringalgorithm AT hongyangli novelbearingfaultdiagnosismethodusingsparkbasedparallelacokmeansclusteringalgorithm AT changyunli novelbearingfaultdiagnosismethodusingsparkbasedparallelacokmeansclusteringalgorithm |
_version_ |
1721436118886383616 |