A Novel Bearing Fault Diagnosis Method Using Spark-Based Parallel ACO-K-Means Clustering Algorithm

K-Means clustering algorithm is a typical unsupervised learning method, and it has been widely used in the field of fault diagnosis. However, the traditional serial K-Means clustering algorithm is difficult to efficiently and accurately perform clustering analysis on the massive running-state monito...

Full description

Bibliographic Details
Main Authors: Lanjun Wan, Gen Zhang, Hongyang Li, Changyun Li
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9353541/
id doaj-5e7cfef24bd142f8bed58ca86e9d7b49
record_format Article
spelling doaj-5e7cfef24bd142f8bed58ca86e9d7b492021-05-19T23:03:30ZengIEEEIEEE Access2169-35362021-01-019287532876810.1109/ACCESS.2021.30592219353541A Novel Bearing Fault Diagnosis Method Using Spark-Based Parallel ACO-K-Means Clustering AlgorithmLanjun Wan0https://orcid.org/0000-0001-7236-3589Gen Zhang1Hongyang Li2Changyun Li3School of Computer Science, Hunan University of Technology, Zhuzhou, ChinaSchool of Computer Science, Hunan University of Technology, Zhuzhou, ChinaSchool of Computer Science, Hunan University of Technology, Zhuzhou, ChinaHunan Key Laboratory of Intelligent Information Perception and Processing Technology, Hunan University of Technology, Zhuzhou, ChinaK-Means clustering algorithm is a typical unsupervised learning method, and it has been widely used in the field of fault diagnosis. However, the traditional serial K-Means clustering algorithm is difficult to efficiently and accurately perform clustering analysis on the massive running-state monitoring data of rolling bearing. Therefore, a novel fault diagnosis method of rolling bearing using Spark-based parallel ant colony optimization (ACO)-K-Means clustering algorithm is proposed. Firstly, a Spark-based three-layer wavelet packet decomposition approach is developed to efficiently preprocess the running-state monitoring data to obtain eigenvectors, which are stored in Hadoop Distributed File System (HDFS) and served as the input of ACO-K-Means clustering algorithm. Secondly, ACO-K-Means clustering algorithm suitable for rolling bearing fault diagnosis is proposed to improve the diagnosis accuracy. ACO algorithm is adopted to obtain the global optimal initial clustering centers of K-Means from all eigenvectors, and the K-Means clustering algorithm based on weighted Euclidean distance is used to perform clustering analysis on all eigenvectors to obtain a rolling bearing fault diagnosis model. Thirdly, the efficient parallelization of ACO-K-Means clustering algorithm is implemented on a Spark platform, which can make full use of the computing resources of a cluster to efficiently process large-scale rolling bearing datasets in parallel. Extensive experiments are conducted to verify the effectiveness of the proposed fault diagnosis method. Experimental results show that the proposed method can not only achieve good fault diagnosis accuracy but also provide high model training efficiency and fault diagnosis efficiency in a big data environment.https://ieeexplore.ieee.org/document/9353541/Ant colony optimizationfault diagnosisK-Means clusteringrolling bearingsparkwavelet packet decomposition
collection DOAJ
language English
format Article
sources DOAJ
author Lanjun Wan
Gen Zhang
Hongyang Li
Changyun Li
spellingShingle Lanjun Wan
Gen Zhang
Hongyang Li
Changyun Li
A Novel Bearing Fault Diagnosis Method Using Spark-Based Parallel ACO-K-Means Clustering Algorithm
IEEE Access
Ant colony optimization
fault diagnosis
K-Means clustering
rolling bearing
spark
wavelet packet decomposition
author_facet Lanjun Wan
Gen Zhang
Hongyang Li
Changyun Li
author_sort Lanjun Wan
title A Novel Bearing Fault Diagnosis Method Using Spark-Based Parallel ACO-K-Means Clustering Algorithm
title_short A Novel Bearing Fault Diagnosis Method Using Spark-Based Parallel ACO-K-Means Clustering Algorithm
title_full A Novel Bearing Fault Diagnosis Method Using Spark-Based Parallel ACO-K-Means Clustering Algorithm
title_fullStr A Novel Bearing Fault Diagnosis Method Using Spark-Based Parallel ACO-K-Means Clustering Algorithm
title_full_unstemmed A Novel Bearing Fault Diagnosis Method Using Spark-Based Parallel ACO-K-Means Clustering Algorithm
title_sort novel bearing fault diagnosis method using spark-based parallel aco-k-means clustering algorithm
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2021-01-01
description K-Means clustering algorithm is a typical unsupervised learning method, and it has been widely used in the field of fault diagnosis. However, the traditional serial K-Means clustering algorithm is difficult to efficiently and accurately perform clustering analysis on the massive running-state monitoring data of rolling bearing. Therefore, a novel fault diagnosis method of rolling bearing using Spark-based parallel ant colony optimization (ACO)-K-Means clustering algorithm is proposed. Firstly, a Spark-based three-layer wavelet packet decomposition approach is developed to efficiently preprocess the running-state monitoring data to obtain eigenvectors, which are stored in Hadoop Distributed File System (HDFS) and served as the input of ACO-K-Means clustering algorithm. Secondly, ACO-K-Means clustering algorithm suitable for rolling bearing fault diagnosis is proposed to improve the diagnosis accuracy. ACO algorithm is adopted to obtain the global optimal initial clustering centers of K-Means from all eigenvectors, and the K-Means clustering algorithm based on weighted Euclidean distance is used to perform clustering analysis on all eigenvectors to obtain a rolling bearing fault diagnosis model. Thirdly, the efficient parallelization of ACO-K-Means clustering algorithm is implemented on a Spark platform, which can make full use of the computing resources of a cluster to efficiently process large-scale rolling bearing datasets in parallel. Extensive experiments are conducted to verify the effectiveness of the proposed fault diagnosis method. Experimental results show that the proposed method can not only achieve good fault diagnosis accuracy but also provide high model training efficiency and fault diagnosis efficiency in a big data environment.
topic Ant colony optimization
fault diagnosis
K-Means clustering
rolling bearing
spark
wavelet packet decomposition
url https://ieeexplore.ieee.org/document/9353541/
work_keys_str_mv AT lanjunwan anovelbearingfaultdiagnosismethodusingsparkbasedparallelacokmeansclusteringalgorithm
AT genzhang anovelbearingfaultdiagnosismethodusingsparkbasedparallelacokmeansclusteringalgorithm
AT hongyangli anovelbearingfaultdiagnosismethodusingsparkbasedparallelacokmeansclusteringalgorithm
AT changyunli anovelbearingfaultdiagnosismethodusingsparkbasedparallelacokmeansclusteringalgorithm
AT lanjunwan novelbearingfaultdiagnosismethodusingsparkbasedparallelacokmeansclusteringalgorithm
AT genzhang novelbearingfaultdiagnosismethodusingsparkbasedparallelacokmeansclusteringalgorithm
AT hongyangli novelbearingfaultdiagnosismethodusingsparkbasedparallelacokmeansclusteringalgorithm
AT changyunli novelbearingfaultdiagnosismethodusingsparkbasedparallelacokmeansclusteringalgorithm
_version_ 1721436118886383616