Density Peaks Clustering Based on Weighted Local Density Sequence and Nearest Neighbor Assignment

Density peaks clustering (DPC) is a density-based clustering algorithm with excellent clustering performance including accuracy, automatically detecting the number of clusters, and identifying center points. However, the local density of DPC strongly depends on the cutoff distance which must be pres...

Full description

Bibliographic Details
Main Authors: Donghua Yu, Guojun Liu, Maozu Guo, Xiaoyan Liu, Shuang Yao
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8665962/
id doaj-d3818ba7d48f4f3baa2103b8b8de794e
record_format Article
spelling doaj-d3818ba7d48f4f3baa2103b8b8de794e2021-03-29T22:55:22ZengIEEEIEEE Access2169-35362019-01-017343013431710.1109/ACCESS.2019.29042548665962Density Peaks Clustering Based on Weighted Local Density Sequence and Nearest Neighbor AssignmentDonghua Yu0https://orcid.org/0000-0003-0464-9205Guojun Liu1Maozu Guo2Xiaoyan Liu3Shuang Yao4School of Computer Science and Technology, Harbin Institute of Technology, Harbin, ChinaSchool of Computer Science and Technology, Harbin Institute of Technology, Harbin, ChinaSchool of Computer Science and Technology, Harbin Institute of Technology, Harbin, ChinaSchool of Computer Science and Technology, Harbin Institute of Technology, Harbin, ChinaCollege of Economics and Management, China Jiliang University, Hangzhou, ChinaDensity peaks clustering (DPC) is a density-based clustering algorithm with excellent clustering performance including accuracy, automatically detecting the number of clusters, and identifying center points. However, the local density of DPC strongly depends on the cutoff distance which must be prespecified; in addition, the strategy assigns each remaining point to the same cluster as its nearest neighbor of higher density in descending order of local density, which is likely to cause cluster label error propagation. To overcome these limitations, we propose an improved DPC by introducing weighted local density sequence and two-stage assignment strategies, called DPCSA. Many previous improved DPC algorithms neglect additional complexity, whereas DPCSA incorporates the nearest neighbor dynamic table to enhance clustering efficiency. The experimental results for 12 artificial and 11 real-world datasets, including Olivetti face, verify that the DPCSA clustering performance is significantly superior to DPC and DPC via heat diffusion (HDDPC), and slightly superior to fuzzy weighted k-nearest neighbors density peak clustering (FKNNDPC). In addition, the DPCSA is more computationally efficient than FKNNDPC and HDDPC, but less than DPC. The source code of DPCSA is available at https://www.github.com/Yu123456/DPCSA.https://ieeexplore.ieee.org/document/8665962/Cluster analysisdensity peaksK-nearest neighborslocal densitynearest neighbor dynamic table
collection DOAJ
language English
format Article
sources DOAJ
author Donghua Yu
Guojun Liu
Maozu Guo
Xiaoyan Liu
Shuang Yao
spellingShingle Donghua Yu
Guojun Liu
Maozu Guo
Xiaoyan Liu
Shuang Yao
Density Peaks Clustering Based on Weighted Local Density Sequence and Nearest Neighbor Assignment
IEEE Access
Cluster analysis
density peaks
K-nearest neighbors
local density
nearest neighbor dynamic table
author_facet Donghua Yu
Guojun Liu
Maozu Guo
Xiaoyan Liu
Shuang Yao
author_sort Donghua Yu
title Density Peaks Clustering Based on Weighted Local Density Sequence and Nearest Neighbor Assignment
title_short Density Peaks Clustering Based on Weighted Local Density Sequence and Nearest Neighbor Assignment
title_full Density Peaks Clustering Based on Weighted Local Density Sequence and Nearest Neighbor Assignment
title_fullStr Density Peaks Clustering Based on Weighted Local Density Sequence and Nearest Neighbor Assignment
title_full_unstemmed Density Peaks Clustering Based on Weighted Local Density Sequence and Nearest Neighbor Assignment
title_sort density peaks clustering based on weighted local density sequence and nearest neighbor assignment
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2019-01-01
description Density peaks clustering (DPC) is a density-based clustering algorithm with excellent clustering performance including accuracy, automatically detecting the number of clusters, and identifying center points. However, the local density of DPC strongly depends on the cutoff distance which must be prespecified; in addition, the strategy assigns each remaining point to the same cluster as its nearest neighbor of higher density in descending order of local density, which is likely to cause cluster label error propagation. To overcome these limitations, we propose an improved DPC by introducing weighted local density sequence and two-stage assignment strategies, called DPCSA. Many previous improved DPC algorithms neglect additional complexity, whereas DPCSA incorporates the nearest neighbor dynamic table to enhance clustering efficiency. The experimental results for 12 artificial and 11 real-world datasets, including Olivetti face, verify that the DPCSA clustering performance is significantly superior to DPC and DPC via heat diffusion (HDDPC), and slightly superior to fuzzy weighted k-nearest neighbors density peak clustering (FKNNDPC). In addition, the DPCSA is more computationally efficient than FKNNDPC and HDDPC, but less than DPC. The source code of DPCSA is available at https://www.github.com/Yu123456/DPCSA.
topic Cluster analysis
density peaks
K-nearest neighbors
local density
nearest neighbor dynamic table
url https://ieeexplore.ieee.org/document/8665962/
work_keys_str_mv AT donghuayu densitypeaksclusteringbasedonweightedlocaldensitysequenceandnearestneighborassignment
AT guojunliu densitypeaksclusteringbasedonweightedlocaldensitysequenceandnearestneighborassignment
AT maozuguo densitypeaksclusteringbasedonweightedlocaldensitysequenceandnearestneighborassignment
AT xiaoyanliu densitypeaksclusteringbasedonweightedlocaldensitysequenceandnearestneighborassignment
AT shuangyao densitypeaksclusteringbasedonweightedlocaldensitysequenceandnearestneighborassignment
_version_ 1724190601538174976