How the Outliers Influence the Quality of Clustering?

In this article, we evaluate the efficiency and performance of two clustering algorithms: AHC (Agglomerative Hierarchical Clustering) and K − Means. We are aware that there are various linkage options and distance measures that influence the clustering results. We assess the quality of clustering us...

Full description

Bibliographic Details
Main Authors: Gaibei, I. (Author), Nowak-Brzezińska, A. (Author)
Format: Article
Language:English
Published: MDPI 2022
Subjects:
AHC
Online Access:View Fulltext in Publisher
LEADER 01617nam a2200205Ia 4500
001 10.3390-e24070917
008 220718s2022 CNT 000 0 und d
020 |a 10994300 (ISSN) 
245 1 0 |a How the Outliers Influence the Quality of Clustering? 
260 0 |b MDPI  |c 2022 
856 |z View Fulltext in Publisher  |u https://doi.org/10.3390/e24070917 
520 3 |a In this article, we evaluate the efficiency and performance of two clustering algorithms: AHC (Agglomerative Hierarchical Clustering) and K − Means. We are aware that there are various linkage options and distance measures that influence the clustering results. We assess the quality of clustering using the Davies–Bouldin and Dunn cluster validity indexes. The main contribution of this research is to verify whether the quality of clusters without outliers is higher than those with outliers in the data. To do this, we compare and analyze outlier detection algorithms depending on the applied clustering algorithm. In our research, we use and compare the LOF (Local Outlier Factor) and COF (Connectivity-based Outlier Factor) algorithms for detecting outliers before and after removing 1%, 5%, and 10% of outliers. Next, we analyze how the quality of clustering has improved. In the experiments, three real data sets were used with a different number of instances. © 2022 by the authors. Licensee MDPI, Basel, Switzerland. 
650 0 4 |a AHC 
650 0 4 |a clustering 
650 0 4 |a clustering quality indexes 
650 0 4 |a k-Means 
650 0 4 |a outlier detection 
700 1 |a Gaibei, I.  |e author 
700 1 |a Nowak-Brzezińska, A.  |e author 
773 |t Entropy