Laplacian Eigenmaps Dimensionality Reduction Based on Clustering-Adjusted Similarity

Euclidean distance between instances is widely used to capture the manifold structure of data and for graph-based dimensionality reduction. However, in some circumstances, the basic Euclidean distance cannot accurately capture the similarity between instances; some instances from different classes b...

Full description

Bibliographic Details
Main Authors: Honghu Zhou, Jun Wang
Format: Article
Language:English
Published: MDPI AG 2019-10-01
Series:Algorithms
Subjects:
Online Access:https://www.mdpi.com/1999-4893/12/10/210
id doaj-73926eb285fe4aec9a84fd22b64b00bb
record_format Article
spelling doaj-73926eb285fe4aec9a84fd22b64b00bb2020-11-24T22:09:34ZengMDPI AGAlgorithms1999-48932019-10-01121021010.3390/a12100210a12100210Laplacian Eigenmaps Dimensionality Reduction Based on Clustering-Adjusted SimilarityHonghu Zhou0Jun Wang1College of Computer and Information Science, Southwest University, Chongqing 400715, ChinaCollege of Computer and Information Science, Southwest University, Chongqing 400715, ChinaEuclidean distance between instances is widely used to capture the manifold structure of data and for graph-based dimensionality reduction. However, in some circumstances, the basic Euclidean distance cannot accurately capture the similarity between instances; some instances from different classes but close to the decision boundary may be close to each other, which may mislead the graph-based dimensionality reduction and compromise the performance. To mitigate this issue, in this paper, we proposed an approach called Laplacian Eigenmaps based on Clustering-Adjusted Similarity (LE-CAS). LE-CAS first performs clustering on all instances to explore the global structure and discrimination of instances, and quantifies the similarity between cluster centers. Then, it adjusts the similarity between pairwise instances by multiplying the similarity between centers of clusters, which these two instances respectively belong to. In this way, if two instances are from different clusters, the similarity between them is reduced; otherwise, it is unchanged. Finally, LE-CAS performs graph-based dimensionality reduction (via Laplacian Eigenmaps) based on the adjusted similarity. We conducted comprehensive empirical studies on UCI datasets and show that LE-CAS not only has a better performance than other relevant comparing methods, but also is more robust to input parameters.https://www.mdpi.com/1999-4893/12/10/210laplacian eigenmapsdimensionality reductionclustering-adjusted similarity
collection DOAJ
language English
format Article
sources DOAJ
author Honghu Zhou
Jun Wang
spellingShingle Honghu Zhou
Jun Wang
Laplacian Eigenmaps Dimensionality Reduction Based on Clustering-Adjusted Similarity
Algorithms
laplacian eigenmaps
dimensionality reduction
clustering-adjusted similarity
author_facet Honghu Zhou
Jun Wang
author_sort Honghu Zhou
title Laplacian Eigenmaps Dimensionality Reduction Based on Clustering-Adjusted Similarity
title_short Laplacian Eigenmaps Dimensionality Reduction Based on Clustering-Adjusted Similarity
title_full Laplacian Eigenmaps Dimensionality Reduction Based on Clustering-Adjusted Similarity
title_fullStr Laplacian Eigenmaps Dimensionality Reduction Based on Clustering-Adjusted Similarity
title_full_unstemmed Laplacian Eigenmaps Dimensionality Reduction Based on Clustering-Adjusted Similarity
title_sort laplacian eigenmaps dimensionality reduction based on clustering-adjusted similarity
publisher MDPI AG
series Algorithms
issn 1999-4893
publishDate 2019-10-01
description Euclidean distance between instances is widely used to capture the manifold structure of data and for graph-based dimensionality reduction. However, in some circumstances, the basic Euclidean distance cannot accurately capture the similarity between instances; some instances from different classes but close to the decision boundary may be close to each other, which may mislead the graph-based dimensionality reduction and compromise the performance. To mitigate this issue, in this paper, we proposed an approach called Laplacian Eigenmaps based on Clustering-Adjusted Similarity (LE-CAS). LE-CAS first performs clustering on all instances to explore the global structure and discrimination of instances, and quantifies the similarity between cluster centers. Then, it adjusts the similarity between pairwise instances by multiplying the similarity between centers of clusters, which these two instances respectively belong to. In this way, if two instances are from different clusters, the similarity between them is reduced; otherwise, it is unchanged. Finally, LE-CAS performs graph-based dimensionality reduction (via Laplacian Eigenmaps) based on the adjusted similarity. We conducted comprehensive empirical studies on UCI datasets and show that LE-CAS not only has a better performance than other relevant comparing methods, but also is more robust to input parameters.
topic laplacian eigenmaps
dimensionality reduction
clustering-adjusted similarity
url https://www.mdpi.com/1999-4893/12/10/210
work_keys_str_mv AT honghuzhou laplacianeigenmapsdimensionalityreductionbasedonclusteringadjustedsimilarity
AT junwang laplacianeigenmapsdimensionalityreductionbasedonclusteringadjustedsimilarity
_version_ 1725811182014562304