Applying Simplified Swarm Optimization for Solving Clustering Problem

博士 === 國立清華大學 === 工業工程與工程管理學系 === 104 === Data clustering is commonly employed in many disciplines. The aim of clustering is to partition a set of data into clusters, in which objects within the same cluster are similar and dissimilar to other objects that belong to different clusters. K-means (KM)...

Full description

Bibliographic Details
Main Authors: Lai, Chyh Ming, 賴智明
Other Authors: Yeh, Wei Chang
Format: Others
Language:en_US
Published: 2016
Online Access:http://ndltd.ncl.edu.tw/handle/39333513883467888886
id ndltd-TW-104NTHU5031036
record_format oai_dc
spelling ndltd-TW-104NTHU50310362017-07-16T04:29:10Z http://ndltd.ncl.edu.tw/handle/39333513883467888886 Applying Simplified Swarm Optimization for Solving Clustering Problem 應用簡群最佳化演算法求解資料分群問題 Lai, Chyh Ming 賴智明 博士 國立清華大學 工業工程與工程管理學系 104 Data clustering is commonly employed in many disciplines. The aim of clustering is to partition a set of data into clusters, in which objects within the same cluster are similar and dissimilar to other objects that belong to different clusters. K-means (KM) and K-harmonic-means (KHM) are two common and fundamental clustering methods because of their simplicity and efficiency. However, both of them suffer from some problems. This study presents two novel algorithms based on simplified swarm optimization to deal with the drawbacks of KM and KHM, respectively. In addition, with the advance of internet and information technologies, the data size is increasing explosively and many existing clustering approaches including KM and KHM are inefficiency for dealing with the large-size problem. For that, we propose a clustering framework by exploring the connection between principle component analysis and a novel random sampling technique into a procedure to increase the scalability of the proposed clustering algorithm. To empirically evaluate the performance of the proposed methods, all experiments are examined using real-world datasets, and corresponding results are compared with recent works in the literature. Yeh, Wei Chang 葉維彰 2016 學位論文 ; thesis 99 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 博士 === 國立清華大學 === 工業工程與工程管理學系 === 104 === Data clustering is commonly employed in many disciplines. The aim of clustering is to partition a set of data into clusters, in which objects within the same cluster are similar and dissimilar to other objects that belong to different clusters. K-means (KM) and K-harmonic-means (KHM) are two common and fundamental clustering methods because of their simplicity and efficiency. However, both of them suffer from some problems. This study presents two novel algorithms based on simplified swarm optimization to deal with the drawbacks of KM and KHM, respectively. In addition, with the advance of internet and information technologies, the data size is increasing explosively and many existing clustering approaches including KM and KHM are inefficiency for dealing with the large-size problem. For that, we propose a clustering framework by exploring the connection between principle component analysis and a novel random sampling technique into a procedure to increase the scalability of the proposed clustering algorithm. To empirically evaluate the performance of the proposed methods, all experiments are examined using real-world datasets, and corresponding results are compared with recent works in the literature.
author2 Yeh, Wei Chang
author_facet Yeh, Wei Chang
Lai, Chyh Ming
賴智明
author Lai, Chyh Ming
賴智明
spellingShingle Lai, Chyh Ming
賴智明
Applying Simplified Swarm Optimization for Solving Clustering Problem
author_sort Lai, Chyh Ming
title Applying Simplified Swarm Optimization for Solving Clustering Problem
title_short Applying Simplified Swarm Optimization for Solving Clustering Problem
title_full Applying Simplified Swarm Optimization for Solving Clustering Problem
title_fullStr Applying Simplified Swarm Optimization for Solving Clustering Problem
title_full_unstemmed Applying Simplified Swarm Optimization for Solving Clustering Problem
title_sort applying simplified swarm optimization for solving clustering problem
publishDate 2016
url http://ndltd.ncl.edu.tw/handle/39333513883467888886
work_keys_str_mv AT laichyhming applyingsimplifiedswarmoptimizationforsolvingclusteringproblem
AT làizhìmíng applyingsimplifiedswarmoptimizationforsolvingclusteringproblem
AT laichyhming yīngyòngjiǎnqúnzuìjiāhuàyǎnsuànfǎqiújiězīliàofēnqúnwèntí
AT làizhìmíng yīngyòngjiǎnqúnzuìjiāhuàyǎnsuànfǎqiújiězīliàofēnqúnwèntí
_version_ 1718496194680848384