VLSI Architectures for Clustering Algorithms

博士 === 國立臺灣師範大學 === 資訊工程研究所 === 100 === In this dissertation, several hardware architectures are proposed for various clustering algorithms, including the c-means, competitive learning, fuzzy c-means, and fuzzy c-means with spatial constraint algorithms. All these architectures have been implemented...

Full description

Bibliographic Details
Main Authors:	Hui-Ya Li, 李惠雅
Other Authors:	Wen-Jyi Hwang
Format:	Others
Language:	en_US
Published:	2011
Online Access:	http://ndltd.ncl.edu.tw/handle/15049414863456041584

id	ndltd-TW-100NTNU5392039
record_format	oai_dc
spelling	ndltd-TW-100NTNU53920392016-03-28T04:20:21Z http://ndltd.ncl.edu.tw/handle/15049414863456041584 VLSI Architectures for Clustering Algorithms 分群演算法之超大型積體電路架構研究 Hui-Ya Li 李惠雅博士國立臺灣師範大學資訊工程研究所 100 In this dissertation, several hardware architectures are proposed for various clustering algorithms, including the c-means, competitive learning, fuzzy c-means, and fuzzy c-means with spatial constraint algorithms. All these architectures have been implemented on field programmable gate array (FPGA) devices to construct system on programmable chip (SOPC) systems for clustering. Both the partitioning and centroid computation operations in the proposed c-means architecture are fully pipelined thus multiple training vectors can be concurrently processed. A lookup table based divider is employed to reduce the area cost and latency for centroid computation. Two kinds of hardware realization of competitive learning algorithm with k-winners-take-all (kWTA) activation are presented. In the first architecture, the k winners associating with an input vector are identified by a module performing partial distance search (PDS) in the wavelet domain. The neuron updating process is based on a hardware divider with simple table lookup operations. Both the partial distance search module and hardware divider adopt finite precision calculation for area cost reduction. Subspace search and multiple-coefficient accumulation techniques are also employed to reduce the computation latency for the PDS search. The second architecture is based on an efficient pipeline allowing kWTA competition processes associated with different training vectors to be performed concurrently. The pipeline architecture employs a novel codeword swapping scheme so that neurons failing the competition for a training vector are immediately available for the competitions for subsequent training vectors. The proposed fuzzy c-means architecture is an efficient parallel solution. The architecture reduces the area cost and computational complexity for membership coefficients and centroid computation by employing lookup table based dividers. The usual iterative operations for updating the membership matrix and cluster centroid are merged into one single updating process to evade the large storage requirement. Such architecture is also extended to for the implementation of fuzzy c-means with spatial constraint. In the architecture, lookup table based root operators are adapted to relax the restriction on the degree of fuzziness. Experimental results show that the proposed architectures are cost-effective, and can attain high speedup over other hardware or software implementations for large data sets and/or large number of clusters. Wen-Jyi Hwang 黃文吉 2011 學位論文 ; thesis 183 en_US
collection	NDLTD
language	en_US
format	Others
sources	NDLTD
description	博士 === 國立臺灣師範大學 === 資訊工程研究所 === 100 === In this dissertation, several hardware architectures are proposed for various clustering algorithms, including the c-means, competitive learning, fuzzy c-means, and fuzzy c-means with spatial constraint algorithms. All these architectures have been implemented on field programmable gate array (FPGA) devices to construct system on programmable chip (SOPC) systems for clustering. Both the partitioning and centroid computation operations in the proposed c-means architecture are fully pipelined thus multiple training vectors can be concurrently processed. A lookup table based divider is employed to reduce the area cost and latency for centroid computation. Two kinds of hardware realization of competitive learning algorithm with k-winners-take-all (kWTA) activation are presented. In the first architecture, the k winners associating with an input vector are identified by a module performing partial distance search (PDS) in the wavelet domain. The neuron updating process is based on a hardware divider with simple table lookup operations. Both the partial distance search module and hardware divider adopt finite precision calculation for area cost reduction. Subspace search and multiple-coefficient accumulation techniques are also employed to reduce the computation latency for the PDS search. The second architecture is based on an efficient pipeline allowing kWTA competition processes associated with different training vectors to be performed concurrently. The pipeline architecture employs a novel codeword swapping scheme so that neurons failing the competition for a training vector are immediately available for the competitions for subsequent training vectors. The proposed fuzzy c-means architecture is an efficient parallel solution. The architecture reduces the area cost and computational complexity for membership coefficients and centroid computation by employing lookup table based dividers. The usual iterative operations for updating the membership matrix and cluster centroid are merged into one single updating process to evade the large storage requirement. Such architecture is also extended to for the implementation of fuzzy c-means with spatial constraint. In the architecture, lookup table based root operators are adapted to relax the restriction on the degree of fuzziness. Experimental results show that the proposed architectures are cost-effective, and can attain high speedup over other hardware or software implementations for large data sets and/or large number of clusters.
author2	Wen-Jyi Hwang
author_facet	Wen-Jyi Hwang Hui-Ya Li 李惠雅
author	Hui-Ya Li 李惠雅
spellingShingle	Hui-Ya Li 李惠雅 VLSI Architectures for Clustering Algorithms
author_sort	Hui-Ya Li
title	VLSI Architectures for Clustering Algorithms
title_short	VLSI Architectures for Clustering Algorithms
title_full	VLSI Architectures for Clustering Algorithms
title_fullStr	VLSI Architectures for Clustering Algorithms
title_full_unstemmed	VLSI Architectures for Clustering Algorithms
title_sort	vlsi architectures for clustering algorithms
publishDate	2011
url	http://ndltd.ncl.edu.tw/handle/15049414863456041584
work_keys_str_mv	AT huiyali vlsiarchitecturesforclusteringalgorithms AT lǐhuìyǎ vlsiarchitecturesforclusteringalgorithms AT huiyali fēnqúnyǎnsuànfǎzhīchāodàxíngjītǐdiànlùjiàgòuyánjiū AT lǐhuìyǎ fēnqúnyǎnsuànfǎzhīchāodàxíngjītǐdiànlùjiàgòuyánjiū
_version_	1718213016003018752

VLSI Architectures for Clustering Algorithms

Similar Items