VLSI Architectures for Clustering Algorithms

博士 === 國立臺灣師範大學 === 資訊工程研究所 === 100 === In this dissertation, several hardware architectures are proposed for various clustering algorithms, including the c-means, competitive learning, fuzzy c-means, and fuzzy c-means with spatial constraint algorithms. All these architectures have been implemented...

Full description

Bibliographic Details
Main Authors: Hui-Ya Li, 李惠雅
Other Authors: Wen-Jyi Hwang
Format: Others
Language:en_US
Published: 2011
Online Access:http://ndltd.ncl.edu.tw/handle/15049414863456041584
id ndltd-TW-100NTNU5392039
record_format oai_dc
spelling ndltd-TW-100NTNU53920392016-03-28T04:20:21Z http://ndltd.ncl.edu.tw/handle/15049414863456041584 VLSI Architectures for Clustering Algorithms 分群演算法之超大型積體電路架構研究 Hui-Ya Li 李惠雅 博士 國立臺灣師範大學 資訊工程研究所 100 In this dissertation, several hardware architectures are proposed for various clustering algorithms, including the c-means, competitive learning, fuzzy c-means, and fuzzy c-means with spatial constraint algorithms. All these architectures have been implemented on field programmable gate array (FPGA) devices to construct system on programmable chip (SOPC) systems for clustering. Both the partitioning and centroid computation operations in the proposed c-means architecture are fully pipelined thus multiple training vectors can be concurrently processed. A lookup table based divider is employed to reduce the area cost and latency for centroid computation. Two kinds of hardware realization of competitive learning algorithm with k-winners-take-all (kWTA) activation are presented. In the first architecture, the k winners associating with an input vector are identified by a module performing partial distance search (PDS) in the wavelet domain. The neuron updating process is based on a hardware divider with simple table lookup operations. Both the partial distance search module and hardware divider adopt finite precision calculation for area cost reduction. Subspace search and multiple-coefficient accumulation techniques are also employed to reduce the computation latency for the PDS search. The second architecture is based on an efficient pipeline allowing kWTA competition processes associated with different training vectors to be performed concurrently. The pipeline architecture employs a novel codeword swapping scheme so that neurons failing the competition for a training vector are immediately available for the competitions for subsequent training vectors. The proposed fuzzy c-means architecture is an efficient parallel solution. The architecture reduces the area cost and computational complexity for membership coefficients and centroid computation by employing lookup table based dividers. The usual iterative operations for updating the membership matrix and cluster centroid are merged into one single updating process to evade the large storage requirement. Such architecture is also extended to for the implementation of fuzzy c-means with spatial constraint. In the architecture, lookup table based root operators are adapted to relax the restriction on the degree of fuzziness. Experimental results show that the proposed architectures are cost-effective, and can attain high speedup over other hardware or software implementations for large data sets and/or large number of clusters. Wen-Jyi Hwang 黃文吉 2011 學位論文 ; thesis 183 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 博士 === 國立臺灣師範大學 === 資訊工程研究所 === 100 === In this dissertation, several hardware architectures are proposed for various clustering algorithms, including the c-means, competitive learning, fuzzy c-means, and fuzzy c-means with spatial constraint algorithms. All these architectures have been implemented on field programmable gate array (FPGA) devices to construct system on programmable chip (SOPC) systems for clustering. Both the partitioning and centroid computation operations in the proposed c-means architecture are fully pipelined thus multiple training vectors can be concurrently processed. A lookup table based divider is employed to reduce the area cost and latency for centroid computation. Two kinds of hardware realization of competitive learning algorithm with k-winners-take-all (kWTA) activation are presented. In the first architecture, the k winners associating with an input vector are identified by a module performing partial distance search (PDS) in the wavelet domain. The neuron updating process is based on a hardware divider with simple table lookup operations. Both the partial distance search module and hardware divider adopt finite precision calculation for area cost reduction. Subspace search and multiple-coefficient accumulation techniques are also employed to reduce the computation latency for the PDS search. The second architecture is based on an efficient pipeline allowing kWTA competition processes associated with different training vectors to be performed concurrently. The pipeline architecture employs a novel codeword swapping scheme so that neurons failing the competition for a training vector are immediately available for the competitions for subsequent training vectors. The proposed fuzzy c-means architecture is an efficient parallel solution. The architecture reduces the area cost and computational complexity for membership coefficients and centroid computation by employing lookup table based dividers. The usual iterative operations for updating the membership matrix and cluster centroid are merged into one single updating process to evade the large storage requirement. Such architecture is also extended to for the implementation of fuzzy c-means with spatial constraint. In the architecture, lookup table based root operators are adapted to relax the restriction on the degree of fuzziness. Experimental results show that the proposed architectures are cost-effective, and can attain high speedup over other hardware or software implementations for large data sets and/or large number of clusters.
author2 Wen-Jyi Hwang
author_facet Wen-Jyi Hwang
Hui-Ya Li
李惠雅
author Hui-Ya Li
李惠雅
spellingShingle Hui-Ya Li
李惠雅
VLSI Architectures for Clustering Algorithms
author_sort Hui-Ya Li
title VLSI Architectures for Clustering Algorithms
title_short VLSI Architectures for Clustering Algorithms
title_full VLSI Architectures for Clustering Algorithms
title_fullStr VLSI Architectures for Clustering Algorithms
title_full_unstemmed VLSI Architectures for Clustering Algorithms
title_sort vlsi architectures for clustering algorithms
publishDate 2011
url http://ndltd.ncl.edu.tw/handle/15049414863456041584
work_keys_str_mv AT huiyali vlsiarchitecturesforclusteringalgorithms
AT lǐhuìyǎ vlsiarchitecturesforclusteringalgorithms
AT huiyali fēnqúnyǎnsuànfǎzhīchāodàxíngjītǐdiànlùjiàgòuyánjiū
AT lǐhuìyǎ fēnqúnyǎnsuànfǎzhīchāodàxíngjītǐdiànlùjiàgòuyánjiū
_version_ 1718213016003018752