An effective and efficient approach to detect arbitrary patterns in clusters with noises in very large databases

碩士 === 國立屏東科技大學 === 資訊管理系 === 94 === Student ID:N9356001 Title of Thesis:An effective and efficient approach to detect arbitrary patterns in clusters with noises in very large databases Page:80 Name of Institute:Graduate Institute of Management Information Systems, National Pingtung University of S...

Full description

Bibliographic Details
Main Authors: Tzu-Ping Wang, 王子評
Other Authors: Cheng-Fa Tsai
Format: Others
Language:zh-TW
Published: 2006
Online Access:http://ndltd.ncl.edu.tw/handle/17182283599418965293
id ndltd-TW-094NPUST396025
record_format oai_dc
spelling ndltd-TW-094NPUST3960252016-12-22T04:10:53Z http://ndltd.ncl.edu.tw/handle/17182283599418965293 An effective and efficient approach to detect arbitrary patterns in clusters with noises in very large databases 一種有效及有效率的方法從含有雜訊的大型資料集中偵測任意圖樣的群集 Tzu-Ping Wang 王子評 碩士 國立屏東科技大學 資訊管理系 94 Student ID:N9356001 Title of Thesis:An effective and efficient approach to detect arbitrary patterns in clusters with noises in very large databases Page:80 Name of Institute:Graduate Institute of Management Information Systems, National Pingtung University of Science and Technology Graduate Date:June, 2006 Degree Conferred:Master Name of Student:Tzu-Ping Wang Advisor:Dr. Cheng-Fa Tsai Abstract: In the domain of data mining, data clustering technology involves numerous famous algorithms. However, these algorithms suffer some limitations. For instance: 1) scalability, 2) the ability of processing noises, or 3) the detective ability of arbitrary patterns cluster, and so on. This thesis proposes a new clustering algorithm named GDH; meanwhile, it can solve the difficulties of various well-known algorithms which encounters in the above three difficulties. The thesis integrates into theories proposed by many scholars, and raises 1) the concept of gradient decrease and 2) the technology of sliding window for increasing the effectiveness of detecting arbitrary patterns cluster. The thesis compares the time complexity with the proposed GDH algorithm and K-MEANS (Partitioning-Based Method), CLIQUE (Grid-Based Method) as well as DBSCAN (Density–Based Method) which were published in the famous SCI journals. Furthermore, the thesis compares the clustering quality and the effectiveness with the proposed GDH algorithm and BIRCH, CURE (Hierarchical Method) as well as CLIQUE. According to the simulation results, it is observed that the proposed GDH approach can deal with clusters with arbitrary patterns effectively through the fine-tuning technology and gradient decrease method. In addition, it is exactly better than K-MEANS, BIRCH, CURE, CLIQUE, and DBSCAN in effectiveness and efficiency comparisons. Keyword:data mining , data clustering, gradient decrease, sliding window Cheng-Fa Tsai 蔡正發 2006 學位論文 ; thesis 80 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立屏東科技大學 === 資訊管理系 === 94 === Student ID:N9356001 Title of Thesis:An effective and efficient approach to detect arbitrary patterns in clusters with noises in very large databases Page:80 Name of Institute:Graduate Institute of Management Information Systems, National Pingtung University of Science and Technology Graduate Date:June, 2006 Degree Conferred:Master Name of Student:Tzu-Ping Wang Advisor:Dr. Cheng-Fa Tsai Abstract: In the domain of data mining, data clustering technology involves numerous famous algorithms. However, these algorithms suffer some limitations. For instance: 1) scalability, 2) the ability of processing noises, or 3) the detective ability of arbitrary patterns cluster, and so on. This thesis proposes a new clustering algorithm named GDH; meanwhile, it can solve the difficulties of various well-known algorithms which encounters in the above three difficulties. The thesis integrates into theories proposed by many scholars, and raises 1) the concept of gradient decrease and 2) the technology of sliding window for increasing the effectiveness of detecting arbitrary patterns cluster. The thesis compares the time complexity with the proposed GDH algorithm and K-MEANS (Partitioning-Based Method), CLIQUE (Grid-Based Method) as well as DBSCAN (Density–Based Method) which were published in the famous SCI journals. Furthermore, the thesis compares the clustering quality and the effectiveness with the proposed GDH algorithm and BIRCH, CURE (Hierarchical Method) as well as CLIQUE. According to the simulation results, it is observed that the proposed GDH approach can deal with clusters with arbitrary patterns effectively through the fine-tuning technology and gradient decrease method. In addition, it is exactly better than K-MEANS, BIRCH, CURE, CLIQUE, and DBSCAN in effectiveness and efficiency comparisons. Keyword:data mining , data clustering, gradient decrease, sliding window
author2 Cheng-Fa Tsai
author_facet Cheng-Fa Tsai
Tzu-Ping Wang
王子評
author Tzu-Ping Wang
王子評
spellingShingle Tzu-Ping Wang
王子評
An effective and efficient approach to detect arbitrary patterns in clusters with noises in very large databases
author_sort Tzu-Ping Wang
title An effective and efficient approach to detect arbitrary patterns in clusters with noises in very large databases
title_short An effective and efficient approach to detect arbitrary patterns in clusters with noises in very large databases
title_full An effective and efficient approach to detect arbitrary patterns in clusters with noises in very large databases
title_fullStr An effective and efficient approach to detect arbitrary patterns in clusters with noises in very large databases
title_full_unstemmed An effective and efficient approach to detect arbitrary patterns in clusters with noises in very large databases
title_sort effective and efficient approach to detect arbitrary patterns in clusters with noises in very large databases
publishDate 2006
url http://ndltd.ncl.edu.tw/handle/17182283599418965293
work_keys_str_mv AT tzupingwang aneffectiveandefficientapproachtodetectarbitrarypatternsinclusterswithnoisesinverylargedatabases
AT wángzipíng aneffectiveandefficientapproachtodetectarbitrarypatternsinclusterswithnoisesinverylargedatabases
AT tzupingwang yīzhǒngyǒuxiàojíyǒuxiàolǜdefāngfǎcónghányǒuzáxùndedàxíngzīliàojízhōngzhēncèrènyìtúyàngdeqúnjí
AT wángzipíng yīzhǒngyǒuxiàojíyǒuxiàolǜdefāngfǎcónghányǒuzáxùndedàxíngzīliàojízhōngzhēncèrènyìtúyàngdeqúnjí
AT tzupingwang effectiveandefficientapproachtodetectarbitrarypatternsinclusterswithnoisesinverylargedatabases
AT wángzipíng effectiveandefficientapproachtodetectarbitrarypatternsinclusterswithnoisesinverylargedatabases
_version_ 1718401910660136960