Mining Frequent Patterns from Dynamic Data Streams Using Support Approximation

博士 === 國立中興大學 === 資訊科學與工程學系所 === 101 === In many real-life applications nowadays, the data are present in the form of continuous streams, which has made information retrieval and knowledge discovery more difficult and challenging. In this dissertation, we study the practical problem of discovering f...

Full description

Bibliographic Details
Main Authors: Chao-Wei Li, 李兆偉
Other Authors: Kuen-Fang Jea
Format: Others
Language:en_US
Published: 2013
Online Access:http://ndltd.ncl.edu.tw/handle/74852547458965418836
id ndltd-TW-101NCHU5394041
record_format oai_dc
spelling ndltd-TW-101NCHU53940412015-10-13T22:35:49Z http://ndltd.ncl.edu.tw/handle/74852547458965418836 Mining Frequent Patterns from Dynamic Data Streams Using Support Approximation 使用支持度估算方法於動態資料串流探勘頻繁樣式 Chao-Wei Li 李兆偉 博士 國立中興大學 資訊科學與工程學系所 101 In many real-life applications nowadays, the data are present in the form of continuous streams, which has made information retrieval and knowledge discovery more difficult and challenging. In this dissertation, we study the practical problem of discovering frequent patterns (large itemsets) from streaming data, namely data-stream frequent pattern mining. We also tackle some important issues concerning data stream mining, which include data overloads and concept drifts. Our methodology to the studied problem is called support approximation. It obtains the frequency counts of itemsets by fast estimation instead of enumeration and counting. The basis of support approximation is the frequency relationships between itemsets and itemsets. We have proposed approaches of modeling the frequency relationships in order to construct support approximation functions. As to the issue of data overload, we devise two mechanisms with respective ideas for lessening the workload of the mining system. With regard to the issue of concept drift, we deal with it by means of concept description and concept learning, which are related to the principle of support approximation. According to our experimental evaluations, the proposed method could have high efficiency in streaming data processing, and it could achieve reasonable mining accuracy. Besides, to a certain extent, the issues of concept drift and data overload could be addressed adequately. Kuen-Fang Jea 賈坤芳 2013 學位論文 ; thesis 93 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 博士 === 國立中興大學 === 資訊科學與工程學系所 === 101 === In many real-life applications nowadays, the data are present in the form of continuous streams, which has made information retrieval and knowledge discovery more difficult and challenging. In this dissertation, we study the practical problem of discovering frequent patterns (large itemsets) from streaming data, namely data-stream frequent pattern mining. We also tackle some important issues concerning data stream mining, which include data overloads and concept drifts. Our methodology to the studied problem is called support approximation. It obtains the frequency counts of itemsets by fast estimation instead of enumeration and counting. The basis of support approximation is the frequency relationships between itemsets and itemsets. We have proposed approaches of modeling the frequency relationships in order to construct support approximation functions. As to the issue of data overload, we devise two mechanisms with respective ideas for lessening the workload of the mining system. With regard to the issue of concept drift, we deal with it by means of concept description and concept learning, which are related to the principle of support approximation. According to our experimental evaluations, the proposed method could have high efficiency in streaming data processing, and it could achieve reasonable mining accuracy. Besides, to a certain extent, the issues of concept drift and data overload could be addressed adequately.
author2 Kuen-Fang Jea
author_facet Kuen-Fang Jea
Chao-Wei Li
李兆偉
author Chao-Wei Li
李兆偉
spellingShingle Chao-Wei Li
李兆偉
Mining Frequent Patterns from Dynamic Data Streams Using Support Approximation
author_sort Chao-Wei Li
title Mining Frequent Patterns from Dynamic Data Streams Using Support Approximation
title_short Mining Frequent Patterns from Dynamic Data Streams Using Support Approximation
title_full Mining Frequent Patterns from Dynamic Data Streams Using Support Approximation
title_fullStr Mining Frequent Patterns from Dynamic Data Streams Using Support Approximation
title_full_unstemmed Mining Frequent Patterns from Dynamic Data Streams Using Support Approximation
title_sort mining frequent patterns from dynamic data streams using support approximation
publishDate 2013
url http://ndltd.ncl.edu.tw/handle/74852547458965418836
work_keys_str_mv AT chaoweili miningfrequentpatternsfromdynamicdatastreamsusingsupportapproximation
AT lǐzhàowěi miningfrequentpatternsfromdynamicdatastreamsusingsupportapproximation
AT chaoweili shǐyòngzhīchídùgūsuànfāngfǎyúdòngtàizīliàochuànliútànkānpínfányàngshì
AT lǐzhàowěi shǐyòngzhīchídùgūsuànfāngfǎyúdòngtàizīliàochuànliútànkānpínfányàngshì
_version_ 1718078669808730112