Mining Frequent Patterns from Dynamic Data Streams Using Support Approximation

博士 === 國立中興大學 === 資訊科學與工程學系所 === 101 === In many real-life applications nowadays, the data are present in the form of continuous streams, which has made information retrieval and knowledge discovery more difficult and challenging. In this dissertation, we study the practical problem of discovering f...

Full description

Bibliographic Details
Main Authors:	Chao-Wei Li, 李兆偉
Other Authors:	Kuen-Fang Jea
Format:	Others
Language:	en_US
Published:	2013
Online Access:	http://ndltd.ncl.edu.tw/handle/74852547458965418836

id	ndltd-TW-101NCHU5394041
record_format	oai_dc
spelling	ndltd-TW-101NCHU53940412015-10-13T22:35:49Z http://ndltd.ncl.edu.tw/handle/74852547458965418836 Mining Frequent Patterns from Dynamic Data Streams Using Support Approximation 使用支持度估算方法於動態資料串流探勘頻繁樣式 Chao-Wei Li 李兆偉博士國立中興大學資訊科學與工程學系所 101 In many real-life applications nowadays, the data are present in the form of continuous streams, which has made information retrieval and knowledge discovery more difficult and challenging. In this dissertation, we study the practical problem of discovering frequent patterns (large itemsets) from streaming data, namely data-stream frequent pattern mining. We also tackle some important issues concerning data stream mining, which include data overloads and concept drifts. Our methodology to the studied problem is called support approximation. It obtains the frequency counts of itemsets by fast estimation instead of enumeration and counting. The basis of support approximation is the frequency relationships between itemsets and itemsets. We have proposed approaches of modeling the frequency relationships in order to construct support approximation functions. As to the issue of data overload, we devise two mechanisms with respective ideas for lessening the workload of the mining system. With regard to the issue of concept drift, we deal with it by means of concept description and concept learning, which are related to the principle of support approximation. According to our experimental evaluations, the proposed method could have high efficiency in streaming data processing, and it could achieve reasonable mining accuracy. Besides, to a certain extent, the issues of concept drift and data overload could be addressed adequately. Kuen-Fang Jea 賈坤芳 2013 學位論文 ; thesis 93 en_US
collection	NDLTD
language	en_US
format	Others
sources	NDLTD
description	博士 === 國立中興大學 === 資訊科學與工程學系所 === 101 === In many real-life applications nowadays, the data are present in the form of continuous streams, which has made information retrieval and knowledge discovery more difficult and challenging. In this dissertation, we study the practical problem of discovering frequent patterns (large itemsets) from streaming data, namely data-stream frequent pattern mining. We also tackle some important issues concerning data stream mining, which include data overloads and concept drifts. Our methodology to the studied problem is called support approximation. It obtains the frequency counts of itemsets by fast estimation instead of enumeration and counting. The basis of support approximation is the frequency relationships between itemsets and itemsets. We have proposed approaches of modeling the frequency relationships in order to construct support approximation functions. As to the issue of data overload, we devise two mechanisms with respective ideas for lessening the workload of the mining system. With regard to the issue of concept drift, we deal with it by means of concept description and concept learning, which are related to the principle of support approximation. According to our experimental evaluations, the proposed method could have high efficiency in streaming data processing, and it could achieve reasonable mining accuracy. Besides, to a certain extent, the issues of concept drift and data overload could be addressed adequately.
author2	Kuen-Fang Jea
author_facet	Kuen-Fang Jea Chao-Wei Li 李兆偉
author	Chao-Wei Li 李兆偉
spellingShingle	Chao-Wei Li 李兆偉 Mining Frequent Patterns from Dynamic Data Streams Using Support Approximation
author_sort	Chao-Wei Li
title	Mining Frequent Patterns from Dynamic Data Streams Using Support Approximation
title_short	Mining Frequent Patterns from Dynamic Data Streams Using Support Approximation
title_full	Mining Frequent Patterns from Dynamic Data Streams Using Support Approximation
title_fullStr	Mining Frequent Patterns from Dynamic Data Streams Using Support Approximation
title_full_unstemmed	Mining Frequent Patterns from Dynamic Data Streams Using Support Approximation
title_sort	mining frequent patterns from dynamic data streams using support approximation
publishDate	2013
url	http://ndltd.ncl.edu.tw/handle/74852547458965418836
work_keys_str_mv	AT chaoweili miningfrequentpatternsfromdynamicdatastreamsusingsupportapproximation AT lǐzhàowěi miningfrequentpatternsfromdynamicdatastreamsusingsupportapproximation AT chaoweili shǐyòngzhīchídùgūsuànfāngfǎyúdòngtàizīliàochuànliútànkānpínfányàngshì AT lǐzhàowěi shǐyòngzhīchídùgūsuànfāngfǎyúdòngtàizīliàochuànliútànkānpínfányàngshì
_version_	1718078669808730112

Mining Frequent Patterns from Dynamic Data Streams Using Support Approximation

Similar Items