Mining Frequent Patterns from Dynamic Data Streams Using Support Approximation
博士 === 國立中興大學 === 資訊科學與工程學系所 === 101 === In many real-life applications nowadays, the data are present in the form of continuous streams, which has made information retrieval and knowledge discovery more difficult and challenging. In this dissertation, we study the practical problem of discovering f...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2013
|
Online Access: | http://ndltd.ncl.edu.tw/handle/74852547458965418836 |
id |
ndltd-TW-101NCHU5394041 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-101NCHU53940412015-10-13T22:35:49Z http://ndltd.ncl.edu.tw/handle/74852547458965418836 Mining Frequent Patterns from Dynamic Data Streams Using Support Approximation 使用支持度估算方法於動態資料串流探勘頻繁樣式 Chao-Wei Li 李兆偉 博士 國立中興大學 資訊科學與工程學系所 101 In many real-life applications nowadays, the data are present in the form of continuous streams, which has made information retrieval and knowledge discovery more difficult and challenging. In this dissertation, we study the practical problem of discovering frequent patterns (large itemsets) from streaming data, namely data-stream frequent pattern mining. We also tackle some important issues concerning data stream mining, which include data overloads and concept drifts. Our methodology to the studied problem is called support approximation. It obtains the frequency counts of itemsets by fast estimation instead of enumeration and counting. The basis of support approximation is the frequency relationships between itemsets and itemsets. We have proposed approaches of modeling the frequency relationships in order to construct support approximation functions. As to the issue of data overload, we devise two mechanisms with respective ideas for lessening the workload of the mining system. With regard to the issue of concept drift, we deal with it by means of concept description and concept learning, which are related to the principle of support approximation. According to our experimental evaluations, the proposed method could have high efficiency in streaming data processing, and it could achieve reasonable mining accuracy. Besides, to a certain extent, the issues of concept drift and data overload could be addressed adequately. Kuen-Fang Jea 賈坤芳 2013 學位論文 ; thesis 93 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
博士 === 國立中興大學 === 資訊科學與工程學系所 === 101 === In many real-life applications nowadays, the data are present in the form of continuous streams, which has made information retrieval and knowledge discovery more difficult and challenging. In this dissertation, we study the practical problem of discovering frequent patterns (large itemsets) from streaming data, namely data-stream frequent pattern mining. We also tackle some important issues concerning data stream mining, which include data overloads and concept drifts. Our methodology to the studied problem is called support approximation. It obtains the frequency counts of itemsets by fast estimation instead of enumeration and counting. The basis of support approximation is the frequency relationships between itemsets and itemsets. We have proposed approaches of modeling the frequency relationships in order to construct support approximation functions. As to the issue of data overload, we devise two mechanisms with respective ideas for lessening the workload of the mining system. With regard to the issue of concept drift, we deal with it by means of concept description and concept learning, which are related to the principle of support approximation. According to our experimental evaluations, the proposed method could have high efficiency in streaming data processing, and it could achieve reasonable mining accuracy. Besides, to a certain extent, the issues of concept drift and data overload could be addressed adequately.
|
author2 |
Kuen-Fang Jea |
author_facet |
Kuen-Fang Jea Chao-Wei Li 李兆偉 |
author |
Chao-Wei Li 李兆偉 |
spellingShingle |
Chao-Wei Li 李兆偉 Mining Frequent Patterns from Dynamic Data Streams Using Support Approximation |
author_sort |
Chao-Wei Li |
title |
Mining Frequent Patterns from Dynamic Data Streams Using Support Approximation |
title_short |
Mining Frequent Patterns from Dynamic Data Streams Using Support Approximation |
title_full |
Mining Frequent Patterns from Dynamic Data Streams Using Support Approximation |
title_fullStr |
Mining Frequent Patterns from Dynamic Data Streams Using Support Approximation |
title_full_unstemmed |
Mining Frequent Patterns from Dynamic Data Streams Using Support Approximation |
title_sort |
mining frequent patterns from dynamic data streams using support approximation |
publishDate |
2013 |
url |
http://ndltd.ncl.edu.tw/handle/74852547458965418836 |
work_keys_str_mv |
AT chaoweili miningfrequentpatternsfromdynamicdatastreamsusingsupportapproximation AT lǐzhàowěi miningfrequentpatternsfromdynamicdatastreamsusingsupportapproximation AT chaoweili shǐyòngzhīchídùgūsuànfāngfǎyúdòngtàizīliàochuànliútànkānpínfányàngshì AT lǐzhàowěi shǐyòngzhīchídùgūsuànfāngfǎyúdòngtàizīliàochuànliútànkānpínfányàngshì |
_version_ |
1718078669808730112 |