Time-Sensitive Stream Pattern Mining under Bounded Memory Constraints

碩士 === 逢甲大學 === 資訊工程所 === 99 === Mining frequent itemsets over data streams is a challenging problem in recent year. The mining in the sliding window model is more desirable since people generally have more interest in the change of recent data. The transaction-sensitive sliding window model conside...

Full description

Bibliographic Details
Main Authors: Chien-Chan Lee, 李建誠
Other Authors: Ming-Yen Lin
Format: Others
Language:en_US
Published: 2011
Online Access:http://ndltd.ncl.edu.tw/handle/15010926316533204469
id ndltd-TW-099FCU05392040
record_format oai_dc
spelling ndltd-TW-099FCU053920402015-10-23T06:50:32Z http://ndltd.ncl.edu.tw/handle/15010926316533204469 Time-Sensitive Stream Pattern Mining under Bounded Memory Constraints 在有限的記憶體中探勘時間敏感資料串流之頻繁項目 Chien-Chan Lee 李建誠 碩士 逢甲大學 資訊工程所 99 Mining frequent itemsets over data streams is a challenging problem in recent year. The mining in the sliding window model is more desirable since people generally have more interest in the change of recent data. The transaction-sensitive sliding window model considers the window of fixed numbered transactions. In contrast, the time-sensitive sliding window model handles the data stream during a user-defined time interval and is more practical. Most time-sensitive sliding window algorithms assume that the memory is always available and enough for holding the entire transaction data in a window. Nevertheless, these algorithms would fail if bulk data comes in a short burst for the latest sliding window, a common scenario for a particular event in data stream applications. In this thesis, we propose an algorithm, called Tisbom, to solve the problem of mining frequent patterns in time-sensitive stream data using a bounded memory. The Tisbom algorithm uses a condensed representation for each block for efficient window sliding. A block allocation mechanism is invoked to re-adjust the block sizes in the window when the memory is insufficient for the new incoming block. The Tisbom algorithm is characterized in two phases, approximation phase and mining phase. The approximation is to merge similar transactions controlled by a difference threshold. The mining phase is invoked when a user submits a query with a specified support threshold for the frequent patterns in the sliding window. The experiment results show that Tisbom is efficient and has good precisions and recalls for mining time-sensitive stream using a bounded memory. Ming-Yen Lin 林明言 2011 學位論文 ; thesis 87 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 逢甲大學 === 資訊工程所 === 99 === Mining frequent itemsets over data streams is a challenging problem in recent year. The mining in the sliding window model is more desirable since people generally have more interest in the change of recent data. The transaction-sensitive sliding window model considers the window of fixed numbered transactions. In contrast, the time-sensitive sliding window model handles the data stream during a user-defined time interval and is more practical. Most time-sensitive sliding window algorithms assume that the memory is always available and enough for holding the entire transaction data in a window. Nevertheless, these algorithms would fail if bulk data comes in a short burst for the latest sliding window, a common scenario for a particular event in data stream applications. In this thesis, we propose an algorithm, called Tisbom, to solve the problem of mining frequent patterns in time-sensitive stream data using a bounded memory. The Tisbom algorithm uses a condensed representation for each block for efficient window sliding. A block allocation mechanism is invoked to re-adjust the block sizes in the window when the memory is insufficient for the new incoming block. The Tisbom algorithm is characterized in two phases, approximation phase and mining phase. The approximation is to merge similar transactions controlled by a difference threshold. The mining phase is invoked when a user submits a query with a specified support threshold for the frequent patterns in the sliding window. The experiment results show that Tisbom is efficient and has good precisions and recalls for mining time-sensitive stream using a bounded memory.
author2 Ming-Yen Lin
author_facet Ming-Yen Lin
Chien-Chan Lee
李建誠
author Chien-Chan Lee
李建誠
spellingShingle Chien-Chan Lee
李建誠
Time-Sensitive Stream Pattern Mining under Bounded Memory Constraints
author_sort Chien-Chan Lee
title Time-Sensitive Stream Pattern Mining under Bounded Memory Constraints
title_short Time-Sensitive Stream Pattern Mining under Bounded Memory Constraints
title_full Time-Sensitive Stream Pattern Mining under Bounded Memory Constraints
title_fullStr Time-Sensitive Stream Pattern Mining under Bounded Memory Constraints
title_full_unstemmed Time-Sensitive Stream Pattern Mining under Bounded Memory Constraints
title_sort time-sensitive stream pattern mining under bounded memory constraints
publishDate 2011
url http://ndltd.ncl.edu.tw/handle/15010926316533204469
work_keys_str_mv AT chienchanlee timesensitivestreampatternminingunderboundedmemoryconstraints
AT lǐjiànchéng timesensitivestreampatternminingunderboundedmemoryconstraints
AT chienchanlee zàiyǒuxiàndejìyìtǐzhōngtànkānshíjiānmǐngǎnzīliàochuànliúzhīpínfánxiàngmù
AT lǐjiànchéng zàiyǒuxiàndejìyìtǐzhōngtànkānshíjiānmǐngǎnzīliàochuànliúzhīpínfánxiàngmù
_version_ 1718110406750240768