An efficient interval-based sequential pattern mining
碩士 === 國立交通大學 === 資訊科學與工程研究所 === 98 === Existing sequential pattern mining algorithms assume that events occur instantaneously. However, events in real world applications usually have durations which are called interval-based events. But complex relationship among event intervals causes difficulty i...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2010
|
Online Access: | http://ndltd.ncl.edu.tw/handle/96142387729499523193 |
id |
ndltd-TW-098NCTU5394053 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-098NCTU53940532016-04-25T04:28:33Z http://ndltd.ncl.edu.tw/handle/96142387729499523193 An efficient interval-based sequential pattern mining 時間區間事件之有效率的循序樣式探勘 Jiang, Ji-Chiang 姜季強 碩士 國立交通大學 資訊科學與工程研究所 98 Existing sequential pattern mining algorithms assume that events occur instantaneously. However, events in real world applications usually have durations which are called interval-based events. But complex relationship among event intervals causes difficulty in designing an efficient interval-based event mining algorithm. Therefore, the concept of “coincidence-slice” is proposed to solve the problem caused by the complex relationship among event intervals. The event intervals are incised to disjoint smaller “event slices” according to the coincidences among event intervals, that is, several event slices may occur in the same time period called “coincidence”. Therefore, an original event sequence can be represented as a list of ordered “coincidences” which contains event slices. This new representation proposed is called “coincidence sequence representation”. We transform the problem of complex relationship among event interval to consider the simple relationship among event slices. The proposed interval-based sequential pattern mining algorithm called CTMiner is based on the “coincidence sequence representation”. The CTMier also uses the concept of well-known sequential pattern mining algorithm PrefixSpan to find temporal patterns without candidate generation. Finally, to comprehend the frequent temporal pattern represented by “coincidence sequence representation”, we discover and use relation list to present all the relationships in a pattern. We also implement some pruning strategies to improve the performance of CTMiner by considering the characteristics of the “Coincidence-slice”. Experiments on both synthetic datasets and real dataset of library lending indicate the efficiency and scalability of the proposed algorithm. Lee, Suh-Yin 李素瑛 2010 學位論文 ; thesis 62 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立交通大學 === 資訊科學與工程研究所 === 98 === Existing sequential pattern mining algorithms assume that events occur instantaneously. However, events in real world applications usually have durations which are called interval-based events. But complex relationship among event intervals causes difficulty in designing an efficient interval-based event mining algorithm. Therefore, the concept of “coincidence-slice” is proposed to solve the problem caused by the complex relationship among event intervals. The event intervals are incised to disjoint smaller “event slices” according to the coincidences among event intervals, that is, several event slices may occur in the same time period called “coincidence”. Therefore, an original event sequence can be represented as a list of ordered “coincidences” which contains event slices. This new representation proposed is called “coincidence sequence representation”. We transform the problem of complex relationship among event interval to consider the simple relationship among event slices. The proposed interval-based sequential pattern mining algorithm called CTMiner is based on the “coincidence sequence representation”. The CTMier also uses the concept of well-known sequential pattern mining algorithm PrefixSpan to find temporal patterns without candidate generation. Finally, to comprehend the frequent temporal pattern represented by “coincidence sequence representation”, we discover and use relation list to present all the relationships in a pattern. We also implement some pruning strategies to improve the performance of CTMiner by considering the characteristics of the “Coincidence-slice”. Experiments on both synthetic datasets and real dataset of library lending indicate the efficiency and scalability of the proposed algorithm.
|
author2 |
Lee, Suh-Yin |
author_facet |
Lee, Suh-Yin Jiang, Ji-Chiang 姜季強 |
author |
Jiang, Ji-Chiang 姜季強 |
spellingShingle |
Jiang, Ji-Chiang 姜季強 An efficient interval-based sequential pattern mining |
author_sort |
Jiang, Ji-Chiang |
title |
An efficient interval-based sequential pattern mining |
title_short |
An efficient interval-based sequential pattern mining |
title_full |
An efficient interval-based sequential pattern mining |
title_fullStr |
An efficient interval-based sequential pattern mining |
title_full_unstemmed |
An efficient interval-based sequential pattern mining |
title_sort |
efficient interval-based sequential pattern mining |
publishDate |
2010 |
url |
http://ndltd.ncl.edu.tw/handle/96142387729499523193 |
work_keys_str_mv |
AT jiangjichiang anefficientintervalbasedsequentialpatternmining AT jiāngjìqiáng anefficientintervalbasedsequentialpatternmining AT jiangjichiang shíjiānqūjiānshìjiànzhīyǒuxiàolǜdexúnxùyàngshìtànkān AT jiāngjìqiáng shíjiānqūjiānshìjiànzhīyǒuxiàolǜdexúnxùyàngshìtànkān AT jiangjichiang efficientintervalbasedsequentialpatternmining AT jiāngjìqiáng efficientintervalbasedsequentialpatternmining |
_version_ |
1718233398473916416 |