An efficient interval-based sequential pattern mining

碩士 === 國立交通大學 === 資訊科學與工程研究所 === 98 === Existing sequential pattern mining algorithms assume that events occur instantaneously. However, events in real world applications usually have durations which are called interval-based events. But complex relationship among event intervals causes difficulty i...

Full description

Bibliographic Details
Main Authors: Jiang, Ji-Chiang, 姜季強
Other Authors: Lee, Suh-Yin
Format: Others
Language:en_US
Published: 2010
Online Access:http://ndltd.ncl.edu.tw/handle/96142387729499523193
id ndltd-TW-098NCTU5394053
record_format oai_dc
spelling ndltd-TW-098NCTU53940532016-04-25T04:28:33Z http://ndltd.ncl.edu.tw/handle/96142387729499523193 An efficient interval-based sequential pattern mining 時間區間事件之有效率的循序樣式探勘 Jiang, Ji-Chiang 姜季強 碩士 國立交通大學 資訊科學與工程研究所 98 Existing sequential pattern mining algorithms assume that events occur instantaneously. However, events in real world applications usually have durations which are called interval-based events. But complex relationship among event intervals causes difficulty in designing an efficient interval-based event mining algorithm. Therefore, the concept of “coincidence-slice” is proposed to solve the problem caused by the complex relationship among event intervals. The event intervals are incised to disjoint smaller “event slices” according to the coincidences among event intervals, that is, several event slices may occur in the same time period called “coincidence”. Therefore, an original event sequence can be represented as a list of ordered “coincidences” which contains event slices. This new representation proposed is called “coincidence sequence representation”. We transform the problem of complex relationship among event interval to consider the simple relationship among event slices. The proposed interval-based sequential pattern mining algorithm called CTMiner is based on the “coincidence sequence representation”. The CTMier also uses the concept of well-known sequential pattern mining algorithm PrefixSpan to find temporal patterns without candidate generation. Finally, to comprehend the frequent temporal pattern represented by “coincidence sequence representation”, we discover and use relation list to present all the relationships in a pattern. We also implement some pruning strategies to improve the performance of CTMiner by considering the characteristics of the “Coincidence-slice”. Experiments on both synthetic datasets and real dataset of library lending indicate the efficiency and scalability of the proposed algorithm. Lee, Suh-Yin 李素瑛 2010 學位論文 ; thesis 62 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立交通大學 === 資訊科學與工程研究所 === 98 === Existing sequential pattern mining algorithms assume that events occur instantaneously. However, events in real world applications usually have durations which are called interval-based events. But complex relationship among event intervals causes difficulty in designing an efficient interval-based event mining algorithm. Therefore, the concept of “coincidence-slice” is proposed to solve the problem caused by the complex relationship among event intervals. The event intervals are incised to disjoint smaller “event slices” according to the coincidences among event intervals, that is, several event slices may occur in the same time period called “coincidence”. Therefore, an original event sequence can be represented as a list of ordered “coincidences” which contains event slices. This new representation proposed is called “coincidence sequence representation”. We transform the problem of complex relationship among event interval to consider the simple relationship among event slices. The proposed interval-based sequential pattern mining algorithm called CTMiner is based on the “coincidence sequence representation”. The CTMier also uses the concept of well-known sequential pattern mining algorithm PrefixSpan to find temporal patterns without candidate generation. Finally, to comprehend the frequent temporal pattern represented by “coincidence sequence representation”, we discover and use relation list to present all the relationships in a pattern. We also implement some pruning strategies to improve the performance of CTMiner by considering the characteristics of the “Coincidence-slice”. Experiments on both synthetic datasets and real dataset of library lending indicate the efficiency and scalability of the proposed algorithm.
author2 Lee, Suh-Yin
author_facet Lee, Suh-Yin
Jiang, Ji-Chiang
姜季強
author Jiang, Ji-Chiang
姜季強
spellingShingle Jiang, Ji-Chiang
姜季強
An efficient interval-based sequential pattern mining
author_sort Jiang, Ji-Chiang
title An efficient interval-based sequential pattern mining
title_short An efficient interval-based sequential pattern mining
title_full An efficient interval-based sequential pattern mining
title_fullStr An efficient interval-based sequential pattern mining
title_full_unstemmed An efficient interval-based sequential pattern mining
title_sort efficient interval-based sequential pattern mining
publishDate 2010
url http://ndltd.ncl.edu.tw/handle/96142387729499523193
work_keys_str_mv AT jiangjichiang anefficientintervalbasedsequentialpatternmining
AT jiāngjìqiáng anefficientintervalbasedsequentialpatternmining
AT jiangjichiang shíjiānqūjiānshìjiànzhīyǒuxiàolǜdexúnxùyàngshìtànkān
AT jiāngjìqiáng shíjiānqūjiānshìjiànzhīyǒuxiàolǜdexúnxùyàngshìtànkān
AT jiangjichiang efficientintervalbasedsequentialpatternmining
AT jiāngjìqiáng efficientintervalbasedsequentialpatternmining
_version_ 1718233398473916416