Mining Weighted Frequent Closed Episodes over Multiple Sequences
Frequent episode discovery is introduced to mine useful and interesting temporal patterns from sequential data. The existing episode mining methods mainly focused on mining from a single long sequence consisting of events with time constraints. However, there can be multiple sequences of different i...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Faculty of Mechanical Engineering in Slavonski Brod, Faculty of Electrical Engineering in Osijek, Faculty of Civil Engineering in Osijek
2018-01-01
|
Series: | Tehnički Vjesnik |
Subjects: | |
Online Access: | https://hrcak.srce.hr/file/293209 |
id |
doaj-1dbcf9fa341e4700b84609e3d7222fe4 |
---|---|
record_format |
Article |
spelling |
doaj-1dbcf9fa341e4700b84609e3d7222fe42020-11-24T21:49:55ZengFaculty of Mechanical Engineering in Slavonski Brod, Faculty of Electrical Engineering in Osijek, Faculty of Civil Engineering in Osijek Tehnički Vjesnik1330-36511848-63392018-01-01252510518Mining Weighted Frequent Closed Episodes over Multiple SequencesGuoqiong Liao0Xiaoting Yang1Sihong Xie2Philip S. Yu3Changxuan Wan4School of Information Technology, Jiangxi University of Finance and Economics, Nanchang 330013, ChinaSchool of Information Technology, Jiangxi University of Finance and Economics, Nanchang 330013, ChinaComputer Science and Engineering Department, Lehigh University, Bethlehem, PA 18015, USADepartment of Computer Science, University of Illinois at Chicago, Chicago, IL, 60607, USASchool of Information Technology, Jiangxi University of Finance and Economics, Nanchang 330013, ChinaFrequent episode discovery is introduced to mine useful and interesting temporal patterns from sequential data. The existing episode mining methods mainly focused on mining from a single long sequence consisting of events with time constraints. However, there can be multiple sequences of different importance as the persons or entities associated with each sequence can be of different importance. Aiming to mine episodes in multiple sequences of different importance, we first define a new kind of episodes, i.e., the weighted frequent closed episodes, to take sequence importance, episode distribution and occurrence frequency into account together. Secondly, to facilitate the mining of such new episodes, we present a new concept called maximal duration serial episodes to cut a whole sequence into multiple maximum episodes using duration constraints, and discuss its properties for episode shrinking processing. Finally, based on the theoretical properties, we propose a two-phase approach to efficiently mine these new episodes. In Phase I, we adopt a level-wise episode shrinking framework to discover the candidate frequent closed episodes with the same prefixes, and in Phase II, we match the candidates with different prefixes to find the frequent close episodes. Experiments on simulated and real datasets demonstrate that the proposed episode mining strategy has good mining effectiveness and efficiency.https://hrcak.srce.hr/file/293209closed episodesepisode miningfrequent episodesmultiple sequences |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Guoqiong Liao Xiaoting Yang Sihong Xie Philip S. Yu Changxuan Wan |
spellingShingle |
Guoqiong Liao Xiaoting Yang Sihong Xie Philip S. Yu Changxuan Wan Mining Weighted Frequent Closed Episodes over Multiple Sequences Tehnički Vjesnik closed episodes episode mining frequent episodes multiple sequences |
author_facet |
Guoqiong Liao Xiaoting Yang Sihong Xie Philip S. Yu Changxuan Wan |
author_sort |
Guoqiong Liao |
title |
Mining Weighted Frequent Closed Episodes over Multiple Sequences |
title_short |
Mining Weighted Frequent Closed Episodes over Multiple Sequences |
title_full |
Mining Weighted Frequent Closed Episodes over Multiple Sequences |
title_fullStr |
Mining Weighted Frequent Closed Episodes over Multiple Sequences |
title_full_unstemmed |
Mining Weighted Frequent Closed Episodes over Multiple Sequences |
title_sort |
mining weighted frequent closed episodes over multiple sequences |
publisher |
Faculty of Mechanical Engineering in Slavonski Brod, Faculty of Electrical Engineering in Osijek, Faculty of Civil Engineering in Osijek |
series |
Tehnički Vjesnik |
issn |
1330-3651 1848-6339 |
publishDate |
2018-01-01 |
description |
Frequent episode discovery is introduced to mine useful and interesting temporal patterns from sequential data. The existing episode mining methods mainly focused on mining from a single long sequence consisting of events with time constraints. However, there can be multiple sequences of different importance as the persons or entities associated with each sequence can be of different importance. Aiming to mine episodes in multiple sequences of different importance, we first define a new kind of episodes, i.e., the weighted frequent closed episodes, to take sequence importance, episode distribution and occurrence frequency into account together. Secondly, to facilitate the mining of such new episodes, we present a new concept called maximal duration serial episodes to cut a whole sequence into multiple maximum episodes using duration constraints, and discuss its properties for episode shrinking processing. Finally, based on the theoretical properties, we propose a two-phase approach to efficiently mine these new episodes. In Phase I, we adopt a level-wise episode shrinking framework to discover the candidate frequent closed episodes with the same prefixes, and in Phase II, we match the candidates with different prefixes to find the frequent close episodes. Experiments on simulated and real datasets demonstrate that the proposed episode mining strategy has good mining effectiveness and efficiency. |
topic |
closed episodes episode mining frequent episodes multiple sequences |
url |
https://hrcak.srce.hr/file/293209 |
work_keys_str_mv |
AT guoqiongliao miningweightedfrequentclosedepisodesovermultiplesequences AT xiaotingyang miningweightedfrequentclosedepisodesovermultiplesequences AT sihongxie miningweightedfrequentclosedepisodesovermultiplesequences AT philipsyu miningweightedfrequentclosedepisodesovermultiplesequences AT changxuanwan miningweightedfrequentclosedepisodesovermultiplesequences |
_version_ |
1725886500304846848 |