Mining Weighted Frequent Closed Episodes over Multiple Sequences

Frequent episode discovery is introduced to mine useful and interesting temporal patterns from sequential data. The existing episode mining methods mainly focused on mining from a single long sequence consisting of events with time constraints. However, there can be multiple sequences of different i...

Full description

Bibliographic Details
Main Authors: Guoqiong Liao, Xiaoting Yang, Sihong Xie, Philip S. Yu, Changxuan Wan
Format: Article
Language:English
Published: Faculty of Mechanical Engineering in Slavonski Brod, Faculty of Electrical Engineering in Osijek, Faculty of Civil Engineering in Osijek 2018-01-01
Series:Tehnički Vjesnik
Subjects:
Online Access:https://hrcak.srce.hr/file/293209
id doaj-1dbcf9fa341e4700b84609e3d7222fe4
record_format Article
spelling doaj-1dbcf9fa341e4700b84609e3d7222fe42020-11-24T21:49:55ZengFaculty of Mechanical Engineering in Slavonski Brod, Faculty of Electrical Engineering in Osijek, Faculty of Civil Engineering in Osijek Tehnički Vjesnik1330-36511848-63392018-01-01252510518Mining Weighted Frequent Closed Episodes over Multiple SequencesGuoqiong Liao0Xiaoting Yang1Sihong Xie2Philip S. Yu3Changxuan Wan4School of Information Technology, Jiangxi University of Finance and Economics, Nanchang 330013, ChinaSchool of Information Technology, Jiangxi University of Finance and Economics, Nanchang 330013, ChinaComputer Science and Engineering Department, Lehigh University, Bethlehem, PA 18015, USADepartment of Computer Science, University of Illinois at Chicago, Chicago, IL, 60607, USASchool of Information Technology, Jiangxi University of Finance and Economics, Nanchang 330013, ChinaFrequent episode discovery is introduced to mine useful and interesting temporal patterns from sequential data. The existing episode mining methods mainly focused on mining from a single long sequence consisting of events with time constraints. However, there can be multiple sequences of different importance as the persons or entities associated with each sequence can be of different importance. Aiming to mine episodes in multiple sequences of different importance, we first define a new kind of episodes, i.e., the weighted frequent closed episodes, to take sequence importance, episode distribution and occurrence frequency into account together. Secondly, to facilitate the mining of such new episodes, we present a new concept called maximal duration serial episodes to cut a whole sequence into multiple maximum episodes using duration constraints, and discuss its properties for episode shrinking processing. Finally, based on the theoretical properties, we propose a two-phase approach to efficiently mine these new episodes. In Phase I, we adopt a level-wise episode shrinking framework to discover the candidate frequent closed episodes with the same prefixes, and in Phase II, we match the candidates with different prefixes to find the frequent close episodes. Experiments on simulated and real datasets demonstrate that the proposed episode mining strategy has good mining effectiveness and efficiency.https://hrcak.srce.hr/file/293209closed episodesepisode miningfrequent episodesmultiple sequences
collection DOAJ
language English
format Article
sources DOAJ
author Guoqiong Liao
Xiaoting Yang
Sihong Xie
Philip S. Yu
Changxuan Wan
spellingShingle Guoqiong Liao
Xiaoting Yang
Sihong Xie
Philip S. Yu
Changxuan Wan
Mining Weighted Frequent Closed Episodes over Multiple Sequences
Tehnički Vjesnik
closed episodes
episode mining
frequent episodes
multiple sequences
author_facet Guoqiong Liao
Xiaoting Yang
Sihong Xie
Philip S. Yu
Changxuan Wan
author_sort Guoqiong Liao
title Mining Weighted Frequent Closed Episodes over Multiple Sequences
title_short Mining Weighted Frequent Closed Episodes over Multiple Sequences
title_full Mining Weighted Frequent Closed Episodes over Multiple Sequences
title_fullStr Mining Weighted Frequent Closed Episodes over Multiple Sequences
title_full_unstemmed Mining Weighted Frequent Closed Episodes over Multiple Sequences
title_sort mining weighted frequent closed episodes over multiple sequences
publisher Faculty of Mechanical Engineering in Slavonski Brod, Faculty of Electrical Engineering in Osijek, Faculty of Civil Engineering in Osijek
series Tehnički Vjesnik
issn 1330-3651
1848-6339
publishDate 2018-01-01
description Frequent episode discovery is introduced to mine useful and interesting temporal patterns from sequential data. The existing episode mining methods mainly focused on mining from a single long sequence consisting of events with time constraints. However, there can be multiple sequences of different importance as the persons or entities associated with each sequence can be of different importance. Aiming to mine episodes in multiple sequences of different importance, we first define a new kind of episodes, i.e., the weighted frequent closed episodes, to take sequence importance, episode distribution and occurrence frequency into account together. Secondly, to facilitate the mining of such new episodes, we present a new concept called maximal duration serial episodes to cut a whole sequence into multiple maximum episodes using duration constraints, and discuss its properties for episode shrinking processing. Finally, based on the theoretical properties, we propose a two-phase approach to efficiently mine these new episodes. In Phase I, we adopt a level-wise episode shrinking framework to discover the candidate frequent closed episodes with the same prefixes, and in Phase II, we match the candidates with different prefixes to find the frequent close episodes. Experiments on simulated and real datasets demonstrate that the proposed episode mining strategy has good mining effectiveness and efficiency.
topic closed episodes
episode mining
frequent episodes
multiple sequences
url https://hrcak.srce.hr/file/293209
work_keys_str_mv AT guoqiongliao miningweightedfrequentclosedepisodesovermultiplesequences
AT xiaotingyang miningweightedfrequentclosedepisodesovermultiplesequences
AT sihongxie miningweightedfrequentclosedepisodesovermultiplesequences
AT philipsyu miningweightedfrequentclosedepisodesovermultiplesequences
AT changxuanwan miningweightedfrequentclosedepisodesovermultiplesequences
_version_ 1725886500304846848