Applying Sequential Pattern Mining Technique to Build an Optimized Classification Model for Time-interval Sequences

碩士 === 元智大學 === 工業工程與管理學系 === 98 === To survive in current competitive and fast-changed business environment, enterprises need to know their customers behavior in depth to provide the right services to customers at right time. However, the customer databases in enterprises are usually large and diso...

Full description

Bibliographic Details
Main Authors: Chun-Ju Chien, 簡君如
Other Authors: Chieh-Yuan Tsai
Format: Others
Language:en_US
Published: 2010
Online Access:http://ndltd.ncl.edu.tw/handle/82350116054653654561
id ndltd-TW-098YZU05031062
record_format oai_dc
spelling ndltd-TW-098YZU050310622015-10-13T18:20:43Z http://ndltd.ncl.edu.tw/handle/82350116054653654561 Applying Sequential Pattern Mining Technique to Build an Optimized Classification Model for Time-interval Sequences 應用序列樣式探勘技術建構一個時間間隔序列最佳化分類模型 Chun-Ju Chien 簡君如 碩士 元智大學 工業工程與管理學系 98 To survive in current competitive and fast-changed business environment, enterprises need to know their customers behavior in depth to provide the right services to customers at right time. However, the customer databases in enterprises are usually large and disordered which makes customer behavior analysis difficult. Sequential pattern mining and sequence classification are two popular data mining methods to explore customer behavior. The former can discover frequent occurring patterns, while the latter can assign a most probable class label to a given sequence based on the characteristics of the sequence. However, previous researches seldom discussed sequence classification problem related to time information. Without time information, two sequences with the same itemsets but different time-intervals will be classified as the same class, which is inappropriate in customer behavior analysis. For this reason, this research presents a time-interval sequence classification methodology to help decision makers make better business strategies to satisfy their various customers. The proposed sequence classification methodology includes two main stages. The first stage is time-interval sequential pattern mining, which employs I-PrefixSpan algorithm to discover time-interval sequential patterns in the large database. The second stage is time-interval sequence classification method, which contains sequence similarity measure, time-interval sequence classification model establishment, and model optimization procedure. A simple case and NorthWind database which is a large scale database are employed to test the classification method. The experiment results indicate the proposed time-interval sequence classification method is feasible and efficient. Chieh-Yuan Tsai 蔡介元 2010 學位論文 ; thesis 110 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 元智大學 === 工業工程與管理學系 === 98 === To survive in current competitive and fast-changed business environment, enterprises need to know their customers behavior in depth to provide the right services to customers at right time. However, the customer databases in enterprises are usually large and disordered which makes customer behavior analysis difficult. Sequential pattern mining and sequence classification are two popular data mining methods to explore customer behavior. The former can discover frequent occurring patterns, while the latter can assign a most probable class label to a given sequence based on the characteristics of the sequence. However, previous researches seldom discussed sequence classification problem related to time information. Without time information, two sequences with the same itemsets but different time-intervals will be classified as the same class, which is inappropriate in customer behavior analysis. For this reason, this research presents a time-interval sequence classification methodology to help decision makers make better business strategies to satisfy their various customers. The proposed sequence classification methodology includes two main stages. The first stage is time-interval sequential pattern mining, which employs I-PrefixSpan algorithm to discover time-interval sequential patterns in the large database. The second stage is time-interval sequence classification method, which contains sequence similarity measure, time-interval sequence classification model establishment, and model optimization procedure. A simple case and NorthWind database which is a large scale database are employed to test the classification method. The experiment results indicate the proposed time-interval sequence classification method is feasible and efficient.
author2 Chieh-Yuan Tsai
author_facet Chieh-Yuan Tsai
Chun-Ju Chien
簡君如
author Chun-Ju Chien
簡君如
spellingShingle Chun-Ju Chien
簡君如
Applying Sequential Pattern Mining Technique to Build an Optimized Classification Model for Time-interval Sequences
author_sort Chun-Ju Chien
title Applying Sequential Pattern Mining Technique to Build an Optimized Classification Model for Time-interval Sequences
title_short Applying Sequential Pattern Mining Technique to Build an Optimized Classification Model for Time-interval Sequences
title_full Applying Sequential Pattern Mining Technique to Build an Optimized Classification Model for Time-interval Sequences
title_fullStr Applying Sequential Pattern Mining Technique to Build an Optimized Classification Model for Time-interval Sequences
title_full_unstemmed Applying Sequential Pattern Mining Technique to Build an Optimized Classification Model for Time-interval Sequences
title_sort applying sequential pattern mining technique to build an optimized classification model for time-interval sequences
publishDate 2010
url http://ndltd.ncl.edu.tw/handle/82350116054653654561
work_keys_str_mv AT chunjuchien applyingsequentialpatternminingtechniquetobuildanoptimizedclassificationmodelfortimeintervalsequences
AT jiǎnjūnrú applyingsequentialpatternminingtechniquetobuildanoptimizedclassificationmodelfortimeintervalsequences
AT chunjuchien yīngyòngxùlièyàngshìtànkānjìshùjiàngòuyīgèshíjiānjiāngéxùlièzuìjiāhuàfēnlèimóxíng
AT jiǎnjūnrú yīngyòngxùlièyàngshìtànkānjìshùjiàngòuyīgèshíjiānjiāngéxùlièzuìjiāhuàfēnlèimóxíng
_version_ 1718030077121265664