A Study of Mining Sequential Pattern in Large Dynamic Transaction Database

碩士 === 國立臺南大學 === 資訊教育研究所碩士班 === 92 === Mining sequential pattern plays an important role in data mining. Numerous algorithms have been proposed to mining sequential patterns efficiently in a static database. However, the maintenance of such discovered sequential patterns is nontrivial in large da...

Full description

Bibliographic Details
Main Authors: Shen-jung Hsu, 徐勝榮
Other Authors: Chien-I Lee
Format: Others
Language:zh-TW
Published: 2004
Online Access:http://ndltd.ncl.edu.tw/handle/44019394452474051290
id ndltd-TW-092NTNT5395033
record_format oai_dc
spelling ndltd-TW-092NTNT53950332015-11-23T04:03:31Z http://ndltd.ncl.edu.tw/handle/44019394452474051290 A Study of Mining Sequential Pattern in Large Dynamic Transaction Database 在大型動態交易資料庫中探勘循序樣式之研究 Shen-jung Hsu 徐勝榮 碩士 國立臺南大學 資訊教育研究所碩士班 92 Mining sequential pattern plays an important role in data mining. Numerous algorithms have been proposed to mining sequential patterns efficiently in a static database. However, the maintenance of such discovered sequential patterns is nontrivial in large database. In real world, a transaction database may allow the users to insert/delete the transaction data and the customer data frequently. Additionally, the development of multiple shop leads in the urgent requirement of fast mining sequential patterns in the merged database. Therefore, the efficient maintenance of the latest sequential patterns in dynamic large database becomes more and more important. First, this study presents the incremental sequential patterns mining (ISP) algorithm to maintain the discovered sequential patterns in incremental transaction database efficiently. ISP applies the information of the discovered sequential patterns in original database to reduce the number of candidates. Consequently, the performance of ISP is better than that of GSP in our experimental results. Furthermore, this study also proposes the updated sequential patterns mining (USP) algorithm, which extends the concept of reducing the number of candidates to support both insertion and deletion operations in an update database efficiently. For the issue of efficient mining sequential patterns in the merged database, this study develops the merged sequential patterns mining (MSP) algorithm. Contract to re-mine a merged large database, MSP uses two small database, which have discovered their sequential pattern, respectively to speed up the mining process. Simulation results reveal that ISP, USP and MSP are superior to GSP, respectively in several artificial datasets. The more difference of the frequent sequential pattern between original database and incremental database, the more performance difference between ISP and GSP. USP is very efficient when the proportion of the updated part of the database is small. The performance difference is greater between MSP and GSP when the proportion of the common frequent 1-items in two unmerged database is smaller. Simulation results also appear that the three algorithms use less memory than that of GSP, respectively. Chien-I Lee 李建億 2004 學位論文 ; thesis 49 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立臺南大學 === 資訊教育研究所碩士班 === 92 === Mining sequential pattern plays an important role in data mining. Numerous algorithms have been proposed to mining sequential patterns efficiently in a static database. However, the maintenance of such discovered sequential patterns is nontrivial in large database. In real world, a transaction database may allow the users to insert/delete the transaction data and the customer data frequently. Additionally, the development of multiple shop leads in the urgent requirement of fast mining sequential patterns in the merged database. Therefore, the efficient maintenance of the latest sequential patterns in dynamic large database becomes more and more important. First, this study presents the incremental sequential patterns mining (ISP) algorithm to maintain the discovered sequential patterns in incremental transaction database efficiently. ISP applies the information of the discovered sequential patterns in original database to reduce the number of candidates. Consequently, the performance of ISP is better than that of GSP in our experimental results. Furthermore, this study also proposes the updated sequential patterns mining (USP) algorithm, which extends the concept of reducing the number of candidates to support both insertion and deletion operations in an update database efficiently. For the issue of efficient mining sequential patterns in the merged database, this study develops the merged sequential patterns mining (MSP) algorithm. Contract to re-mine a merged large database, MSP uses two small database, which have discovered their sequential pattern, respectively to speed up the mining process. Simulation results reveal that ISP, USP and MSP are superior to GSP, respectively in several artificial datasets. The more difference of the frequent sequential pattern between original database and incremental database, the more performance difference between ISP and GSP. USP is very efficient when the proportion of the updated part of the database is small. The performance difference is greater between MSP and GSP when the proportion of the common frequent 1-items in two unmerged database is smaller. Simulation results also appear that the three algorithms use less memory than that of GSP, respectively.
author2 Chien-I Lee
author_facet Chien-I Lee
Shen-jung Hsu
徐勝榮
author Shen-jung Hsu
徐勝榮
spellingShingle Shen-jung Hsu
徐勝榮
A Study of Mining Sequential Pattern in Large Dynamic Transaction Database
author_sort Shen-jung Hsu
title A Study of Mining Sequential Pattern in Large Dynamic Transaction Database
title_short A Study of Mining Sequential Pattern in Large Dynamic Transaction Database
title_full A Study of Mining Sequential Pattern in Large Dynamic Transaction Database
title_fullStr A Study of Mining Sequential Pattern in Large Dynamic Transaction Database
title_full_unstemmed A Study of Mining Sequential Pattern in Large Dynamic Transaction Database
title_sort study of mining sequential pattern in large dynamic transaction database
publishDate 2004
url http://ndltd.ncl.edu.tw/handle/44019394452474051290
work_keys_str_mv AT shenjunghsu astudyofminingsequentialpatterninlargedynamictransactiondatabase
AT xúshèngróng astudyofminingsequentialpatterninlargedynamictransactiondatabase
AT shenjunghsu zàidàxíngdòngtàijiāoyìzīliàokùzhōngtànkānxúnxùyàngshìzhīyánjiū
AT xúshèngróng zàidàxíngdòngtàijiāoyìzīliàokùzhōngtànkānxúnxùyàngshìzhīyánjiū
AT shenjunghsu studyofminingsequentialpatterninlargedynamictransactiondatabase
AT xúshèngróng studyofminingsequentialpatterninlargedynamictransactiondatabase
_version_ 1718134283164450816