An Efficient XML Keyword Search Indexing Method Based on Modified UCIS-X Index

碩士 === 國立中興大學 === 資訊科學與工程學系 === 103 === The UCIS-X (An Updatable Compressing and Indexing Scheme for XML) indexing method uses the Dewey encoding to record the parent-child relationship of nodes in an XML file, and the branch information between XML nodes are encoded in Branch maps. UCIS-X outperfor...

Full description

Bibliographic Details
Main Authors: Yi-Xue Lin, 林宜學
Other Authors: 廖宜恩
Format: Others
Language:zh-TW
Published: 2015
Online Access:http://ndltd.ncl.edu.tw/handle/r5ktvp
id ndltd-TW-103NCHU5394071
record_format oai_dc
spelling ndltd-TW-103NCHU53940712019-05-15T22:25:04Z http://ndltd.ncl.edu.tw/handle/r5ktvp An Efficient XML Keyword Search Indexing Method Based on Modified UCIS-X Index 一個基於修改UCIS-X之快速XML關鍵字查詢索引方法 Yi-Xue Lin 林宜學 碩士 國立中興大學 資訊科學與工程學系 103 The UCIS-X (An Updatable Compressing and Indexing Scheme for XML) indexing method uses the Dewey encoding to record the parent-child relationship of nodes in an XML file, and the branch information between XML nodes are encoded in Branch maps. UCIS-X outperforms other XML indexing schemes in terms of index space and query response time. But its performance will be downgraded in case of keyword search in XML file with large contents due to linear list used the content-index. In this thesis, we modified the Content-Index of UCIS-X by designing a hashed content-index using content keywords as hash keys to improve the performance on keyword search. The experimental results show that the proposed method has about 44% performance improvement over UCIS-X on the content-rich benchmark dataset XMark, and it also has about 65.5% performance improvement over UCIS-X on structure-duplicated dataset DBLP. 廖宜恩 2015 學位論文 ; thesis 48 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立中興大學 === 資訊科學與工程學系 === 103 === The UCIS-X (An Updatable Compressing and Indexing Scheme for XML) indexing method uses the Dewey encoding to record the parent-child relationship of nodes in an XML file, and the branch information between XML nodes are encoded in Branch maps. UCIS-X outperforms other XML indexing schemes in terms of index space and query response time. But its performance will be downgraded in case of keyword search in XML file with large contents due to linear list used the content-index. In this thesis, we modified the Content-Index of UCIS-X by designing a hashed content-index using content keywords as hash keys to improve the performance on keyword search. The experimental results show that the proposed method has about 44% performance improvement over UCIS-X on the content-rich benchmark dataset XMark, and it also has about 65.5% performance improvement over UCIS-X on structure-duplicated dataset DBLP.
author2 廖宜恩
author_facet 廖宜恩
Yi-Xue Lin
林宜學
author Yi-Xue Lin
林宜學
spellingShingle Yi-Xue Lin
林宜學
An Efficient XML Keyword Search Indexing Method Based on Modified UCIS-X Index
author_sort Yi-Xue Lin
title An Efficient XML Keyword Search Indexing Method Based on Modified UCIS-X Index
title_short An Efficient XML Keyword Search Indexing Method Based on Modified UCIS-X Index
title_full An Efficient XML Keyword Search Indexing Method Based on Modified UCIS-X Index
title_fullStr An Efficient XML Keyword Search Indexing Method Based on Modified UCIS-X Index
title_full_unstemmed An Efficient XML Keyword Search Indexing Method Based on Modified UCIS-X Index
title_sort efficient xml keyword search indexing method based on modified ucis-x index
publishDate 2015
url http://ndltd.ncl.edu.tw/handle/r5ktvp
work_keys_str_mv AT yixuelin anefficientxmlkeywordsearchindexingmethodbasedonmodifieducisxindex
AT línyíxué anefficientxmlkeywordsearchindexingmethodbasedonmodifieducisxindex
AT yixuelin yīgèjīyúxiūgǎiucisxzhīkuàisùxmlguānjiànzìcháxúnsuǒyǐnfāngfǎ
AT línyíxué yīgèjīyúxiūgǎiucisxzhīkuàisùxmlguānjiànzìcháxúnsuǒyǐnfāngfǎ
AT yixuelin efficientxmlkeywordsearchindexingmethodbasedonmodifieducisxindex
AT línyíxué efficientxmlkeywordsearchindexingmethodbasedonmodifieducisxindex
_version_ 1719129431563304960