An Efficient XML Keyword Search Indexing Method Based on Modified UCIS-X Index

碩士 === 國立中興大學 === 資訊科學與工程學系 === 103 === The UCIS-X (An Updatable Compressing and Indexing Scheme for XML) indexing method uses the Dewey encoding to record the parent-child relationship of nodes in an XML file, and the branch information between XML nodes are encoded in Branch maps. UCIS-X outperfor...

Full description

Bibliographic Details
Main Authors: Yi-Xue Lin, 林宜學
Other Authors: 廖宜恩
Format: Others
Language:zh-TW
Published: 2015
Online Access:http://ndltd.ncl.edu.tw/handle/r5ktvp
Description
Summary:碩士 === 國立中興大學 === 資訊科學與工程學系 === 103 === The UCIS-X (An Updatable Compressing and Indexing Scheme for XML) indexing method uses the Dewey encoding to record the parent-child relationship of nodes in an XML file, and the branch information between XML nodes are encoded in Branch maps. UCIS-X outperforms other XML indexing schemes in terms of index space and query response time. But its performance will be downgraded in case of keyword search in XML file with large contents due to linear list used the content-index. In this thesis, we modified the Content-Index of UCIS-X by designing a hashed content-index using content keywords as hash keys to improve the performance on keyword search. The experimental results show that the proposed method has about 44% performance improvement over UCIS-X on the content-rich benchmark dataset XMark, and it also has about 65.5% performance improvement over UCIS-X on structure-duplicated dataset DBLP.