Study on the Search Engine of the XML Documents

碩士 === 國立交通大學 === 資訊學院碩士在職專班資訊組 === 95 === Traditional search engine mainly query by keywords.Although Boolean opera-tions are provided, it is unable to query the ordering of keywords or attributes in XML documents. In XML document serach engine, besides the keyword query function of the traditional...

Full description

Bibliographic Details
Main Authors: Hsun-Wen Hsiao, 蕭遜文
Other Authors: Suh-Yin Lee
Format: Others
Language:zh-TW
Published: 2006
Online Access:http://ndltd.ncl.edu.tw/handle/77800501702553229544
Description
Summary:碩士 === 國立交通大學 === 資訊學院碩士在職專班資訊組 === 95 === Traditional search engine mainly query by keywords.Although Boolean opera-tions are provided, it is unable to query the ordering of keywords or attributes in XML documents. In XML document serach engine, besides the keyword query function of the traditional search engine, the ordering of data or the hierarchical relation of data in the XML documents must also be considered. XML query strings expressed in Xpath, which is the W3C XML query language, can query the order of keywords,the structure of XML documents. In this thesis, we are focused on the speed up of query operations in large XML documents database. We use Begin-End-Level (BEL) interval encoding method to build the index structure for each XML document.After the BEL coding of the XML documents, the indexes are saved into Relation Database. The query in the XPath ex-pression is transformed into the SQL query Commands. The stored records can recon-struct the original and consistent contents of XML documents. In order to speed up the query, the index mechanism of signature file is employed to filter out the unqualified documents first and avoid nonessential query operations.