Study on the Search Engine of the XML Documents

碩士 === 國立交通大學 === 資訊學院碩士在職專班資訊組 === 95 === Traditional search engine mainly query by keywords.Although Boolean opera-tions are provided, it is unable to query the ordering of keywords or attributes in XML documents. In XML document serach engine, besides the keyword query function of the traditional...

Full description

Bibliographic Details
Main Authors: Hsun-Wen Hsiao, 蕭遜文
Other Authors: Suh-Yin Lee
Format: Others
Language:zh-TW
Published: 2006
Online Access:http://ndltd.ncl.edu.tw/handle/77800501702553229544
id ndltd-TW-095NCTU5446007
record_format oai_dc
spelling ndltd-TW-095NCTU54460072016-05-27T04:18:38Z http://ndltd.ncl.edu.tw/handle/77800501702553229544 Study on the Search Engine of the XML Documents XML文件搜尋引擎的研究 Hsun-Wen Hsiao 蕭遜文 碩士 國立交通大學 資訊學院碩士在職專班資訊組 95 Traditional search engine mainly query by keywords.Although Boolean opera-tions are provided, it is unable to query the ordering of keywords or attributes in XML documents. In XML document serach engine, besides the keyword query function of the traditional search engine, the ordering of data or the hierarchical relation of data in the XML documents must also be considered. XML query strings expressed in Xpath, which is the W3C XML query language, can query the order of keywords,the structure of XML documents. In this thesis, we are focused on the speed up of query operations in large XML documents database. We use Begin-End-Level (BEL) interval encoding method to build the index structure for each XML document.After the BEL coding of the XML documents, the indexes are saved into Relation Database. The query in the XPath ex-pression is transformed into the SQL query Commands. The stored records can recon-struct the original and consistent contents of XML documents. In order to speed up the query, the index mechanism of signature file is employed to filter out the unqualified documents first and avoid nonessential query operations. Suh-Yin Lee 李素瑛 2006 學位論文 ; thesis 43 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立交通大學 === 資訊學院碩士在職專班資訊組 === 95 === Traditional search engine mainly query by keywords.Although Boolean opera-tions are provided, it is unable to query the ordering of keywords or attributes in XML documents. In XML document serach engine, besides the keyword query function of the traditional search engine, the ordering of data or the hierarchical relation of data in the XML documents must also be considered. XML query strings expressed in Xpath, which is the W3C XML query language, can query the order of keywords,the structure of XML documents. In this thesis, we are focused on the speed up of query operations in large XML documents database. We use Begin-End-Level (BEL) interval encoding method to build the index structure for each XML document.After the BEL coding of the XML documents, the indexes are saved into Relation Database. The query in the XPath ex-pression is transformed into the SQL query Commands. The stored records can recon-struct the original and consistent contents of XML documents. In order to speed up the query, the index mechanism of signature file is employed to filter out the unqualified documents first and avoid nonessential query operations.
author2 Suh-Yin Lee
author_facet Suh-Yin Lee
Hsun-Wen Hsiao
蕭遜文
author Hsun-Wen Hsiao
蕭遜文
spellingShingle Hsun-Wen Hsiao
蕭遜文
Study on the Search Engine of the XML Documents
author_sort Hsun-Wen Hsiao
title Study on the Search Engine of the XML Documents
title_short Study on the Search Engine of the XML Documents
title_full Study on the Search Engine of the XML Documents
title_fullStr Study on the Search Engine of the XML Documents
title_full_unstemmed Study on the Search Engine of the XML Documents
title_sort study on the search engine of the xml documents
publishDate 2006
url http://ndltd.ncl.edu.tw/handle/77800501702553229544
work_keys_str_mv AT hsunwenhsiao studyonthesearchengineofthexmldocuments
AT xiāoxùnwén studyonthesearchengineofthexmldocuments
AT hsunwenhsiao xmlwénjiànsōuxúnyǐnqíngdeyánjiū
AT xiāoxùnwén xmlwénjiànsōuxúnyǐnqíngdeyánjiū
_version_ 1718283879367835648