Metadata Architecture for Digital Library Integration
博士 === 國立交通大學 === 資訊科學與工程研究所 === 95 === Metadata has been playing an essential role in integrating heterogeneous digital libraries (DL). However, conventional metadata architecture is insufficient to achieve interoperability among DL because of the heterogeneity in semantics and no structure conside...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2006
|
Online Access: | http://ndltd.ncl.edu.tw/handle/93894890887154828752 |
id |
ndltd-TW-095NCTU5394030 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-095NCTU53940302016-05-25T04:13:41Z http://ndltd.ncl.edu.tw/handle/93894890887154828752 Metadata Architecture for Digital Library Integration 數位圖書館整合之詮釋資料架構 Su-Hsien Huang 黃夙賢 博士 國立交通大學 資訊科學與工程研究所 95 Metadata has been playing an essential role in integrating heterogeneous digital libraries (DL). However, conventional metadata architecture is insufficient to achieve interoperability among DL because of the heterogeneity in semantics and no structure consideration in metadata formats. This dissertation proposes novel metadata architecture called M-Architecture@DL to integrate DL seamlessly from the perspective of metadata. M-Architecture@DL follows Model-Extraction-Query (MEQ) model to obtain more permanent and explicit knowledge in the process of DL query. M-Architecture@DL contains three layers, namely metadata modeling layer, data extraction layer, and semantic query layer. The separation of M-Architecture@DL into three-layer achieves format, protocol and semantic interoperability in each layer. Metadata modeling layer uses Metadata Modeling Language (MML) to describe real-world entities. MML adopts XML as its syntax and extends Resource Description Framework (RDF) by adding name hierarchy reference. MML provides two constructors, tuple and set constructors, to represent structures. With these two constructors, metadata can be translated by manipulating attributes of metadata with operations. In this layer, the format interoperability is achieved. Data extraction layer collects data from distributed DL and encapsulates result into MML metadata. Data from DL services with similar structure can be extracted into metadata automatically by means of the common structure. In the process of extraction, the first step is to assign level ID for the sample document and determine the common part to be extracted. Then an extraction algorithm called Metadata Extractor is implemented to extract the documents according to the common structure. This layer provides a transparent way without prearrangement with distributed DLs and saves much effort to collect information through the HTTP protocol. Therefore, the protocol interoperability is achieved. Semantic query layer retrieves metadata semantically by adding relationships in query statements. A Content and Service Inference Model (CSIM) is proposed to derive 15 relationships from two essential aspects of DL: content and services. The 15 structural relationships create operations to manipulate metadata in a query predicate and facilitate a query with as much semantics. Both content and service queries are presented to derive more semantic answers in a DL search. In this layer, the semantic interoperability is achieved. Experiments are conducted and indicate that M-Architecture@DL has excellent performance in DL integration. The experiment results have shown that both accuracy and coverage are improved to a conventional keyword-based approach. Adopting M-Architecture@DL can alleviate the administrative load. When developing novel DL services, such as library resource planning and virtual union catalog system, librarians are recommended with alternative answers to combine existent DL components. The reuse of DL services and metadata is the future trend in DL integration. Wei-Pang Yang Hao-Ren Ke 楊維邦 柯皓仁 2006 學位論文 ; thesis 75 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
博士 === 國立交通大學 === 資訊科學與工程研究所 === 95 === Metadata has been playing an essential role in integrating heterogeneous digital libraries (DL). However, conventional metadata architecture is insufficient to achieve interoperability among DL because of the heterogeneity in semantics and no structure consideration in metadata formats. This dissertation proposes novel metadata architecture called M-Architecture@DL to integrate DL seamlessly from the perspective of metadata. M-Architecture@DL follows Model-Extraction-Query (MEQ) model to obtain more permanent and explicit knowledge in the process of DL query. M-Architecture@DL contains three layers, namely metadata modeling layer, data extraction layer, and semantic query layer. The separation of M-Architecture@DL into three-layer achieves format, protocol and semantic interoperability in each layer.
Metadata modeling layer uses Metadata Modeling Language (MML) to describe real-world entities. MML adopts XML as its syntax and extends Resource Description Framework (RDF) by adding name hierarchy reference. MML provides two constructors, tuple and set constructors, to represent structures. With these two constructors, metadata can be translated by manipulating attributes of metadata with operations. In this layer, the format interoperability is achieved.
Data extraction layer collects data from distributed DL and encapsulates result into MML metadata. Data from DL services with similar structure can be extracted into metadata automatically by means of the common structure. In the process of extraction, the first step is to assign level ID for the sample document and determine the common part to be extracted. Then an extraction algorithm called Metadata Extractor is implemented to extract the documents according to the common structure. This layer provides a transparent way without prearrangement with distributed DLs and saves much effort to collect information through the HTTP protocol. Therefore, the protocol interoperability is achieved.
Semantic query layer retrieves metadata semantically by adding relationships in query statements. A Content and Service Inference Model (CSIM) is proposed to derive 15 relationships from two essential aspects of DL: content and services. The 15 structural relationships create operations to manipulate metadata in a query predicate and facilitate a query with as much semantics. Both content and service queries are presented to derive more semantic answers in a DL search. In this layer, the semantic interoperability is achieved.
Experiments are conducted and indicate that M-Architecture@DL has excellent performance in DL integration. The experiment results have shown that both accuracy and coverage are improved to a conventional keyword-based approach. Adopting M-Architecture@DL can alleviate the administrative load. When developing novel DL services, such as library resource planning and virtual union catalog system, librarians are recommended with alternative answers to combine existent DL components. The reuse of DL services and metadata is the future trend in DL integration.
|
author2 |
Wei-Pang Yang |
author_facet |
Wei-Pang Yang Su-Hsien Huang 黃夙賢 |
author |
Su-Hsien Huang 黃夙賢 |
spellingShingle |
Su-Hsien Huang 黃夙賢 Metadata Architecture for Digital Library Integration |
author_sort |
Su-Hsien Huang |
title |
Metadata Architecture for Digital Library Integration |
title_short |
Metadata Architecture for Digital Library Integration |
title_full |
Metadata Architecture for Digital Library Integration |
title_fullStr |
Metadata Architecture for Digital Library Integration |
title_full_unstemmed |
Metadata Architecture for Digital Library Integration |
title_sort |
metadata architecture for digital library integration |
publishDate |
2006 |
url |
http://ndltd.ncl.edu.tw/handle/93894890887154828752 |
work_keys_str_mv |
AT suhsienhuang metadataarchitecturefordigitallibraryintegration AT huángsùxián metadataarchitecturefordigitallibraryintegration AT suhsienhuang shùwèitúshūguǎnzhěnghézhīquánshìzīliàojiàgòu AT huángsùxián shùwèitúshūguǎnzhěnghézhīquánshìzīliàojiàgòu |
_version_ |
1718279896653889536 |