Semantic Enrichment of Ontology Mappings

Schema and ontology matching play an important part in the field of data integration and semantic web. Given two heterogeneous data sources, meta data matching usually constitutes the first step in the data integration workflow, which refers to the analysis and comparison of two input resources like...

Full description

Bibliographic Details
Main Author:	Arnold, Patrick
Other Authors:	Universität Leipzig, Fakultät für Mathematik und Informatik
Format:	Doctoral Thesis
Language:	English
Published:	Universitätsbibliothek Leipzig 2016
Subjects:	ontology mapping ontology matching semantische erweiterung datenintegration hintergrundwissen semantic enrichment data integration background knowledge ddc:500
Online Access:	http://nbn-resolving.de/urn:nbn:de:bsz:15-qucosa-192438 http://nbn-resolving.de/urn:nbn:de:bsz:15-qucosa-192438 http://www.qucosa.de/fileadmin/data/qucosa/documents/19243/Dissertation_final_final.pdf

id	ndltd-DRESDEN-oai-qucosa.de-bsz-15-qucosa-192438
record_format	oai_dc
collection	NDLTD
language	English
format	Doctoral Thesis
sources	NDLTD
topic	ontology mapping ontology matching semantische erweiterung datenintegration hintergrundwissen ontology mapping ontology matching semantic enrichment data integration background knowledge ddc:500
spellingShingle	ontology mapping ontology matching semantische erweiterung datenintegration hintergrundwissen ontology mapping ontology matching semantic enrichment data integration background knowledge ddc:500 Arnold, Patrick Semantic Enrichment of Ontology Mappings
description	Schema and ontology matching play an important part in the field of data integration and semantic web. Given two heterogeneous data sources, meta data matching usually constitutes the first step in the data integration workflow, which refers to the analysis and comparison of two input resources like schemas or ontologies. The result is a list of correspondences between the two schemas or ontologies, which is often called mapping or alignment. Many tools and research approaches have been proposed to automatically determine those correspondences. However, most match tools do not provide any information about the relation type that holds between matching concepts, for the simple but important reason that most common match strategies are too simple and heuristic to allow any sophisticated relation type determination. Knowing the specific type holding between two concepts, e.g., whether they are in an equality, subsumption (is-a) or part-of relation, is very important for advanced data integration tasks, such as ontology merging or ontology evolution. It is also very important for mappings in the biological or biomedical domain, where is-a and part-of relations may exceed the number of equality correspondences by far. Such more expressive mappings allow much better integration results and have scarcely been in the focus of research so far. In this doctoral thesis, the determination of the correspondence types in a given mapping is the focus of interest, which is referred to as semantic mapping enrichment. We introduce and present the mapping enrichment tool STROMA, which obtains a pre-calculated schema or ontology mapping and for each correspondence determines a semantic relation type. In contrast to previous approaches, we will strongly focus on linguistic laws and linguistic insights. By and large, linguistics is the key for precise matching and for the determination of relation types. We will introduce various strategies that make use of these linguistic laws and are able to calculate the semantic type between two matching concepts. The observations and insights gained from this research go far beyond the field of mapping enrichment and can be also applied to schema and ontology matching in general. Since generic strategies have certain limits and may not be able to determine the relation type between more complex concepts, like a laptop and a personal computer, background knowledge plays an important role in this research as well. For example, a thesaurus can help to recognize that these two concepts are in an is-a relation. We will show how background knowledge can be effectively used in this instance, how it is possible to draw conclusions even if a concept is not contained in it, how the relation types in complex paths can be resolved and how time complexity can be reduced by a so-called bidirectional search. The developed techniques go far beyond the background knowledge exploitation of previous approaches, and are now part of the semantic repository SemRep, a flexible and extendable system that combines different lexicographic resources. Further on, we will show how additional lexicographic resources can be developed automatically by parsing Wikipedia articles. The proposed Wikipedia relation extraction approach yields some millions of additional relations, which constitute significant additional knowledge for mapping enrichment. The extracted relations were also added to SemRep, which thus became a comprehensive background knowledge resource. To augment the quality of the repository, different techniques were used to discover and delete irrelevant semantic relations. We could show in several experiments that STROMA obtains very good results w.r.t. relation type detection. In a comparative evaluation, it was able to achieve considerably better results than related applications. This corroborates the overall usefulness and strengths of the implemented strategies, which were developed with particular emphasis on the principles and laws of linguistics.
author2	Universität Leipzig, Fakultät für Mathematik und Informatik
author_facet	Universität Leipzig, Fakultät für Mathematik und Informatik Arnold, Patrick
author	Arnold, Patrick
author_sort	Arnold, Patrick
title	Semantic Enrichment of Ontology Mappings
title_short	Semantic Enrichment of Ontology Mappings
title_full	Semantic Enrichment of Ontology Mappings
title_fullStr	Semantic Enrichment of Ontology Mappings
title_full_unstemmed	Semantic Enrichment of Ontology Mappings
title_sort	semantic enrichment of ontology mappings
publisher	Universitätsbibliothek Leipzig
publishDate	2016
url	http://nbn-resolving.de/urn:nbn:de:bsz:15-qucosa-192438 http://nbn-resolving.de/urn:nbn:de:bsz:15-qucosa-192438 http://www.qucosa.de/fileadmin/data/qucosa/documents/19243/Dissertation_final_final.pdf
work_keys_str_mv	AT arnoldpatrick semanticenrichmentofontologymappings
_version_	1718160365735378944
spelling	ndltd-DRESDEN-oai-qucosa.de-bsz-15-qucosa-1924382016-01-05T03:30:10Z Semantic Enrichment of Ontology Mappings Arnold, Patrick ontology mapping ontology matching semantische erweiterung datenintegration hintergrundwissen ontology mapping ontology matching semantic enrichment data integration background knowledge ddc:500 Schema and ontology matching play an important part in the field of data integration and semantic web. Given two heterogeneous data sources, meta data matching usually constitutes the first step in the data integration workflow, which refers to the analysis and comparison of two input resources like schemas or ontologies. The result is a list of correspondences between the two schemas or ontologies, which is often called mapping or alignment. Many tools and research approaches have been proposed to automatically determine those correspondences. However, most match tools do not provide any information about the relation type that holds between matching concepts, for the simple but important reason that most common match strategies are too simple and heuristic to allow any sophisticated relation type determination. Knowing the specific type holding between two concepts, e.g., whether they are in an equality, subsumption (is-a) or part-of relation, is very important for advanced data integration tasks, such as ontology merging or ontology evolution. It is also very important for mappings in the biological or biomedical domain, where is-a and part-of relations may exceed the number of equality correspondences by far. Such more expressive mappings allow much better integration results and have scarcely been in the focus of research so far. In this doctoral thesis, the determination of the correspondence types in a given mapping is the focus of interest, which is referred to as semantic mapping enrichment. We introduce and present the mapping enrichment tool STROMA, which obtains a pre-calculated schema or ontology mapping and for each correspondence determines a semantic relation type. In contrast to previous approaches, we will strongly focus on linguistic laws and linguistic insights. By and large, linguistics is the key for precise matching and for the determination of relation types. We will introduce various strategies that make use of these linguistic laws and are able to calculate the semantic type between two matching concepts. The observations and insights gained from this research go far beyond the field of mapping enrichment and can be also applied to schema and ontology matching in general. Since generic strategies have certain limits and may not be able to determine the relation type between more complex concepts, like a laptop and a personal computer, background knowledge plays an important role in this research as well. For example, a thesaurus can help to recognize that these two concepts are in an is-a relation. We will show how background knowledge can be effectively used in this instance, how it is possible to draw conclusions even if a concept is not contained in it, how the relation types in complex paths can be resolved and how time complexity can be reduced by a so-called bidirectional search. The developed techniques go far beyond the background knowledge exploitation of previous approaches, and are now part of the semantic repository SemRep, a flexible and extendable system that combines different lexicographic resources. Further on, we will show how additional lexicographic resources can be developed automatically by parsing Wikipedia articles. The proposed Wikipedia relation extraction approach yields some millions of additional relations, which constitute significant additional knowledge for mapping enrichment. The extracted relations were also added to SemRep, which thus became a comprehensive background knowledge resource. To augment the quality of the repository, different techniques were used to discover and delete irrelevant semantic relations. We could show in several experiments that STROMA obtains very good results w.r.t. relation type detection. In a comparative evaluation, it was able to achieve considerably better results than related applications. This corroborates the overall usefulness and strengths of the implemented strategies, which were developed with particular emphasis on the principles and laws of linguistics. Universitätsbibliothek Leipzig Universität Leipzig, Fakultät für Mathematik und Informatik Prof. Erhard Rahm Prof. Sören Auer 2016-01-04 doc-type:doctoralThesis application/pdf http://nbn-resolving.de/urn:nbn:de:bsz:15-qucosa-192438 urn:nbn:de:bsz:15-qucosa-192438 http://www.qucosa.de/fileadmin/data/qucosa/documents/19243/Dissertation_final_final.pdf eng

Semantic Enrichment of Ontology Mappings

Similar Items