Research on the Method of Extracting Domain Knowledge From the Freebase RDF Dumps

In the process of constructing a domain semantic knowledge base based on ontologies, reusing existing domain knowledge bases not only facilitates sharing, integration, and reuse of the domain semantic knowledge base but also can accelerate the construction of the domain semantic knowledge base. The...

Full description

Bibliographic Details
Main Authors: Deyan Chen, Hong Zhao
Format: Article
Language:English
Published: IEEE 2018-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8454435/
id doaj-064168a055e9489cb147c88d7d0e796d
record_format Article
spelling doaj-064168a055e9489cb147c88d7d0e796d2021-03-29T21:17:04ZengIEEEIEEE Access2169-35362018-01-016503065032210.1109/ACCESS.2018.28685168454435Research on the Method of Extracting Domain Knowledge From the Freebase RDF DumpsDeyan Chen0https://orcid.org/0000-0001-8141-662XHong Zhao1School of Computer Science & Engineering, Northeastern University, Shenyang, ChinaSchool of Computer Science & Engineering, Northeastern University, Shenyang, ChinaIn the process of constructing a domain semantic knowledge base based on ontologies, reusing existing domain knowledge bases not only facilitates sharing, integration, and reuse of the domain semantic knowledge base but also can accelerate the construction of the domain semantic knowledge base. The open and fast growing Freebase database is a good data source, which can be reused to construct the domain semantic knowledge base. However, extracting domain knowledge from the Freebase Resource Description Framework (RDF) dumps faces many challenges. For example, the dump package is too large to read or load; the dump package contains a lot of unnecessary and redundant facts; some ill-formed triples may cause the load to fail, and so on. In response to these obstacles and the deficiencies of existing research, this paper proposes a method to extract domain knowledge quickly, accurately, and completely from the Freebase RDF dumps and describes the domain knowledge using the semantic constructs in ontology standard description languages. Taking extracting the ontology schema and instance data of the medicine domain, including the facts pointing to semantically related domains, as an example, the principle and implementation process of the method are explained in detail and the algorithms of the key processes are described. Finally, the method of this paper is evaluated, including the comparison and analysis of related methods with work objectives, software tools used, processing results, processing performance, accuracy, completeness, and reusability.https://ieeexplore.ieee.org/document/8454435/Freebasedomain semantic knowledge baseontologysemantic constructssemantic models
collection DOAJ
language English
format Article
sources DOAJ
author Deyan Chen
Hong Zhao
spellingShingle Deyan Chen
Hong Zhao
Research on the Method of Extracting Domain Knowledge From the Freebase RDF Dumps
IEEE Access
Freebase
domain semantic knowledge base
ontology
semantic constructs
semantic models
author_facet Deyan Chen
Hong Zhao
author_sort Deyan Chen
title Research on the Method of Extracting Domain Knowledge From the Freebase RDF Dumps
title_short Research on the Method of Extracting Domain Knowledge From the Freebase RDF Dumps
title_full Research on the Method of Extracting Domain Knowledge From the Freebase RDF Dumps
title_fullStr Research on the Method of Extracting Domain Knowledge From the Freebase RDF Dumps
title_full_unstemmed Research on the Method of Extracting Domain Knowledge From the Freebase RDF Dumps
title_sort research on the method of extracting domain knowledge from the freebase rdf dumps
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2018-01-01
description In the process of constructing a domain semantic knowledge base based on ontologies, reusing existing domain knowledge bases not only facilitates sharing, integration, and reuse of the domain semantic knowledge base but also can accelerate the construction of the domain semantic knowledge base. The open and fast growing Freebase database is a good data source, which can be reused to construct the domain semantic knowledge base. However, extracting domain knowledge from the Freebase Resource Description Framework (RDF) dumps faces many challenges. For example, the dump package is too large to read or load; the dump package contains a lot of unnecessary and redundant facts; some ill-formed triples may cause the load to fail, and so on. In response to these obstacles and the deficiencies of existing research, this paper proposes a method to extract domain knowledge quickly, accurately, and completely from the Freebase RDF dumps and describes the domain knowledge using the semantic constructs in ontology standard description languages. Taking extracting the ontology schema and instance data of the medicine domain, including the facts pointing to semantically related domains, as an example, the principle and implementation process of the method are explained in detail and the algorithms of the key processes are described. Finally, the method of this paper is evaluated, including the comparison and analysis of related methods with work objectives, software tools used, processing results, processing performance, accuracy, completeness, and reusability.
topic Freebase
domain semantic knowledge base
ontology
semantic constructs
semantic models
url https://ieeexplore.ieee.org/document/8454435/
work_keys_str_mv AT deyanchen researchonthemethodofextractingdomainknowledgefromthefreebaserdfdumps
AT hongzhao researchonthemethodofextractingdomainknowledgefromthefreebaserdfdumps
_version_ 1724193162329587712