Research on the Method of Extracting Domain Knowledge From the Freebase RDF Dumps
In the process of constructing a domain semantic knowledge base based on ontologies, reusing existing domain knowledge bases not only facilitates sharing, integration, and reuse of the domain semantic knowledge base but also can accelerate the construction of the domain semantic knowledge base. The...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2018-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8454435/ |
id |
doaj-064168a055e9489cb147c88d7d0e796d |
---|---|
record_format |
Article |
spelling |
doaj-064168a055e9489cb147c88d7d0e796d2021-03-29T21:17:04ZengIEEEIEEE Access2169-35362018-01-016503065032210.1109/ACCESS.2018.28685168454435Research on the Method of Extracting Domain Knowledge From the Freebase RDF DumpsDeyan Chen0https://orcid.org/0000-0001-8141-662XHong Zhao1School of Computer Science & Engineering, Northeastern University, Shenyang, ChinaSchool of Computer Science & Engineering, Northeastern University, Shenyang, ChinaIn the process of constructing a domain semantic knowledge base based on ontologies, reusing existing domain knowledge bases not only facilitates sharing, integration, and reuse of the domain semantic knowledge base but also can accelerate the construction of the domain semantic knowledge base. The open and fast growing Freebase database is a good data source, which can be reused to construct the domain semantic knowledge base. However, extracting domain knowledge from the Freebase Resource Description Framework (RDF) dumps faces many challenges. For example, the dump package is too large to read or load; the dump package contains a lot of unnecessary and redundant facts; some ill-formed triples may cause the load to fail, and so on. In response to these obstacles and the deficiencies of existing research, this paper proposes a method to extract domain knowledge quickly, accurately, and completely from the Freebase RDF dumps and describes the domain knowledge using the semantic constructs in ontology standard description languages. Taking extracting the ontology schema and instance data of the medicine domain, including the facts pointing to semantically related domains, as an example, the principle and implementation process of the method are explained in detail and the algorithms of the key processes are described. Finally, the method of this paper is evaluated, including the comparison and analysis of related methods with work objectives, software tools used, processing results, processing performance, accuracy, completeness, and reusability.https://ieeexplore.ieee.org/document/8454435/Freebasedomain semantic knowledge baseontologysemantic constructssemantic models |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Deyan Chen Hong Zhao |
spellingShingle |
Deyan Chen Hong Zhao Research on the Method of Extracting Domain Knowledge From the Freebase RDF Dumps IEEE Access Freebase domain semantic knowledge base ontology semantic constructs semantic models |
author_facet |
Deyan Chen Hong Zhao |
author_sort |
Deyan Chen |
title |
Research on the Method of Extracting Domain Knowledge From the Freebase RDF Dumps |
title_short |
Research on the Method of Extracting Domain Knowledge From the Freebase RDF Dumps |
title_full |
Research on the Method of Extracting Domain Knowledge From the Freebase RDF Dumps |
title_fullStr |
Research on the Method of Extracting Domain Knowledge From the Freebase RDF Dumps |
title_full_unstemmed |
Research on the Method of Extracting Domain Knowledge From the Freebase RDF Dumps |
title_sort |
research on the method of extracting domain knowledge from the freebase rdf dumps |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2018-01-01 |
description |
In the process of constructing a domain semantic knowledge base based on ontologies, reusing existing domain knowledge bases not only facilitates sharing, integration, and reuse of the domain semantic knowledge base but also can accelerate the construction of the domain semantic knowledge base. The open and fast growing Freebase database is a good data source, which can be reused to construct the domain semantic knowledge base. However, extracting domain knowledge from the Freebase Resource Description Framework (RDF) dumps faces many challenges. For example, the dump package is too large to read or load; the dump package contains a lot of unnecessary and redundant facts; some ill-formed triples may cause the load to fail, and so on. In response to these obstacles and the deficiencies of existing research, this paper proposes a method to extract domain knowledge quickly, accurately, and completely from the Freebase RDF dumps and describes the domain knowledge using the semantic constructs in ontology standard description languages. Taking extracting the ontology schema and instance data of the medicine domain, including the facts pointing to semantically related domains, as an example, the principle and implementation process of the method are explained in detail and the algorithms of the key processes are described. Finally, the method of this paper is evaluated, including the comparison and analysis of related methods with work objectives, software tools used, processing results, processing performance, accuracy, completeness, and reusability. |
topic |
Freebase domain semantic knowledge base ontology semantic constructs semantic models |
url |
https://ieeexplore.ieee.org/document/8454435/ |
work_keys_str_mv |
AT deyanchen researchonthemethodofextractingdomainknowledgefromthefreebaserdfdumps AT hongzhao researchonthemethodofextractingdomainknowledgefromthefreebaserdfdumps |
_version_ |
1724193162329587712 |