CN-DBpedia2: An Extraction and Verification Framework for Enriching Chinese Encyclopedia Knowledge Base
Knowledge base plays an important role in machine understanding and has been widely used in various applications, such as search engine, recommendation system and question answering. However, most knowledge bases are incomplete, which can cause many downstream applications to p...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
The MIT Press
2019-06-01
|
Series: | Data Intelligence |
Online Access: | https://www.mitpressjournals.org/doi/abs/10.1162/dint_a_00017 |
id |
doaj-d95a3c822cc7410082dc28b60660cd5f |
---|---|
record_format |
Article |
spelling |
doaj-d95a3c822cc7410082dc28b60660cd5f2020-11-25T02:30:09ZengThe MIT PressData Intelligence2641-435X2019-06-011327128810.1162/dint_a_00017CN-DBpedia2: An Extraction and Verification Framework for Enriching Chinese Encyclopedia Knowledge BaseXu, BoLiang, JiaqingXie, ChenhaoLiang, BinChen, LihanXiao, Yanghua Knowledge base plays an important role in machine understanding and has been widely used in various applications, such as search engine, recommendation system and question answering. However, most knowledge bases are incomplete, which can cause many downstream applications to perform poorly because they cannot find the corresponding facts in the knowledge bases. In this paper, we propose an extraction and verification framework to enrich the knowledge bases. Specifically, based on the existing knowledge base, we first extract new facts from the description texts of entities. But not all newly-formed facts can be added directly to the knowledge base because the errors might be involved by the extraction. Then we propose a novel crowd-sourcing based verification step to verify the candidate facts. Finally, we apply this framework to the existing knowledge base CN-DBpedia and construct a new version of knowledge base CN-DBpedia2, which additionally contains the high confidence facts extracted from the description texts of entities. https://www.mitpressjournals.org/doi/abs/10.1162/dint_a_00017 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Xu, Bo Liang, Jiaqing Xie, Chenhao Liang, Bin Chen, Lihan Xiao, Yanghua |
spellingShingle |
Xu, Bo Liang, Jiaqing Xie, Chenhao Liang, Bin Chen, Lihan Xiao, Yanghua CN-DBpedia2: An Extraction and Verification Framework for Enriching Chinese Encyclopedia Knowledge Base Data Intelligence |
author_facet |
Xu, Bo Liang, Jiaqing Xie, Chenhao Liang, Bin Chen, Lihan Xiao, Yanghua |
author_sort |
Xu, Bo |
title |
CN-DBpedia2: An Extraction and Verification Framework for Enriching Chinese Encyclopedia Knowledge Base |
title_short |
CN-DBpedia2: An Extraction and Verification Framework for Enriching Chinese Encyclopedia Knowledge Base |
title_full |
CN-DBpedia2: An Extraction and Verification Framework for Enriching Chinese Encyclopedia Knowledge Base |
title_fullStr |
CN-DBpedia2: An Extraction and Verification Framework for Enriching Chinese Encyclopedia Knowledge Base |
title_full_unstemmed |
CN-DBpedia2: An Extraction and Verification Framework for Enriching Chinese Encyclopedia Knowledge Base |
title_sort |
cn-dbpedia2: an extraction and verification framework for enriching chinese encyclopedia knowledge base |
publisher |
The MIT Press |
series |
Data Intelligence |
issn |
2641-435X |
publishDate |
2019-06-01 |
description |
Knowledge base plays an important role in machine understanding and has been widely used in various applications, such as search engine, recommendation system and question answering. However, most knowledge bases are incomplete, which can cause many downstream applications to perform poorly because they cannot find the corresponding facts in the knowledge bases. In this paper, we propose an extraction and verification framework to enrich the knowledge bases. Specifically, based on the existing knowledge base, we first extract new facts from the description texts of entities. But not all newly-formed facts can be added directly to the knowledge base because the errors might be involved by the extraction. Then we propose a novel crowd-sourcing based verification step to verify the candidate facts. Finally, we apply this framework to the existing knowledge base CN-DBpedia and construct a new version of knowledge base CN-DBpedia2, which additionally contains
the high confidence facts extracted from the description texts of entities. |
url |
https://www.mitpressjournals.org/doi/abs/10.1162/dint_a_00017 |
work_keys_str_mv |
AT xubo cndbpedia2anextractionandverificationframeworkforenrichingchineseencyclopediaknowledgebase AT liangjiaqing cndbpedia2anextractionandverificationframeworkforenrichingchineseencyclopediaknowledgebase AT xiechenhao cndbpedia2anextractionandverificationframeworkforenrichingchineseencyclopediaknowledgebase AT liangbin cndbpedia2anextractionandverificationframeworkforenrichingchineseencyclopediaknowledgebase AT chenlihan cndbpedia2anextractionandverificationframeworkforenrichingchineseencyclopediaknowledgebase AT xiaoyanghua cndbpedia2anextractionandverificationframeworkforenrichingchineseencyclopediaknowledgebase |
_version_ |
1724829666151235584 |