CN-DBpedia2: An Extraction and Verification Framework for Enriching Chinese Encyclopedia Knowledge Base

Knowledge base plays an important role in machine understanding and has been widely used in various applications, such as search engine, recommendation system and question answering. However, most knowledge bases are incomplete, which can cause many downstream applications to p...

Full description

Bibliographic Details
Main Authors: Xu, Bo, Liang, Jiaqing, Xie, Chenhao, Liang, Bin, Chen, Lihan, Xiao, Yanghua
Format: Article
Language:English
Published: The MIT Press 2019-06-01
Series:Data Intelligence
Online Access:https://www.mitpressjournals.org/doi/abs/10.1162/dint_a_00017
id doaj-d95a3c822cc7410082dc28b60660cd5f
record_format Article
spelling doaj-d95a3c822cc7410082dc28b60660cd5f2020-11-25T02:30:09ZengThe MIT PressData Intelligence2641-435X2019-06-011327128810.1162/dint_a_00017CN-DBpedia2: An Extraction and Verification Framework for Enriching Chinese Encyclopedia Knowledge BaseXu, BoLiang, JiaqingXie, ChenhaoLiang, BinChen, LihanXiao, Yanghua Knowledge base plays an important role in machine understanding and has been widely used in various applications, such as search engine, recommendation system and question answering. However, most knowledge bases are incomplete, which can cause many downstream applications to perform poorly because they cannot find the corresponding facts in the knowledge bases. In this paper, we propose an extraction and verification framework to enrich the knowledge bases. Specifically, based on the existing knowledge base, we first extract new facts from the description texts of entities. But not all newly-formed facts can be added directly to the knowledge base because the errors might be involved by the extraction. Then we propose a novel crowd-sourcing based verification step to verify the candidate facts. Finally, we apply this framework to the existing knowledge base CN-DBpedia and construct a new version of knowledge base CN-DBpedia2, which additionally contains the high confidence facts extracted from the description texts of entities. https://www.mitpressjournals.org/doi/abs/10.1162/dint_a_00017
collection DOAJ
language English
format Article
sources DOAJ
author Xu, Bo
Liang, Jiaqing
Xie, Chenhao
Liang, Bin
Chen, Lihan
Xiao, Yanghua
spellingShingle Xu, Bo
Liang, Jiaqing
Xie, Chenhao
Liang, Bin
Chen, Lihan
Xiao, Yanghua
CN-DBpedia2: An Extraction and Verification Framework for Enriching Chinese Encyclopedia Knowledge Base
Data Intelligence
author_facet Xu, Bo
Liang, Jiaqing
Xie, Chenhao
Liang, Bin
Chen, Lihan
Xiao, Yanghua
author_sort Xu, Bo
title CN-DBpedia2: An Extraction and Verification Framework for Enriching Chinese Encyclopedia Knowledge Base
title_short CN-DBpedia2: An Extraction and Verification Framework for Enriching Chinese Encyclopedia Knowledge Base
title_full CN-DBpedia2: An Extraction and Verification Framework for Enriching Chinese Encyclopedia Knowledge Base
title_fullStr CN-DBpedia2: An Extraction and Verification Framework for Enriching Chinese Encyclopedia Knowledge Base
title_full_unstemmed CN-DBpedia2: An Extraction and Verification Framework for Enriching Chinese Encyclopedia Knowledge Base
title_sort cn-dbpedia2: an extraction and verification framework for enriching chinese encyclopedia knowledge base
publisher The MIT Press
series Data Intelligence
issn 2641-435X
publishDate 2019-06-01
description Knowledge base plays an important role in machine understanding and has been widely used in various applications, such as search engine, recommendation system and question answering. However, most knowledge bases are incomplete, which can cause many downstream applications to perform poorly because they cannot find the corresponding facts in the knowledge bases. In this paper, we propose an extraction and verification framework to enrich the knowledge bases. Specifically, based on the existing knowledge base, we first extract new facts from the description texts of entities. But not all newly-formed facts can be added directly to the knowledge base because the errors might be involved by the extraction. Then we propose a novel crowd-sourcing based verification step to verify the candidate facts. Finally, we apply this framework to the existing knowledge base CN-DBpedia and construct a new version of knowledge base CN-DBpedia2, which additionally contains the high confidence facts extracted from the description texts of entities.
url https://www.mitpressjournals.org/doi/abs/10.1162/dint_a_00017
work_keys_str_mv AT xubo cndbpedia2anextractionandverificationframeworkforenrichingchineseencyclopediaknowledgebase
AT liangjiaqing cndbpedia2anextractionandverificationframeworkforenrichingchineseencyclopediaknowledgebase
AT xiechenhao cndbpedia2anextractionandverificationframeworkforenrichingchineseencyclopediaknowledgebase
AT liangbin cndbpedia2anextractionandverificationframeworkforenrichingchineseencyclopediaknowledgebase
AT chenlihan cndbpedia2anextractionandverificationframeworkforenrichingchineseencyclopediaknowledgebase
AT xiaoyanghua cndbpedia2anextractionandverificationframeworkforenrichingchineseencyclopediaknowledgebase
_version_ 1724829666151235584