Querying knowledge graphs in natural language

Abstract Knowledge graphs are a powerful concept for querying large amounts of data. These knowledge graphs are typically enormous and are often not easily accessible to end-users because they require specialized knowledge in query languages such as SPARQL. Moreover, end-users need a deep understand...

Full description

Bibliographic Details
Main Authors:	Shiqi Liang, Kurt Stockinger, Tarcisio Mendes de Farias, Maria Anisimova, Manuel Gil
Format:	Article
Language:	English
Published:	SpringerOpen 2021-01-01
Series:	Journal of Big Data
Subjects:	Natural language processing Query processing Knowledge graphs SPARQL
Online Access:	https://doi.org/10.1186/s40537-020-00383-w

id	doaj-42feebda03d14ce39c24152a5e85e3ab
record_format	Article
spelling	doaj-42feebda03d14ce39c24152a5e85e3ab2021-01-10T13:01:55ZengSpringerOpenJournal of Big Data2196-11152021-01-018112310.1186/s40537-020-00383-wQuerying knowledge graphs in natural languageShiqi Liang0Kurt Stockinger1Tarcisio Mendes de Farias2Maria Anisimova3Manuel Gil4ETH Swiss Federal Institute of TechnologyZurich University of Applied SciencesSIB Swiss Institute of BioinformaticsZurich University of Applied SciencesZurich University of Applied SciencesAbstract Knowledge graphs are a powerful concept for querying large amounts of data. These knowledge graphs are typically enormous and are often not easily accessible to end-users because they require specialized knowledge in query languages such as SPARQL. Moreover, end-users need a deep understanding of the structure of the underlying data models often based on the Resource Description Framework (RDF). This drawback has led to the development of Question-Answering (QA) systems that enable end-users to express their information needs in natural language. While existing systems simplify user access, there is still room for improvement in the accuracy of these systems. In this paper we propose a new QA system for translating natural language questions into SPARQL queries. The key idea is to break up the translation process into 5 smaller, more manageable sub-tasks and use ensemble machine learning methods as well as Tree-LSTM-based neural network models to automatically learn and translate a natural language question into a SPARQL query. The performance of our proposed QA system is empirically evaluated using the two renowned benchmarks-the 7th Question Answering over Linked Data Challenge (QALD-7) and the Large-Scale Complex Question Answering Dataset (LC-QuAD). Experimental results show that our QA system outperforms the state-of-art systems by 15% on the QALD-7 dataset and by 48% on the LC-QuAD dataset, respectively. In addition, we make our source code available.https://doi.org/10.1186/s40537-020-00383-wNatural language processingQuery processingKnowledge graphsSPARQL
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Shiqi Liang Kurt Stockinger Tarcisio Mendes de Farias Maria Anisimova Manuel Gil
spellingShingle	Shiqi Liang Kurt Stockinger Tarcisio Mendes de Farias Maria Anisimova Manuel Gil Querying knowledge graphs in natural language Journal of Big Data Natural language processing Query processing Knowledge graphs SPARQL
author_facet	Shiqi Liang Kurt Stockinger Tarcisio Mendes de Farias Maria Anisimova Manuel Gil
author_sort	Shiqi Liang
title	Querying knowledge graphs in natural language
title_short	Querying knowledge graphs in natural language
title_full	Querying knowledge graphs in natural language
title_fullStr	Querying knowledge graphs in natural language
title_full_unstemmed	Querying knowledge graphs in natural language
title_sort	querying knowledge graphs in natural language
publisher	SpringerOpen
series	Journal of Big Data
issn	2196-1115
publishDate	2021-01-01
description	Abstract Knowledge graphs are a powerful concept for querying large amounts of data. These knowledge graphs are typically enormous and are often not easily accessible to end-users because they require specialized knowledge in query languages such as SPARQL. Moreover, end-users need a deep understanding of the structure of the underlying data models often based on the Resource Description Framework (RDF). This drawback has led to the development of Question-Answering (QA) systems that enable end-users to express their information needs in natural language. While existing systems simplify user access, there is still room for improvement in the accuracy of these systems. In this paper we propose a new QA system for translating natural language questions into SPARQL queries. The key idea is to break up the translation process into 5 smaller, more manageable sub-tasks and use ensemble machine learning methods as well as Tree-LSTM-based neural network models to automatically learn and translate a natural language question into a SPARQL query. The performance of our proposed QA system is empirically evaluated using the two renowned benchmarks-the 7th Question Answering over Linked Data Challenge (QALD-7) and the Large-Scale Complex Question Answering Dataset (LC-QuAD). Experimental results show that our QA system outperforms the state-of-art systems by 15% on the QALD-7 dataset and by 48% on the LC-QuAD dataset, respectively. In addition, we make our source code available.
topic	Natural language processing Query processing Knowledge graphs SPARQL
url	https://doi.org/10.1186/s40537-020-00383-w
work_keys_str_mv	AT shiqiliang queryingknowledgegraphsinnaturallanguage AT kurtstockinger queryingknowledgegraphsinnaturallanguage AT tarcisiomendesdefarias queryingknowledgegraphsinnaturallanguage AT mariaanisimova queryingknowledgegraphsinnaturallanguage AT manuelgil queryingknowledgegraphsinnaturallanguage
_version_	1724341852148072448

Querying knowledge graphs in natural language

Similar Items