Querying knowledge graphs in natural language

被引:0
作者
Shiqi Liang
Kurt Stockinger
Tarcisio Mendes de Farias
Maria Anisimova
Manuel Gil
机构
[1] ETH Swiss Federal Institute of Technology,Department of Ecology and Evolution
[2] Zurich University of Applied Sciences,undefined
[3] SIB Swiss Institute of Bioinformatics,undefined
[4] University of Lausanne,undefined
来源
Journal of Big Data | / 8卷
关键词
Natural language processing; Query processing; Knowledge graphs; SPARQL;
D O I
暂无
中图分类号
学科分类号
摘要
Knowledge graphs are a powerful concept for querying large amounts of data. These knowledge graphs are typically enormous and are often not easily accessible to end-users because they require specialized knowledge in query languages such as SPARQL. Moreover, end-users need a deep understanding of the structure of the underlying data models often based on the Resource Description Framework (RDF). This drawback has led to the development of Question-Answering (QA) systems that enable end-users to express their information needs in natural language. While existing systems simplify user access, there is still room for improvement in the accuracy of these systems. In this paper we propose a new QA system for translating natural language questions into SPARQL queries. The key idea is to break up the translation process into 5 smaller, more manageable sub-tasks and use ensemble machine learning methods as well as Tree-LSTM-based neural network models to automatically learn and translate a natural language question into a SPARQL query. The performance of our proposed QA system is empirically evaluated using the two renowned benchmarks-the 7th Question Answering over Linked Data Challenge (QALD-7) and the Large-Scale Complex Question Answering Dataset (LC-QuAD). Experimental results show that our QA system outperforms the state-of-art systems by 15% on the QALD-7 dataset and by 48% on the LC-QuAD dataset, respectively. In addition, we make our source code available.
引用
收藏
相关论文
共 50 条
  • [1] Diefenbach D(2018)Core techniques of question answering systems over knowledge bases: a survey Knowl Informat syst 55 529-569
  • [2] Lopez V(2014)Constructing an interactive natural language interface for relational databases Proceed VLDB Endowment 8 73-84
  • [3] Singh K(2019)A comparative survey of recent natural language interfaces for databases VLDB J. 8 895-920
  • [4] Maret P(2017)Survey on challenges of question answering in the semantic web Semant Web 2019 baz106-249
  • [5] Li F(2019)Enabling semantic queries across federated bioinformatics databases Database. 5 225-81
  • [6] Jagadish H(1990)Natural language interfaces to databases Knowl Eng Rev 1 29-32
  • [7] Affolter K(1995)Natural language interfaces to databases-an introduction Nat Lang Eng 45 5-181
  • [8] Stockinger K(2001)Random forests Machine learn 46 157-13
  • [9] Bernstein A(2012)Dbpedia and the live extraction of structured data from wikipedia Program Electron Libr Informat Syst 21 3-146
  • [10] Höffner K(2013)Evaluating question answering over linked data Web Semant Sci Serv Agents World Wide Web 5 135-2159