SPARQL-QA-v2 system for Knowledge Base Question Answering

被引:8
作者
Borroto, Manuel A. [1 ]
Ricca, Francesco [1 ]
机构
[1] Univ Calabria, Dept Math & Comp Sci, Via Pietro Bucci,Cubo 30B, I-87036 Arcavacata Di Rende, Calabria, Italy
关键词
Knowledge base; Question answering; Neural network;
D O I
10.1016/j.eswa.2023.120383
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accessing the large volumes of information available in public knowledge bases might be complicated for those users unfamiliar with formal languages, such as the SPARQL query language and the ontology definition languages. This issue can be overcome by providing systems able to answer questions posed in natural language on a knowledge base, a task that is called Knowledge Base Question Answering (KBQA) in the literature. More in detail, many KBQA systems aim at translating automatically questions into the corresponding SPARQL queries to be executed over the knowledge base to get the answers. Effective state-of-the-art KBQA systems are based on neural-machine translation but easily fail to recognize words that are Out Of the Vocabulary (OOV) of the training set. This is a serious issue while querying large ontologies where the list of entities is huge and easily evolves over time. In this paper, we present the SPARQL-QA-v2 system that combines in an innovative way Named Entity Linking, Named Entity Recognition, and Neural Machine Translation for addressing the problem of generating SPARQL queries from questions posed in natural language. We demonstrate empirically that SPARQL-QA-v2 is effective and resilient to OOV words and delivers state-of-the-art performance in well-known datasets for question answering over DBpedia and Wikidata knowledge bases.
引用
收藏
页数:11
相关论文
共 56 条
  • [1] [Anonymous], 2014, RESOURCE DESCRIPTION
  • [2] [Anonymous], 2017, SYNTHESIS LECT HUMAN, DOI [DOI 10.2200/S00762ED1V01Y201703HLT037, 10.2200/S00762ED1V01Y201703HLT037]
  • [3] Template-based Question Answering using Recursive Neural Networks
    Athreya, Ram G.
    Bansal, Srividya K.
    Ngomo, Axel-Cyrille Ngonga
    Usbeck, Ricardo
    [J]. 2021 IEEE 15TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2021), 2021, : 195 - 198
  • [4] Bahdanau D, 2016, Arxiv, DOI [arXiv:1409.0473, 10.48550/arXiv.1409.0473, DOI 10.48550/ARXIV.1409.0473]
  • [5] Baramiia N., 2022, CEUR WORKSHOP P, V3196, P32
  • [6] LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT
    BENGIO, Y
    SIMARD, P
    FRASCONI, P
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02): : 157 - 166
  • [7] Bojanowski P., 2017, T ASSOC COMPUT LING, V5, P135, DOI [DOI 10.1162/TACLA00051, 10.1162/tacl\_a\_00051, 10.1162/tacl_a_00051, DOI 10.1162/TACL_A_00051]
  • [8] A Neural-Machine-Translation System Resilient to Out of Vocabulary Words for Translating Natural Language to SPARQL
    Borroto, Manuel
    Ricca, Francesco
    Cuteri, Bernardo
    [J]. AIXIA 2021 - ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, 13196 : 171 - 184
  • [9] Chen S, 2021, ACL-IJCNLP 2021: THE JOINT CONFERENCE OF THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING: PROCEEDINGS OF THE SYSTEM DEMONSTRATIONS, P325
  • [10] Chen YR, 2020, PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P3751