Tree-KGQA: An Unsupervised Approach for Question Answering Over Knowledge Graphs

被引:14
作者
Rony, M. D. Rashad A. L. Hasan [1 ,3 ]
Chaudhuri, Debanjan [1 ]
Usbeck, Ricardo [2 ]
Lehmann, Jens [1 ,3 ]
机构
[1] Univ Bonn, Smart Data Analyt Res Grp, D-53115 Bonn, Germany
[2] Univ Hamburg, Semant Syst Grp, D-22527 Hamburg, Germany
[3] Fraunhofer IAIS Dresden, D-01069 Dresden, Germany
关键词
Task analysis; Joining processes; Training; Vegetation; Training data; Licenses; Indexing; Knowledge based systems; information retrieval; question answering; entity linking; relation linking; indexing; pre-trained language models;
D O I
10.1109/ACCESS.2022.3173355
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most Knowledge Graph-based Question Answering (KGQA) systems rely on training data to reach their optimal performance. However, acquiring training data for supervised systems is both time-consuming and resource-intensive. To address this, in this paper, we propose Tree-KGQA, an unsupervised KGQA system leveraging pre-trained language models and tree-based algorithms. Entity and relation linking are essential components of any KGQA system. We employ several pre-trained language models in the entity linking task to recognize the entities mentioned in the question and obtain the contextual representation for indexing. Furthermore, for relation linking we incorporate a pre-trained language model previously trained for language inference task. Finally, we introduce a novel algorithm for extracting the answer entities from a KG, where we construct a forest of interpretations and introduce tree-walking and tree disambiguation techniques. Our algorithm uses the linked relation and predicts the tree branches that eventually lead to the potential answer entities. The proposed method achieves 4.5% and 7.1% gains in F1 score in entity linking tasks on LC-QuAD 2.0 and LC-QuAD 2.0 (KBpearl) datasets, respectively, and a 5.4% increase in the relation linking task on LC-QuAD 2.0 (KBpearl). The comprehensive evaluations demonstrate that our unsupervised KGQA approach outperforms other supervised state-of-the-art methods on the WebQSP-WD test set (1.4% increase in F1 score) - without training on the target dataset.
引用
收藏
页码:50467 / 50478
页数:12
相关论文
共 54 条
[1]  
[Anonymous], Proceedings of the 2008 ACM SIGMOD international conference on Management of data, SIGMOD '08
[2]   PNEL: Pointer Network Based End-To-End Entity Linking over Knowledge Graphs [J].
Banerjee, Debayan ;
Chaudhuri, Debanjan ;
Dubey, Mohnish ;
Lehmann, Jens .
SEMANTIC WEB - ISWC 2020, PT I, 2020, 12506 :21-38
[3]  
Bao Junwei, 2016, P COLING 2016 26 INT, P2503
[4]   Query-Driven On-The-Fly Knowledge Base Construction [J].
Dat Ba Nguyen ;
Abujabal, Abdalghani ;
Nam Khanh Tran ;
Theobald, Martin ;
Weikum, Gerhard .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2017, 11 (01) :66-79
[5]  
Del Corro Luciano, 2013, P 22 INT C WORLD WID, P355, DOI 10.1145/2488388.2488420
[6]  
Delpeuch Antonin, 2019, arXiv preprint arXiv:1904.09131
[7]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[8]   LC-QuAD 2.0: A Large Dataset for Complex Question Answering over Wikidata and DBpedia [J].
Dubey, Mohnish ;
Banerjee, Debayan ;
Abdelkawi, Abdelrahman ;
Lehmann, Jens .
SEMANTIC WEB - ISWC 2019, PT II, 2019, 11779 :69-78
[9]   EARL: Joint Entity and Relation Linking for Question Answering over Knowledge Graphs [J].
Dubey, Mohnish ;
Banerjee, Debayan ;
Chaudhuri, Debanjan ;
Lehmann, Jens .
SEMANTIC WEB - ISWC 2018, PT I, 2018, 11136 :108-126
[10]  
Fabbri AR, 2020, 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), P4508