Reinforcement learning with dynamic completion for answering multi-hop questions over incomplete knowledge graph

被引：14

作者：

Cui, Hai ^{[1
]}

Peng, Tao ^{[1
,2
,3
]}

Han, Ridong ^{[1
]}

Zhu, Beibei ^{[1
]}

Bi, Haijia ^{[1
]}

Liu, Lu ^{[1
,2
,3
]}

机构：

[1] Jilin Univ, Coll Comp Sci & Technol, Changchun 130012, Jilin, Peoples R China

[2] Jilin Univ, Coll Software, Changchun 130012, Jilin, Peoples R China

[3] Minist Educ, Key Lab Symbol Computat & Knowledge Engineer, Changchun 130012, Jilin, Peoples R China

来源：

INFORMATION PROCESSING & MANAGEMENT | 2023年 / 60卷 / 03期

基金：

中国国家自然科学基金;

关键词：

Knowledge graph; Question answering; Reinforcement learning; Path-based reasoning; Dynamic completion;

D O I：

10.1016/j.ipm.2023.103283

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Text-enhanced and implicit reasoning methods are proposed for answering questions over incomplete knowledge graph (KG), whereas prior studies either rely on external resources or lack necessary interpretability. This article desires to extend the line of reinforcement learning (RL) methods for better interpretability and dynamically augment original KG action space with additional actions. To this end, we propose a RL framework along with a dynamic completion mechanism, namely Dynamic Completion Reasoning Network (DCRN). DCRN consists of an action space completion module and a policy network. The action space completion module exploits three sub-modules (relation selector, relation pruner and tail entity predictor) to enrich options for decision making. The policy network calculates probability distribution over joint action space and selects promising next-step actions. Simultaneously, we employ the beam search-based action selection strategy to alleviate delayed and sparse rewards. Extensive experiments conducted on WebQSP, CWQ and MetaQA demonstrate the effectiveness of DCRN. Specifically, under 50% KG setting, the Hits@1 performance improvements of DCRN on MetaQA-1H and MetaQA-3H are 2.94% and 1.18% respectively. Moreover, under 30% and 10% KG settings, DCRN prevails over all baselines by 0.9% and 1.5% on WebQSP, indicating the robustness to sparse KGs.

引用

页数：21

共 75 条

[1]

Alon T, 2018, P 2018 C N AM CHAPT, V1, P641

[2]

Bahdanau D, 2016, Arxiv, DOI [arXiv:1409.0473, 10.48550/arXiv.1409.0473,1409.0473, DOI 10.48550/ARXIV.1409.0473,1409.0473]

[3]

Balazevic I, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P5185

[4]

Cai JY, 2021, FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, P219

[5] Explicable recommendation based on knowledge graph [J].

Cai, Xingjuan ;

Xie, Lijie ;

Tian, Rui ;

Cui, Zhihua .

EXPERT SYSTEMS WITH APPLICATIONS, 2022, 200

[6] Reading Wikipedia to Answer Open-Domain Questions [J].

Chen, Danqi ;

Fisch, Adam ;

Weston, Jason ;

Bordes, Antoine .

PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, :1870-1879

[7]

Chen WJ, 2022, Arxiv, DOI arXiv:2207.07503

[8]

Chen Y., 2019, NAACL-HLT, P2913

[9]

Chen ZY, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P345

[10] Incorporating anticipation embedding into reinforcement learning framework for multi-hop knowledge graph question answering [J].

Cui, Hai ;

Peng, Tao ;

Xiao, Feng ;

Han, Jiayu ;

Han, Ridong ;

Liu, Lu .

INFORMATION SCIENCES, 2023, 619 :745-761

← 1 2 3 4 5 6 7 8 →