Reasoning for Local Graph Over Knowledge Graph With a Multi-Policy Agent

被引：2

作者：

Zhang, Jie ^{[1
]}

Pei, Zhongmin ^{[1
]}

Luo, Zhangkai ^{[1
]}

机构：

[1] Space Engn Univ, Sci & Technol Complex Elect Syst Simulat Lab, Beijing 101416, Peoples R China

来源：

IEEE ACCESS | 2021年 / 9卷 / 09期

关键词：

Cognition; Trajectory; Magnetic heads; Training; Search problems; Markov processes; Task analysis; Knowledge graph completion; local graph reasoning; Markov decision process; reinforcement learning; searching window;

D O I：

10.1109/ACCESS.2021.3083794

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The reinforcement learning framework for multi-hop relational paths is one of the effective methods for solving knowledge graph incompletion. However, these models are associated with limited performances attributed to delayed rewards and far-fetched search trajectories. To overcome these challenges, we propose the searching window and multi-policy agent. The searching window provides a large action space, so that the agent can backtrack based on the newly obtained information and establish a local graph instead of a path chain. Based on the searching window, a double long short-term memory (DBL-LSTM) policy network is introduced to encode the local graph and relation sequence, after which the encoding information is used by the agent to select a correct entity to grow the local graph. Furthermore, multi-policy agent separately infers the local graph through three different policy networks, then, all local graphs are integrated into an information-rich local graph. Experiments using the WN18RR dataset revealed that local graph reasoning with searching window had greater rewards than path reasoning, the proposed DBL-LSTM policy network improved all HITS@N(N = 1,3,5,10) compared to prior works, and that the multi-policy agent achieved higher hit rates than single-policy agent.

引用

页码：78452 / 78462

页数：11

共 31 条

[1] [Anonymous], 2019, ARXIV190900230
[2] Bordes A., 2013, P 26 INT C NEUR INF, V2, P2787
[3] Chen W., 2018, ARXIV180306581
[4] Das R., 2018, P INT C LEARN REPR, P1
[5] Das R, 2017, 15TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2017), VOL 1: LONG PAPERS, P132
[6] Dettmers T, 2018, AAAI CONF ARTIF INTE, P1811
[7] Drumond L., 2012, P ACM S APPL COMP SA, P326, DOI DOI 10.1145/2245276.2245341
[8] Deep Residual Learning for Image Recognition
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
[9] Ioffe S, 2015, PR MACH LEARN RES, V37, P448
[10] Lin X.V., 2018, ARXIV180810568

← 1 2 3 4 →