Reasoning for Local Graph Over Knowledge Graph With a Multi-Policy Agent

被引:2
作者
Zhang, Jie [1 ]
Pei, Zhongmin [1 ]
Luo, Zhangkai [1 ]
机构
[1] Space Engn Univ, Sci & Technol Complex Elect Syst Simulat Lab, Beijing 101416, Peoples R China
来源
IEEE ACCESS | 2021年 / 9卷 / 09期
关键词
Cognition; Trajectory; Magnetic heads; Training; Search problems; Markov processes; Task analysis; Knowledge graph completion; local graph reasoning; Markov decision process; reinforcement learning; searching window;
D O I
10.1109/ACCESS.2021.3083794
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The reinforcement learning framework for multi-hop relational paths is one of the effective methods for solving knowledge graph incompletion. However, these models are associated with limited performances attributed to delayed rewards and far-fetched search trajectories. To overcome these challenges, we propose the searching window and multi-policy agent. The searching window provides a large action space, so that the agent can backtrack based on the newly obtained information and establish a local graph instead of a path chain. Based on the searching window, a double long short-term memory (DBL-LSTM) policy network is introduced to encode the local graph and relation sequence, after which the encoding information is used by the agent to select a correct entity to grow the local graph. Furthermore, multi-policy agent separately infers the local graph through three different policy networks, then, all local graphs are integrated into an information-rich local graph. Experiments using the WN18RR dataset revealed that local graph reasoning with searching window had greater rewards than path reasoning, the proposed DBL-LSTM policy network improved all HITS@N(N = 1,3,5,10) compared to prior works, and that the multi-policy agent achieved higher hit rates than single-policy agent.
引用
收藏
页码:78452 / 78462
页数:11
相关论文
共 31 条
  • [1] [Anonymous], 2019, ARXIV190900230
  • [2] Bordes A., 2013, P 26 INT C NEUR INF, V2, P2787
  • [3] Chen W., 2018, ARXIV180306581
  • [4] Das R., 2018, P INT C LEARN REPR, P1
  • [5] Das R, 2017, 15TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2017), VOL 1: LONG PAPERS, P132
  • [6] Dettmers T, 2018, AAAI CONF ARTIF INTE, P1811
  • [7] Drumond L., 2012, P ACM S APPL COMP SA, P326, DOI DOI 10.1145/2245276.2245341
  • [8] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [9] Ioffe S, 2015, PR MACH LEARN RES, V37, P448
  • [10] Lin X.V., 2018, ARXIV180810568