DAPath: Distance-aware knowledge graph reasoning based on deep reinforcement learning

被引:38
作者
Tiwari, Prayag [1 ]
Zhu, Hongyin [2 ]
Pandey, Hari Mohan [3 ]
机构
[1] Univ Padua, Dept Informat Engn, Padua, Italy
[2] Chinese Acad Sci, Inst Automat, Beijing, Peoples R China
[3] Edge Hill Univ, Dept Comp Sci, Ormskirk L39 4QP, England
基金
欧盟地平线“2020”;
关键词
Knowledge graph reasoning; Reinforcement learning; Graph self-attention; GRU;
D O I
10.1016/j.neunet.2020.11.012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Knowledge graph reasoning aims to find reasoning paths for relations over incomplete knowledge graphs (KG). Prior works may not take into account that the rewards for each position (vertex in the graph) may be different. We propose the distance-aware reward in the reinforcement learning framework to assign different rewards for different positions. We observe that KG embeddings are learned from independent triples and therefore cannot fully cover the information described in the local neighborhood. To this effect, we integrate a graph self-attention (GSA) mechanism to capture more comprehensive entity information from the neighboring entities and relations. To let the model remember the path, we incorporate the GSA mechanism with GRU to consider the memory of relations in the path. Our approach can train the agent in one-pass, thus eliminating the pre-training or finetuning process, which significantly reduces the problem complexity. Experimental results demonstrate the effectiveness of our method. We found that our model can mine more balanced paths for each relation. (c) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1 / 12
页数:12
相关论文
共 61 条
  • [1] [Anonymous], 2018, NEURAL INFORM PROCES
  • [2] [Anonymous], 2016, P ACL
  • [3] Balazevic I., 2019, P EMNLP IJCNLP
  • [4] Bansal T, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P4387
  • [5] Bordes A., 2013, ADV NEURAL INFORM PR, V26, P2787, DOI DOI 10.5555/2999792.2999923
  • [6] Carlson A., 2010, P AAAI
  • [7] Chung J., CoRR
  • [8] Cohen W, 2011, P 2011 C EMPIRICAL M, P529
  • [9] Das R, 2017, 15TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2017), VOL 1: LONG PAPERS, P132
  • [10] Das Rajarshi, 2018, P ICLR