Multi-hop reasoning over paths in temporal knowledge graphs using reinforcement learning

被引:32
作者
Bai, Luyi [1 ]
Yu, Wenting [1 ]
Chen, Mingzhuo [1 ]
Ma, Xiangnan [1 ]
机构
[1] Northeastern Univ Qinhuangdao, Sch Comp & Commun Engn, Qinhuangdao 066004, Hebei, Peoples R China
基金
中国国家自然科学基金;
关键词
Temporal knowledge graph; Multi-hop reasoning; Reinforcement learning; Long Short-Term Memory Networks;
D O I
10.1016/j.asoc.2021.107144
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Knowledge graphs (KGs) are usually incomplete-many new facts can be inferred from KGs with existing information. In some traditional reasoning methods, temporal information is not taken into consideration, meaning that only triplets (head, relation, tail) are trained. In current dynamic knowledge graphs, it is a challenge to consider the temporal aspects of facts. Recent temporal reasoning methods embed temporal information into low-dimensional spaces. These methods mainly support implicitly reasoning, which means they cannot get the specific reasoning paths. These methods limit the accuracy of reasoning paths and ignore multiple explainable reasoning paths in temporal knowledge graphs (TKGs). To overcome this limitation, we propose a multi-hop reasoning model TPath in this paper. It is a reinforcement learning (RL) framework which can learn multi-hop reasoning paths and continuously adjust the reasoning paths in TKGs. More importantly, we add time vectors in reasoning paths, which further improve the accuracy of reasoning paths. Meanwhile, considering the diversity of temporal reasoning paths, we propose a new reward function. In TPath, the agent employs the Long Short-Term Memory networks (LSTM) to capture current observations from the environment, and it outputs action vectors (relation vectors and time vectors) to the environment through activation functions. Experimentally, our model outperforms other state-of-the-art reasoning methods in several aspects over two public temporal knowledge graph datasets. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:9
相关论文
共 32 条
[1]  
Bauerle A., 2020, IEEE T MULTIMEDIA, DOI DOI 10.1109/TMM.2020.2966887
[2]  
Bordes A., 2013, ADV NEURAL INFORM PR, V26, P2787, DOI DOI 10.5555/2999792.2999923
[3]  
Boschee E., 2015, ICEWS Coded Event Data Read Me
[4]  
Das R., 2018, INT C LEARN REPR
[5]  
Dasgupta Shib Sankar, 2018, P 2018 C EMP METH NA, P2001
[6]  
García-Durán A, 2018, 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), P4816
[7]   Introducing time into RDF [J].
Gutierrez, Claudio ;
Hurtado, Carlos A. ;
Vaisman, Alejandro .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2007, 19 (02) :207-218
[8]  
Heck L, 2013, INTERSPEECH, P1593
[9]  
Hildebrandt M, 2020, AAAI CONF ARTIF INTE, V34, P4123
[10]  
Ji GL, 2016, AAAI CONF ARTIF INTE, P985