Multi-hop reasoning over paths in temporal knowledge graphs using reinforcement learning

被引：40

作者：

Bai, Luyi ^{[1
]}

Yu, Wenting ^{[1
]}

Chen, Mingzhuo ^{[1
]}

Ma, Xiangnan ^{[1
]}

机构：

[1] Northeastern Univ Qinhuangdao, Sch Comp & Commun Engn, Qinhuangdao 066004, Hebei, Peoples R China

来源：

APPLIED SOFT COMPUTING | 2021年 / 103卷

基金：

中国国家自然科学基金;

关键词：

Temporal knowledge graph; Multi-hop reasoning; Reinforcement learning; Long Short-Term Memory Networks;

D O I：

10.1016/j.asoc.2021.107144

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Knowledge graphs (KGs) are usually incomplete-many new facts can be inferred from KGs with existing information. In some traditional reasoning methods, temporal information is not taken into consideration, meaning that only triplets (head, relation, tail) are trained. In current dynamic knowledge graphs, it is a challenge to consider the temporal aspects of facts. Recent temporal reasoning methods embed temporal information into low-dimensional spaces. These methods mainly support implicitly reasoning, which means they cannot get the specific reasoning paths. These methods limit the accuracy of reasoning paths and ignore multiple explainable reasoning paths in temporal knowledge graphs (TKGs). To overcome this limitation, we propose a multi-hop reasoning model TPath in this paper. It is a reinforcement learning (RL) framework which can learn multi-hop reasoning paths and continuously adjust the reasoning paths in TKGs. More importantly, we add time vectors in reasoning paths, which further improve the accuracy of reasoning paths. Meanwhile, considering the diversity of temporal reasoning paths, we propose a new reward function. In TPath, the agent employs the Long Short-Term Memory networks (LSTM) to capture current observations from the environment, and it outputs action vectors (relation vectors and time vectors) to the environment through activation functions. Experimentally, our model outperforms other state-of-the-art reasoning methods in several aspects over two public temporal knowledge graph datasets. (C) 2021 Elsevier B.V. All rights reserved.

引用

页数：9

共 32 条

[1]

[Anonymous], 2015, ACS SYM SER

[2]

Bauerle A., 2020, IEEE T MULTIMEDIA, DOI DOI 10.1109/TMM.2020.2966887

[3]

Bordes A., 2013, Advances in Neural Information Processing Systems, V26, P2787, DOI DOI 10.5555/2999792.2999923

[4]

Boschee E., 2015, ICEWS coded event data

[5]

Cohen WW, 2017, 31 ANN C NEURAL INFO, V30

[6]

Das Rajarshi, 2018, INT C LEARN REPR

[7]

Dasgupta S.S., 2018, P 2018 C EMP METH NA, P2001, DOI 10.18653/

[8]

García-Durán A, 2018, 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), P4816

[9] Introducing time into RDF [J].

Gutierrez, Claudio ;

Hurtado, Carlos A. ;

Vaisman, Alejandro .

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2007, 19 (02) :207-218

[10]

Heck L, 2013, INTERSPEECH, P1593

← 1 2 3 4 →