Knowledge Reasoning Method Based on Deep Transfer Reinforcement Learning: DTRLpath

被引：0

作者：

Lin, Shiming ^{[1
,2
,3
]}

Ye, Ling ^{[2
]}

Zhuang, Yijie ^{[1
]}

Lu, Lingyun ^{[2
]}

Zheng, Shaoqiu ^{[2
]}

Huang, Chenxi ^{[1
]}

Kwee, Ng Yin ^{[4
]}

机构：

[1] Xiamen Univ, Sch Informat, Xiamen 361104, Peoples R China

[2] Nanjing Res Inst Elect Engn, Key Lab Informat Syst Requirement, Nanjing 210007, Peoples R China

[3] Changji Univ, Sch Informat Engn, Changji 831100, Peoples R China

[4] Nanyang Technol Univ, Sch Mech & Aerosp Engn, Singapore 639798, Singapore

来源：

CMC-COMPUTERS MATERIALS & CONTINUA | 2024年 / 80卷 / 01期

关键词：

Intelligent agent; knowledge graph reasoning; reinforcement; transfer learning;

D O I：

10.32604/cmc.2024.051379

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In recent years, with the continuous development of deep learning and knowledge graph reasoning methods, more and more researchers have shown great interest in improving knowledge graph reasoning methods by inferring missing facts through reasoning. By searching paths on the knowledge graph and making fact and link predictions based on these paths, deep learning-based Reinforcement Learning (RL) agents can demonstrate good performance and interpretability. Therefore, deep reinforcement learning-based knowledge reasoning methods have rapidly emerged in recent years and have become a hot research topic. However, even in a small and fixed knowledge graph reasoning action space, there are still a large number of invalid actions. It often leads to the interruption of RL agents' wandering due to the selection of invalid actions, resulting in a significant decrease in the success rate of path mining. In order to improve the success rate of RL agents in the early stages of path search, this article proposes a knowledge reasoning method based on Deep Transfer Reinforcement Learning path (DTRLpath). Before supervised pre-training and retraining, a pre-task of searching for effective actions in a single step is added. The RL agent is first trained in the pre-task to improve its ability to search for effective actions. Then, the trained agent is transferred to the target reasoning task for path search training, which improves its success rate in searching for target task paths. Finally, based on the comparative experimental results on the FB15K-237 and NELL-995 datasets, it can be concluded that the proposed method significantly improves the success rate of path search and outperforms similar methods in most reasoning tasks.

引用

页码：299 / 317

页数：19

共 31 条

[1]

Ammanabrolu P, 2019, Arxiv, DOI arXiv:1908.06556

[2]

Ba J, 2014, ACS SYM SER

[3]

Bordes A., 2013, Advances in Neural Information Processing Systems, V26

[4]

Chen WH, 2018, Arxiv, DOI arXiv:1803.06581

[5]

Das R., 2018, P 6 INT C LEARN REPR, P1

[6]

Dong L, 2015, PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1, P260

[7]

Gao Y, 2020, Arxiv, DOI arXiv:2004.00387

[8] A Survey on Knowledge Graph-Based Recommender Systems [J].

Guo, Qingyu ;

Zhuang, Fuzhen ;

Qin, Chuan ;

Zhu, Hengshu ;

Xie, Xing ;

Xiong, Hui ;

He, Qing .

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (08) :3549-3568

[9]

Han X, 2018, CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018): PROCEEDINGS OF SYSTEM DEMONSTRATIONS, P139

[10]

Hochreiter S, 1997, NEURAL COMPUT, V9, P1735, DOI [10.1162/neco.1997.9.8.1735, 10.1007/978-3-642-24797-2, 10.1162/neco.1997.9.1.1]

← 1 2 3 4 →