Knowledge-guided Deep Reinforcement Learning for Interactive Recommendation

被引：297

作者：

Chen, Xiaocong ^{[1
]}

Huang, Chaoran ^{[1
]}

Yao, Lina ^{[1
]}

Wang, Xianzhi ^{[2
]}

Liu, Wei ^{[1
]}

Zhang, Wenjie ^{[1
]}

机构：

[1] Univ New South Wales, Sch Comp Sci & Engn, Sydney, NSW, Australia

[2] Univ Technol Sydney, Sch Comp Sci, Sydney, NSW, Australia

来源：

2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2020年

关键词：

Recommender System; Reinforcement Learning; Deep Neural Network;

D O I：

10.1109/ijcnn48605.2020.9207010

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Interactive recommendation aims to learn from dynamic interactions between items and users to achieve responsiveness and accuracy. Reinforcement learning is inherently advantageous for coping with dynamic environments and thus has attracted increasing attention in interactive recommendation research. Inspired by knowledge-aware recommendation, we proposed Knowledge-Guided deep Reinforcement learning (KGRL) to harness the advantages of both reinforcement learning and knowledge graphs for interactive recommendation. This model is implemented upon the actor-critic network framework. It maintains a local knowledge network to guide decision-making and employs the attention mechanism to capture long-term semantics between items. We have conducted comprehensive experiments in a simulated online environment with six public real-world datasets and demonstrated the superiority of our model over several state-of-the-art methods.

引用

页数：8

共 34 条

[1]

[Anonymous], 2014, NIPS WORKSH DISTR MA

[2] Unifying Knowledge Graph Learning and Recommendation: Towards a Better Understanding of User Preferences [J].

Cao, Yixin ;

Wang, Xiang ;

He, Xiangnan ;

Hu, Zikun ;

Chua, Tat-Seng .

WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, :151-161

[3]

Chen HK, 2019, AAAI CONF ARTIF INTE, P3312

[4] Stabilizing Reinforcement Learning in Dynamic Environment with Application to Online Recommendation [J].

Chen, Shi-Yong ;

Yu, Yang ;

Da, Qing ;

Tan, Jun ;

Huang, Hai-Kuan ;

Tang, Hai-Hong .

KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, :1187-1196

[5]

Dulac-Arnold Gabriel, 2015, Deep reinforcement learning in large discrete action spaces

[6] A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients [J].

Grondman, Ivo ;

Busoniu, Lucian ;

Lopes, Gabriel A. D. ;

Babuska, Robert .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2012, 42 (06) :1291-1307

[7] Online Learning to Rank for Information Retrieval SIGIR 2016 Tutorial [J].

Grotov, Artem ;

de Rijke, Maarten .

SIGIR'16: PROCEEDINGS OF THE 39TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2016, :1215-1218

[8]

He J, 2010, NINTH WUHAN INTERNATIONAL CONFERENCE ON E-BUSINESS, VOLS I-III, P1061

[9] Improving Sequential Recommendation with Knowledge-Enhanced Memory Networks [J].

Huang, Jin ;

Zhao, Wayne Xin ;

Dou, Hongjian ;

Wen, Ji-Rong ;

Chang, Edward Y. .

ACM/SIGIR PROCEEDINGS 2018, 2018, :505-514

[10]

Kang Wang-Cheng, 2018, 2018 IEEE INT C DAT

← 1 2 3 4 →