Knowledge-guided Deep Reinforcement Learning for Interactive Recommendation

被引:359
作者
Chen, Xiaocong [1 ]
Huang, Chaoran [1 ]
Yao, Lina [1 ]
Wang, Xianzhi [2 ]
Liu, Wei [1 ]
Zhang, Wenjie [1 ]
机构
[1] Univ New South Wales, Sch Comp Sci & Engn, Sydney, NSW, Australia
[2] Univ Technol Sydney, Sch Comp Sci, Sydney, NSW, Australia
来源
2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2020年
关键词
Recommender System; Reinforcement Learning; Deep Neural Network;
D O I
10.1109/ijcnn48605.2020.9207010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Interactive recommendation aims to learn from dynamic interactions between items and users to achieve responsiveness and accuracy. Reinforcement learning is inherently advantageous for coping with dynamic environments and thus has attracted increasing attention in interactive recommendation research. Inspired by knowledge-aware recommendation, we proposed Knowledge-Guided deep Reinforcement learning (KGRL) to harness the advantages of both reinforcement learning and knowledge graphs for interactive recommendation. This model is implemented upon the actor-critic network framework. It maintains a local knowledge network to guide decision-making and employs the attention mechanism to capture long-term semantics between items. We have conducted comprehensive experiments in a simulated online environment with six public real-world datasets and demonstrated the superiority of our model over several state-of-the-art methods.
引用
收藏
页数:8
相关论文
共 34 条
  • [1] [Anonymous], 2019, NEURIPS
  • [2] Unifying Knowledge Graph Learning and Recommendation: Towards a Better Understanding of User Preferences
    Cao, Yixin
    Wang, Xiang
    He, Xiangnan
    Hu, Zikun
    Chua, Tat-Seng
    [J]. WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 151 - 161
  • [3] Chen HK, 2019, AAAI CONF ARTIF INTE, P3312
  • [4] Stabilizing Reinforcement Learning in Dynamic Environment with Application to Online Recommendation
    Chen, Shi-Yong
    Yu, Yang
    Da, Qing
    Tan, Jun
    Huang, Hai-Kuan
    Tang, Hai-Hong
    [J]. KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 1187 - 1196
  • [5] Dulac-Arnold G., 2015, arXiv preprint arXiv:1512.07679, P2
  • [6] A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients
    Grondman, Ivo
    Busoniu, Lucian
    Lopes, Gabriel A. D.
    Babuska, Robert
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2012, 42 (06): : 1291 - 1307
  • [7] Online Learning to Rank for Information Retrieval SIGIR 2016 Tutorial
    Grotov, Artem
    de Rijke, Maarten
    [J]. SIGIR'16: PROCEEDINGS OF THE 39TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2016, : 1215 - 1218
  • [8] He J, 2010, NINTH WUHAN INTERNATIONAL CONFERENCE ON E-BUSINESS, VOLS I-III, P1061
  • [9] Improving Sequential Recommendation with Knowledge-Enhanced Memory Networks
    Huang, Jin
    Zhao, Wayne Xin
    Dou, Hongjian
    Wen, Ji-Rong
    Chang, Edward Y.
    [J]. ACM/SIGIR PROCEEDINGS 2018, 2018, : 505 - 514
  • [10] Johnson Christopher C, 2014, NIPS WORKSH DISTR MA, V27