Social Attentive Deep Q-Networks for Recommender Systems

被引:8
作者
Lei, Yu [1 ]
Wang, Zhitao [1 ]
Li, Wenjie [1 ]
Pei, Hongbin [2 ]
Dai, Quanyu [1 ]
机构
[1] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Peoples R China
[2] Jilin Univ, Sch Comp Sci & Technol, Changchun 130012, Jilin, Peoples R China
基金
中国国家自然科学基金;
关键词
Social network services; Learning (artificial intelligence); Recommender systems; Machine learning; Task analysis; Estimation; Standards; DQN; reinforcement learning; recommender systems; social networks;
D O I
10.1109/TKDE.2020.3012346
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recommender systems aim to accurately and actively provide users with potentially interesting items (products, information or services). Deep reinforcement learning has been successfully applied to recommender systems, but still heavily suffer from data sparsity and cold-start in real-world tasks. In this work, we propose an effective way to address such issues by leveraging the pervasive social networks among users in the estimation of action-values (Q). Specifically, we develop a Social Attentive Deep Q-network (SADQN) to approximate the optimal action-value function based on the preferences of both individual users and social neighbors, by successfully utilizing a social attention layer to model the influence between them. Further, we propose an enhanced variant of SADQN, termed SADQN++, to model the complicated and diverse trade-offs between personal preferences and social influence for all involved users, making the agent more powerful and flexible in learning the optimal policies. The experimental results on real-world datasets demonstrate that the proposed SADQNs remarkably outperform the state-of-the-art deep reinforcement learning agents, with reasonable computation cost.
引用
收藏
页码:2443 / 2457
页数:15
相关论文
共 86 条
  • [11] SamWalker: Social Recommendation with Informative Sampling Strategy
    Chen, Jiawei
    Wang, Can
    Zhou, Sheng
    Shi, Qihao
    Feng, Yan
    Chen, Chun
    [J]. WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 228 - 239
  • [12] Top-K Off-Policy Correction for a REINFORCE Recommender System
    Chen, Minmin
    Beutel, Alex
    Covington, Paul
    Jain, Sagar
    Belletti, Francois
    Chi, Ed H.
    [J]. PROCEEDINGS OF THE TWELFTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM'19), 2019, : 456 - 464
  • [13] Stabilizing Reinforcement Learning in Dynamic Environment with Application to Online Recommendation
    Chen, Shi-Yong
    Yu, Yang
    Da, Qing
    Tan, Jun
    Huang, Hai-Kuan
    Tang, Hai-Hong
    [J]. KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 1187 - 1196
  • [14] Chen XS, 2019, 36 INT C MACHINE LEA, V97
  • [15] Choi S.-M., 2018, ARXIV PREPRINT ARXIV
  • [16] Cremonesi Paolo, 2010, P 4 ACM C REC SYST B, P39
  • [17] Item-based top-N recommendation algorithms
    Deshpande, M
    Karypis, G
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2004, 22 (01) : 143 - 177
  • [18] Collaborative Memory Network for Recommendation Systems
    Ebesu, Travis
    Shen, Bin
    Fang, Yi
    [J]. ACM/SIGIR PROCEEDINGS 2018, 2018, : 515 - 524
  • [19] Graph Neural Networks for Social Recommendation
    Fan, Wenqi
    Ma, Yao
    Li, Qing
    He, Yuan
    Zhao, Eric
    Tang, Jiliang
    Yin, Dawei
    [J]. WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 417 - 426
  • [20] Fortunato M, 2018, P INT C REPR LEARN