Social Attentive Deep Q-Networks for Recommender Systems

被引:8
作者
Lei, Yu [1 ]
Wang, Zhitao [1 ]
Li, Wenjie [1 ]
Pei, Hongbin [2 ]
Dai, Quanyu [1 ]
机构
[1] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Peoples R China
[2] Jilin Univ, Sch Comp Sci & Technol, Changchun 130012, Jilin, Peoples R China
基金
中国国家自然科学基金;
关键词
Social network services; Learning (artificial intelligence); Recommender systems; Machine learning; Task analysis; Estimation; Standards; DQN; reinforcement learning; recommender systems; social networks;
D O I
10.1109/TKDE.2020.3012346
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recommender systems aim to accurately and actively provide users with potentially interesting items (products, information or services). Deep reinforcement learning has been successfully applied to recommender systems, but still heavily suffer from data sparsity and cold-start in real-world tasks. In this work, we propose an effective way to address such issues by leveraging the pervasive social networks among users in the estimation of action-values (Q). Specifically, we develop a Social Attentive Deep Q-network (SADQN) to approximate the optimal action-value function based on the preferences of both individual users and social neighbors, by successfully utilizing a social attention layer to model the influence between them. Further, we propose an enhanced variant of SADQN, termed SADQN++, to model the complicated and diverse trade-offs between personal preferences and social influence for all involved users, making the agent more powerful and flexible in learning the optimal policies. The experimental results on real-world datasets demonstrate that the proposed SADQNs remarkably outperform the state-of-the-art deep reinforcement learning agents, with reasonable computation cost.
引用
收藏
页码:2443 / 2457
页数:15
相关论文
共 86 条
  • [1] Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions
    Adomavicius, G
    Tuzhilin, A
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (06) : 734 - 749
  • [2] Badrul S., 2001, P 10 INT C WORLD WID, P285, DOI DOI 10.1145/371920.372071
  • [3] Bakshy E, 2012, P 21 INT C WORLD WID, P519, DOI DOI 10.1145/2187836.2187907
  • [4] DYNAMIC PROGRAMMING
    BELLMAN, R
    [J]. SCIENCE, 1966, 153 (3731) : 34 - &
  • [5] Berg R. v. d., 2017, ARXIV170602263
  • [6] A 61-million-person experiment in social influence and political mobilization
    Bond, Robert M.
    Fariss, Christopher J.
    Jones, Jason J.
    Kramer, Adamd. I.
    Marlow, Cameron
    Settle, Jaime E.
    Fowler, James H.
    [J]. NATURE, 2012, 489 (7415) : 295 - 298
  • [7] Cantador I., 2011, P 5 ACM C RECOMMENDE, P387, DOI DOI 10.1145/2043932.2044016
  • [8] Chaney A. J., 2015, 9 ACM C RECOMMENDER, P43, DOI [10.1145/2792838.2800193, DOI 10.1145/2792838.2800193]
  • [9] Social Attentional Memory Network: Modeling Aspect- and Friend-level Differences in Recommendation
    Chen, Chong
    Zhang, Min
    Liu, Yiqun
    Ma, Shaoping
    [J]. PROCEEDINGS OF THE TWELFTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM'19), 2019, : 177 - 185
  • [10] Chen HK, 2019, AAAI CONF ARTIF INTE, P3312