Social Attentive Deep Q-Networks for Recommender Systems

被引：8

作者：

Lei, Yu ^{[1
]}

Wang, Zhitao ^{[1
]}

Li, Wenjie ^{[1
]}

Pei, Hongbin ^{[2
]}

Dai, Quanyu ^{[1
]}

机构：

[1] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Peoples R China

[2] Jilin Univ, Sch Comp Sci & Technol, Changchun 130012, Jilin, Peoples R China

来源：

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING | 2022年 / 34卷 / 05期

基金：

中国国家自然科学基金;

关键词：

Social network services; Learning (artificial intelligence); Recommender systems; Machine learning; Task analysis; Estimation; Standards; DQN; reinforcement learning; recommender systems; social networks;

D O I：

10.1109/TKDE.2020.3012346

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recommender systems aim to accurately and actively provide users with potentially interesting items (products, information or services). Deep reinforcement learning has been successfully applied to recommender systems, but still heavily suffer from data sparsity and cold-start in real-world tasks. In this work, we propose an effective way to address such issues by leveraging the pervasive social networks among users in the estimation of action-values (Q). Specifically, we develop a Social Attentive Deep Q-network (SADQN) to approximate the optimal action-value function based on the preferences of both individual users and social neighbors, by successfully utilizing a social attention layer to model the influence between them. Further, we propose an enhanced variant of SADQN, termed SADQN++, to model the complicated and diverse trade-offs between personal preferences and social influence for all involved users, making the agent more powerful and flexible in learning the optimal policies. The experimental results on real-world datasets demonstrate that the proposed SADQNs remarkably outperform the state-of-the-art deep reinforcement learning agents, with reasonable computation cost.

引用

页码：2443 / 2457

页数：15

共 86 条

[1] Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions
Adomavicius, G
Tuzhilin, A
[J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (06) : 734 - 749
[2] Badrul S., 2001, P 10 INT C WORLD WID, P285, DOI DOI 10.1145/371920.372071
[3] Bakshy E, 2012, P 21 INT C WORLD WID, P519, DOI DOI 10.1145/2187836.2187907
[4] DYNAMIC PROGRAMMING
BELLMAN, R
[J]. SCIENCE, 1966, 153 (3731) : 34 - &
[5] Berg R. v. d., 2017, ARXIV170602263
[6] A 61-million-person experiment in social influence and political mobilization
Bond, Robert M.
Fariss, Christopher J.
Jones, Jason J.
Kramer, Adamd. I.
Marlow, Cameron
Settle, Jaime E.
Fowler, James H.
[J]. NATURE, 2012, 489 (7415) : 295 - 298
[7] Cantador I., 2011, P 5 ACM C RECOMMENDE, P387, DOI DOI 10.1145/2043932.2044016
[8] Chaney A. J., 2015, 9 ACM C RECOMMENDER, P43, DOI [10.1145/2792838.2800193, DOI 10.1145/2792838.2800193]
[9] Social Attentional Memory Network: Modeling Aspect- and Friend-level Differences in Recommendation
Chen, Chong
Zhang, Min
Liu, Yiqun
Ma, Shaoping
[J]. PROCEEDINGS OF THE TWELFTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM'19), 2019, : 177 - 185
[10] Chen HK, 2019, AAAI CONF ARTIF INTE, P3312

← 1 2 3 4 5 6 7 8 9 →