Social Attentive Deep Q-Networks for Recommender Systems

被引：8

作者：

Lei, Yu ^{[1
]}

Wang, Zhitao ^{[1
]}

Li, Wenjie ^{[1
]}

Pei, Hongbin ^{[2
]}

Dai, Quanyu ^{[1
]}

机构：

[1] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Peoples R China

[2] Jilin Univ, Sch Comp Sci & Technol, Changchun 130012, Jilin, Peoples R China

来源：

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING | 2022年 / 34卷 / 05期

基金：

中国国家自然科学基金;

关键词：

Social network services; Learning (artificial intelligence); Recommender systems; Machine learning; Task analysis; Estimation; Standards; DQN; reinforcement learning; recommender systems; social networks;

D O I：

10.1109/TKDE.2020.3012346

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recommender systems aim to accurately and actively provide users with potentially interesting items (products, information or services). Deep reinforcement learning has been successfully applied to recommender systems, but still heavily suffer from data sparsity and cold-start in real-world tasks. In this work, we propose an effective way to address such issues by leveraging the pervasive social networks among users in the estimation of action-values (Q). Specifically, we develop a Social Attentive Deep Q-network (SADQN) to approximate the optimal action-value function based on the preferences of both individual users and social neighbors, by successfully utilizing a social attention layer to model the influence between them. Further, we propose an enhanced variant of SADQN, termed SADQN++, to model the complicated and diverse trade-offs between personal preferences and social influence for all involved users, making the agent more powerful and flexible in learning the optimal policies. The experimental results on real-world datasets demonstrate that the proposed SADQNs remarkably outperform the state-of-the-art deep reinforcement learning agents, with reasonable computation cost.

引用

页码：2443 / 2457

页数：15

共 86 条

[11] SamWalker: Social Recommendation with Informative Sampling Strategy
Chen, Jiawei
Wang, Can
Zhou, Sheng
Shi, Qihao
Feng, Yan
Chen, Chun
[J]. WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 228 - 239
[12] Top-K Off-Policy Correction for a REINFORCE Recommender System
Chen, Minmin
Beutel, Alex
Covington, Paul
Jain, Sagar
Belletti, Francois
Chi, Ed H.
[J]. PROCEEDINGS OF THE TWELFTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM'19), 2019, : 456 - 464
[13] Stabilizing Reinforcement Learning in Dynamic Environment with Application to Online Recommendation
Chen, Shi-Yong
Yu, Yang
Da, Qing
Tan, Jun
Huang, Hai-Kuan
Tang, Hai-Hong
[J]. KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 1187 - 1196
[14] Chen XS, 2019, 36 INT C MACHINE LEA, V97
[15] Choi S.-M., 2018, ARXIV PREPRINT ARXIV
[16] Cremonesi Paolo, 2010, P 4 ACM C REC SYST B, P39
[17] Item-based top-N recommendation algorithms
Deshpande, M
Karypis, G
[J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2004, 22 (01) : 143 - 177
[18] Collaborative Memory Network for Recommendation Systems
Ebesu, Travis
Shen, Bin
Fang, Yi
[J]. ACM/SIGIR PROCEEDINGS 2018, 2018, : 515 - 524
[19] Graph Neural Networks for Social Recommendation
Fan, Wenqi
Ma, Yao
Li, Qing
He, Yuan
Zhao, Eric
Tang, Jiliang
Yin, Dawei
[J]. WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 417 - 426
[20] Fortunato M, 2018, P INT C REPR LEARN

← 1 2 3 4 5 6 7 8 9 →