Sequential Recommendation Using Deep Reinforcement Learning and Multi-Head Attention

被引:1
作者
Sultan, Raneem [1 ]
Abu-Elkheir, Mervat [1 ]
机构
[1] German Univ Cairo, Media Engn & Technol, Cairo, Egypt
来源
2022 56TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS) | 2022年
关键词
Recommender Systems; Deep Reinforcement Learning; Attention;
D O I
10.1109/CISS53076.2022.9751174
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recommender Systems have become a crucial part of many of our online interactions. From shopping for clothes, planning a trip, or deciding what to watch, recommender systems are aiming to help users navigate the overwhelming amount of options available online. The problem with most of the existing recommender systems is that they treat the recommendation process as a static one and make recommendations according to a fixed greedy strategy. This is a problem because user preferences are dynamic. In this paper, we aim to address this problem by modeling the recommendation problem as a Markov Decision Process (MDP) and solving it using deep reinforcement learning. Furthermore, we use multi-head attention to improve the recommendations. We conduct extensive experiments using the MovieLens real-world dataset and achieve an improvement of 6% over the state-of-the-art approach results in terms of precision@20.
引用
收藏
页码:258 / 262
页数:5
相关论文
共 23 条
[1]  
[Anonymous], 2008, P 14 ACM SIGKDD INT, DOI DOI 10.1145/1401890.1401944
[2]   Deep Neural Networks for YouTube Recommendations [J].
Covington, Paul ;
Adams, Jay ;
Sargin, Emre .
PROCEEDINGS OF THE 10TH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS'16), 2016, :191-198
[3]   Translation-based Recommendation [J].
He, Ruining ;
Kang, Wang-Cheng ;
McAuley, Julian .
PROCEEDINGS OF THE ELEVENTH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS'17), 2017, :161-169
[4]  
He RN, 2016, IEEE DATA MINING, P191, DOI [10.1109/ICDM.2016.88, 10.1109/ICDM.2016.0030]
[5]   Cumulated gain-based evaluation of IR techniques [J].
Järvelin, K ;
Kekäläinen, J .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2002, 20 (04) :422-446
[6]   Self-Attentive Sequential Recommendation [J].
Kang, Wang-Cheng ;
McAuley, Julian .
2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, :197-206
[7]  
Li JC, 2020, PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM '20), P322, DOI 10.1145/3336191.3371786
[8]  
Li Lihong, 2010, P 19 INT C WORLD WID, P661
[9]  
Lillicrap TP., 2015, ARXIV
[10]  
Liu F, 2019, Arxiv, DOI arXiv:1810.12027