DENOISING-ORIENTED DEEP HIERARCHICAL REINFORCEMENT LEARNING FOR NEXT-BASKET RECOMMENDATION

被引:8
作者
Du, Qihan [1 ]
Yu, Li [1 ]
Li, Huiyuan [1 ]
Leng, Youfang [1 ]
Ou, Ningrui [1 ]
机构
[1] Renmin Univ China, Beijing, Peoples R China
来源
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2022年
关键词
Recommender systems; Reinforcement learning; Deep learning; Next-basket recommendation;
D O I
10.1109/ICASSP43922.2022.9747757
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Next basket recommendation aims to provide users a basket of items on the next visit by considering the sequence of their historical baskets. However, since a user's purchase interests vary over time, historical baskets often contain many irrelevant items to his/her next choices. Therefore, it is necessary to denoise the sequence of historical baskets and reserve the indeed relevant items to enhance the recommendation performance. In this work, we propose a Hierarchical Reinforcement Learning framework for next Basket recommendation, named HRL4Ba, which learns the personalized inter-basket and intra-basket contexts of the user for dynamic denoising. Specifically, the high-level and the low-level agent in the denoising module perform hierarchical decisions, i.e., revise baskets and remove items; the recommendation module serves as the environment to give feedback to agents and recommends the next basket. Extensive experiments on two e-commerce datasets show the HRL4Ba outperforms existing state-of-the-art methods, and our ablation studies further show the effectiveness of each component in HRL4Ba.
引用
收藏
页码:4093 / 4097
页数:5
相关论文
共 19 条
[1]   An Attribute-aware Neural Attentive Model for Next Basket Recommendation [J].
Bai, Ting ;
Nie, Jian-Yun ;
Zhao, Wayne Xin ;
Zhu, Yutao ;
Du, Pan ;
Wen, Ji-Rong .
ACM/SIGIR PROCEEDINGS 2018, 2018, :1201-1204
[2]  
Hidasi B, 2016, Session-based Recommendations with Recurrent Neual NetWorks, DOI DOI 10.48550/ARXIV.1511.06939
[3]  
Le DT, 2019, PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P2808
[4]  
Lei Y, 2020, IEEE T KNOWLEDGE DAT
[5]   Recurrent Convolution Basket Map for Diversity Next-Basket Recommendation [J].
Leng, Youfang ;
Yu, Li ;
Xiong, Jie ;
Xu, Guanyu .
DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2020), PT III, 2020, 12114 :638-653
[6]  
Mantha A, 2020, INT CONF ACOUST SPEE, P3807, DOI [10.1109/icassp40776.2020.9053091, 10.1109/ICASSP40776.2020.9053091]
[7]   Human-level control through deep reinforcement learning [J].
Mnih, Volodymyr ;
Kavukcuoglu, Koray ;
Silver, David ;
Rusu, Andrei A. ;
Veness, Joel ;
Bellemare, Marc G. ;
Graves, Alex ;
Riedmiller, Martin ;
Fidjeland, Andreas K. ;
Ostrovski, Georg ;
Petersen, Stig ;
Beattie, Charles ;
Sadik, Amir ;
Antonoglou, Ioannis ;
King, Helen ;
Kumaran, Dharshan ;
Wierstra, Daan ;
Legg, Shane ;
Hassabis, Demis .
NATURE, 2015, 518 (7540) :529-533
[8]   The World is Binary: Contrastive Learning for Denoising Next Basket Recommendation [J].
Qin, Yuqi ;
Wang, Pengfei ;
Li, Chenliang .
SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, :859-868
[9]  
Rendle S., 2010, P 19 INT C WORLD WID, P811, DOI DOI 10.1145/1772690.1772773
[10]  
Silver D, 2014, PR MACH LEARN RES, V32