DENOISING-ORIENTED DEEP HIERARCHICAL REINFORCEMENT LEARNING FOR NEXT-BASKET RECOMMENDATION

被引：8

作者：

Du, Qihan ^{[1
]}

Yu, Li ^{[1
]}

Li, Huiyuan ^{[1
]}

Leng, Youfang ^{[1
]}

Ou, Ningrui ^{[1
]}

机构：

[1] Renmin Univ China, Beijing, Peoples R China

来源：

2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2022年

关键词：

Recommender systems; Reinforcement learning; Deep learning; Next-basket recommendation;

D O I：

10.1109/ICASSP43922.2022.9747757

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Next basket recommendation aims to provide users a basket of items on the next visit by considering the sequence of their historical baskets. However, since a user's purchase interests vary over time, historical baskets often contain many irrelevant items to his/her next choices. Therefore, it is necessary to denoise the sequence of historical baskets and reserve the indeed relevant items to enhance the recommendation performance. In this work, we propose a Hierarchical Reinforcement Learning framework for next Basket recommendation, named HRL4Ba, which learns the personalized inter-basket and intra-basket contexts of the user for dynamic denoising. Specifically, the high-level and the low-level agent in the denoising module perform hierarchical decisions, i.e., revise baskets and remove items; the recommendation module serves as the environment to give feedback to agents and recommends the next basket. Extensive experiments on two e-commerce datasets show the HRL4Ba outperforms existing state-of-the-art methods, and our ablation studies further show the effectiveness of each component in HRL4Ba.

引用

页码：4093 / 4097

页数：5

共 19 条

[1] An Attribute-aware Neural Attentive Model for Next Basket Recommendation [J].

Bai, Ting ;

Nie, Jian-Yun ;

Zhao, Wayne Xin ;

Zhu, Yutao ;

Du, Pan ;

Wen, Ji-Rong .

ACM/SIGIR PROCEEDINGS 2018, 2018, :1201-1204

[2]

Hidasi B, 2016, Session-based Recommendations with Recurrent Neual NetWorks, DOI DOI 10.48550/ARXIV.1511.06939

[3]

Le DT, 2019, PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P2808

[4]

Lei Y, 2020, IEEE T KNOWLEDGE DAT

[5] Recurrent Convolution Basket Map for Diversity Next-Basket Recommendation [J].

Leng, Youfang ;

Yu, Li ;

Xiong, Jie ;

Xu, Guanyu .

DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2020), PT III, 2020, 12114 :638-653

[6]

Mantha A, 2020, INT CONF ACOUST SPEE, P3807, DOI [10.1109/icassp40776.2020.9053091, 10.1109/ICASSP40776.2020.9053091]

[7] Human-level control through deep reinforcement learning [J].

Mnih, Volodymyr ;

Kavukcuoglu, Koray ;

Silver, David ;

Rusu, Andrei A. ;

Veness, Joel ;

Bellemare, Marc G. ;

Graves, Alex ;

Riedmiller, Martin ;

Fidjeland, Andreas K. ;

Ostrovski, Georg ;

Petersen, Stig ;

Beattie, Charles ;

Sadik, Amir ;

Antonoglou, Ioannis ;

King, Helen ;

Kumaran, Dharshan ;

Wierstra, Daan ;

Legg, Shane ;

Hassabis, Demis .

NATURE, 2015, 518 (7540) :529-533

[8] The World is Binary: Contrastive Learning for Denoising Next Basket Recommendation [J].

Qin, Yuqi ;

Wang, Pengfei ;

Li, Chenliang .

SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, :859-868

[9]

Rendle S., 2010, P 19 INT C WORLD WID, P811, DOI DOI 10.1145/1772690.1772773

[10]

Silver D, 2014, PR MACH LEARN RES, V32

← 1 2 →