Intrinsically motivated reinforcement learning based recommendation with counterfactual data augmentation

被引：0

作者：

Xiaocong Chen

Siyu Wang

Lianyong Qi

Yong Li

Lina Yao

机构：

[1] University of New South Wales,School of Computer Science and Engineering

[2] China University of Petroleum (East China),College of Computer Science and Technology

[3] Tsinghua University,Department of Electronic Engineering

[4] CSIRO,Data 61

来源：

World Wide Web | 2023年 / 26卷

关键词：

Recommender systems; Deep reinforcement learning; Counterfactual reasoning;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Deep reinforcement learning (DRL) has shown promising results in modeling dynamic user preferences in RS in recent literature. However, training a DRL agent in the sparse RS environment poses a significant challenge. This is because the agent must balance between exploring informative user-item interaction trajectories and using existing trajectories for policy learning, a known exploration and exploitation trade-off. This trade-off greatly affects the recommendation performance when the environment is sparse. In DRL-based RS, balancing exploration and exploitation is even more challenging as the agent needs to deeply explore informative trajectories and efficiently exploit them in the context of RS. To address this issue, we propose a novel intrinsically motivated reinforcement learning (IMRL) method that enhances the agent’s capability to explore informative interaction trajectories in the sparse environment. We further enrich these trajectories via an adaptive counterfactual augmentation strategy with a customised threshold to improve their efficiency in exploitation. Our approach is evaluated on six offline datasets and three online simulation platforms, demonstrating its superiority over existing state-of-the-art methods. The extensive experiments show that our IMRL method outperforms other methods in terms of recommendation performance in the sparse RS environment.

引用

页码：3253 / 3274

页数：21

共 50 条

[21] Reinforcement Learning based Recommendation with Graph Convolutional Q-network
Lei, Yu
Pei, Hongbin
Yan, Hanqi
Li, Wenjie
[J]. PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 1757 - 1760
[22] Deep Reinforcement Learning Recommendation System based on GRU and Attention Mechanism
Hou, Yan-e
Gu, Wenbo
Yang, Kang
Dang, Lanxue
[J]. ENGINEERING LETTERS, 2023, 31 (02) : 695 - 701
[23] Daily Schedule Recommendation in Urban Life Based on Deep Reinforcement Learning
Liu, Jia
Zhai, Donghai
Huang, Wei
Ji, Shenggong
Zhang, Junbo
Li, Tianrui
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (02) : 2196 - 2205
[24] Deep Reinforcement Learning Framework for Category-Based Item Recommendation
Fu, Mingsheng
Agrawal, Anubha
Irissappane, Athirai A.
Zhang, Jie
Huang, Liwei
Qu, Hong
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (11) : 12028 - 12041
[25] Learning From Atypical Behavior: Temporary Interest Aware Recommendation Based on Reinforcement Learning
Du, Ziwen
Yang, Ning
Yu, Zhonghua
Yu, Philip S.
[J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (10) : 9824 - 9835
[26] Multiobjective Deep Reinforcement Learning for Recommendation Systems
Keat, Ee Yeo
Sharef, Nurfadhlina Mohd
Yaakob, Razali
Kasmiran, Khairul Azhar
Marlisah, Erzam
Mustapha, Norwati
Zolkepli, Maslina
[J]. IEEE ACCESS, 2022, 10 : 65011 - 65027
[27] Deep reinforcement learning for personalized treatment recommendation
Liu, Mingyang
Shen, Xiaotong
Pan, Wei
[J]. STATISTICS IN MEDICINE, 2022, 41 (20) : 4034 - 4056
[28] A General Knowledge Distillation Framework for Counterfactual Recommendation via Uniform Data
Liu, Dugang
Cheng, Pengxiang
Dong, Zhenhua
He, Xiuqiang
Pan, Weike
Ming, Zhong
[J]. PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 831 - 840
[29] Rethinking Reinforcement Learning for Recommendation: A Prompt Perspective
Xin, Xin
Pimentel, Tiago
Karatzoglou, Alexandros
Ren, Pengjie
Christakopoulou, Konstantina
Ren, Zhaochun
[J]. PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 1347 - 1357
[30] AUTOMATIC DATA AUGMENTATION VIA DEEP REINFORCEMENT LEARNING FOR EFFECTIVE KIDNEY TUMOR SEGMENTATION
Qin, Tiexin
Wang, Ziyuan
He, Kelei
Shi, Yinghuan
Gao, Yang
Shen, Dinggang
[J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 1419 - 1423

← 1 2 3 4 5 →