Intrinsically motivated reinforcement learning based recommendation with counterfactual data augmentation

被引:0
作者
Xiaocong Chen
Siyu Wang
Lianyong Qi
Yong Li
Lina Yao
机构
[1] University of New South Wales,School of Computer Science and Engineering
[2] China University of Petroleum (East China),College of Computer Science and Technology
[3] Tsinghua University,Department of Electronic Engineering
[4] CSIRO,Data 61
来源
World Wide Web | 2023年 / 26卷
关键词
Recommender systems; Deep reinforcement learning; Counterfactual reasoning;
D O I
暂无
中图分类号
学科分类号
摘要
Deep reinforcement learning (DRL) has shown promising results in modeling dynamic user preferences in RS in recent literature. However, training a DRL agent in the sparse RS environment poses a significant challenge. This is because the agent must balance between exploring informative user-item interaction trajectories and using existing trajectories for policy learning, a known exploration and exploitation trade-off. This trade-off greatly affects the recommendation performance when the environment is sparse. In DRL-based RS, balancing exploration and exploitation is even more challenging as the agent needs to deeply explore informative trajectories and efficiently exploit them in the context of RS. To address this issue, we propose a novel intrinsically motivated reinforcement learning (IMRL) method that enhances the agent’s capability to explore informative interaction trajectories in the sparse environment. We further enrich these trajectories via an adaptive counterfactual augmentation strategy with a customised threshold to improve their efficiency in exploitation. Our approach is evaluated on six offline datasets and three online simulation platforms, demonstrating its superiority over existing state-of-the-art methods. The extensive experiments show that our IMRL method outperforms other methods in terms of recommendation performance in the sparse RS environment.
引用
收藏
页码:3253 / 3274
页数:21
相关论文
共 50 条
  • [21] Reinforcement Learning based Recommendation with Graph Convolutional Q-network
    Lei, Yu
    Pei, Hongbin
    Yan, Hanqi
    Li, Wenjie
    [J]. PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 1757 - 1760
  • [22] Deep Reinforcement Learning Recommendation System based on GRU and Attention Mechanism
    Hou, Yan-e
    Gu, Wenbo
    Yang, Kang
    Dang, Lanxue
    [J]. ENGINEERING LETTERS, 2023, 31 (02) : 695 - 701
  • [23] Daily Schedule Recommendation in Urban Life Based on Deep Reinforcement Learning
    Liu, Jia
    Zhai, Donghai
    Huang, Wei
    Ji, Shenggong
    Zhang, Junbo
    Li, Tianrui
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (02) : 2196 - 2205
  • [24] Deep Reinforcement Learning Framework for Category-Based Item Recommendation
    Fu, Mingsheng
    Agrawal, Anubha
    Irissappane, Athirai A.
    Zhang, Jie
    Huang, Liwei
    Qu, Hong
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (11) : 12028 - 12041
  • [25] Learning From Atypical Behavior: Temporary Interest Aware Recommendation Based on Reinforcement Learning
    Du, Ziwen
    Yang, Ning
    Yu, Zhonghua
    Yu, Philip S.
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (10) : 9824 - 9835
  • [26] Multiobjective Deep Reinforcement Learning for Recommendation Systems
    Keat, Ee Yeo
    Sharef, Nurfadhlina Mohd
    Yaakob, Razali
    Kasmiran, Khairul Azhar
    Marlisah, Erzam
    Mustapha, Norwati
    Zolkepli, Maslina
    [J]. IEEE ACCESS, 2022, 10 : 65011 - 65027
  • [27] Deep reinforcement learning for personalized treatment recommendation
    Liu, Mingyang
    Shen, Xiaotong
    Pan, Wei
    [J]. STATISTICS IN MEDICINE, 2022, 41 (20) : 4034 - 4056
  • [28] A General Knowledge Distillation Framework for Counterfactual Recommendation via Uniform Data
    Liu, Dugang
    Cheng, Pengxiang
    Dong, Zhenhua
    He, Xiuqiang
    Pan, Weike
    Ming, Zhong
    [J]. PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 831 - 840
  • [29] Rethinking Reinforcement Learning for Recommendation: A Prompt Perspective
    Xin, Xin
    Pimentel, Tiago
    Karatzoglou, Alexandros
    Ren, Pengjie
    Christakopoulou, Konstantina
    Ren, Zhaochun
    [J]. PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 1347 - 1357
  • [30] AUTOMATIC DATA AUGMENTATION VIA DEEP REINFORCEMENT LEARNING FOR EFFECTIVE KIDNEY TUMOR SEGMENTATION
    Qin, Tiexin
    Wang, Ziyuan
    He, Kelei
    Shi, Yinghuan
    Gao, Yang
    Shen, Dinggang
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 1419 - 1423