Intrinsically motivated reinforcement learning based recommendation with counterfactual data augmentation

被引:2
作者
Chen, Xiaocong [1 ]
Wang, Siyu [1 ]
Qi, Lianyong [2 ]
Li, Yong [3 ]
Yao, Lina [1 ,4 ]
机构
[1] Univ New South Wales, Sch Comp Sci & Engn, Sydney, NSW 2052, Australia
[2] China Univ Petr East China, Coll Comp Sci & Technol, Dong Ying Shi, Peoples R China
[3] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
[4] CSIRO, Data 61, Eveleigh, NSW 2015, Australia
来源
WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS | 2023年 / 26卷 / 05期
关键词
Recommender systems; Deep reinforcement learning; Counterfactual reasoning; CAPACITY;
D O I
10.1007/s11280-023-01187-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep reinforcement learning (DRL) has shown promising results in modeling dynamic user preferences in RS in recent literature. However, training a DRL agent in the sparse RS environment poses a significant challenge. This is because the agent must balance between exploring informative user-item interaction trajectories and using existing trajectories for policy learning, a known exploration and exploitation trade-off. This trade-off greatly affects the recommendation performance when the environment is sparse. In DRL-based RS, balancing exploration and exploitation is even more challenging as the agent needs to deeply explore informative trajectories and efficiently exploit them in the context of RS. To address this issue, we propose a novel intrinsically motivated reinforcement learning (IMRL) method that enhances the agent's capability to explore informative interaction trajectories in the sparse environment. We further enrich these trajectories via an adaptive counterfactual augmentation strategy with a customised threshold to improve their efficiency in exploitation. Our approach is evaluated on six offline datasets and three online simulation platforms, demonstrating its superiority over existing state-of-the-art methods. The extensive experiments show that our IMRL method outperforms other methods in terms of recommendation performance in the sparse RS environment.
引用
收藏
页码:3253 / 3274
页数:22
相关论文
共 50 条
  • [21] Reinforcement Learning based Recommendation with Graph Convolutional Q-network
    Lei, Yu
    Pei, Hongbin
    Yan, Hanqi
    Li, Wenjie
    PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 1757 - 1760
  • [22] Daily Schedule Recommendation in Urban Life Based on Deep Reinforcement Learning
    Liu, Jia
    Zhai, Donghai
    Huang, Wei
    Ji, Shenggong
    Zhang, Junbo
    Li, Tianrui
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (02) : 2196 - 2205
  • [23] Deep Reinforcement Learning Recommendation System based on GRU and Attention Mechanism
    Hou, Yan-e
    Gu, Wenbo
    Yang, Kang
    Dang, Lanxue
    ENGINEERING LETTERS, 2023, 31 (02) : 695 - 701
  • [24] Learning From Atypical Behavior: Temporary Interest Aware Recommendation Based on Reinforcement Learning
    Du, Ziwen
    Yang, Ning
    Yu, Zhonghua
    Yu, Philip S.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (10) : 9824 - 9835
  • [25] Deep Reinforcement Learning Framework for Category-Based Item Recommendation
    Fu, Mingsheng
    Agrawal, Anubha
    Irissappane, Athirai A.
    Zhang, Jie
    Huang, Liwei
    Qu, Hong
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (11) : 12028 - 12041
  • [26] Rethinking Reinforcement Learning for Recommendation: A Prompt Perspective
    Xin, Xin
    Pimentel, Tiago
    Karatzoglou, Alexandros
    Ren, Pengjie
    Christakopoulou, Konstantina
    Ren, Zhaochun
    PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 1347 - 1357
  • [27] Deep reinforcement learning for personalized treatment recommendation
    Liu, Mingyang
    Shen, Xiaotong
    Pan, Wei
    STATISTICS IN MEDICINE, 2022, 41 (20) : 4034 - 4056
  • [28] AUTOMATIC DATA AUGMENTATION VIA DEEP REINFORCEMENT LEARNING FOR EFFECTIVE KIDNEY TUMOR SEGMENTATION
    Qin, Tiexin
    Wang, Ziyuan
    He, Kelei
    Shi, Yinghuan
    Gao, Yang
    Shen, Dinggang
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 1419 - 1423
  • [29] A General Knowledge Distillation Framework for Counterfactual Recommendation via Uniform Data
    Liu, Dugang
    Cheng, Pengxiang
    Dong, Zhenhua
    He, Xiuqiang
    Pan, Weike
    Ming, Zhong
    PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 831 - 840
  • [30] Multiobjective Deep Reinforcement Learning for Recommendation Systems
    Keat, Ee Yeo
    Sharef, Nurfadhlina Mohd
    Yaakob, Razali
    Kasmiran, Khairul Azhar
    Marlisah, Erzam
    Mustapha, Norwati
    Zolkepli, Maslina
    IEEE ACCESS, 2022, 10 : 65011 - 65027