Intrinsically motivated reinforcement learning based recommendation with counterfactual data augmentation

被引:2
|
作者
Chen, Xiaocong [1 ]
Wang, Siyu [1 ]
Qi, Lianyong [2 ]
Li, Yong [3 ]
Yao, Lina [1 ,4 ]
机构
[1] Univ New South Wales, Sch Comp Sci & Engn, Sydney, NSW 2052, Australia
[2] China Univ Petr East China, Coll Comp Sci & Technol, Dong Ying Shi, Peoples R China
[3] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
[4] CSIRO, Data 61, Eveleigh, NSW 2015, Australia
来源
WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS | 2023年 / 26卷 / 05期
关键词
Recommender systems; Deep reinforcement learning; Counterfactual reasoning; CAPACITY;
D O I
10.1007/s11280-023-01187-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep reinforcement learning (DRL) has shown promising results in modeling dynamic user preferences in RS in recent literature. However, training a DRL agent in the sparse RS environment poses a significant challenge. This is because the agent must balance between exploring informative user-item interaction trajectories and using existing trajectories for policy learning, a known exploration and exploitation trade-off. This trade-off greatly affects the recommendation performance when the environment is sparse. In DRL-based RS, balancing exploration and exploitation is even more challenging as the agent needs to deeply explore informative trajectories and efficiently exploit them in the context of RS. To address this issue, we propose a novel intrinsically motivated reinforcement learning (IMRL) method that enhances the agent's capability to explore informative interaction trajectories in the sparse environment. We further enrich these trajectories via an adaptive counterfactual augmentation strategy with a customised threshold to improve their efficiency in exploitation. Our approach is evaluated on six offline datasets and three online simulation platforms, demonstrating its superiority over existing state-of-the-art methods. The extensive experiments show that our IMRL method outperforms other methods in terms of recommendation performance in the sparse RS environment.
引用
收藏
页码:3253 / 3274
页数:22
相关论文
共 50 条
  • [1] Intrinsically motivated reinforcement learning based recommendation with counterfactual data augmentation
    Xiaocong Chen
    Siyu Wang
    Lianyong Qi
    Yong Li
    Lina Yao
    World Wide Web, 2023, 26 : 3253 - 3274
  • [2] Significance extraction based on data augmentation for reinforcement learning
    Han, Yuxi
    Li, Dequan
    Yang, Yang
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2025, : 385 - 399
  • [3] Intrinsically motivated reinforcement learning for human-robot interaction in the real-world
    Qureshi, Ahmed Hussain
    Nakamura, Yutaka
    Yoshikawa, Yuichiro
    Ishiguro, Hiroshi
    NEURAL NETWORKS, 2018, 107 : 23 - 33
  • [4] Using Data Augmentation Based Reinforcement Learning for Daily Stock Trading
    Yuan, Yuyu
    Wen, Wen
    Yang, Jincui
    ELECTRONICS, 2020, 9 (09) : 1 - 13
  • [5] Counterfactual based reinforcement learning for graph neural networks
    Pham, David
    Zhang, Yongfeng
    ANNALS OF OPERATIONS RESEARCH, 2022,
  • [6] Multimodal Counterfactual Learning Network for Multimedia-based Recommendation
    Li, Shuaiyang
    Guo, Dan
    Liu, Kang
    Hong, Richang
    Xue, Feng
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 1539 - 1548
  • [7] Empowerment-driven Policy Gradient Learning with Counterfactual Augmentation in Recommender Systems
    Chen, Xiaocong
    Yao, Lina
    Chang, Xiaojun
    Wang, Siyu
    2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2022, : 885 - 890
  • [8] Research on big data personalised recommendation model based on deep reinforcement learning
    Shi H.
    Shang L.
    International Journal of Networking and Virtual Organisations, 2023, 28 (2-4) : 364 - 380
  • [9] Plug-and-Play Model-Agnostic Counterfactual Policy Synthesis for Deep Reinforcement Learning-Based Recommendation
    Wang, Siyu
    Chen, Xiaocong
    McAuley, Julian
    Cripps, Sally
    Yao, Lina
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 1044 - 1055
  • [10] RLISR: A Deep Reinforcement Learning Based Interactive Service Recommendation Model
    Zhang, Mingwei
    Qu, Yingjie
    Li, Yage
    Wen, Xingyu
    Zhou, Yi
    IEEE ACCESS, 2024, 12 : 90204 - 90217