Scalable Deep Q-Learning for Session-Based Slate Recommendation

被引:0
|
作者
Roy, Aayush Singha [1 ,2 ]
D'Amico, Edoardo [1 ,2 ]
Tragos, Elias [1 ,2 ]
Lawlor, Aonghus [1 ,2 ]
Hurley, Neil [1 ,2 ]
机构
[1] Univ Coll Dublin, Dublin, Ireland
[2] Insight Ctr Data Analyt, Dublin, Ireland
来源
PROCEEDINGS OF THE 17TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2023 | 2023年
基金
爱尔兰科学基金会;
关键词
Recommender systems; Slate recommendation; Reinforcement learning;
D O I
10.1145/3604915.3608843
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning (RL) has demonstrated great potential to improve slate-based recommender systems by optimizing recommendations for long-term user engagement. To handle the combinatorial action space in slate recommendation, recent works decompose the Q-value of a slate into item-wise Q-values, using an item-wise value-based policy. However, the common case where the value function is a parameterized function taking state and action as input results in a linearly increasing number of evaluations required to select an action, proportional to the number of candidate items. While slow training may be acceptable, this becomes intractable when considering the costly evaluation of the parameterized function, such as with deep neural networks, during model serving time. To address this issue, we propose an actor-based policy that reduces the evaluation of the Q-function to a subset of items, significantly reducing inference time and enabling practical deployment in real-world industrial settings. In our empirical evaluation, we demonstrate that our proposed approach achieves equivalent user session engagement to a value-based policy, while significantly reducing the slate serving time by at least 4 times.
引用
收藏
页码:877 / 882
页数:6
相关论文
共 50 条
  • [1] Session-based Interactive Recommendation via Deep Reinforcement Learning
    Shi, Longxiang
    Zhang, Zilin
    Wang, Shoujin
    Zhang, Qi
    Wu, Minghui
    Yang, Cheng
    Li, Shijian
    Proceedings - IEEE International Conference on Data Mining, ICDM, 2023, : 1319 - 1324
  • [2] Session-based Interactive Recommendation via Deep Reinforcement Learning
    Shi, Longxiang
    Zhang, Zilin
    Wang, Shoujin
    Zhang, Qi
    Wu, Minghui
    Yang, Cheng
    Li, Shijian
    23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, ICDM 2023, 2023, : 1319 - 1324
  • [3] Contrastive Learning for Session-Based Recommendation
    Chen, Yan
    Qian, Wanhui
    Liu, Dongqin
    Su, Yipeng
    Zhou, Yan
    Han, Jizhong
    Li, Ruixuan
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT IV, 2022, 13532 : 358 - 369
  • [4] Adaptive Learning Recommendation Strategy Based on Deep Q-learning
    Tan, Chunxi
    Han, Ruijian
    Ye, Rougang
    Chen, Kani
    APPLIED PSYCHOLOGICAL MEASUREMENT, 2020, 44 (04) : 251 - 266
  • [5] A Study of Deep Learning-Based Approaches for Session-Based Recommendation Systems
    Dang T.K.
    Nguyen Q.P.
    Nguyen V.S.
    SN Computer Science, 2020, 1 (4)
  • [6] Collaborative Graph Learning for Session-based Recommendation
    Pan, Zhiqiang
    Cai, Fei
    Chen, Wanyu
    Chen, Chonghao
    Chen, Honghui
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2022, 40 (04)
  • [7] Improving session-based recommendation with contrastive learning
    Tai, Wenxin
    Lan, Tian
    Wu, Zufeng
    Wang, Pengyu
    Wang, Yixiang
    Zhou, Fan
    USER MODELING AND USER-ADAPTED INTERACTION, 2023, 33 (01) : 1 - 42
  • [8] Self Contrastive Learning for Session-Based Recommendation
    Shi, Zhengxiang
    Wang, Xi
    Lipani, Aldo
    ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT I, 2024, 14608 : 3 - 20
  • [9] Dynamic Graph Learning for Session-Based Recommendation
    Pan, Zhiqiang
    Chen, Wanyu
    Chen, Honghui
    MATHEMATICS, 2021, 9 (12)
  • [10] Improving session-based recommendation with contrastive learning
    Wenxin Tai
    Tian Lan
    Zufeng Wu
    Pengyu Wang
    Yixiang Wang
    Fan Zhou
    User Modeling and User-Adapted Interaction, 2023, 33 : 1 - 42