Scalable Deep Q-Learning for Session-Based Slate Recommendation

被引:0
作者
Roy, Aayush Singha [1 ,2 ]
D'Amico, Edoardo [1 ,2 ]
Tragos, Elias [1 ,2 ]
Lawlor, Aonghus [1 ,2 ]
Hurley, Neil [1 ,2 ]
机构
[1] Univ Coll Dublin, Dublin, Ireland
[2] Insight Ctr Data Analyt, Dublin, Ireland
来源
PROCEEDINGS OF THE 17TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2023 | 2023年
基金
爱尔兰科学基金会;
关键词
Recommender systems; Slate recommendation; Reinforcement learning;
D O I
10.1145/3604915.3608843
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning (RL) has demonstrated great potential to improve slate-based recommender systems by optimizing recommendations for long-term user engagement. To handle the combinatorial action space in slate recommendation, recent works decompose the Q-value of a slate into item-wise Q-values, using an item-wise value-based policy. However, the common case where the value function is a parameterized function taking state and action as input results in a linearly increasing number of evaluations required to select an action, proportional to the number of candidate items. While slow training may be acceptable, this becomes intractable when considering the costly evaluation of the parameterized function, such as with deep neural networks, during model serving time. To address this issue, we propose an actor-based policy that reduces the evaluation of the Q-function to a subset of items, significantly reducing inference time and enabling practical deployment in real-world industrial settings. In our empirical evaluation, we demonstrate that our proposed approach achieves equivalent user session engagement to a value-based policy, while significantly reducing the slate serving time by at least 4 times.
引用
收藏
页码:877 / 882
页数:6
相关论文
共 50 条
[41]   Global Context-Aware Graph Neural Networks for Session-based Recommendation [J].
Wang, Mingfeng ;
Li, Jing ;
Chang, Jun ;
Liu, Donghua ;
Zhang, Chenyan ;
Huang, Xiaosai .
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[42]   SPARE: Shortest Path Global Item Relations for Efficient Session-based Recommendation [J].
Peintner, Andreas ;
Mohammadi, Amir Reza ;
Zangerle, Eva .
PROCEEDINGS OF THE 17TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2023, 2023, :58-69
[43]   Purpose tendency-aware diversified strategy for effective session-based recommendation [J].
Yin, Qing ;
Zhang, Danning ;
Fang, Hui ;
Sun, Zhu .
ELECTRONIC COMMERCE RESEARCH AND APPLICATIONS, 2023, 57
[44]   High-order attentive graph neural network for session-based recommendation [J].
Sang, Sheng ;
Liu, Nan ;
Li, Wenxuan ;
Zhang, Zhijun ;
Qin, Qianqian ;
Yuan, Weihua .
APPLIED INTELLIGENCE, 2022, 52 (14) :16975-16989
[45]   DEN-DQL: Quick Convergent Deep Q-Learning with Double Exploration Networks for News Recommendation [J].
Song, Zhanghan ;
Zhang, Dian ;
Shi, Xiaochuan ;
Li, Wei ;
Ma, Chao ;
Wu, Libing .
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[46]   News Session-Based Recommendations using Deep Neural Networks [J].
Pereira Moreira, Gabriel de Souza ;
Ferreira, Felipe ;
da Cunha, Adilson Marques .
PROCEEDINGS OF THE 3RD WORKSHOP ON DEEP LEARNING FOR RECOMMENDER SYSTEMS (DLRS), 2018, :15-23
[47]   3D Convolutional Networks for Session-based Recommendation with Content Features [J].
Trinh Xuan Tuan ;
Tu Minh Phuong .
PROCEEDINGS OF THE ELEVENTH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS'17), 2017, :138-146
[48]   TSESRec: A transformer-facilitated set extension model for session-based recommendation [J].
Liu, Chen ;
Yu, Tianhao ;
Zhou, Xianghong ;
Zhou, Lixin ;
Gong, Xiaoyu .
JOURNAL OF SUPERCOMPUTING, 2025, 81 (01)
[49]   High-order attentive graph neural network for session-based recommendation [J].
Sheng Sang ;
Nan Liu ;
Wenxuan Li ;
Zhijun Zhang ;
Qianqian Qin ;
Weihua Yuan .
Applied Intelligence, 2022, 52 :16975-16989
[50]   Enhancing Session-Based Recommendation With Multi-Interest Hyperbolic Representation Networks [J].
Liu, Tongcun ;
Bao, Xukai ;
Zhang, Jiaxin ;
Fang, Kai ;
Feng, Hailin .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (06) :10567-10579