Dynamics-Aware Adaptation for Reinforcement Learning Based Cross-Domain Interactive Recommendation

被引:15
作者
Wu, Junda [1 ]
Xie, Zhihui [2 ]
Yu, Tong [3 ]
Zhao, Handong [3 ]
Zhang, Ruiyi [3 ]
Li, Shuai [2 ]
机构
[1] NYU, New York, NY USA
[2] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[3] Adobe Res, San Jose, CA USA
来源
PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22) | 2022年
关键词
interactive recommender systems; cross-domain recommendation; reinforcement learning;
D O I
10.1145/3477495.3531969
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Interactive recommender systems (IRS) have received wide attention in recent years. To capture users' dynamic preferences and maximize their long-term engagement, IRS are usually formulated as reinforcement learning (RL) problems. Despite the promise to solve complex decision-making problems, RL-based methods generally require a large amount of online interaction, restricting their applications due to economic considerations. One possible direction to alleviate this issue is cross-domain recommendation that aims to leverage abundant logged interaction data from a source domain (e.g., adventure genre in movie recommendation) to improve the recommendation quality in the target domain (e.g., crime genre). Nevertheless, prior studies mostly focus on adapting the static representations of users/items. Few have explored how the temporally dynamic user-item interaction patterns transform across domains. Motivated by the above consideration, we propose DACIR, a novelDoubly-Adaptive deep RL-based framework forCross-domain Interactive Recommendation. We first pinpoint how users behave differently in two domains and highlight the potential to leverage the shared user dynamics to boost IRS. To transfer static user preferences across domains, DACIR enforces consistency of item representation by aligning embeddings into a shared latent space. In addition, given the user dynamics in IRS, DACIR calibrates the dynamic interaction patterns in two domains via reward correlation. Once the double adaptation narrows the cross-domain gap, we are able to learn a transferable policy for the target recommender by leveraging logged data. Experiments on real-world datasets validate the superiority of our approach, which consistently achieves significant improvements over the baselines.
引用
收藏
页码:290 / 300
页数:11
相关论文
共 50 条
[21]   A personalized cross-domain recommendation with federated meta learning [J].
Zhao, Peng ;
Jin, Yuanyang ;
Ren, Xuebin ;
Li, Yanan .
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (28) :71435-71450
[22]   Deep Graph Mutual Learning for Cross-domain Recommendation [J].
Wang, Yifan ;
Li, Yongkang ;
Li, Shuai ;
Song, Weiping ;
Fan, Jiangke ;
Gao, Shan ;
Ma, Ling ;
Cheng, Bing ;
Cai, Xunliang ;
Wang, Sheng ;
Zhang, Ming .
DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2022, PT II, 2022, :298-305
[23]   A deep selective learning network for cross-domain recommendation [J].
Liu, Huiting ;
Liu, Qian ;
Li, Peipei ;
Zhao, Peng ;
Wu, Xindong .
APPLIED SOFT COMPUTING, 2022, 125
[24]   CausalCDR: Causal Embedding Learning for Cross-domain Recommendation [J].
Li, Fengxin ;
Liu, Hongyan ;
He, Jun ;
Du, Xiaoyong .
PROCEEDINGS OF THE 2024 SIAM INTERNATIONAL CONFERENCE ON DATA MINING, SDM, 2024, :553-561
[25]   DisenCDR: Learning Disentangled Representations for Cross-Domain Recommendation [J].
Cao, Jiangxia ;
Lin, Xixun ;
Cong, Xin ;
Ya, Jing ;
Liu, Tingwen ;
Wang, Bin .
PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, :267-277
[26]   Cross-Domain Random Pretraining With Prototypes for Reinforcement Learning [J].
Liu, Xin ;
Chen, Yaran ;
Li, Haoran ;
Li, Boyu ;
Zhao, Dongbin .
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2025, 55 (05) :3601-3613
[27]   Towards Universal Cross-Domain RecommendationTowards Universal Cross-Domain Recommendation [J].
Cao, Jiangxia ;
Li, Shaoshuai ;
Yu, Bowen ;
Guo, Xiaobo ;
Liu, Tingwen ;
Bin Wang .
PROCEEDINGS OF THE SIXTEENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, WSDM 2023, VOL 1, 2023, :78-86
[28]   Adversarial Learning of Transitive Semantic Features for Cross-Domain Recommendation [J].
Li, Zhetao ;
Qiao, Pengpeng ;
Zhang, Yuanxing ;
Bian, Kaigui .
2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,
[29]   Cross-domain recommendation based on latent factor alignment [J].
Yu, Xu ;
Hu, Qiang ;
Li, Hui ;
Du, Junwei ;
Gao, Jia ;
Sun, Lijun .
NEURAL COMPUTING & APPLICATIONS, 2022, 34 (05) :3421-3432
[30]   Cross-Domain Item Recommendation Based on User Similarity [J].
Xu, Zhenzhen ;
Jiang, Huizhen ;
Kong, Xiangjie ;
Kang, Jialiang ;
Wang, Wei ;
Xia, Feng .
COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2016, 13 (02) :359-373