xMTF: A Formula-Free Model for Reinforcement-Learning-Based Multi-Task Fusion in Recommender Systems

被引：0

作者：

Cao, Yang ^{[1
]}

Zhang, Changhao ^{[2
]}

Chen, Xiaoshuang ^{[1
]}

Zhan, Kaiqiao ^{[1
]}

Wang, Ben ^{[1
]}

机构：

[1] Kuaishou Technol, Beijing, Peoples R China

[2] Peking Univ, Beijing, Peoples R China

来源：

PROCEEDINGS OF THE ACM WEB CONFERENCE 2025, WWW 2025 | 2025年

关键词：

Multi-Task Fusion; Reinforcement Learning; Recommender System;

D O I：

10.1145/3696410.3714959

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Recommender systems need to optimize various types of user feedback, e.g., clicks, likes, and shares. A typical recommender system handling multiple types of feedback has two components: a multi-task learning (MTL) module, predicting feedback such as click-through rate and like rate; and a multi-task fusion (MTF) module, integrating these predictions into a single score for item ranking. MTF is essential for ensuring user satisfaction, as it directly influences recommendation outcomes. Recently, reinforcement learning (RL) has been applied to MTF tasks to improve long-term user satisfaction. However, existing RL-based MTF methods are formula-based methods, which only adjust limited coefficients within pre-defined formulas. The pre-defined formulas restrict the RL search space and become a bottleneck for MTF. To overcome this, we propose a formula-free MTF framework. We demonstrate that any suitable fusion function can be expressed as a composition of single-variable monotonic functions, as per the Sprecher Representation Theorem. Leveraging this, we introduce a novel learnable monotonic fusion cell (MFC) to replace pre-defined formulas. We call this new MFC-based model eXtreme MTF (xMTF). Furthermore, we employ a two-stage hybrid (TSH) learning strategy to train xMTF effectively. By expanding the MTF search space, xMTF outperforms existing methods in extensive offline and online experiments.

引用

页码：3840 / 3849

页数：10

共 37 条

[11]

Jianhua Han, 2019, 2019 International Conference on Artificial Intelligence and Advanced Manufacturing (AIAM). Proceedings, P22, DOI 10.1109/AIAM48774.2019.00011

[12]

Köppen M, 2005, ADV SOFT COMP, P202

[13]

Liashchynskyi P, 2019, Arxiv, DOI arXiv:1912.06059

[14]

Lillicrap T. P., 2015, arXiv

[15] Amazon.com recommendation - Item-to-item collaborative filtering [J].

Linden, G ;

Smith, B ;

York, J .

IEEE INTERNET COMPUTING, 2003, 7 (01) :76-80

[16]

Liu J, 2010, IUI 2010, P31

[17]

Liu P, 2024, Arxiv, DOI arXiv:2404.17589

[18] Modeling Task Relationships in Multi-task Learning with Multi-gate Mixture-of-Experts [J].

Ma, Jiaqi ;

Zhao, Zhe ;

Yi, Xinyang ;

Chen, Jilin ;

Hong, Lichan ;

Chi, Ed H. .

KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, :1930-1939

[19]

Mokus J., 1975, Optimization Techniques IFIP Technical Conference, V27, P117

[20]

Pei Changhua, 2019, arXiv

← 1 2 3 4 →