STG-MTL: scalable task grouping for multi-task learning using data maps

被引:1
作者
Sherif, Ammar [1 ]
Abid, Abubakar [2 ]
Elattar, Mustafa [1 ]
Elhelw, Mohamed [1 ]
机构
[1] Nile Univ, El Sheikh Zayed, Egypt
[2] Hugging Face, 20 Jay St, Brooklyn, NY 11201 USA
来源
MACHINE LEARNING-SCIENCE AND TECHNOLOGY | 2024年 / 5卷 / 02期
关键词
STG-MTL; multi-task learning; MTL; task grouping; scalability; data-driven;
D O I
10.1088/2632-2153/ad4e04
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-Task Learning (MTL) is a powerful technique that has gained popularity due to its performance improvement over traditional Single-Task Learning (STL). However, MTL is often challenging because there is an exponential number of possible task groupings, which can make it difficult to choose the best one because some groupings might produce performance degradation due to negative interference between tasks. That is why existing solutions are severely suffering from scalability issues, limiting any practical application. In our paper, we propose a new data-driven method that addresses these challenges and provides a scalable and modular solution for classification task grouping based on a re-proposed data-driven features, Data Maps, which capture the training dynamics for each classification task during the MTL training. Through a theoretical comparison with other techniques, we manage to show that our approach has the superior scalability. Our experiments show a better performance and verify the method's effectiveness, even on an unprecedented number of tasks (up to 100 tasks on CIFAR100). Being the first to work on such number of tasks, our comparisons on the resulting grouping shows similar grouping to the mentioned in the dataset, CIFAR100. Finally, we provide a modular implementation 3 3 https://github.com/ammarSherif/STG-MTL. for easier integration and testing, with examples from multiple datasets and tasks.
引用
收藏
页数:17
相关论文
共 41 条
[1]  
Aribandi Vamsi, 2021, INT C LEARN REPR
[2]   COVID-MTL: Multitask learning with Shift3D and random-weighted loss for COVID-19 diagnosis and severity assessment [J].
Bao, Guoqing ;
Chen, Huai ;
Liu, Tongliang ;
Gong, Guanzhong ;
Yin, Yong ;
Wang, Lisheng ;
Wang, Xiuying .
PATTERN RECOGNITION, 2022, 124
[3]   FCM - THE FUZZY C-MEANS CLUSTERING-ALGORITHM [J].
BEZDEK, JC ;
EHRLICH, R ;
FULL, W .
COMPUTERS & GEOSCIENCES, 1984, 10 (2-3) :191-203
[4]  
Bickel S., 2008, P 25 INT C MACH LEAR, P56, DOI DOI 10.1145/1390156.1390164
[5]   Multitask learning [J].
Caruana, R .
MACHINE LEARNING, 1997, 28 (01) :41-75
[6]   Mod-Squad: Designing Mixtures of Experts As Modular Multi-Task Learners [J].
Chen, Zitian ;
Shen, Yikang ;
Ding, Mingyu ;
Chen, Zhenfang ;
Zhao, Hengshuang ;
Learned-Miller, Erik ;
Gan, Chuang .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :11828-11837
[7]  
Crawshaw M., 2020, arXiv
[8]  
De Brabandere Bert, 2019, BRANCHED MULTITASK N
[9]   HD-MTL: Hierarchical Deep Multi-Task Learning for Large-Scale Visual Recognition [J].
Fan, Jianping ;
Zhao, Tianyi ;
Kuang, Zhenzhong ;
Zheng, Yu ;
Zhang, Ji ;
Yu, Jun ;
Peng, Jinye .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (04) :1923-1938
[10]  
Fifty C, 2021, ADV NEUR IN, V34