Robust Multitask Learning With Sample Gradient Similarity

被引：2

作者：

Peng, Xinyu ^{[1
]}

Chang, Cheng ^{[1
]}

Wang, Fei-Yue ^{[2
]}

Li, Li ^{[3
]}

机构：

[1] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China

[2] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100080, Peoples R China

[3] Tsinghua Univ, Dept Automat, Tsinghua Natl Lab Informat Sci & Technol, Beijing 100084, Peoples R China

来源：

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2024年 / 54卷 / 01期

关键词：

Deep learning; Automation; multitask learning; sample gradient; sample reweighting; task reweighting; INSTANCE SEGMENTATION; OPTIMIZATION PROBLEMS; NETWORK; FUSION; SYSTEM;

D O I：

10.1109/TSMC.2023.3315541

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Multitask learning has led to great success in many deep learning applications during the last decade. However, recent experiments have demonstrated that the performance of multitask learning depends on how to balance the relationship between different tasks. Therefore, many approaches have been proposed to adjust per-task gradient directions or design a more appropriate task reweighting scheme based on task-level statistics. In this article, we discuss how to boost the performance of multitask learning by using more fine-grained sample gradient information. To this end, we propose the concept of sample gradient similarity, which measures the agreement between the sample gradient for a task and the true gradient. Based on this concept, greater weight is assigned to more consistent tasks and more robust training samples to improve the training process of multitask learning. Extensive experimental results show that our proposed method outperforms the state-of-the-art algorithms on a series of challenging multitask datasets.

引用

页码：497 / 506

页数：10

共 88 条

[1] Alain G., 2015, arXiv
[2] Deep learning and time series-to-image encoding for financial forecasting
Barra, Silvio
Carta, Salvatore Mario
Corriga, Andrea
Podda, Alessandro Sebastian
Recupero, Diego Reforgiato
[J]. IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2020, 7 (03) : 683 - 692
[3] Bengio Y., 2009, Proceedings of the 26th annual international conference on machine learning, P41, DOI 10.1145/1553374.155338
[4] Multitask learning
Caruana, R
[J]. MACHINE LEARNING, 1997, 28 (01) : 41 - 75
[5] Chang HS, 2017, ADV NEUR IN, V30
[6] Solving Many-Objective Optimization Problems via Multistage Evolutionary Search
Chen, Huangke
Cheng, Ran
Pedrycz, Witold
Jin, Yaochu
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2021, 51 (06): : 3552 - 3564
[7] E-LSTM-D: A Deep Learning Framework for Dynamic Network Link Prediction
Chen, Jinyin
Zhang, Jian
Xu, Xuanheng
Fu, Chenbo
Zhang, Dan
Zhang, Qingpeng
Xuan, Qi
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2021, 51 (06): : 3699 - 3712
[8] Chen Shuxiao, 2021, arXiv
[9] Chen Z, 2020, ADV NEUR IN, V33
[10] Chen Z, 2018, PR MACH LEARN RES, V80

← 1 2 3 4 5 6 7 8 9 →