Robust Multitask Learning With Sample Gradient Similarity

被引:2
作者
Peng, Xinyu [1 ]
Chang, Cheng [1 ]
Wang, Fei-Yue [2 ]
Li, Li [3 ]
机构
[1] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China
[2] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100080, Peoples R China
[3] Tsinghua Univ, Dept Automat, Tsinghua Natl Lab Informat Sci & Technol, Beijing 100084, Peoples R China
来源
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2024年 / 54卷 / 01期
关键词
Deep learning; Automation; multitask learning; sample gradient; sample reweighting; task reweighting; INSTANCE SEGMENTATION; OPTIMIZATION PROBLEMS; NETWORK; FUSION; SYSTEM;
D O I
10.1109/TSMC.2023.3315541
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multitask learning has led to great success in many deep learning applications during the last decade. However, recent experiments have demonstrated that the performance of multitask learning depends on how to balance the relationship between different tasks. Therefore, many approaches have been proposed to adjust per-task gradient directions or design a more appropriate task reweighting scheme based on task-level statistics. In this article, we discuss how to boost the performance of multitask learning by using more fine-grained sample gradient information. To this end, we propose the concept of sample gradient similarity, which measures the agreement between the sample gradient for a task and the true gradient. Based on this concept, greater weight is assigned to more consistent tasks and more robust training samples to improve the training process of multitask learning. Extensive experimental results show that our proposed method outperforms the state-of-the-art algorithms on a series of challenging multitask datasets.
引用
收藏
页码:497 / 506
页数:10
相关论文
共 88 条
  • [1] Alain G., 2015, arXiv
  • [2] Deep learning and time series-to-image encoding for financial forecasting
    Barra, Silvio
    Carta, Salvatore Mario
    Corriga, Andrea
    Podda, Alessandro Sebastian
    Recupero, Diego Reforgiato
    [J]. IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2020, 7 (03) : 683 - 692
  • [3] Bengio Y., 2009, Proceedings of the 26th annual international conference on machine learning, P41, DOI 10.1145/1553374.155338
  • [4] Multitask learning
    Caruana, R
    [J]. MACHINE LEARNING, 1997, 28 (01) : 41 - 75
  • [5] Chang HS, 2017, ADV NEUR IN, V30
  • [6] Solving Many-Objective Optimization Problems via Multistage Evolutionary Search
    Chen, Huangke
    Cheng, Ran
    Pedrycz, Witold
    Jin, Yaochu
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2021, 51 (06): : 3552 - 3564
  • [7] E-LSTM-D: A Deep Learning Framework for Dynamic Network Link Prediction
    Chen, Jinyin
    Zhang, Jian
    Xu, Xuanheng
    Fu, Chenbo
    Zhang, Dan
    Zhang, Qingpeng
    Xuan, Qi
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2021, 51 (06): : 3699 - 3712
  • [8] Chen Shuxiao, 2021, arXiv
  • [9] Chen Z, 2020, ADV NEUR IN, V33
  • [10] Chen Z, 2018, PR MACH LEARN RES, V80