Robust Multitask Learning With Sample Gradient Similarity

被引：3

作者：

Peng, Xinyu ^{[1
]}

Chang, Cheng ^{[1
]}

Wang, Fei-Yue ^{[2
]}

Li, Li ^{[3
]}

机构：

[1] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China

[2] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100080, Peoples R China

[3] Tsinghua Univ, Dept Automat, Tsinghua Natl Lab Informat Sci & Technol, Beijing 100084, Peoples R China

来源：

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2024年 / 54卷 / 01期

关键词：

Deep learning; Automation; multitask learning; sample gradient; sample reweighting; task reweighting; INSTANCE SEGMENTATION; OPTIMIZATION PROBLEMS; NETWORK; FUSION; SYSTEM;

D O I：

10.1109/TSMC.2023.3315541

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Multitask learning has led to great success in many deep learning applications during the last decade. However, recent experiments have demonstrated that the performance of multitask learning depends on how to balance the relationship between different tasks. Therefore, many approaches have been proposed to adjust per-task gradient directions or design a more appropriate task reweighting scheme based on task-level statistics. In this article, we discuss how to boost the performance of multitask learning by using more fine-grained sample gradient information. To this end, we propose the concept of sample gradient similarity, which measures the agreement between the sample gradient for a task and the true gradient. Based on this concept, greater weight is assigned to more consistent tasks and more robust training samples to improve the training process of multitask learning. Extensive experimental results show that our proposed method outperforms the state-of-the-art algorithms on a series of challenging multitask datasets.

引用

页码：497 / 506

页数：10

共 88 条

[41] Sampling Methods for Efficient Training of Graph Convolutional Networks: A Survey [J].

Liu, Xin ;

Yan, Mingyu ;

Deng, Lei ;

Li, Guoqi ;

Ye, Xiaochun ;

Fan, Dongrui .

IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2022, 9 (02) :205-234

[42] Stacked Broad Learning System: From Incremental Flatted Structure to Deep Model [J].

Liu, Zhulin ;

Chen, C. L. Philip ;

Feng, Shuang ;

Feng, Qiying ;

Zhang, Tong .

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2021, 51 (01) :209-222

[43]

Loshchilov I., 2015, CoRR

[44] SwinFusion: Cross-domain Long-range Learning for General Image Fusion via Swin Transformer [J].

Ma, Jiayi ;

Tang, Linfeng ;

Fan, Fan ;

Huang, Jun ;

Mei, Xiaoguang ;

Ma, Yong .

IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2022, 9 (07) :1200-1217

[45] Attentive Single-Tasking of Multiple Tasks [J].

Maninis, Kevis-Kokitsi ;

Radosavovic, Ilija ;

Kokkinos, Iasonas .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :1851-1860

[46]

Michel P., 2021, ARXIV

[47] Adversarial Learning and Self-Teaching Techniques for Domain Adaptation in Semantic Segmentation [J].

Michieli, Umberto ;

Biasetton, Matteo ;

Agresti, Gianluca ;

Zanuttigh, Pietro .

IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2020, 5 (03) :508-518

[48] A Two-Stage Evolutionary Algorithm With Balanced Convergence and Diversity for Many-Objective Optimization [J].

Ming, Fei ;

Gong, Wenyin ;

Wang, Ling .

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (10) :6222-6234

[49] Early Lane Change Prediction for Automated Driving Systems Using Multi-Task Attention-Based Convolutional Neural Networks [J].

Mozaffari, Sajjad ;

Arnold, Eduardo ;

Dianati, Mehrdad ;

Fallah, Saber .

IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2022, 7 (03) :758-770

[50] CurveNet: Curvature-Based Multitask Learning Deep Networks for 3D Object Recognition [J].

Muzahid, A. A. M. ;

Wan, Wanggen ;

Sohel, Ferdous ;

Wu, Lianyao ;

Hou, Li .

IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2021, 8 (06) :1177-1187

← 1 2 3 4 5 6 7 8 9 →