Adaptive Task-Wise Message Passing for Multi-Task Learning: A Spatial Interaction Perspective

被引：0

作者：

Sirejiding, Shalayiding ^{[1
]}

Bayramli, Bayram ^{[1
]}

Lu, Yuxiang ^{[1
]}

Huang, Suizhi ^{[1
]}

Lu, Hongtao ^{[1
]}

Ding, Yue ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai 200240, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2024年 / 34卷 / 10期

关键词：

Task analysis; Decoding; Multitasking; Message passing; Adaptation models; Feature extraction; Estimation; Multi-task learning; dense prediction; self-attention mechanism; graph neural network;

D O I：

10.1109/TCSVT.2024.3399613

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Recent advancements have facilitated the simultaneous processing of multiple dense prediction tasks, utilizing diverse correlations between these tasks. However, many of these advances predominantly focus on a singular or fixed task interaction, leading to negative transfer effects. In this paper, we introduce an end-to-end model called the Adaptive Task-Wise Message Passing Network (ATMPNet) for multi-task learning. Our proposed model focuses on excavating comprehensive spatial messages among tasks in an adaptive manner. To achieve this, ATMPNet incorporates the Adaptive Spatial Message Interaction (ASMI) module, which models various local spatial message interactions and global interactions among tasks. ASMI explores potential spatial relationships by generating a task-specific message pool for each target task. Furthermore, we propose an Adaptive Task Message Passing (ATMP) module, a novel method for aggregating messages. The ATMP module generates refined global-local messages from each message pool and adaptively transfers them to the corresponding target tasks through a well-designed message passing scheme. We conduct extensive experiments on the NYUD-v2 and PASCAL-Context datasets to evaluate the effectiveness of ATMPNet. The results demonstrate the state-of-the-art performance of our proposed model in handling multi-task learning scenarios. Code will be publicly available in here.

引用

页码：9499 / 9514

页数：16

共 59 条

[1] Ballas N., 2016, arXiv
[2] Exploring Relational Context for Multi-Task Dense Prediction
Bruggemann, David
Kanakis, Menelaos
Obukhov, Anton
Georgoulis, Stamatios
Van Gool, Luc
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 15849 - 15858
[3] Detect What You Can: Detecting and Representing Objects using Holistic Models and Body Parts
Chen, Xianjie
Mottaghi, Roozbeh
Liu, Xiaobai
Fidler, Sanja
Urtasun, Raquel
Yuille, Alan
[J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 1979 - 1986
[4] Chen Z, 2018, PR MACH LEARN RES, V80
[5] Dosovitskiy A., 2021, IMAGE IS WORTH 16 16, DOI DOI 10.48550/ARXIV.2010.11929
[6] The Pascal Visual Object Classes (VOC) Challenge
Everingham, Mark
Van Gool, Luc
Williams, Christopher K. I.
Winn, John
Zisserman, Andrew
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) : 303 - 338
[7] Dual Attention Network for Scene Segmentation
Fu, Jun
Liu, Jing
Tian, Haijie
Li, Yong
Bao, Yongjun
Fang, Zhiwei
Lu, Hanqing
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3141 - 3149
[8] G„hlert N, 2020, Arxiv, DOI arXiv:2006.07864
[9] NDDR-CNN: Layerwise Feature Fusing in Multi-Task CNNs by Neural Discriminative Dimensionality Reduction
Gao, Yuan
Ma, Jiayi
Zhao, Mingbo
Liu, Wei
Yuille, Alan L.
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3200 - 3209
[10] Gilmer J, 2017, PR MACH LEARN RES, V70

← 1 2 3 4 5 6 →