Adaptive Task-Wise Message Passing for Multi-Task Learning: A Spatial Interaction Perspective

被引:0
作者
Sirejiding, Shalayiding [1 ]
Bayramli, Bayram [1 ]
Lu, Yuxiang [1 ]
Huang, Suizhi [1 ]
Lu, Hongtao [1 ]
Ding, Yue [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai 200240, Peoples R China
关键词
Task analysis; Decoding; Multitasking; Message passing; Adaptation models; Feature extraction; Estimation; Multi-task learning; dense prediction; self-attention mechanism; graph neural network;
D O I
10.1109/TCSVT.2024.3399613
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recent advancements have facilitated the simultaneous processing of multiple dense prediction tasks, utilizing diverse correlations between these tasks. However, many of these advances predominantly focus on a singular or fixed task interaction, leading to negative transfer effects. In this paper, we introduce an end-to-end model called the Adaptive Task-Wise Message Passing Network (ATMPNet) for multi-task learning. Our proposed model focuses on excavating comprehensive spatial messages among tasks in an adaptive manner. To achieve this, ATMPNet incorporates the Adaptive Spatial Message Interaction (ASMI) module, which models various local spatial message interactions and global interactions among tasks. ASMI explores potential spatial relationships by generating a task-specific message pool for each target task. Furthermore, we propose an Adaptive Task Message Passing (ATMP) module, a novel method for aggregating messages. The ATMP module generates refined global-local messages from each message pool and adaptively transfers them to the corresponding target tasks through a well-designed message passing scheme. We conduct extensive experiments on the NYUD-v2 and PASCAL-Context datasets to evaluate the effectiveness of ATMPNet. The results demonstrate the state-of-the-art performance of our proposed model in handling multi-task learning scenarios. Code will be publicly available in here.
引用
收藏
页码:9499 / 9514
页数:16
相关论文
共 59 条
  • [1] Ballas N., 2016, arXiv
  • [2] Exploring Relational Context for Multi-Task Dense Prediction
    Bruggemann, David
    Kanakis, Menelaos
    Obukhov, Anton
    Georgoulis, Stamatios
    Van Gool, Luc
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 15849 - 15858
  • [3] Detect What You Can: Detecting and Representing Objects using Holistic Models and Body Parts
    Chen, Xianjie
    Mottaghi, Roozbeh
    Liu, Xiaobai
    Fidler, Sanja
    Urtasun, Raquel
    Yuille, Alan
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 1979 - 1986
  • [4] Chen Z, 2018, PR MACH LEARN RES, V80
  • [5] Dosovitskiy A., 2021, IMAGE IS WORTH 16 16, DOI DOI 10.48550/ARXIV.2010.11929
  • [6] The Pascal Visual Object Classes (VOC) Challenge
    Everingham, Mark
    Van Gool, Luc
    Williams, Christopher K. I.
    Winn, John
    Zisserman, Andrew
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) : 303 - 338
  • [7] Dual Attention Network for Scene Segmentation
    Fu, Jun
    Liu, Jing
    Tian, Haijie
    Li, Yong
    Bao, Yongjun
    Fang, Zhiwei
    Lu, Hanqing
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3141 - 3149
  • [8] G„hlert N, 2020, Arxiv, DOI arXiv:2006.07864
  • [9] NDDR-CNN: Layerwise Feature Fusing in Multi-Task CNNs by Neural Discriminative Dimensionality Reduction
    Gao, Yuan
    Ma, Jiayi
    Zhao, Mingbo
    Liu, Wei
    Yuille, Alan L.
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3200 - 3209
  • [10] Gilmer J, 2017, PR MACH LEARN RES, V70