Does Thermal Really Always Matter for RGB-T Salient Object Detection?

被引:46
作者
Cong, Runmin [1 ,2 ,3 ]
Zhang, Kepu [1 ,2 ]
Zhang, Chen [1 ,2 ]
Zheng, Feng [4 ,5 ]
Zhao, Yao [1 ,2 ]
Huang, Qingming [6 ,7 ,8 ]
Kwong, Sam [3 ,9 ]
机构
[1] Beijing Jiaotong Univ, Inst Informat Sci, Beijing 100044, Peoples R China
[2] Network Technol, Beijing Key Lab Adv Informat Sci, Beijing 100044, Peoples R China
[3] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[4] Southern Univ Sci & Technol, Dept Comp Sci & Technol, Shenzhen 518055, Peoples R China
[5] Res Inst Trustworthy Autonomous Syst, Shenzhen 518055, Peoples R China
[6] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 101408, Peoples R China
[7] Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100190, Peoples R China
[8] Peng Cheng Lab, Shenzhen 518055, Peoples R China
[9] City Univ Hong Kong, Shenzhen Res Inst, Shenzhen 51800, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金; 国家重点研发计划;
关键词
Task analysis; Decoding; Semantics; Object detection; Location awareness; Lighting; Feature extraction; RGB-T images; salient object detection; global illumination estimation; semantic constraint provider; localization and complementation; FUSION NETWORK;
D O I
10.1109/TMM.2022.3216476
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, RGB-T salient object detection (SOD) has attracted continuous attention, which makes it possible to identify salient objects in environments such as low light by introducing thermal image. However, most of the existing RGB-T SOD models focus on how to perform cross-modality feature fusion, ignoring whether thermal image is really always matter in SOD task. Starting from the definition and nature of this task, this paper rethinks the connotation of thermal modality, and proposes a network named TNet to solve the RGB-T SOD task. In this paper, we introduce a global illumination estimation module to predict the global illuminance score of the image, so as to regulate the role played by the two modalities. In addition, considering the role of thermal modality, we set up different cross-modality interaction mechanisms in the encoding phase and the decoding phase. On the one hand, we introduce a semantic constraint provider to enrich the semantics of thermal images in the encoding phase, which makes thermal modality more suitable for the SOD task. On the other hand, we introduce a two-stage localization and complementation module in the decoding phase to transfer object localization cue and internal integrity cue in thermal features to the RGB modality. Extensive experiments on three datasets show that the proposed TNet achieves competitive performance compared with 20 state-of-the-art methods.
引用
收藏
页码:6971 / 6982
页数:12
相关论文
共 50 条
  • [41] Multi-Prior Driven Network for RGB-D Salient Object Detection
    Zhang, Xiaoqin
    Xu, Yuewang
    Wang, Tao
    Liao, Tangfei
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9209 - 9222
  • [42] MSEDNet: Multi-scale fusion and edge-supervised network for RGB-T salient object detection
    Peng, Daogang
    Zhou, Weiyi
    Pan, Junzhen
    Wang, Danhao
    NEURAL NETWORKS, 2024, 171 : 410 - 422
  • [43] RGB-D Salient Object Detection With Ubiquitous Target Awareness
    Zhao, Yifan
    Zhao, Jiawei
    Li, Jia
    Chen, Xiaowu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 7717 - 7731
  • [44] Cross-Collaboration Weighted Fusion Network for RGB-T Salient Detection
    Wang, Yumei
    Dongye, Changlei
    Zhao, Wenxiu
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT IV, ICIC 2024, 2024, 14865 : 301 - 312
  • [45] Frequency-aware feature aggregation network with dual-task consistency for RGB-T salient object detection
    Zhou, Heng
    Tian, Chunna
    Zhang, Zhenxi
    Li, Chengyang
    Xie, Yongqiang
    Li, Zhongbo
    PATTERN RECOGNITION, 2024, 146
  • [46] Highly Efficient RGB-D Salient Object Detection With Adaptive Fusion and Attention Regulation
    Gao, Haoran
    Wang, Fasheng
    Wang, Mengyin
    Sun, Fuming
    Li, Haojie
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (04) : 3104 - 3118
  • [47] Cross-Collaborative Fusion-Encoder Network for Robust RGB-Thermal Salient Object Detection
    Liao, Guibiao
    Gao, Wei
    Li, Ge
    Wang, Junle
    Kwong, Sam
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (11) : 7646 - 7661
  • [48] Deep Hypersphere Feature Regularization for Weakly Supervised RGB-D Salient Object Detection
    Liu, Zhiyu
    Hayat, Munawar
    Yang, Hong
    Peng, Duo
    Lei, Yinjie
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 5423 - 5437
  • [49] Modal-Adaptive Gated Recoding Network for RGB-D Salient Object Detection
    Zhu, Jinchao
    Zhang, Xiaoyu
    Fang, Xian
    Dong, Feng
    Qiu, Yu
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 359 - 363
  • [50] Adaptive Depth Enhancement Network for RGB-D Salient Object Detection
    Yi, Kang
    Li, Yumeng
    Tang, Haoran
    Xu, Jing
    IEEE SIGNAL PROCESSING LETTERS, 2025, 32 : 176 - 180