Dual-branch deep cross-modal interaction network for semantic segmentation with thermal images

被引:1
|
作者
Dai, Kang [1 ]
Chen, Suting [1 ]
机构
[1] Nanjing Univ Informat Sci Technol, Sch Elect & Informat Engn, Nanjing 210044, Peoples R China
基金
中国国家自然科学基金;
关键词
Thermal images; Semantic segmentation; Cross-modal feature; Deep interaction; FUSION NETWORK; RGB;
D O I
10.1016/j.engappai.2024.108820
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semantic segmentation using RGB (Red-Green-Blue) images and thermal datas is an indispensable component of autonomous driving. The key to RGB-Thermal (RGB and Thermal) semantic segmentation is achieving the interaction and fusion of features between RGB and thermal images. Therefore, we propose a dual-branch deep cross-modal interaction network (DCIT) based on Encoder-Decoder structure. This framework consists of two parallel networks for feature extraction from RGB and Thermal data. Specifically, in each feature extraction stage of the Encoder, we design a Cross Feature Regulation Modules (CFRM) to align and correct modality specific features by reducing the inter-modality feature differences and eliminating intra-modality noise. Then, the modality features are aggregated through Cross Modal Feature Fusion Module (CMFFM) based on cross linear attention to capture global information from modality features. Finally, Adaptive Multi-Scale Cross- positional Fusion Module (AMCFM) utilizes the fused features to integrate consistent semantic information in the Decoder stage. Our framework can improve the interaction of cross modal features. Extensive experiments on urban scene datasets demonstrate that our proposed framework outperforms other RGB-Thermal semantic segmentation methods in terms of objective metrics and subjective visual assessments.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Compact interactive dual-branch network for real-time semantic segmentation
    Yongsheng Dong
    Haotian Yang
    Yuanhua Pei
    Longchao Shen
    Lintao Zheng
    Peiluan Li
    Complex & Intelligent Systems, 2023, 9 : 6177 - 6190
  • [22] A SAM-based dual-branch network for remote sensing semantic segmentation
    Zhang, Hui
    REMOTE SENSING LETTERS, 2025, 16 (04) : 365 - 375
  • [23] Food image segmentation based on deep and shallow dual-branch network
    Xiao, Zhiyong
    Li, Yang
    Deng, Zhaohong
    MULTIMEDIA SYSTEMS, 2025, 31 (01)
  • [24] Semantic deep cross-modal hashing
    Lin, Qiubin
    Cao, Wenming
    He, Zhihai
    He, Zhiquan
    NEUROCOMPUTING, 2020, 396 (396) : 113 - 122
  • [25] A dual-branch network for ultrasound image segmentation
    Zhu, Zhiqin
    Zhang, Zimeng
    Qi, Guanqiu
    Li, Yuanyuan
    Li, Yuzhen
    Mu, Lan
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 103
  • [26] Graph Neural Network Enhanced Dual-Branch Network for lesion segmentation in ultrasound images
    Wang, Yaqi
    Jiang, Cunang
    Luo, Shixin
    Dai, Yu
    Zhang, Jiangxun
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 256
  • [27] Parallel Dual-Branch Polyp Segmentation Network
    Sun, Kunjie
    Cheng, Li
    Yuan, Haiwen
    Li, Xuan
    IEEE ACCESS, 2024, 12 : 192051 - 192061
  • [28] A Dual-Branch Deep Learning Architecture for Multisensor and Multitemporal Remote Sensing Semantic Segmentation
    Bergamasco, Luca
    Bovolo, Francesca
    Bruzzone, Lorenzo
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 2147 - 2162
  • [29] Deep Graph Convolutional Network with Dual-Branch and Multi-interaction
    Lou J.
    Ye H.
    Yang B.
    Li M.
    Cao F.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2022, 35 (08): : 754 - 763
  • [30] STDBNet: Shared Trunk and Dual-Branch Network for Real-Time Semantic Segmentation
    Ren, Fenglei
    Zhou, Haibo
    Yang, Lu
    Bai, Yiwen
    Xu, Wenxue
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 770 - 774