Dual-branch deep cross-modal interaction network for semantic segmentation with thermal images

被引:1
|
作者
Dai, Kang [1 ]
Chen, Suting [1 ]
机构
[1] Nanjing Univ Informat Sci Technol, Sch Elect & Informat Engn, Nanjing 210044, Peoples R China
基金
中国国家自然科学基金;
关键词
Thermal images; Semantic segmentation; Cross-modal feature; Deep interaction; FUSION NETWORK; RGB;
D O I
10.1016/j.engappai.2024.108820
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semantic segmentation using RGB (Red-Green-Blue) images and thermal datas is an indispensable component of autonomous driving. The key to RGB-Thermal (RGB and Thermal) semantic segmentation is achieving the interaction and fusion of features between RGB and thermal images. Therefore, we propose a dual-branch deep cross-modal interaction network (DCIT) based on Encoder-Decoder structure. This framework consists of two parallel networks for feature extraction from RGB and Thermal data. Specifically, in each feature extraction stage of the Encoder, we design a Cross Feature Regulation Modules (CFRM) to align and correct modality specific features by reducing the inter-modality feature differences and eliminating intra-modality noise. Then, the modality features are aggregated through Cross Modal Feature Fusion Module (CMFFM) based on cross linear attention to capture global information from modality features. Finally, Adaptive Multi-Scale Cross- positional Fusion Module (AMCFM) utilizes the fused features to integrate consistent semantic information in the Decoder stage. Our framework can improve the interaction of cross modal features. Extensive experiments on urban scene datasets demonstrate that our proposed framework outperforms other RGB-Thermal semantic segmentation methods in terms of objective metrics and subjective visual assessments.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Lightweight dual-branch network for vehicle exhausts segmentation
    Sheng, Chiyun
    Hu, Bin
    Meng, Fanjun
    Yin, Dong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (12) : 17785 - 17806
  • [42] Dual-Branch Network for Cloud and Cloud Shadow Segmentation
    Lu, Chen
    Xia, Min
    Qian, Ming
    Chen, Binyu
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [43] Dual-branch residual network for lung nodule segmentation
    Cao, Haichao
    Liu, Hong
    Song, Enmin
    Hung, Chih-Cheng
    Ma, Guangzhi
    Xu, Xiangyang
    Jin, Renchao
    Lu, Jianguo
    APPLIED SOFT COMPUTING, 2020, 86
  • [44] A Dual-Branch Fusion Network for Surgical Instrument Segmentation
    Yang, Lei
    Zhai, Chenxu
    Wang, Hongyong
    Liu, Yanhong
    Bian, Guibin
    IEEE TRANSACTIONS ON MEDICAL ROBOTICS AND BIONICS, 2024, 6 (04): : 1542 - 1554
  • [45] Dual-branch image projection network for geographic atrophy segmentation in retinal OCT images
    Liu, Xiaoming
    Li, Jieyang
    Zhang, Ying
    Yao, Junping
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [46] DANet: Dual-Branch Activation Network for Small Object Instance Segmentation of Ship Images
    Sun, Yuxin
    Su, Li
    Yuan, Shouzheng
    Meng, Hao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (11) : 6708 - 6720
  • [47] Lightweight dual-branch network for vehicle exhausts segmentation
    Chiyun Sheng
    Bin Hu
    Fanjun Meng
    Dong Yin
    Multimedia Tools and Applications, 2021, 80 : 17785 - 17806
  • [48] DBCG-Net: Dual Branch Calibration Guided Deep Network for UAV Images Semantic Segmentation
    Mai, Chaoyun
    Wu, Yibo
    Zhai, Yikui
    Quan, Hao
    Zhou, Jianhong
    Genovese, Angelo
    Piuri, Vincenzo
    Scotti, Fabio
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 7932 - 7945
  • [49] Dual-branch residual network for lung nodule segmentation
    Cao, Haichao
    Liu, Hong
    Song, Enmin
    Hung, Chih-Cheng
    Ma, Guangzhi
    Xu, Xiangyang
    Jin, Renchao
    Lu, Jianguo
    Liu, Hong (hl.cbib@gmail.com), 1600, Elsevier Ltd (86):
  • [50] A dual-branch hybrid network of CNN and transformer with adaptive keyframe scheduling for video semantic segmentation
    Liang, Zhixue
    Dong, Wenyong
    Zhang, Bo
    MULTIMEDIA SYSTEMS, 2024, 30 (02)