Semantic Region Adaptive Fusion of Infrared and Visible Images via Dual-DeepLab Guidance

被引:4
作者
Cao, Wenzi [1 ]
Zheng, Minghui [1 ]
Liao, Qing [1 ]
机构
[1] Wuhan Inst Technol, Hubei Key Lab Opt Informat & Pattern Recognit, Wuhan 430205, Peoples R China
关键词
Semantics; Task analysis; Feature extraction; Lighting; Training; Semantic segmentation; Image reconstruction; High-level semantic perception; image fusion; infrared; prioritized preservation scheme; region adaptation; visible; GENERATIVE ADVERSARIAL NETWORK; CLASSIFICATION; PERFORMANCE; NEST;
D O I
10.1109/TIM.2023.3318709
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Given the potential diminishment of semantic information caused by previous nonregion-specific maximum or weighted intensity losses under different lighting conditions, we propose a novel high-level semantic perception fusion framework, termed semantic region adaptive fusion (SRAFusion). We establish a prioritized preservation scheme for high-level semantic information (HLSI), gradient information, and intensity information, ranked in descending order of priority. Based on this prioritization scheme and prior knowledge of semantic distribution in the source images, we construct a ground truth for the fusion task. Specifically, we capture the HLSI distribution of the source images using two independent semantic segmentation networks. Subsequently, we introduce semantic region decision block (SRDB) to partition the original scene into region with bimodal HLSI, region with unimodal HLSI, and region lacking HLSI. We then design specific loss functions to constrain the aforementioned regions, facilitating the integration of complete semantic information. Furthermore, taking into account the susceptibility of the visible segmentation network to lighting conditions, we use a two-stage training strategy involving coarse-tuning and fine-tuning. This method aims to optimize one-stage training strategy and achieve a more accurate region delineation. Finally, qualitative and quantitative experiments conducted on publicly available datasets such as MFNet, RoadScene, and TNO demonstrate the superiority of our SRAFusion over state-of-the-art methods. Our code will be available: https://github.com/WenziCao/SRAFusion.
引用
收藏
页数:16
相关论文
共 71 条
[1]   Deep Neural Networks for No-Reference and Full-Reference Image Quality Assessment [J].
Bosse, Sebastian ;
Maniry, Dominique ;
Mueller, Klaus-Robert ;
Wiegand, Thomas ;
Samek, Wojciech .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (01) :206-219
[2]   CaMap: Camera-based Map Manipulation on Mobile Devices [J].
Chen, Liang ;
Chen, Dongyi .
PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATION ENGINEERING (CSAE2018), 2018,
[3]   Evaluating Robustness of Deep Image Super-Resolution Against Adversarial Attacks [J].
Choi, Jun-Ho ;
Zhang, Huan ;
Kim, Jun-Hyuk ;
Hsieh, Cho-Jui ;
Lee, Jong-Seok .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :303-311
[4]  
CHOLLET F, 2017, PROC CVPR IEEE, P1800, DOI DOI 10.1109/CVPR.2017.195
[5]   Detail preserved fusion of visible and infrared images using regional saliency extraction and multi-scale image decomposition [J].
Cui, Guangmang ;
Feng, Huajun ;
Xu, Zhihai ;
Li, Qi ;
Chen, Yueting .
OPTICS COMMUNICATIONS, 2015, 341 :199-209
[6]   FEANet: Feature-Enhanced Attention Network for RGB-Thermal Real-time Semantic Segmentation [J].
Deng, Fuqin ;
Feng, Hua ;
Liang, Mingjian ;
Wang, Hongmin ;
Yang, Yong ;
Gao, Yuan ;
Chen, Junfeng ;
Hu, Junjie ;
Guo, Xiyue ;
Lam, Tin Lun .
2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, :4467-4473
[7]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[8]   Image Quality Assessment: Unifying Structure and Texture Similarity [J].
Ding, Keyan ;
Ma, Kede ;
Wang, Shiqi ;
Simoncelli, Eero P. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (05) :2567-2581
[9]  
Dosovitskiy A., 2020, ICLR, V20, DOI 10.48550/arXiv.2010.11929
[10]   Image quality measures and their performance [J].
Eskicioglu, AM ;
Fisher, PS .
IEEE TRANSACTIONS ON COMMUNICATIONS, 1995, 43 (12) :2959-2965