An interactively reinforced paradigm for joint infrared-visible image fusion and saliency object detection

被引:98
作者
Wang, Di [1 ,4 ]
Liu, Jinyuan [3 ]
Liu, Risheng [2 ,4 ]
Fan, Xin [2 ,4 ]
机构
[1] Dalian Univ Technol, Sch Software Technol, Dalian 116620, Peoples R China
[2] Dalian Univ Technol, DUT RU Int Sch Informat Sci & Engn, Dalian 116620, Peoples R China
[3] Dalian Univ Technol, Sch Mech Engn, Dalian 116023, Peoples R China
[4] Key Lab Ubiquitous Network & Serv Software Liaonin, Dalian 116620, Peoples R China
基金
中国国家自然科学基金;
关键词
Image fusion; Infrared and visible image; Multi-modal salient object detection; Interactively reinforced paradigm; Interactive loop learning strategy; MULTISCALE TRANSFORM; NETWORK; PERFORMANCE; EFFICIENT;
D O I
10.1016/j.inffus.2023.101828
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This research focuses on the discovery and localization of hidden objects in the wild and serves unmanned systems. Through empirical analysis, infrared and visible image fusion (IVIF) enables hard-to-find objects apparent, whereas multimodal salient object detection (SOD) accurately delineates the precise spatial location of objects within the picture. Their common characteristic of seeking complementary cues from different source images motivates us to explore the collaborative relationship between Fusion and Salient object detection tasks on infrared and visible images via an Interactively Reinforced multi-task paradigm for the first time, termed IRFS. To the seamless bridge of multimodal image fusion and SOD tasks, we specifically develop a Feature Screening-based Fusion subnetwork (FSFNet) to screen out interfering features from source images, thereby preserving saliency-related features. After generating the fused image through FSFNet, it is then fed into the subsequent Fusion-Guided Cross-Complementary SOD subnetwork (FC2Net) as the third modality to drive the precise prediction of the saliency map by leveraging the complementary information derived from the fused image. In addition, we develop an interactive loop learning strategy to achieve the mutual reinforcement of IVIF and SOD tasks with a shorter training period and fewer network parameters. Comprehensive experiment results demonstrate that the seamless bridge of IVIF and SOD mutually enhances their performance, and highlights their superiority. This code is available at https://github.com/wdhudiekou/IRFS.
引用
收藏
页数:13
相关论文
共 82 条
[1]  
Bavirisetti DP, 2017, 2017 20TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), P701
[2]   Anomaly Detection in Autonomous Driving: A Survey [J].
Bogdoll, Daniel ;
Nitsche, Maximilian ;
Zoellner, J. Marius .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, :4487-4498
[3]   Infrared and visible image fusion based on target-enhanced multiscale transform decomposition [J].
Chen, Jun ;
Li, Xuejiao ;
Luo, Linbo ;
Mei, Xiaoguang ;
Ma, Jiayi .
INFORMATION SCIENCES, 2020, 508 :64-78
[4]   HDRUNet: Single Image HDR Reconstruction with Denoising and Dequantization [J].
Chen, Xiangyu ;
Liu, Yihao ;
Zhang, Zhengwen ;
Qiao, Yu ;
Dong, Chao .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, :354-363
[5]   Region-based multimodal image fusion using ICA bases [J].
Cvejic, Nedeljko ;
Lewis, John ;
Bull, David ;
Canagarajah, Nishan .
2006 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP 2006, PROCEEDINGS, 2006, :1801-+
[6]  
Fu Y., 2021, ABS210713967 CORR
[7]   Infrared and visible image fusion with the use of multi-scale edge-preserving decomposition and guided image filter [J].
Gan, Wei ;
Wu, Xiaohong ;
Wu, Wei ;
Yang, Xiaomin ;
Ren, Chao ;
He, Xiaohai ;
Liu, Kai .
INFRARED PHYSICS & TECHNOLOGY, 2015, 72 :37-51
[8]   Unified Information Fusion Network for Multi-Modal RGB-D and RGB-T Salient Object Detection [J].
Gao, Wei ;
Liao, Guibiao ;
Ma, Siwei ;
Li, Ge ;
Liang, Yongsheng ;
Lin, Weisi .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (04) :2091-2106
[9]   Saliency Guided Image Detail Enhancement [J].
Ghosh, Sanjay ;
Gavaskar, Ruturaj G. ;
Chaudhury, Kunal N. .
2019 25TH NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2019,
[10]   High-Level Task-Driven Single Image Deraining: Segmentation in Rainy Days [J].
Guo, Mengxi ;
Chen, Mingtao ;
Ma, Cong ;
Li, Yuan ;
Li, Xianfeng ;
Xie, Xiaodong .
NEURAL INFORMATION PROCESSING, ICONIP 2020, PT I, 2020, 12532 :350-362