Cross-modal collaborative propagation for RGB-T saliency detection

被引:2
作者
Yu, Xiaosheng [1 ]
Pang, Yu [2 ]
Chi, Jianning [1 ]
Qi, Qi [3 ]
机构
[1] Northeastern Univ, Fac Robot Sci & Engn, Shenyang 110819, Peoples R China
[2] Shenyang Univ Technol, Sch Artificial Intelligence, Shenyang 110870, Peoples R China
[3] Liaoning Prov Party Comm, Party Sch, Dept Decis Consulting, Shenyang 110004, Peoples R China
关键词
Saliency detection; Collaborative learning; Propagation mechanism; Deep features optimization; Multi-modal integration; IMAGE;
D O I
10.1007/s00371-023-03085-5
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Recently, RGB-T saliency detection becomes gradually a hot topic due to the fact that RGB-T multi-modal data could overcome the limitation of conventional RGB data in some cases. However, existing RGB-T saliency detection methods usually fail to take both advantages of two modalities and cannot boost performance effectively. Therefore, we achieve RGB-T saliency detection via a novel method, namely cross-modal collaborative propagation (CMCP), which contains a novel saliency propagation mechanism and a novel cross-modal collaborative learning framework relied on the proposed propagation mechanism. More specifically, we firstly propose a novel saliency propagation method and then, respectively, regard two modalities as inputs to generate RGB-induced and thermal-induced propagation mechanisms. To bridge RGB-T modalities, a novel cross-modal collaborative learning framework between RGB-induced and thermal-induced propagation mechanisms is devised to optimize, respectively, two propagation results. In other words, two modalities constantly extract supervision information to help the opposite side to refine propagation result until attaining a stable state. Finally, we integrate two modalities-induced propagation results into a refined saliency map. We compare our model with the state-of-the-art RGB-T and RGB saliency detection algorithms on three benchmark datasets, and experimental results show that the proposed CMCP achieves the significant improvement.
引用
收藏
页码:4337 / 4354
页数:18
相关论文
共 48 条
[1]   SLIC Superpixels Compared to State-of-the-Art Superpixel Methods [J].
Achanta, Radhakrishna ;
Shaji, Appu ;
Smith, Kevin ;
Lucchi, Aurelien ;
Fua, Pascal ;
Suesstrunk, Sabine .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (11) :2274-2281
[2]  
Achanta R, 2009, PROC CVPR IEEE, P1597, DOI 10.1109/CVPRW.2009.5206596
[3]   Generative Adversarial Networks for Extreme Learned Image Compression [J].
Agustsson, Eirikur ;
Tschannen, Michael ;
Mentzer, Fabian ;
Timofte, Radu ;
Van Gool, Luc .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :221-231
[4]   Multi-complementary and unlab ele d learning for arbitrary losses and models [J].
Cao, Yuzhou ;
Liu, Shuqi ;
Xu, Yitian .
PATTERN RECOGNITION, 2022, 124
[5]   Discriminative saliency propagation with sink points [J].
Chen, Shuhan ;
Zheng, Ling ;
Hu, Xuelong ;
Zhou, Ping .
PATTERN RECOGNITION, 2016, 60 :2-12
[6]   Global Contrast based Salient Region Detection [J].
Cheng, Ming-Ming ;
Zhang, Guo-Xin ;
Mitra, Niloy J. ;
Huang, Xiaolei ;
Hu, Shi-Min .
2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011, :409-416
[7]   Does Thermal Really Always Matter for RGB-T Salient Object Detection? [J].
Cong, Runmin ;
Zhang, Kepu ;
Zhang, Chen ;
Zheng, Feng ;
Zhao, Yao ;
Huang, Qingming ;
Kwong, Sam .
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 :6971-6982
[8]   Learning Discriminative Subspaces on Random Contrasts for Image Saliency Analysis [J].
Fang, Shu ;
Li, Jia ;
Tian, Yonghong ;
Huang, Tiejun ;
Chen, Xiaowu .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (05) :1095-1108
[9]   Attentive Feedback Network for Boundary-Aware Salient Object Detection [J].
Feng, Mengyang ;
Lu, Huchuan ;
Ding, Errui .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :1623-1632
[10]   Uncertainty-Aware Unsupervised Domain Adaptation in Object Detection [J].
Guan, Dayan ;
Huang, Jiaxing ;
Xiao, Aoran ;
Lu, Shijian ;
Cao, Yanpeng .
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 :2502-2514