Hierarchical Two-stage modal fusion for Triple-modality salient object detection

被引：2

作者：

Wen, Hongwei ^{[1
,2
,3
]}

Song, Kechen ^{[1
,2
,3
]}

Huang, Liming ^{[1
,2
,3
]}

Wang, Han ^{[1
,2
,3
]}

Wang, Junyi ^{[4
]}

Yan, Yunhui ^{[1
,2
,3
]}

机构：

[1] Northeastern Univ, Sch Mech Engn & Automat, Shenyang 110819, Peoples R China

[2] Northeastern Univ, Natl Frontiers Sci Ctr Ind Intelligence & Syst Opt, Shenyang 110819, Peoples R China

[3] Northeastern Univ, Key Lab Data Analyt & Optimizat Smart Ind, Minist Educ, Shenyang, Peoples R China

[4] Northeastern Univ, Fac Robot Sci & Engn, Shenyang 110819, Peoples R China

来源：

MEASUREMENT | 2023年 / 218卷

基金：

中国国家自然科学基金;

关键词：

Triple-modality salient object detection; Two-stage fusion; Accurate location; Feature-level correlation; NETWORK;

D O I：

10.1016/j.measurement.2023.113180

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

Salient object detection (SOD) is essential for home service robot grasping technology. Recently, studies have shown that using triple-modality (RGB-Depth-Thermal infrared, RGB-D-T) information can significantly improve the detection effect of salient object detection. The location accuracy and completeness of the salient object play a crucial role in the subsequent grasping process. This seriously affects the robot's judgment of the object position and grasping position. Therefore, the critical problem of triple-modality SOD technology in robot grasping is locating the salient object accurately and detecting the salient object completely. Consequently, we propose a triple-modality salient object detection method based on hierarchical two-stage modal fusion. In the first fusion stage, we use the triple-modal information rationally to locate the salient object accurately. Considering the properties of the different modal information, we use the depth information to supplement and improve the visible light and thermal infrared information through an accurate selection fusion module (ASFM). Then in the second stage, we use the feature correlation enhancement module (FCEM) to realize the correlation of the different modal salient features. FCEM can perfectly combine the different modal information and make the salient object more complete. Comparative and challenging experiments demonstrate that the proposed method outperforms 14 state-of-the-art methods on the VDT2048 dataset. The code is available at: https://github. com/VDT-2048/HTMF.

引用

页数：15

共 79 条

[71] Revisiting Feature Fusion for RGB-T Salient Object Detection [J].

Zhang, Qiang ;

Xiao, Tonglin ;

Huang, Nianchang ;

Zhang, Dingwen ;

Han, Jungong .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (05) :1804-1818

[72] RGB-T Salient Object Detection via Fusing Multi-Level CNN Features [J].

Zhang, Qiang ;

Huang, Nianchang ;

Yao, Lin ;

Zhang, Dingwen ;

Shan, Caifeng ;

Han, Jungong .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :3321-3335

[73] Depth Quality-Inspired Feature Manipulation for Efficient RGB-D Salient Object Detection [J].

Zhang, Wenbo ;

Ji, Ge-Peng ;

Wang, Zhuo ;

Fu, Keren ;

Zhao, Qijun .

PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, :731-740

[74] Signal Detection and Classification in Shared Spectrum: A Deep Learning Approach [J].

Zhang, Wenhan ;

Feng, Mingjie ;

Krunz, Marwan ;

Abyaneh, Amir Hossein Yazdani .

IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2021), 2021,

[75] RGB-D Salient Object Detection With Ubiquitous Target Awareness [J].

Zhao, Yifan ;

Zhao, Jiawei ;

Li, Jia ;

Chen, Xiaowu .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :7717-7731

[76] Specificity-preserving RGB-D Saliency Detection [J].

Zhou, Tao ;

Fu, Huazhu ;

Chen, Geng ;

Zhou, Yi ;

Fan, Deng-Ping ;

Shao, Ling .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :4661-4671

[77] APNet: Adversarial Learning Assistance and Perceived Importance Fusion Network for All-Day RGB-T Salient Object Detection [J].

Zhou, Wujie ;

Zhu, Yun ;

Lei, Jingsheng ;

Wan, Jian ;

Yu, Lu .

IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2022, 6 (04) :957-968

[78] ECFFNet: Effective and Consistent Feature Fusion Network for RGB-T Salient Object Detection [J].

Zhou, Wujie ;

Guo, Qinling ;

Lei, Jingsheng ;

Yu, Lu ;

Hwang, Jenq-Neng .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (03) :1224-1235

[79]

Zhu Heqin., 2022, arXiv

← 1 2 3 4 5 6 7 8 →