RGB-D Visual Saliency Detection Algorithm Based on Information Guided and Multimodal Feature Fusion

被引：1

作者：

Xu, Lijuan ^{[1
]}

Xu, Xuemiao ^{[2
]}

机构：

[1] Guangzhou Huashang Coll, Sch Data Sci, Guangzhou 511300, Peoples R China

[2] South China Univ Technol, Sch Future Technol, Guangzhou 510006, Peoples R China

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Information analysis; multimodal features; RGB-D; Markov chain model; visual inspection; ablation experiment;

D O I：

10.1109/ACCESS.2023.3346970

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

With the development of scientific information technology and the popularization of electronic devices, images and videos have become very important forms of information expression and carriers in our current lives. Accelerating the mining of valuable information content from massive data has become a very important aspect of current computer vision research. The saliency object detection method, which is related to human visual attention, is gradually being applied in computer processing. However, in current color depth models, the association mining of data depth clues is still far from sufficient, and there is still significant room for improvement in image quality. Based on this, an improved color depth detection model is proposed for information guided and multi feature fusion, and an absorption Markov model is introduced to optimize the guidance of low-level, middle-level, and high-level saliency maps, grasping different feature information contents. Subsequently, the gradual guidance of the network is achieved from aspects such as feature encoding, multi-scale and multi attention models, and attention refinement mechanisms. The experimental analysis of the fusion model proposed in the study showed that the average classification improvement accuracy of the fusion model reached 5.23%, and its error value was less than 0.1. The effectiveness on all four quantitative indicators exceeded 92%. The system's detection response rate exceeded 93%, which is limited by the target object and results in a decrease in accuracy. This algorithm can provide reference value and means for target localization recognition and virtual scene detection.

引用

页码：268 / 280

页数：13

共 24 条

[1] [Anonymous], 2023, Int. J. Math. Oper. Res., V24, P29, DOI [10.1504/IJMOR.2023.128637, DOI 10.1504/IJMOR.2023.128637]
[2] [Anonymous], 2023, Int. J. Eng., V36, P1440, DOI [10.5829/IJE.2023.36.08B.04.[20]C, DOI 10.5829/IJE.2023.36.08B.04.[20]C]
[3] [Anonymous], 2023, Int. J. Speech Technol., V53, P7957, DOI [10.1007/s10489-022-03612-2, DOI 10.1007/S10489-022-03612-2]
[4] Human Vision Attention Mechanism-Inspired Temporal-Spatial Feature Pyramid for Video Saliency Detection
Chang, Qinyao
Zhu, Shiping
[J]. COGNITIVE COMPUTATION, 2023, 15 (03) : 856 - 868
[5] Modality-Induced Transfer-Fusion Network for RGB-D and RGB-T Salient Object Detection
Chen, Gang
Shao, Feng
Chai, Xiongli
Chen, Hangwei
Jiang, Qiuping
Meng, Xiangchao
Ho, Yo-Sung
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (04) : 1787 - 1801
[6] Chen Q, 2021, AAAI CONF ARTIF INTE, V35, P1063
[7] CFIDNet: cascaded feature interaction decoder for RGB-D salient object detection
Chen, Tianyou
Hu, Xiaoguang
Xiao, Jin
Zhang, Guofeng
Wang, Shaojie
[J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (10) : 7547 - 7563
[8] Unified Information Fusion Network for Multi-Modal RGB-D and RGB-T Salient Object Detection
Gao, Wei
Liao, Guibiao
Ma, Siwei
Li, Ge
Liang, Yongsheng
Lin, Weisi
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (04) : 2091 - 2106
[9] Improved collaborative filtering personalized recommendation algorithm based on k-means clustering and weighted similarity on the reduced item space
Huang, Jiaquan
Jia, Zhen
Zuo, Peng
[J]. MATHEMATICAL MODELLING AND CONTROL, 2023, 3 (01): : 39 - 49
[10] MoADNet: Mobile Asymmetric Dual-Stream Networks for Real-Time and Lightweight RGB-D Salient Object Detection
Jin, Xiao
Yi, Kang
Xu, Jing
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (11) : 7632 - 7645

← 1 2 3 →