RGB-D Visual Saliency Detection Algorithm Based on Information Guided and Multimodal Feature Fusion

被引:1
作者
Xu, Lijuan [1 ]
Xu, Xuemiao [2 ]
机构
[1] Guangzhou Huashang Coll, Sch Data Sci, Guangzhou 511300, Peoples R China
[2] South China Univ Technol, Sch Future Technol, Guangzhou 510006, Peoples R China
关键词
Information analysis; multimodal features; RGB-D; Markov chain model; visual inspection; ablation experiment;
D O I
10.1109/ACCESS.2023.3346970
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the development of scientific information technology and the popularization of electronic devices, images and videos have become very important forms of information expression and carriers in our current lives. Accelerating the mining of valuable information content from massive data has become a very important aspect of current computer vision research. The saliency object detection method, which is related to human visual attention, is gradually being applied in computer processing. However, in current color depth models, the association mining of data depth clues is still far from sufficient, and there is still significant room for improvement in image quality. Based on this, an improved color depth detection model is proposed for information guided and multi feature fusion, and an absorption Markov model is introduced to optimize the guidance of low-level, middle-level, and high-level saliency maps, grasping different feature information contents. Subsequently, the gradual guidance of the network is achieved from aspects such as feature encoding, multi-scale and multi attention models, and attention refinement mechanisms. The experimental analysis of the fusion model proposed in the study showed that the average classification improvement accuracy of the fusion model reached 5.23%, and its error value was less than 0.1. The effectiveness on all four quantitative indicators exceeded 92%. The system's detection response rate exceeded 93%, which is limited by the target object and results in a decrease in accuracy. This algorithm can provide reference value and means for target localization recognition and virtual scene detection.
引用
收藏
页码:268 / 280
页数:13
相关论文
共 24 条
  • [1] [Anonymous], 2023, Int. J. Math. Oper. Res., V24, P29, DOI [10.1504/IJMOR.2023.128637, DOI 10.1504/IJMOR.2023.128637]
  • [2] [Anonymous], 2023, Int. J. Eng., V36, P1440, DOI [10.5829/IJE.2023.36.08B.04.[20]C, DOI 10.5829/IJE.2023.36.08B.04.[20]C]
  • [3] [Anonymous], 2023, Int. J. Speech Technol., V53, P7957, DOI [10.1007/s10489-022-03612-2, DOI 10.1007/S10489-022-03612-2]
  • [4] Human Vision Attention Mechanism-Inspired Temporal-Spatial Feature Pyramid for Video Saliency Detection
    Chang, Qinyao
    Zhu, Shiping
    [J]. COGNITIVE COMPUTATION, 2023, 15 (03) : 856 - 868
  • [5] Modality-Induced Transfer-Fusion Network for RGB-D and RGB-T Salient Object Detection
    Chen, Gang
    Shao, Feng
    Chai, Xiongli
    Chen, Hangwei
    Jiang, Qiuping
    Meng, Xiangchao
    Ho, Yo-Sung
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (04) : 1787 - 1801
  • [6] Chen Q, 2021, AAAI CONF ARTIF INTE, V35, P1063
  • [7] CFIDNet: cascaded feature interaction decoder for RGB-D salient object detection
    Chen, Tianyou
    Hu, Xiaoguang
    Xiao, Jin
    Zhang, Guofeng
    Wang, Shaojie
    [J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (10) : 7547 - 7563
  • [8] Unified Information Fusion Network for Multi-Modal RGB-D and RGB-T Salient Object Detection
    Gao, Wei
    Liao, Guibiao
    Ma, Siwei
    Li, Ge
    Liang, Yongsheng
    Lin, Weisi
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (04) : 2091 - 2106
  • [9] Improved collaborative filtering personalized recommendation algorithm based on k-means clustering and weighted similarity on the reduced item space
    Huang, Jiaquan
    Jia, Zhen
    Zuo, Peng
    [J]. MATHEMATICAL MODELLING AND CONTROL, 2023, 3 (01): : 39 - 49
  • [10] MoADNet: Mobile Asymmetric Dual-Stream Networks for Real-Time and Lightweight RGB-D Salient Object Detection
    Jin, Xiao
    Yi, Kang
    Xu, Jing
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (11) : 7632 - 7645