Salient object detection in low-light RGB-T scene via spatial-frequency cues mining

被引:0
作者
Yue, Huihui [1 ]
Guo, Jichang [1 ]
Yin, Xiangjun [1 ]
Zhang, Yi [1 ]
Zheng, Sida [1 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
关键词
RGB-T salient object detection; Low-light scenes; Spatial-frequency mining; Multi-modality; Multi-domain; SEGMENTATION; NETWORK;
D O I
10.1016/j.neunet.2024.106406
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Low -light conditions pose significant challenges to vision tasks, such as salient object detection (SOD), due to insufficient photons. Light -insensitive RGB-T SOD models mitigate the above problems to some extent, but they are limited in performance as they only focus on spatial feature fusion while ignoring the frequency discrepancy. To this end, we propose an RGB-T SOD model by mining spatial -frequency cues, called SFMNet, for low -light scenes. Our SFMNet consists of spatial -frequency feature exploration (SFFE) modules and spatial -frequency feature interaction (SFFI) modules. To be specific, the SFFE module aims to separate spatial -frequency features and adaptively extract high and low -frequency features. Moreover, the SFFI module integrates cross -modality and cross -domain information to capture effective feature representations. By deploying both modules in a top -down pathway, our method generates high -quality saliency predictions. Furthermore, we construct the first low -light RGB-T SOD dataset as a benchmark for evaluating performance. Extensive experiments demonstrate that our SFMNet can achieve higher accuracy than the existing models for low -light scenes.
引用
收藏
页数:11
相关论文
共 47 条
  • [1] A novel hybrid approach for salient object detection using local and global saliency in frequency domain
    Arya, Rinki
    Singh, Navjot
    Agrawal, R. K.
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (14) : 8267 - 8287
  • [2] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
    Chen, Liang-Chieh
    Papandreou, George
    Kokkinos, Iasonas
    Murphy, Kevin
    Yuille, Alan L.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848
  • [3] Chen Q, 2021, AAAI CONF ARTIF INTE, V35, P1063
  • [4] Chen ZY, 2020, AAAI CONF ARTIF INTE, V34, P10599
  • [5] Time-Frequency Analysis, Denoising, Compression, Segmentation, and Classification of PCG Signals
    Chowdhury, Md Tanzil Hoque
    Poudel, Khem Narayan
    Hu, Yating
    [J]. IEEE ACCESS, 2020, 8 : 160882 - 160890
  • [6] Deng ZJ, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P684
  • [7] Rethinking RGB-D Salient Object Detection: Models, Data Sets, and Large-Scale Benchmarks
    Fan, Deng-Ping
    Lin, Zheng
    Zhang, Zhao
    Zhu, Menglong
    Cheng, Ming-Ming
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (05) : 2075 - 2089
  • [8] Dual Attention Network for Scene Segmentation
    Fu, Jun
    Liu, Jing
    Tian, Haijie
    Li, Yong
    Bao, Yongjun
    Fang, Zhiwei
    Lu, Hanqing
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3141 - 3149
  • [9] Salient object detection from low contrast images based on local contrast enhancing and non-local feature learning
    Guo, Tengda
    Xu, Xin
    [J]. VISUAL COMPUTER, 2021, 37 (08) : 2069 - 2081
  • [10] Haohan Wang, 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Proceedings, P8681, DOI 10.1109/CVPR42600.2020.00871