Salient object detection in low-light RGB-T scene via spatial-frequency cues mining

被引：0

作者：

Yue, Huihui ^{[1
]}

Guo, Jichang ^{[1
]}

Yin, Xiangjun ^{[1
]}

Zhang, Yi ^{[1
]}

Zheng, Sida ^{[1
]}

机构：

[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China

来源：

NEURAL NETWORKS | 2024年 / 178卷

关键词：

RGB-T salient object detection; Low-light scenes; Spatial-frequency mining; Multi-modality; Multi-domain; SEGMENTATION; NETWORK;

D O I：

10.1016/j.neunet.2024.106406

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Low -light conditions pose significant challenges to vision tasks, such as salient object detection (SOD), due to insufficient photons. Light -insensitive RGB-T SOD models mitigate the above problems to some extent, but they are limited in performance as they only focus on spatial feature fusion while ignoring the frequency discrepancy. To this end, we propose an RGB-T SOD model by mining spatial -frequency cues, called SFMNet, for low -light scenes. Our SFMNet consists of spatial -frequency feature exploration (SFFE) modules and spatial -frequency feature interaction (SFFI) modules. To be specific, the SFFE module aims to separate spatial -frequency features and adaptively extract high and low -frequency features. Moreover, the SFFI module integrates cross -modality and cross -domain information to capture effective feature representations. By deploying both modules in a top -down pathway, our method generates high -quality saliency predictions. Furthermore, we construct the first low -light RGB-T SOD dataset as a benchmark for evaluating performance. Extensive experiments demonstrate that our SFMNet can achieve higher accuracy than the existing models for low -light scenes.

引用

页数：11

共 47 条

[1] A novel hybrid approach for salient object detection using local and global saliency in frequency domain
Arya, Rinki
Singh, Navjot
Agrawal, R. K.
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (14) : 8267 - 8287
[2] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
Chen, Liang-Chieh
Papandreou, George
Kokkinos, Iasonas
Murphy, Kevin
Yuille, Alan L.
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848
[3] Chen Q, 2021, AAAI CONF ARTIF INTE, V35, P1063
[4] Chen ZY, 2020, AAAI CONF ARTIF INTE, V34, P10599
[5] Time-Frequency Analysis, Denoising, Compression, Segmentation, and Classification of PCG Signals
Chowdhury, Md Tanzil Hoque
Poudel, Khem Narayan
Hu, Yating
[J]. IEEE ACCESS, 2020, 8 : 160882 - 160890
[6] Deng ZJ, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P684
[7] Rethinking RGB-D Salient Object Detection: Models, Data Sets, and Large-Scale Benchmarks
Fan, Deng-Ping
Lin, Zheng
Zhang, Zhao
Zhu, Menglong
Cheng, Ming-Ming
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (05) : 2075 - 2089
[8] Dual Attention Network for Scene Segmentation
Fu, Jun
Liu, Jing
Tian, Haijie
Li, Yong
Bao, Yongjun
Fang, Zhiwei
Lu, Hanqing
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3141 - 3149
[9] Salient object detection from low contrast images based on local contrast enhancing and non-local feature learning
Guo, Tengda
Xu, Xin
[J]. VISUAL COMPUTER, 2021, 37 (08) : 2069 - 2081
[10] Haohan Wang, 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Proceedings, P8681, DOI 10.1109/CVPR42600.2020.00871

← 1 2 3 4 5 →