Spatial Attention-Guided Light Field Salient Object Detection Network With Implicit Neural Representation

被引：2

作者：

Zheng, Xin ^{[1
]}

Li, Zhengqu ^{[1
]}

Liu, Deyang ^{[1
]}

Zhou, Xiaofei ^{[2
]}

Shan, Caifeng ^{[3
,4
]}

机构：

[1] Anqing Normal Univ, Sch Comp & Informat, Anqing 246000, Peoples R China

[2] Hangzhou Dianzi Univ, Sch Automat, Hangzhou 310061, Peoples R China

[3] Shandong Univ Sci & Technol, Coll Elect Engn & Automat, Qingdao 266590, Peoples R China

[4] Nanjing Univ, Sch Intelligence Sci & Technol, Nanjing 210023, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2024年 / 34卷 / 12期

基金：

中国国家自然科学基金;

关键词：

Task analysis; Feature extraction; Image restoration; Three-dimensional displays; Object detection; Light fields; Fuses; Light field; salient object detection; implicit neural representation; spatial attention; DEPTH ESTIMATION;

D O I：

10.1109/TCSVT.2024.3437685

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Recently, many Light Field Salient Object Detection (LF SOD) methods have been proposed. However, guaranteeing the integrality and recovering more high-frequency details of the generated salient object map still remain challenging. To this end, we propose a spatial attention-guided LF SOD network with implicit neural representation to further improve LF SOD performance. We adopt an encoder-decoder structure for model construction. In order to ensure the completeness of the generated salient object map, a multi-modal and multi-scale feature fusion module is designed in the encoder part to refine the salient regions within all-in-focus image and aggregate the focal stack and all-in-focus image in spatial attention-guided manner. In order to recover more high-frequency details of the obtained salient object map, an implicit detail restoration module is proposed in the decoder part. In virtue of implicit neural representation, we convert the detail restoration problem into a functional mapping problem. By further integrating the self-attention mechanism, the derived saliency map can be depicted at a more refined level. Comprehensive experimental results demonstrate the superiority of the proposed method. Ablation studies and visual comparisons further validate that the proposed method can guarantee the integrality and recover more high-frequency detail information of the obtained saliency map. The code is publicly available at https://github.com/ldyorchid/LFSOD-Net.

引用

页码：12437 / 12449

页数：13

共 86 条

[1] Sheng H., Cong R., Yang D., Chen R., Wang S., Cui Z., UrbanLF: A comprehensive light field dataset for semantic segmentation of urban scenes, IEEE Trans. Circuits Syst. Video Technol., 32, 11, pp. 7880-7893, (2022)
[2] Fang H., Et al., From captions to visual concepts and back, Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), pp. 1473-1482, (2015)
[3] Zhao X., Pang Y., Zhang L., Lu H., Joint learning of salient object detection, depth estimation and contour extraction, IEEE Trans. Image Process., 31, pp. 7350-7362, (2022)
[4] Chen C., Wei J., Peng C., Zhang W., Qin H., Improved saliency detection in RGB-D images using two-phase depth estimation and selective deep fusion, IEEE Trans. Image Process., 29, pp. 4296-4307, (2020)
[5] Gu K., Et al., Saliency-guided quality assessment of screen content images, IEEE Trans. Multimedia, 18, 6, pp. 1098-1110, (2016)
[6] Zhang M., Xu S., Piao Y., Lu H., Exploring spatial correlation for light field saliency detection: Expansion from a single view, IEEE Trans. Image Process., 31, pp. 6152-6163, (2022)
[7] Zhang J., Liu Y., Zhang S., Poppe R., Wang M., Light field saliency detection with deep convolutional networks, IEEE Trans. Image Process., 29, pp. 4421-4434, (2020)
[8] Jing D., Zhang S., Cong R., Lin Y., Occlusion-aware bi-directional guided network for light field salient object detection, Proc. 29th ACM Int. Conf. Multimedia, pp. 1692-1701, (2021)
[9] Wang M., Et al., LFBCNet: Light field boundary-aware and cascaded interaction network for salient object detection, Proc. 30th ACM Int. Conf. Multimedia, pp. 3430-3439, (2022)
[10] Zhang Q., Wang S., Wang X., Sun Z., Kwong S., Jiang J., Geometry auxiliary salient object detection for light fields via graph neural networks, IEEE Trans. Image Process., 30, pp. 7578-7592, (2021)

← 1 2 3 4 5 6 7 8 9 →