Motion-Aware Memory Network for Fast Video Salient Object Detection

被引：5

作者：

Zhao, Xing ^{[1
]}

Liang, Haoran ^{[1
]}

Li, Peipei ^{[2
]}

Sun, Guodao ^{[1
]}

Zhao, Dongdong ^{[1
]}

Liang, Ronghua ^{[1
]}

He, Xiaofei ^{[1
]}

机构：

[1] Zhejiang Univ Technol, Coll Comp Sci & Technol, Hangzhou 310023, Peoples R China

[2] Zhejiang Univ Technol, Coll Mech Engn, Hangzhou 310023, Peoples R China

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2024年 / 33卷

关键词：

Video salient object detection; salient object detection; memory network; feature fusion; OPTIMIZATION;

D O I：

10.1109/TIP.2023.3348659

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Previous methods based on 3DCNN, convLSTM, or optical flow have achieved great success in video salient object detection (VSOD). However, these methods still suffer from high computational costs or poor quality of the generated saliency maps. To address this, we design a space-time memory (STM)-based network that employs a standard encoder-decoder architecture. During the encoding stage, we extract high-level temporal features from the current frame and its adjacent frames, which is more efficient and practical than methods reliant on optical flow. During the decoding stage, we introduce an effective fusion strategy for both spatial and temporal branches. The semantic information of the high-level features is used to improve the object details in the low-level features. Subsequently, spatiotemporal features are methodically derived step by step to reconstruct the saliency maps. Moreover, inspired by the boundary supervision prevalent in image salient object detection (ISOD), we design a motion-aware loss that predicts object boundary motion, and simultaneously perform multitask learning for VSOD and object motion prediction. This can further enhance the model's capability to accurately extract spatiotemporal features while maintaining object integrity. Extensive experiments on several datasets demonstrate the effectiveness of our method and can achieve state-of-the-art metrics on some datasets. Our proposed model does not require optical flow or additional preprocessing, and can reach an impressive inference speed of nearly 100 FPS.

引用

页码：709 / 721

页数：13

共 50 条

[11] Spatial context-aware network for salient object detection
Kong, Yuqiu
Feng, Mengyang
Li, Xin
Lu, Huchuan
Liu, Xiuping
Yin, Baocai
PATTERN RECOGNITION, 2021, 114
[12] Boosting Feature-Aware Network for Salient Object Detection
Zheng, Jianwei
Gu, Yubin
Feng, Yuchao
Xu, Jinshan
Zhang, Meiyu
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT IV, 2022, 13532 : 14 - 26
[13] Flow driven attention network for video salient object detection
Zhou, Feng
Shuai, Hui
Liu, Qingshan
Guo, Guodong
IET IMAGE PROCESSING, 2020, 14 (06) : 997 - 1004
[14] DS-Net: Dynamic spatiotemporal network for video salient object detection
Liu, Jing
Wang, Jiaxiang
Wang, Weikang
Su, Yuting
DIGITAL SIGNAL PROCESSING, 2022, 130
[15] Multi-Stream Temporally Enhanced Network for Video Salient Object Detection
Xu, Dan
Ru, Jiale
Shi, Jinlong
CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 78 (01): : 85 - 104
[16] Cross Complementary Fusion Network for Video Salient Object Detection
Wang, Ziyang
Li, Junxia
Pan, Zefeng
IEEE ACCESS, 2020, 8 : 201259 - 201270
[17] PSNet: Parallel Symmetric Network for Video Salient Object Detection
Cong, Runmin
Song, Weiyu
Lei, Jianjun
Yue, Guanghui
Zhao, Yao
Kwong, Sam
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2023, 7 (02): : 402 - 414
[18] A novel spatiotemporal attention enhanced discriminative network for video salient object detection
Liu, Bing
Mu, Kezhou
Xu, Mingzhu
Wang, Fangyuan
Feng, Lei
APPLIED INTELLIGENCE, 2022, 52 (06) : 5922 - 5937
[19] A Novel Video Salient Object Detection Method via Semisupervised Motion Quality Perception
Chen, Chenglizhao
Song, Jia
Peng, Chong
Wang, Guodong
Fang, Yuming
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (05) : 2732 - 2745
[20] Parallax-Aware Network for Light Field Salient Object Detection
Yuan, Bo
Jiang, Yao
Fu, Keren
Zhao, Qijun
IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 810 - 814

← 1 2 3 4 5 →