Pyramid Constrained Self-Attention Network for Fast Video Salient Object Detection

被引：0

作者：

Gu, Yuchao ^{[1
]}

Wang, Lijuan ^{[1
]}

Wang, Ziqin ^{[2
]}

Liu, Yun ^{[1
]}

Cheng, Ming-Ming ^{[1
]}

Lu, Shao-Ping ^{[1
]}

机构：

[1] Nankai Univ, CS, TKLNDST, Tianjin, Peoples R China

[2] Univ Sydney, Sydney, NSW, Australia

来源：

THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2020年 / 34卷

关键词：

SEGMENTATION; OPTIMIZATION;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Spatiotemporal information is essential for video salient object detection (VSOD) due to the highly attractive object motion for human's attention. Previous VSOD methods usually use Long Short-Term Memory (LSTM) or 3D ConvNet (C3D), which can only encode motion information through step-by-step propagation in the temporal domain. Recently, the non-local mechanism is proposed to capture long-range dependencies directly. However, it is not straightforward to apply the non-local mechanism into VSOD, because i) it fails to capture motion cues and tends to learn motion-independent global contexts; ii) its computation and memory costs are prohibitive for video dense prediction tasks such as VSOD. To address the above problems, we design a Constrained Self-Attention (CSA) operation to capture motion cues, based on the prior that objects always move in a continuous trajectory. We group a set of CSA operations in Pyramid structures (PCSA) to capture objects at various scales and speeds. Extensive experimental results demonstrate that our method outperforms previous state-of-the-art methods in both accuracy and speed (110 FPS on a single Titan Xp) on five challenge datasets. Our code is available at https://github.com/guyuchao/PyramidCSA.

引用

页码：10869 / 10876

页数：8

共 50 条

[1] Salient Object Detection Combining a Self-Attention Module and a Feature Pyramid Network
Ren, Guangyu
Dai, Tianhong
Barmpoutis, Panagiotis
Stathaki, Tania
ELECTRONICS, 2020, 9 (10) : 1 - 13
[2] Video Salient Object Detection Using Multi-Scale Self-Attention
Liu, Jiahao (jiahao.liu@akane.waseda.jp), 1600, Institute of Electrical and Electronics Engineers Inc.
[3] Salient Object Detection with Pyramid Attention and Salient Edges
Wang, Wenguan
Zhao, Shuyang
Shen, Jianbing
Hoi, Steven C. H.
Borji, Ali
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1448 - 1457
[4] Optical Flow Guided Pyramid Network for Video Salient Object Detection
Tang, Tinglong
Hua, Sheng
Sun, Shuifa
Wu, Yirong
Zhu, Yuqi
Yue, Chonghao
PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 723 - 728
[5] CAG-FPN: CHANNEL SELF-ATTENTION GUIDED FEATURE PYRAMID NETWORK FOR OBJECT DETECTION
Chang, Jie
Dai, Huhe
Zheng, Yuan
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2024), 2024, : 9616 - 9620
[6] Flow driven attention network for video salient object detection
Zhou, Feng
Shuai, Hui
Liu, Qingshan
Guo, Guodong
IET IMAGE PROCESSING, 2020, 14 (06) : 997 - 1004
[7] Dual pyramid network for salient object detection
Xu, Xuemiao
Chen, Jiaxing
Zhang, Huaidong
Han, Guoqiang
NEUROCOMPUTING, 2020, 375 : 113 - 123
[8] A fast self-attention cascaded network for object detection in large scene remote sensing images
Hua, Xia
Wang, Xinqing
Rui, Ting
Zhang, Haitao
Wang, Dong
APPLIED SOFT COMPUTING, 2020, 94 (94)
[9] Annular Feature Pyramid Network for Salient Object Detection
Zheng, Tao
Li, Bo
Liu, Jiajia
2019 ELEVENTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI 2019), 2019, : 1 - 6
[10] Cross-stage feature fusion and efficient self-attention for salient object detection
Xia, Xiaofeng
Ma, Yingdong
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 104

← 1 2 3 4 5 →