Pyramid Constrained Self-Attention Network for Fast Video Salient Object Detection

被引:0
|
作者
Gu, Yuchao [1 ]
Wang, Lijuan [1 ]
Wang, Ziqin [2 ]
Liu, Yun [1 ]
Cheng, Ming-Ming [1 ]
Lu, Shao-Ping [1 ]
机构
[1] Nankai Univ, CS, TKLNDST, Tianjin, Peoples R China
[2] Univ Sydney, Sydney, NSW, Australia
来源
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2020年 / 34卷
关键词
SEGMENTATION; OPTIMIZATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Spatiotemporal information is essential for video salient object detection (VSOD) due to the highly attractive object motion for human's attention. Previous VSOD methods usually use Long Short-Term Memory (LSTM) or 3D ConvNet (C3D), which can only encode motion information through step-by-step propagation in the temporal domain. Recently, the non-local mechanism is proposed to capture long-range dependencies directly. However, it is not straightforward to apply the non-local mechanism into VSOD, because i) it fails to capture motion cues and tends to learn motion-independent global contexts; ii) its computation and memory costs are prohibitive for video dense prediction tasks such as VSOD. To address the above problems, we design a Constrained Self-Attention (CSA) operation to capture motion cues, based on the prior that objects always move in a continuous trajectory. We group a set of CSA operations in Pyramid structures (PCSA) to capture objects at various scales and speeds. Extensive experimental results demonstrate that our method outperforms previous state-of-the-art methods in both accuracy and speed (110 FPS on a single Titan Xp) on five challenge datasets. Our code is available at https://github.com/guyuchao/PyramidCSA.
引用
收藏
页码:10869 / 10876
页数:8
相关论文
共 50 条
  • [1] Salient Object Detection Combining a Self-Attention Module and a Feature Pyramid Network
    Ren, Guangyu
    Dai, Tianhong
    Barmpoutis, Panagiotis
    Stathaki, Tania
    ELECTRONICS, 2020, 9 (10) : 1 - 13
  • [2] Video Salient Object Detection Using Multi-Scale Self-Attention
    Liu, Jiahao (jiahao.liu@akane.waseda.jp), 1600, Institute of Electrical and Electronics Engineers Inc.
  • [3] Salient Object Detection with Pyramid Attention and Salient Edges
    Wang, Wenguan
    Zhao, Shuyang
    Shen, Jianbing
    Hoi, Steven C. H.
    Borji, Ali
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1448 - 1457
  • [4] Optical Flow Guided Pyramid Network for Video Salient Object Detection
    Tang, Tinglong
    Hua, Sheng
    Sun, Shuifa
    Wu, Yirong
    Zhu, Yuqi
    Yue, Chonghao
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 723 - 728
  • [5] CAG-FPN: CHANNEL SELF-ATTENTION GUIDED FEATURE PYRAMID NETWORK FOR OBJECT DETECTION
    Chang, Jie
    Dai, Huhe
    Zheng, Yuan
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2024), 2024, : 9616 - 9620
  • [6] Flow driven attention network for video salient object detection
    Zhou, Feng
    Shuai, Hui
    Liu, Qingshan
    Guo, Guodong
    IET IMAGE PROCESSING, 2020, 14 (06) : 997 - 1004
  • [7] Dual pyramid network for salient object detection
    Xu, Xuemiao
    Chen, Jiaxing
    Zhang, Huaidong
    Han, Guoqiang
    NEUROCOMPUTING, 2020, 375 : 113 - 123
  • [8] A fast self-attention cascaded network for object detection in large scene remote sensing images
    Hua, Xia
    Wang, Xinqing
    Rui, Ting
    Zhang, Haitao
    Wang, Dong
    APPLIED SOFT COMPUTING, 2020, 94 (94)
  • [9] Annular Feature Pyramid Network for Salient Object Detection
    Zheng, Tao
    Li, Bo
    Liu, Jiajia
    2019 ELEVENTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI 2019), 2019, : 1 - 6
  • [10] Cross-stage feature fusion and efficient self-attention for salient object detection
    Xia, Xiaofeng
    Ma, Yingdong
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 104