Spatio-temporal segments attention for skeleton-based action recognition

被引:19
|
作者
Qiu, Helei [1 ]
Hou, Biao [1 ]
Ren, Bo [1 ]
Zhang, Xiaohua [1 ]
机构
[1] Xidian Univ, Sch Artificial Intelligence, Xian 710071, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Action recognition; Skeleton; Self-attention; Spatio-temporal joints; Feature aggregation; NETWORKS;
D O I
10.1016/j.neucom.2022.10.084
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Capturing the dependencies between joints is critical in skeleton-based action recognition. However, the existing methods cannot effectively capture the correlation of different joints between frames, which is very useful since different body parts (such as the arms and legs in "long jump") between adjacent frames move together. Focus on this issue, a novel spatio-temporal segments attention method is proposed. The skeleton sequence is divided into several segments, and several consecutive frames contained in each segment are encoded. And then an intra-segment self-attention module is proposed to capture the rela-tionship of different joints in consecutive frames. In addition, an inter-segment action attention module is introduced to capture the relationship between segments to enhance the ability to distinguish similar actions. Compared with the state-of-the-art methods, our method achieves better performance on two large-scale datasets. (c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:30 / 38
页数:9
相关论文
共 50 条
  • [31] Skeleton-based action recognition via spatial and temporal transformer networks
    Plizzari, Chiara
    Cannici, Marco
    Matteucci, Matteo
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 208 (208-209)
  • [32] Sequence Segmentation Attention Network for Skeleton-Based Action Recognition
    Zhang, Yujie
    Cai, Haibin
    ELECTRONICS, 2023, 12 (07)
  • [33] View transform graph attention recurrent networks for skeleton-based action recognition
    Qingqing Huang
    Fengyu Zhou
    Runze Qin
    Yang zhao
    Signal, Image and Video Processing, 2021, 15 : 599 - 606
  • [34] View transform graph attention recurrent networks for skeleton-based action recognition
    Huang, Qingqing
    Zhou, Fengyu
    Qin, Runze
    Zhao, Yang
    SIGNAL IMAGE AND VIDEO PROCESSING, 2021, 15 (03) : 599 - 606
  • [35] Human Action Recognition Algorithm Based on Spatio-Temporal Interactive Attention Model
    Pan Na
    Jiang Min
    Kong Jun
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (18)
  • [36] Skeleton Action Recognition Based on Spatio-temporal Feature Enhanced Graph Convolutional Network
    Cao, Yi
    Wu, Weiguan
    Li, Ping
    Xia, Yu
    Gao, Qingyuan
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2023, 45 (08) : 3022 - 3031
  • [37] Motion Complement and Temporal Multifocusing for Skeleton-Based Action Recognition
    Wu, Cong
    Wu, Xiao-Jun
    Xu, Tianyang
    Shen, Zhongwei
    Kittler, Josef
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (01) : 34 - 45
  • [38] Unified Spatio-Temporal Attention Networks for Action Recognition in Videos
    Li, Dong
    Yao, Ting
    Duan, Ling-Yu
    Mei, Tao
    Rui, Yong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (02) : 416 - 428
  • [39] Long-term Spatio-temporal Contrastive Learning framework for Skeleton Action Recognition
    Rustogi, Anshul
    Mukherjee, Snehasis
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [40] Focal and Global Spatial-Temporal Transformer for Skeleton-Based Action Recognition
    Gao, Zhimin
    Wang, Peitao
    Lv, Pei
    Jiang, Xiaoheng
    Liu, Qidong
    Wang, Pichao
    Xu, Mingliang
    Li, Wanqing
    COMPUTER VISION - ACCV 2022, PT IV, 2023, 13844 : 155 - 171