Lightweight skeleton-based action recognition model based on global-local feature extraction and fusion

被引:0
|
作者
Deng, Zhe [1 ]
Wang, Yulin [1 ]
Wei, Xing [2 ,3 ,4 ]
Yang, Fan [2 ,5 ]
Zhao, Chong [4 ]
Lu, Yang [2 ]
机构
[1] Hefei Univ Technol, Sch Mech Engn, Hefei 230009, Anhui, Peoples R China
[2] Hefei Univ Technol, Sch Comp & Informat, Hefei 230601, Anhui, Peoples R China
[3] Hefei Univ Technol, Intelligent Mfg Technol Res Inst, Hefei 230051, Anhui, Peoples R China
[4] Minist Educ, Engn Res Ctr Safety Crit Ind Measurement & Control, Hefei 230009, Anhui, Peoples R China
[5] Anhui Mine IOT & Secur Monitoring Technol Key Lab, Hefei 230088, Anhui, Peoples R China
关键词
Graph convolutional neural networks; Local feature extraction; Feature fusion; Shift graph convolution; GRAPH; NETWORKS; GCN;
D O I
10.1007/s13042-024-02347-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Skeleton-based action recognition has become a research hotspot in the field of computer vision because of its lightweight and strong anti-interference. However, there are disadvantages such as single feature extraction, limited expression ability, and low recognition accuracy. To solve these problems, we propose a lightweight Skeleton-based action recognition model based on global-local feature extraction and fusion (GLF-GCN). GLF-GCN includes a Feature extraction of non-connected nodes Module (Global-GCN), a Feature extraction of adjacent nodes Module (Local-GCN), and a Dynamic Fusion module. More specifically, Global-GCN combines one-dimensional convolution and shift operations to capture spatio-temporal dependencies across global nodes, using shift operations as a replacement for spatio-temporal graph convolution to reduce computational complexity. Meanwhile, Local-GCN captures temporal and spatial local information from first-order neighboring nodes. On this basis, Dynamic Fusion integrates global information based on joint hierarchy and local information based on body parts to discern the varying dependency relationships among different body parts and joints, improving the model's ability to interpret different skeleton action sequences. The experimental results on single stream and multi-stream data show that the proposed model has higher accuracy, which attains the state-of-the-art performance.
引用
收藏
页码:1477 / 1488
页数:12
相关论文
共 50 条
  • [41] Glimpse and focus: Global and local-scale graph convolution network for skeleton-based action recognition
    Gao, Xuehao
    Du, Shaoyi
    Yang, Yang
    NEURAL NETWORKS, 2023, 167 : 551 - 558
  • [42] Local and global self-attention enhanced graph convolutional network for skeleton-based action recognition
    Wu, Zhize
    Ding, Yue
    Wan, Long
    Li, Teng
    Nian, Fudong
    PATTERN RECOGNITION, 2025, 159
  • [43] A Lightweight Architecture Attentional Shift Graph Convolutional Network for Skeleton-Based Action Recognition
    Li, Xianshan
    Kang, Jingwen
    Yang, Yang
    Zhao, Fengda
    INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2023, 18 (03)
  • [44] Spatio-temporal stacking model for skeleton-based action recognition
    Yufeng Zhong
    Qiuyan Yan
    Applied Intelligence, 2022, 52 : 12116 - 12130
  • [45] Fast Temporal Graph Convolutional Model for Skeleton-Based Action Recognition
    Nan, Mihai
    Florea, Adina Magda
    SENSORS, 2022, 22 (19)
  • [46] Lightweight channel-topology based adaptive graph convolutional network for skeleton-based action recognition
    Wang K.
    Deng H.
    Zhu Q.
    Neurocomputing, 2023, 560
  • [47] A Cuboid CNN Model With an Attention Mechanism for Skeleton-Based Action Recognition
    Zhu, Kaijun
    Wang, Ruxin
    Zhao, Qingsong
    Cheng, Jun
    Tao, Dapeng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (11) : 2977 - 2989
  • [48] Global Co-occurrence Feature Learning and Active Coordinate System Conversion for Skeleton-based Action Recognition
    Li, Sheng
    Jiang, Tingting
    Huang, Tiejun
    Tian, Yonghong
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 575 - 583
  • [49] BODY PART LEVEL ATTENTION MODEL FOR SKELETON-BASED ACTION RECOGNITION
    Zhang, Han
    Song, Yonghong
    Zhang, Yuanlin
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 4297 - 4302
  • [50] Spatio-temporal stacking model for skeleton-based action recognition
    Zhong, Yufeng
    Yan, Qiuyan
    APPLIED INTELLIGENCE, 2022, 52 (11) : 12116 - 12130