Research on Human Upper Limb Action Recognition Method Based on Multimodal Heterogeneous Spatial Temporal Graph Network

被引:0
|
作者
Ci, Zelin [1 ]
Ren, Huizhao [1 ]
Liu, Jinming [1 ]
Xie, Songyun [2 ]
Wang, Wendong [1 ,3 ]
机构
[1] Northwestern Polytech Univ, Sch Mech Engn, Xian 710072, Peoples R China
[2] Northwestern Polytech Univ, Elect Informat Coll, Xian 710072, Peoples R China
[3] Sanhang Civil Mil Integrat Innovat Res Inst, Dongguan 523429, Peoples R China
来源
INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2024, PT X | 2025年 / 15210卷
关键词
Action Recognition; Graph Neural Network; Temporal Features; Heterogeneous Graph;
D O I
10.1007/978-981-96-0786-0_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graph convolutional neural networks have been increasingly used in human action recognition because of their powerful ability to deal with spatial topological relations. Like human action, upper limb action contains spatial and temporal features. For temporal features, graph convolutional networks cannot extract well enough to realize the coupling of spatial-temporal features. This paper proposes a multimodal heterogeneous spatial-temporal network (MHST-GCN) model based on multimodal information. Firstly, the model introduces a temporal graph based on a hybrid sparsity strategy, which captures local and global temporal features in the sequence of human upper limb actions while ensuring computational efficiency. Then, a heterogeneous graph model is proposed for fusing the two modal information to enhance the robustness of the model. Finally, extensive experiments are conducted on two standard datasets, NTU-RGB+D, and a homemade upper limb action dataset. The experimental results demonstrate the effectiveness of the proposed method.
引用
收藏
页码:304 / 318
页数:15
相关论文
共 50 条
  • [31] HUMAN ACTION RECOGNITION VIA SPATIAL AND TEMPORAL METHODS
    Eroglu, Hulusi
    Gokce, C. Onur
    Ilk, H. Gokhan
    2014 22ND SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2014, : 104 - 107
  • [32] Pyramidal Graph Convolutional Network for Skeleton-Based Human Action Recognition
    Li, Fanjia
    Zhu, Aichun
    Liu, Zhongyu
    Huo, Yu
    Xu, Yonggang
    Hua, Gang
    IEEE SENSORS JOURNAL, 2021, 21 (14) : 16183 - 16191
  • [33] Spatial-Temporal Interleaved Network for Efficient Action Recognition
    Jiang, Shengqin
    Zhang, Haokui
    Qi, Yuankai
    Liu, Qingshan
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2025, 21 (01) : 178 - 187
  • [34] Multiple temporal scale aggregation graph convolutional network for skeleton-based action recognition
    Li, Xuanfeng
    Lu, Jian
    Zhou, Jian
    Liu, Wei
    Zhang, Kaibing
    COMPUTERS & ELECTRICAL ENGINEERING, 2023, 110
  • [35] Spatial-temporal saliency action mask attention network for action recognition
    Jiang, Min
    Pan, Na
    Kong, Jun
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2020, 71
  • [36] Skeleton Action Recognition Based on Spatio-temporal Feature Enhanced Graph Convolutional Network
    Cao, Yi
    Wu, Weiguan
    Li, Ping
    Xia, Yu
    Gao, Qingyuan
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2023, 45 (08) : 3022 - 3031
  • [37] Extracting hierarchical spatial and temporal features for human action recognition
    Keting Zhang
    Liqing Zhang
    Multimedia Tools and Applications, 2018, 77 : 16053 - 16068
  • [38] Human Action Recognition Using Spatial and Temporal Sequences Alignment
    Li, Yandi
    Zhao, Zhihao
    SECOND INTERNATIONAL CONFERENCE ON OPTICS AND IMAGE PROCESSING (ICOIP 2022), 2022, 12328
  • [39] Extracting hierarchical spatial and temporal features for human action recognition
    Zhang, Keting
    Zhang, Liqing
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (13) : 16053 - 16068
  • [40] Recognition method of basketball players’ shooting action based on graph convolution neural network
    Xu J.
    International Journal of Reasoning-based Intelligent Systems, 2022, 14 (04) : 227 - 232