Research on Human Upper Limb Action Recognition Method Based on Multimodal Heterogeneous Spatial Temporal Graph Network

被引:0
|
作者
Ci, Zelin [1 ]
Ren, Huizhao [1 ]
Liu, Jinming [1 ]
Xie, Songyun [2 ]
Wang, Wendong [1 ,3 ]
机构
[1] Northwestern Polytech Univ, Sch Mech Engn, Xian 710072, Peoples R China
[2] Northwestern Polytech Univ, Elect Informat Coll, Xian 710072, Peoples R China
[3] Sanhang Civil Mil Integrat Innovat Res Inst, Dongguan 523429, Peoples R China
来源
INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2024, PT X | 2025年 / 15210卷
关键词
Action Recognition; Graph Neural Network; Temporal Features; Heterogeneous Graph;
D O I
10.1007/978-981-96-0786-0_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graph convolutional neural networks have been increasingly used in human action recognition because of their powerful ability to deal with spatial topological relations. Like human action, upper limb action contains spatial and temporal features. For temporal features, graph convolutional networks cannot extract well enough to realize the coupling of spatial-temporal features. This paper proposes a multimodal heterogeneous spatial-temporal network (MHST-GCN) model based on multimodal information. Firstly, the model introduces a temporal graph based on a hybrid sparsity strategy, which captures local and global temporal features in the sequence of human upper limb actions while ensuring computational efficiency. Then, a heterogeneous graph model is proposed for fusing the two modal information to enhance the robustness of the model. Finally, extensive experiments are conducted on two standard datasets, NTU-RGB+D, and a homemade upper limb action dataset. The experimental results demonstrate the effectiveness of the proposed method.
引用
收藏
页码:304 / 318
页数:15
相关论文
共 50 条
  • [41] Multi-Branch Spatial-Temporal Network for Action Recognition
    Wang, Yingying
    Li, Wei
    Tao, Ran
    IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (10) : 1556 - 1560
  • [42] Recurrent Spatial-Temporal Attention Network for Action Recognition in Videos
    Du, Wenbin
    Wang, Yali
    Qiao, Yu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (03) : 1347 - 1360
  • [43] Key Frame Selection for Temporal Graph Optimization of Skeleton-Based Action Recognition
    Hou, Jingyi
    Su, Lei
    Zhao, Yan
    APPLIED SCIENCES-BASEL, 2024, 14 (21):
  • [44] Spatial-temporal interaction learning based two-stream network for action recognition
    Liu, Tianyu
    Ma, Yujun
    Yang, Wenhan
    Ji, Wanting
    Wang, Ruili
    Jiang, Ping
    INFORMATION SCIENCES, 2022, 606 : 864 - 876
  • [45] Skeleton-based action recognition with multi-stream, multi-scale dilated spatial-temporal graph convolution network
    Zhang, Haiping
    Liu, Xu
    Yu, Dongjin
    Guan, Liming
    Wang, Dongjing
    Ma, Conghao
    Hu, Zepeng
    APPLIED INTELLIGENCE, 2023, 53 (14) : 17629 - 17643
  • [46] Skeleton-based action recognition with multi-stream, multi-scale dilated spatial-temporal graph convolution network
    Haiping Zhang
    Xu Liu
    Dongjin Yu
    Liming Guan
    Dongjing Wang
    Conghao Ma
    Zepeng Hu
    Applied Intelligence, 2023, 53 : 17629 - 17643
  • [47] Attention module-based spatial-temporal graph convolutional networks for skeleton-based action recognition
    Kong, Yinghui
    Li, Li
    Zhang, Ke
    Ni, Qiang
    Han, Jungong
    JOURNAL OF ELECTRONIC IMAGING, 2019, 28 (04)
  • [48] Research on gas turbine health assessment method based on physical prior knowledge and spatial-temporal graph neural network
    Cheng, Kanru
    Zhang, Kunyu
    Wang, Yuzhang
    Yang, Chaoran
    Li, Jiao
    Wang, Yueheng
    APPLIED ENERGY, 2024, 367
  • [49] Skeleton Action Recognition Based on Multi-Stream Spatial Attention Graph Convolutional SRU Network
    Zhao J.-N.
    She Q.-S.
    Meng M.
    Chen Y.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2022, 50 (07): : 1579 - 1585
  • [50] Attention-Based Generative Graph Convolutional Network for Skeleton-Based Human Action Recognition
    Yang, Kai
    Ding, Xiaolu
    Chen, Wai
    ICVIP 2019: PROCEEDINGS OF 2019 3RD INTERNATIONAL CONFERENCE ON VIDEO AND IMAGE PROCESSING, 2019, : 1 - 6