Research on Human Upper Limb Action Recognition Method Based on Multimodal Heterogeneous Spatial Temporal Graph Network

被引：0

作者：

Ci, Zelin ^{[1
]}

Ren, Huizhao ^{[1
]}

Liu, Jinming ^{[1
]}

Xie, Songyun ^{[2
]}

Wang, Wendong ^{[1
,3
]}

机构：

[1] Northwestern Polytech Univ, Sch Mech Engn, Xian 710072, Peoples R China

[2] Northwestern Polytech Univ, Elect Informat Coll, Xian 710072, Peoples R China

[3] Sanhang Civil Mil Integrat Innovat Res Inst, Dongguan 523429, Peoples R China

来源：

INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2024, PT X | 2025年 / 15210卷

关键词：

Action Recognition; Graph Neural Network; Temporal Features; Heterogeneous Graph;

D O I：

10.1007/978-981-96-0786-0_23

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Graph convolutional neural networks have been increasingly used in human action recognition because of their powerful ability to deal with spatial topological relations. Like human action, upper limb action contains spatial and temporal features. For temporal features, graph convolutional networks cannot extract well enough to realize the coupling of spatial-temporal features. This paper proposes a multimodal heterogeneous spatial-temporal network (MHST-GCN) model based on multimodal information. Firstly, the model introduces a temporal graph based on a hybrid sparsity strategy, which captures local and global temporal features in the sequence of human upper limb actions while ensuring computational efficiency. Then, a heterogeneous graph model is proposed for fusing the two modal information to enhance the robustness of the model. Finally, extensive experiments are conducted on two standard datasets, NTU-RGB+D, and a homemade upper limb action dataset. The experimental results demonstrate the effectiveness of the proposed method.

引用

页码：304 / 318

页数：15

共 50 条

[41] Multi-Branch Spatial-Temporal Network for Action Recognition
Wang, Yingying
Li, Wei
Tao, Ran
IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (10) : 1556 - 1560
[42] Recurrent Spatial-Temporal Attention Network for Action Recognition in Videos
Du, Wenbin
Wang, Yali
Qiao, Yu
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (03) : 1347 - 1360
[43] Key Frame Selection for Temporal Graph Optimization of Skeleton-Based Action Recognition
Hou, Jingyi
Su, Lei
Zhao, Yan
APPLIED SCIENCES-BASEL, 2024, 14 (21):
[44] Spatial-temporal interaction learning based two-stream network for action recognition
Liu, Tianyu
Ma, Yujun
Yang, Wenhan
Ji, Wanting
Wang, Ruili
Jiang, Ping
INFORMATION SCIENCES, 2022, 606 : 864 - 876
[45] Skeleton-based action recognition with multi-stream, multi-scale dilated spatial-temporal graph convolution network
Zhang, Haiping
Liu, Xu
Yu, Dongjin
Guan, Liming
Wang, Dongjing
Ma, Conghao
Hu, Zepeng
APPLIED INTELLIGENCE, 2023, 53 (14) : 17629 - 17643
[46] Skeleton-based action recognition with multi-stream, multi-scale dilated spatial-temporal graph convolution network
Haiping Zhang
Xu Liu
Dongjin Yu
Liming Guan
Dongjing Wang
Conghao Ma
Zepeng Hu
Applied Intelligence, 2023, 53 : 17629 - 17643
[47] Attention module-based spatial-temporal graph convolutional networks for skeleton-based action recognition
Kong, Yinghui
Li, Li
Zhang, Ke
Ni, Qiang
Han, Jungong
JOURNAL OF ELECTRONIC IMAGING, 2019, 28 (04)
[48] Research on gas turbine health assessment method based on physical prior knowledge and spatial-temporal graph neural network
Cheng, Kanru
Zhang, Kunyu
Wang, Yuzhang
Yang, Chaoran
Li, Jiao
Wang, Yueheng
APPLIED ENERGY, 2024, 367
[49] Skeleton Action Recognition Based on Multi-Stream Spatial Attention Graph Convolutional SRU Network
Zhao J.-N.
She Q.-S.
Meng M.
Chen Y.
Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2022, 50 (07): : 1579 - 1585
[50] Attention-Based Generative Graph Convolutional Network for Skeleton-Based Human Action Recognition
Yang, Kai
Ding, Xiaolu
Chen, Wai
ICVIP 2019: PROCEEDINGS OF 2019 3RD INTERNATIONAL CONFERENCE ON VIDEO AND IMAGE PROCESSING, 2019, : 1 - 6

← 1 2 3 4 5 →