Skeleton-based human action recognition by fusing attention based three-stream convolutional neural network and SVM

被引:2
作者
Ren, Fang [1 ]
Tang, Chao [1 ]
Tong, Anyang [1 ]
Wang, Wenjian [2 ]
机构
[1] Hefei Univ, Sch Artificial Intelligence & Big Data, Hefei, Peoples R China
[2] Shanxi Univ, Sch Comp & Informat Technol, Taiyuan, Peoples R China
关键词
Skeleton-based human action recognition; Convolutional neural network; Attention mechanism; Support vector machine; Spatial-temporal feature; RECOMMENDATION SYSTEM; VISION;
D O I
10.1007/s11042-023-15334-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This work proposes a method, aiming the 3D skeleton sequence, for the human action recognition by fusing the attention-based three-stream convolutional neural network and support vector machine. The traditional action recognition methods primarily employ RGB video as input. However, RGB video has issues with respect to large data volume, low semanticity, and ease of making the model interfered by irrelevant information such as the background. The efficient and advanced human action information contained in the 3D skeleton sequence facilitates human behavior recognition. First, the information of 3D coordinates, temporal-difference information, and spatial-difference information of joints are extracted from the raw skeleton data, and the above information is input into the respective convolutional neural networks for pre-training. Then, the pre-trained network model extracts the feature containing the spatial-temporal information. Finally, the mixed feature vectors are input into the support vector machine for training and classification. Under the X-View and X-Sub benchmarks, the accuracy on the open dataset NTU RGB+D is 92.6% and 86.7% respectively, demonstrating that the method proposed for incorporating multistream feature learning, feature fusing, and hybrid model can improve the recognition accuracy.
引用
收藏
页码:6273 / 6295
页数:23
相关论文
共 50 条
[21]   Multi-Scale Adaptive Aggregate Graph Convolutional Network for Skeleton-Based Action Recognition [J].
Zheng, Zhiyun ;
Wang, Yizhou ;
Zhang, Xingjin ;
Wang, Junfeng .
APPLIED SCIENCES-BASEL, 2022, 12 (03)
[22]   Part-Wise Adaptive Topology Graph Convolutional Network for Skeleton-Based Action Recognition [J].
Wang, Jiale ;
Zou, Lian ;
Fan, Cien ;
Chi, Ruan .
ELECTRONICS, 2023, 12 (09)
[23]   Lightweight Multiscale Spatio-Temporal Graph Convolutional Network for Skeleton-Based Action Recognition [J].
Zheng, Zhiyun ;
Yuan, Qilong ;
Zhang, Huaizhu ;
Wang, Yizhou ;
Wang, Junfeng .
BIG DATA MINING AND ANALYTICS, 2025, 8 (02) :310-325
[24]   Multi-Term Attention Networks for Skeleton-Based Action Recognition [J].
Diao, Xiaolei ;
Li, Xiaoqiang ;
Huang, Chen .
APPLIED SCIENCES-BASEL, 2020, 10 (15)
[25]   Traffic Sign Recognition Based on SVM And Convolutional Neural Network [J].
Tong Guofeng ;
Chen Huairong ;
Li Yong ;
Zheng Kai .
PROCEEDINGS OF THE 2017 12TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2017, :2066-2071
[26]   BODY PART LEVEL ATTENTION MODEL FOR SKELETON-BASED ACTION RECOGNITION [J].
Zhang, Han ;
Song, Yonghong ;
Zhang, Yuanlin .
2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, :4297-4302
[27]   A Cuboid CNN Model With an Attention Mechanism for Skeleton-Based Action Recognition [J].
Zhu, Kaijun ;
Wang, Ruxin ;
Zhao, Qingsong ;
Cheng, Jun ;
Tao, Dapeng .
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (11) :2977-2989
[28]   Sequential inter-hop graph convolution neural network (SIhGCN) for skeleton-based human action recognition [J].
Setiawan, Feri ;
Yahya, Bernardo Nugroho ;
Chun, Seok-Ju ;
Lee, Seok-Lyong .
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 195
[29]   Pose-Guided Graph Convolutional Networks for Skeleton-Based Action Recognition [J].
Chen, Han ;
Jiang, Yifan ;
Ko, Hanseok .
IEEE ACCESS, 2022, 10 :111725-111731
[30]   Multi-Modality Adaptive Feature Fusion Graph Convolutional Network for Skeleton-Based Action Recognition [J].
Zhang, Haiping ;
Zhang, Xinhao ;
Yu, Dongjin ;
Guan, Liming ;
Wang, Dongjing ;
Zhou, Fuxing ;
Zhang, Wanjun .
SENSORS, 2023, 23 (12)