Skeleton-based human action recognition by fusing attention based three-stream convolutional neural network and SVM

被引:2
作者
Ren, Fang [1 ]
Tang, Chao [1 ]
Tong, Anyang [1 ]
Wang, Wenjian [2 ]
机构
[1] Hefei Univ, Sch Artificial Intelligence & Big Data, Hefei, Peoples R China
[2] Shanxi Univ, Sch Comp & Informat Technol, Taiyuan, Peoples R China
关键词
Skeleton-based human action recognition; Convolutional neural network; Attention mechanism; Support vector machine; Spatial-temporal feature; RECOMMENDATION SYSTEM; VISION;
D O I
10.1007/s11042-023-15334-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This work proposes a method, aiming the 3D skeleton sequence, for the human action recognition by fusing the attention-based three-stream convolutional neural network and support vector machine. The traditional action recognition methods primarily employ RGB video as input. However, RGB video has issues with respect to large data volume, low semanticity, and ease of making the model interfered by irrelevant information such as the background. The efficient and advanced human action information contained in the 3D skeleton sequence facilitates human behavior recognition. First, the information of 3D coordinates, temporal-difference information, and spatial-difference information of joints are extracted from the raw skeleton data, and the above information is input into the respective convolutional neural networks for pre-training. Then, the pre-trained network model extracts the feature containing the spatial-temporal information. Finally, the mixed feature vectors are input into the support vector machine for training and classification. Under the X-View and X-Sub benchmarks, the accuracy on the open dataset NTU RGB+D is 92.6% and 86.7% respectively, demonstrating that the method proposed for incorporating multistream feature learning, feature fusing, and hybrid model can improve the recognition accuracy.
引用
收藏
页码:6273 / 6295
页数:23
相关论文
共 50 条
[31]   Enhanced decoupling graph convolution network for skeleton-based action recognition [J].
Gu, Yue ;
Yu, Qiang ;
Xue, Wanli .
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (29) :73289-73304
[32]   Joint Spatiotemporal Collaborative Relationship Network for Skeleton-Based Action Recognition [J].
Lu, Hao ;
Wang, Tingwei .
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT I, 2023, 14086 :775-786
[33]   Improved semantic-guided network for skeleton-based action recognition [J].
Mansouri, Amine ;
Bakir, Toufik ;
Elzaar, Abdellah .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 104
[34]   Multi-scale and attention enhanced graph convolution network for skeleton-based violence action recognition [J].
Yang, Huaigang ;
Ren, Ziliang ;
Yuan, Huaqiang ;
Wei, Wenhong ;
Zhang, Qieshi ;
Zhang, Zhaolong .
FRONTIERS IN NEUROROBOTICS, 2022, 16
[35]   Human Action Recognition Fusing Two-Stream Networks and SVM [J].
Tong A. ;
Tang C. ;
Wang W. .
Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2021, 34 (09) :863-870
[36]   Algorithm for Skeleton Action Recognition by Integrating Attention Mechanism and Convolutional Neural Networks [J].
Liu, Jianhua .
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (08) :604-613
[37]   Skeleton-Based Human Action Recognition with Spatial and Temporal Attention-Enhanced Graph Convolution Networks [J].
Xu, Fen ;
Shi, Pengfei ;
Zhang, Xiaoping .
JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2024, 28 (06) :1367-1379
[38]   Whole and Part Adaptive Fusion Graph Convolutional Networks for Skeleton-Based Action Recognition [J].
Zuo, Qi ;
Zou, Lian ;
Fan, Cien ;
Li, Dongqian ;
Jiang, Hao ;
Liu, Yifeng .
SENSORS, 2020, 20 (24) :1-20
[39]   Multi-Branch Spatial-Temporal Attention Graph Convolution Network for Skeleton-based Action Recognition [J].
Wang, Daoshuai ;
Li, Dewei ;
Guan, Yaonan ;
Wang, Gang ;
Shao, Haibin .
2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, :6487-6492
[40]   Target Recognition of Robot Based on Attention Mechanism and Convolutional Neural Network [J].
Li, Hexi ;
Li, Jihua .
PROCEEDINGS OF 2019 IEEE 3RD INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2019), 2019, :2578-2584