Skeleton-based human action recognition by fusing attention based three-stream convolutional neural network and SVM

被引：2

作者：

Ren, Fang ^{[1
]}

Tang, Chao ^{[1
]}

Tong, Anyang ^{[1
]}

Wang, Wenjian ^{[2
]}

机构：

[1] Hefei Univ, Sch Artificial Intelligence & Big Data, Hefei, Peoples R China

[2] Shanxi Univ, Sch Comp & Informat Technol, Taiyuan, Peoples R China

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2023年 / 83卷 / 2期

关键词：

Skeleton-based human action recognition; Convolutional neural network; Attention mechanism; Support vector machine; Spatial-temporal feature; RECOMMENDATION SYSTEM; VISION;

D O I：

10.1007/s11042-023-15334-9

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This work proposes a method, aiming the 3D skeleton sequence, for the human action recognition by fusing the attention-based three-stream convolutional neural network and support vector machine. The traditional action recognition methods primarily employ RGB video as input. However, RGB video has issues with respect to large data volume, low semanticity, and ease of making the model interfered by irrelevant information such as the background. The efficient and advanced human action information contained in the 3D skeleton sequence facilitates human behavior recognition. First, the information of 3D coordinates, temporal-difference information, and spatial-difference information of joints are extracted from the raw skeleton data, and the above information is input into the respective convolutional neural networks for pre-training. Then, the pre-trained network model extracts the feature containing the spatial-temporal information. Finally, the mixed feature vectors are input into the support vector machine for training and classification. Under the X-View and X-Sub benchmarks, the accuracy on the open dataset NTU RGB+D is 92.6% and 86.7% respectively, demonstrating that the method proposed for incorporating multistream feature learning, feature fusing, and hybrid model can improve the recognition accuracy.

引用

页码：6273 / 6295

页数：23

共 50 条

[41] Structure and Sequencing Preserving Representations for Skeleton-based Action Recognition Relying on Attention Mechanisms [J].

Rouali, Mohamed Lamine ;

Boulahia, Said Yacine ;

Amamra, Abdenour .

JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2023, 95 (08) :1003-1019

[42] Tifar-net: three-stream inception former-based action recognition network for infrared videos [J].

Imran, Javed ;

Rajput, Amitesh Singh ;

Vashisht, Rohit .

SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (02)

[43] Structure and Sequencing Preserving Representations for Skeleton-based Action Recognition Relying on Attention Mechanisms [J].

Mohamed Lamine Rouali ;

Said Yacine Boulahia ;

Abdenour Amamra .

Journal of Signal Processing Systems, 2023, 95 :1003-1019

[44] Skeleton-based action recognition with extreme learning machines [J].

Chen, Xi ;

Koskela, Markus .

NEUROCOMPUTING, 2015, 149 :387-396

[45] Lightweight Graph Convolutional Network For Efficient Skeleton Based Action Recognition [J].

Zhang, Yimeng ;

Yang, Yang ;

Gao, Xuehao .

2024 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN 2024, 2024,

[46] AGMS-GCN: Attention-guided multi-scale graph convolutional networks for skeleton-based action recognition [J].

Kilic, Ugur ;

Karadag, Ozge Oztimur ;

Ozyer, Gulsah Tumuklu .

KNOWLEDGE-BASED SYSTEMS, 2025, 311

[47] A Systematic Literature Review of Optimization Methods in Skeleton-Based Human Action Recognition [J].

Chung, Jen-Li ;

Ong, Lee-Yeng ;

Leow, Meng-Chew .

IEEE ACCESS, 2025, 13 :116713-116728

[48] Deep Neural Networks Using Capsule Networks and Skeleton-Based Attentions for Action Recognition [J].

Ha, Manh-Hung ;

Chen, Oscal Tzyh-Chiang .

IEEE ACCESS, 2021, 9 :6164-6178

[49] A 3D graph convolutional networks model for 2D skeleton-based human action recognition [J].

Weng, Libo ;

Lou, Weidong ;

Shen, Xin ;

Gao, Fei .

IET IMAGE PROCESSING, 2023, 17 (03) :773-783

[50] Three-stream network and improved attention mechanism-based blind image quality assessment [J].

Zheng, Jing ;

Cui, Ziguan ;

Gan, Zongliang ;

Tang, Guijin ;

Liu, Feng .

JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (01)

← 1 2 3 4 5 →