3D CNN for Human Action Recognition

被引：4

作者：

Boualia, Sameh Neili ^{[1
,2
]}

Ben Amara, Najoua Essoukri ^{[2
]}

机构：

[1] Univ Tunis El Manar, Natl Engn Sch Tunis, Tunis 1002, Tunisia

[2] Univ Sousse, Ecole Natl Ingn Sousse, LATIS Lab Adv Technol & Intelligent Syst, Sousse 4023, Tunisia

来源：

2021 18TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS & DEVICES (SSD) | 2021年

关键词：

Human Action Recognition; Deep Learning; 3D CNN;

D O I：

10.1109/SSD52085.2021.9429429

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Recognizing different human actions from still images or videos is an important research area in the computer vision and artificial intelligence domains. It represents a key step for a wide range of applications including: human-computer interaction, ambient assisted living, intelligent driving and video surveillance. However, unless the many research works being involved, there are still many challenges ahead including: the high changes in human body shapes, clothing and viewpoint changes and the conditions of system acquisition (illumination variations, occlusions, etc). With the emergence of new deep learning techniques, many approaches are recently proposed for Human Action Recognition (HAR). Compared with conventional machine learning methods, deep learning techniques have more powerful learning ability. The most wide-spread deep learning approach is the Convolutional Neural Network (CNN/ConvNets). It has shown remarkable achievements due to its precision and robustness. As a branch of neural network, 3D CNN is a relatively new technique in the field of deep learning. In this paper, we propose a HAR approach based on a 3D CNN modet We apply the developed model to recognize human actions of KTH and J-HMDB datasets, and we achieve state of the art performance in comparison to baseline methods.

引用

页码：276 / 282

页数：7

共 41 条

[1] Hand Gesture Recognition for Sign Language Using 3DCNN
Al-Hammadi, Muneer
Muhammad, Ghulam
Abdul, Wadood
Alsulaiman, Mansour
Bencherif, Mohamed A.
Mekhtiche, Mohamed Amine
[J]. IEEE ACCESS, 2020, 8 : 79491 - 79509
[2] [Anonymous], 2016, PROC INT C DIGIT IMA
[3] Buitinck L., 2013, ECML PKDD WORKSHOP L, P108
[4] P-CNN: Pose-based CNN Features for Action Recognition
Cheron, Guilhem
Laptev, Ivan
Schmid, Cordelia
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 3218 - 3226
[5] Dollar P., 2005, Proceedings. 2nd Joint IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance (VS-PETS) (IEEE Cat. No. 05EX1178), P65
[6] Learning Spatiotemporal Features with 3D Convolutional Networks
Du Tran
Bourdev, Lubomir
Fergus, Rob
Torresani, Lorenzo
Paluri, Manohar
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4489 - 4497
[7] Elhayek A, 2015, PROC CVPR IEEE, P3810, DOI 10.1109/CVPR.2015.7299005
[8] Fan XC, 2015, Arxiv, DOI arXiv:1504.07159
[9] Gkioxari G, 2014, Arxiv, DOI arXiv:1406.5212
[10] Gómez-Cuba F, 2020, IEEE ICC

← 1 2 3 4 5 →