3D CNN for Human Action Recognition

被引:4
作者
Boualia, Sameh Neili [1 ,2 ]
Ben Amara, Najoua Essoukri [2 ]
机构
[1] Univ Tunis El Manar, Natl Engn Sch Tunis, Tunis 1002, Tunisia
[2] Univ Sousse, Ecole Natl Ingn Sousse, LATIS Lab Adv Technol & Intelligent Syst, Sousse 4023, Tunisia
来源
2021 18TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS & DEVICES (SSD) | 2021年
关键词
Human Action Recognition; Deep Learning; 3D CNN;
D O I
10.1109/SSD52085.2021.9429429
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recognizing different human actions from still images or videos is an important research area in the computer vision and artificial intelligence domains. It represents a key step for a wide range of applications including: human-computer interaction, ambient assisted living, intelligent driving and video surveillance. However, unless the many research works being involved, there are still many challenges ahead including: the high changes in human body shapes, clothing and viewpoint changes and the conditions of system acquisition (illumination variations, occlusions, etc). With the emergence of new deep learning techniques, many approaches are recently proposed for Human Action Recognition (HAR). Compared with conventional machine learning methods, deep learning techniques have more powerful learning ability. The most wide-spread deep learning approach is the Convolutional Neural Network (CNN/ConvNets). It has shown remarkable achievements due to its precision and robustness. As a branch of neural network, 3D CNN is a relatively new technique in the field of deep learning. In this paper, we propose a HAR approach based on a 3D CNN modet We apply the developed model to recognize human actions of KTH and J-HMDB datasets, and we achieve state of the art performance in comparison to baseline methods.
引用
收藏
页码:276 / 282
页数:7
相关论文
共 41 条
  • [1] Hand Gesture Recognition for Sign Language Using 3DCNN
    Al-Hammadi, Muneer
    Muhammad, Ghulam
    Abdul, Wadood
    Alsulaiman, Mansour
    Bencherif, Mohamed A.
    Mekhtiche, Mohamed Amine
    [J]. IEEE ACCESS, 2020, 8 : 79491 - 79509
  • [2] [Anonymous], 2016, PROC INT C DIGIT IMA
  • [3] Buitinck L., 2013, ECML PKDD WORKSHOP L, P108
  • [4] P-CNN: Pose-based CNN Features for Action Recognition
    Cheron, Guilhem
    Laptev, Ivan
    Schmid, Cordelia
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 3218 - 3226
  • [5] Dollar P., 2005, Proceedings. 2nd Joint IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance (VS-PETS) (IEEE Cat. No. 05EX1178), P65
  • [6] Learning Spatiotemporal Features with 3D Convolutional Networks
    Du Tran
    Bourdev, Lubomir
    Fergus, Rob
    Torresani, Lorenzo
    Paluri, Manohar
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4489 - 4497
  • [7] Elhayek A, 2015, PROC CVPR IEEE, P3810, DOI 10.1109/CVPR.2015.7299005
  • [8] Fan XC, 2015, Arxiv, DOI arXiv:1504.07159
  • [9] Gkioxari G, 2014, Arxiv, DOI arXiv:1406.5212
  • [10] Gómez-Cuba F, 2020, IEEE ICC