Pose-based Human Activity Recognition: a review

被引:0
作者
Boualia, Sameh Neili [1 ,2 ]
Ben Amara, Najoua Essoukri [1 ]
机构
[1] Univ Sousse, Ecole Natl Ingenieurs Sousse, LATIS Lab Adv Technol & Intelligent Syst, Sousse 4023, Tunisia
[2] Univ Tunis El Manar, Ecole Natl Ingenieurs Tunis, Tunis 1002, Tunisia
来源
2019 15TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE (IWCMC) | 2019年
关键词
Human Pose Estimation; ConvNets; Deep Learning; Human Activity Recognition;
D O I
10.1109/iwcmc.2019.8766694
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper serves as a survey and empirical evaluation of the state-of-the-art in activity recognition methods using still RGB images and/or videos. Understanding human activities from videos or still images is a challenging task in computer vision domain. Identifying the action or activity being accomplished automatically and then recognizing it represents the prime goal of an intelligent video system. Human Activity Recognition arises in various application domains varying from human computer interfaces, health care monitoring to surveillance and security. Despite the ongoing efforts in the domain, these tasks remained unsolved in unconstrained environments and face many challenges such as occlusions, variations in clothing and background clutter. Recently, numerous deep learning algorithms have been proposed to solve traditional artificial intelligence problems. They have shown great advances, in particular for pose estimation task since they can extract appropriate features while jointly performing discrimination. In this paper, we provide a detailed review of recent and state-of-the-art research advances in the field of human activity recognition. We propose a categorization of human activity methodologies and discuss their advantages and limitations. In particular, we divide feature representation methods into global, local and body modeling. Then, human activity classification approaches are arranged into three categories, which reflect how they model human activities: template-based, generative and discriminative. Moreover, we provide a comprehensive analysis of pose-based human activity recognition where both conventional and deep learning-based human pose estimation approaches are reported. Finally, we discuss the open-challenges in this field and endeavor to provide possible solutions.
引用
收藏
页码:1468 / 1475
页数:8
相关论文
共 80 条
[31]   Cross-view human action recognition from depth maps using spectral graph sequences [J].
Kerola, Tommi ;
Inoue, Nakamasa ;
Shinoda, Koichi .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2017, 154 :108-126
[32]   Application on Integration Technology of Visualized Hierarchical Information [J].
Li, Weibo ;
He, Yang .
2010 THE 3RD INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND INDUSTRIAL APPLICATION (PACIIA2010), VOL I, 2010, :9-12
[33]   Rain Streak Removal Using Layer Priors [J].
Li, Yu ;
Tan, Robby T. ;
Guo, Xiaojie ;
Lu, Jiangbo ;
Brown, Michael S. .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2736-2744
[34]  
Lifshitz Ita, 2016, EUR C COMP VIS
[35]   Sparse composition of body poses and atomic actions for human activity recognition in RGB-D videos [J].
Lillo, Ivan ;
Niebles, Juan Carlos ;
Soto, Alvaro .
IMAGE AND VISION COMPUTING, 2017, 59 :63-75
[36]   Spatio-Temporal LSTM with Trust Gates for 3D Human Action Recognition [J].
Liu, Jun ;
Shahroudy, Amir ;
Xu, Dong ;
Wang, Gang .
COMPUTER VISION - ECCV 2016, PT III, 2016, 9907 :816-833
[37]   Hybrid human detection and recognition in surveillance [J].
Liu, Qiang ;
Zhang, Wei ;
Li, Hongliang ;
Ngan, King Ngi .
NEUROCOMPUTING, 2016, 194 :10-23
[38]  
Luo J., 2013, P IEEE INT C COMP VI
[39]  
Malik, 2014, ARXIV14065212
[40]  
Maninis Kevis-Kokitsi, 2018, P IEEE C COMP VIS PA