Pose-based Human Activity Recognition: a review

被引:0
作者
Boualia, Sameh Neili [1 ,2 ]
Ben Amara, Najoua Essoukri [1 ]
机构
[1] Univ Sousse, Ecole Natl Ingenieurs Sousse, LATIS Lab Adv Technol & Intelligent Syst, Sousse 4023, Tunisia
[2] Univ Tunis El Manar, Ecole Natl Ingenieurs Tunis, Tunis 1002, Tunisia
来源
2019 15TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE (IWCMC) | 2019年
关键词
Human Pose Estimation; ConvNets; Deep Learning; Human Activity Recognition;
D O I
10.1109/iwcmc.2019.8766694
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper serves as a survey and empirical evaluation of the state-of-the-art in activity recognition methods using still RGB images and/or videos. Understanding human activities from videos or still images is a challenging task in computer vision domain. Identifying the action or activity being accomplished automatically and then recognizing it represents the prime goal of an intelligent video system. Human Activity Recognition arises in various application domains varying from human computer interfaces, health care monitoring to surveillance and security. Despite the ongoing efforts in the domain, these tasks remained unsolved in unconstrained environments and face many challenges such as occlusions, variations in clothing and background clutter. Recently, numerous deep learning algorithms have been proposed to solve traditional artificial intelligence problems. They have shown great advances, in particular for pose estimation task since they can extract appropriate features while jointly performing discrimination. In this paper, we provide a detailed review of recent and state-of-the-art research advances in the field of human activity recognition. We propose a categorization of human activity methodologies and discuss their advantages and limitations. In particular, we divide feature representation methods into global, local and body modeling. Then, human activity classification approaches are arranged into three categories, which reflect how they model human activities: template-based, generative and discriminative. Moreover, we provide a comprehensive analysis of pose-based human activity recognition where both conventional and deep learning-based human pose estimation approaches are reported. Finally, we discuss the open-challenges in this field and endeavor to provide possible solutions.
引用
收藏
页码:1468 / 1475
页数:8
相关论文
共 80 条
  • [1] Hierarchical Clustering Multi-Task Learning for Joint Human Action Grouping and Recognition
    Liu, An-An
    Su, Yu-Ting
    Nie, Wei-Zhi
    Kankanhalli, Mohan
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (01) : 102 - 114
  • [2] [Anonymous], 2016, 1 NIPS WORKSH LARG S
  • [3] [Anonymous], 2011, CVPR, DOI DOI 10.1109/CVPR.2011.5995316
  • [4] [Anonymous], 2013, P IEEE C COMP VIS PA
  • [5] [Anonymous], 2015, P IEEE INT C COMP VI
  • [6] [Anonymous], 2015, CVPR
  • [7] [Anonymous], 2013, P IEEE INT C COMP VI
  • [8] [Anonymous], P IEEE C COMP VIS PA, DOI DOI 10.1109/CVPR.2008.4587733
  • [9] Belagiannis V., 2017, FG
  • [10] Computer vision and deep learning techniques for pedestrian detection and tracking: A survey
    Brunetti, Antonio
    Buongiorno, Domenico
    Trotta, Gianpaolo Francesco
    Bevilacqua, Vitoantonio
    [J]. NEUROCOMPUTING, 2018, 300 : 17 - 33