Understanding holistic human pose using class-specific convolutional neural network

被引:2
|
作者
Shamsafar, Faranak [1 ]
Ebrahimnezhad, Hossein [1 ]
机构
[1] Sahand Univ Technol, Comp Vis Res Lab, Elect Engn Fac, Tabriz, Iran
关键词
Human pose estimation; Holistic pose; RGB images; Unconstrained conditions; Deep learning; Convolutional neural network;
D O I
10.1007/s11042-018-5617-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a method to capture human pose from individual real-world RGB images using a deep learning technique. The current works on estimating human pose by deep learning are designed in a detection or a regression framework, and in a part-based manner. As a new perspective, we introduce a classification scheme for this problem, which reasons the pose holistically. To the best of our knowledge, this is the first work for holistic human pose classification task that owes its feasibility to the great power of convolutional neural networks in feature learning. After training a convolutional neural network to classify the input image to one of the KeyPoses, the final pose is computed as a linear combination of several KeyPoses. In this new holistic classification attitude, the vast and high degree of freedom human pose space is divided into a finite number of subspaces and the convolutional neural network shows promising results in learning the features of each subspace. Empirical results (PCP and PCK rates) demonstrate that the proposed scheme is successfully able to understand human pose (i.e., predict a valid, true and coarse pose) in real-world unconstrained images with challenges like severe occlusion, high articulation, low quality and cluttered background. Furthermore, using the proposed method, the need for defining a complex model (such as appearance model or joints pairwise relations) is relieved. We have also verified a potential application of our proposed method in semantic image retrieval based on human pose.
引用
收藏
页码:23193 / 23225
页数:33
相关论文
共 50 条
  • [41] Human action interpretation using convolutional neural network: a survey
    Malik, Zainab
    Bin Shapiai, Mohd Ibrahim
    MACHINE VISION AND APPLICATIONS, 2022, 33 (03)
  • [42] Task-specific word identification from short texts using a convolutional neural network
    Yuan, Shuhan
    Wu, Xintao
    Xiang, Yang
    INTELLIGENT DATA ANALYSIS, 2018, 22 (03) : 533 - 550
  • [43] Understanding the Relationship Between Image Quality and Convolutional Neural Network Performance
    Bergstrom, Austin C.
    Messinger, David W.
    PATTERN RECOGNITION AND TRACKING XXXIII, 2022, 12101
  • [44] Simultaneous Space Object Recognition and Pose Estimation by Convolutional Neural Network
    Afshar, Roya
    Chu, Zhongyi
    Lu, Shuai
    PROCEEDINGS OF THE 2020 INTERNATIONAL CONFERENCE ON ARTIFICIAL LIFE AND ROBOTICS (ICAROB2020), 2020, : 490 - 495
  • [45] Automatic human identification from panoramic dental radiographs using the convolutional neural network
    Fan, Fei
    Ke, Wenchi
    Wu, Wei
    Tian, Xuemei
    Lyu, Tu
    Liu, Yuanyuan
    Liao, Peixi
    Dai, Xinhua
    Chen, Hu
    Deng, Zhenhua
    FORENSIC SCIENCE INTERNATIONAL, 2020, 314
  • [46] Total Recall: Understanding Traffic Signs using Deep Convolutional Neural Network
    Saha, Sourajit
    Kamran, Sharif Amit
    Sabbir, Ali Shihab
    2018 21ST INTERNATIONAL CONFERENCE OF COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2018,
  • [47] 3D Human Pose Machine with a ToF Sensor using Pre-trained Convolutional Neural Networks
    Kim, Jong-Sung
    Kwon, Seung-Joon
    2019 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC): ICT CONVERGENCE LEADING THE AUTONOMOUS FUTURE, 2019, : 1018 - 1020
  • [48] GLUENet: Ultrasound Elastography Using Convolutional Neural Network
    Kibria, Md Golam
    Rivaz, Hassan
    SIMULATION, IMAGE PROCESSING, AND ULTRASOUND SYSTEMS FOR ASSISTED DIAGNOSIS AND NAVIGATION, 2018, 11042 : 21 - 28
  • [49] Image Compressed Sensing Using Convolutional Neural Network
    Shi, Wuzhen
    Jiang, Feng
    Liu, Shaohui
    Zhao, Debin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (29) : 375 - 388
  • [50] Pavement Crack Detection using Convolutional Neural Network
    Nhung Thi Hong Nguyen
    Thanh Ha Le
    Perry, Stuart
    Thi Thuy Nguyen
    PROCEEDINGS OF THE NINTH INTERNATIONAL SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGY (SOICT 2018), 2018, : 251 - 256