Quantification of Occlusion Handling Capability of a 3D Human Pose Estimation Framework

被引:11
作者
Ghafoor, Mehwish [1 ]
Mahmood, Arif [1 ]
机构
[1] Informat Technol Univ, Dept Comp Sci, Lahore 54600, Pakistan
关键词
Action classification; human pose estimation; occlusion aware networks; occlusion handling quantification; temporal dilated CNN;
D O I
10.1109/TMM.2022.3158068
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
3D human pose estimation using monocular images is an important yet challenging task. Existing 3D pose detection methods exhibit excellent performance under normal conditions however their performance may degrade due to occlusion. Recently some occlusion aware methods have also been proposed, however, the occlusion handling capability of these networks has not yet been thoroughly investigated. In the current work, we propose an occlusion-guided 3D human pose estimation framework and quantify its occlusion handling capability by using different protocols. The proposed method estimates more accurate 3D human poses using 2D skeletons with missing joints as input. Missing joints are handled by introducing occlusion guidance that provides extra information about the absence or presence of a joint. Temporal information has also been exploited to better estimate the missing joints. A large number of experiments are performed for the quantification of occlusion handling capability of the proposed method on three publicly available datasets in various settings including random missing joints, fixed body parts missing, and complete frames missing, using mean per joint position error criterion. In addition to that, the quality of the predicted 3D poses is also evaluated using action classification performance as a criterion. 3D poses estimated by the proposed method achieved significantly improved action recognition performance in the presence of missing joints. Our experiments demonstrate the effectiveness of the proposed framework for handling the missing joints as well as quantification of the occlusion handling capability of the deep neural networks.
引用
收藏
页码:3311 / 3318
页数:8
相关论文
共 33 条
  • [1] 2D Pose-Based Real-Time Human Action Recognition With Occlusion-Handling
    Angelini, Federico
    Fu, Zeyu
    Long, Yang
    Shao, Ling
    Naqvi, Syed Mohsen
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (06) : 1433 - 1446
  • [2] Pose-Guided Tracking-by-Detection: Robust Multi-Person Pose Tracking
    Bao, Qian
    Liu, Wu
    Cheng, Yuhao
    Zhou, Boyan
    Mei, Tao
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 161 - 175
  • [3] OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields
    Cao, Zhe
    Hidalgo, Gines
    Simon, Tomas
    Wei, Shih-En
    Sheikh, Yaser
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (01) : 172 - 186
  • [4] Unsupervised 3D Pose Estimation with Geometric Self-Supervision
    Chen, Ching-Hang
    Tyagi, Ambrish
    Agrawal, Amit
    Drover, Dylan
    Rohith, M., V
    Stojanov, Stefan
    Rehg, James M.
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5707 - 5717
  • [5] Anatomy-Aware 3D Human Pose Estimation With Bone-Based Pose Decomposition
    Chen, Tianlang
    Fang, Chen
    Shen, Xiaohui
    Zhu, Yiheng
    Chen, Zhili
    Luo, Jiebo
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (01) : 198 - 209
  • [6] Cheng Y., 2021, P AAAI C ART INT
  • [7] Cheng Y, 2020, AAAI CONF ARTIF INTE, V34, P10631
  • [8] Occlusion-Aware Networks for 3D Human Pose Estimation in Video
    Cheng, Yu
    Yang, Bo
    Wang, Bo
    Yan, Wending
    Tan, Robby T.
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 723 - 732
  • [9] Das Srijan, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12354), P72, DOI 10.1007/978-3-030-58545-7_5
  • [10] Fang HS, 2018, AAAI CONF ARTIF INTE, P6821