Quantification of Occlusion Handling Capability of a 3D Human Pose Estimation Framework

被引：11

作者：

Ghafoor, Mehwish ^{[1
]}

Mahmood, Arif ^{[1
]}

机构：

[1] Informat Technol Univ, Dept Comp Sci, Lahore 54600, Pakistan

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2023年 / 25卷

关键词：

Action classification; human pose estimation; occlusion aware networks; occlusion handling quantification; temporal dilated CNN;

D O I：

10.1109/TMM.2022.3158068

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

3D human pose estimation using monocular images is an important yet challenging task. Existing 3D pose detection methods exhibit excellent performance under normal conditions however their performance may degrade due to occlusion. Recently some occlusion aware methods have also been proposed, however, the occlusion handling capability of these networks has not yet been thoroughly investigated. In the current work, we propose an occlusion-guided 3D human pose estimation framework and quantify its occlusion handling capability by using different protocols. The proposed method estimates more accurate 3D human poses using 2D skeletons with missing joints as input. Missing joints are handled by introducing occlusion guidance that provides extra information about the absence or presence of a joint. Temporal information has also been exploited to better estimate the missing joints. A large number of experiments are performed for the quantification of occlusion handling capability of the proposed method on three publicly available datasets in various settings including random missing joints, fixed body parts missing, and complete frames missing, using mean per joint position error criterion. In addition to that, the quality of the predicted 3D poses is also evaluated using action classification performance as a criterion. 3D poses estimated by the proposed method achieved significantly improved action recognition performance in the presence of missing joints. Our experiments demonstrate the effectiveness of the proposed framework for handling the missing joints as well as quantification of the occlusion handling capability of the deep neural networks.

引用

页码：3311 / 3318

页数：8

共 33 条

[1] 2D Pose-Based Real-Time Human Action Recognition With Occlusion-Handling
Angelini, Federico
Fu, Zeyu
Long, Yang
Shao, Ling
Naqvi, Syed Mohsen
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (06) : 1433 - 1446
[2] Pose-Guided Tracking-by-Detection: Robust Multi-Person Pose Tracking
Bao, Qian
Liu, Wu
Cheng, Yuhao
Zhou, Boyan
Mei, Tao
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 161 - 175
[3] OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields
Cao, Zhe
Hidalgo, Gines
Simon, Tomas
Wei, Shih-En
Sheikh, Yaser
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (01) : 172 - 186
[4] Unsupervised 3D Pose Estimation with Geometric Self-Supervision
Chen, Ching-Hang
Tyagi, Ambrish
Agrawal, Amit
Drover, Dylan
Rohith, M., V
Stojanov, Stefan
Rehg, James M.
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5707 - 5717
[5] Anatomy-Aware 3D Human Pose Estimation With Bone-Based Pose Decomposition
Chen, Tianlang
Fang, Chen
Shen, Xiaohui
Zhu, Yiheng
Chen, Zhili
Luo, Jiebo
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (01) : 198 - 209
[6] Cheng Y., 2021, P AAAI C ART INT
[7] Cheng Y, 2020, AAAI CONF ARTIF INTE, V34, P10631
[8] Occlusion-Aware Networks for 3D Human Pose Estimation in Video
Cheng, Yu
Yang, Bo
Wang, Bo
Yan, Wending
Tan, Robby T.
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 723 - 732
[9] Das Srijan, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12354), P72, DOI 10.1007/978-3-030-58545-7_5
[10] Fang HS, 2018, AAAI CONF ARTIF INTE, P6821

← 1 2 3 4 →