Probabilistic Monocular 3D Human Pose Estimation with Normalizing Flows

被引:41
|
作者
Wehrbein, Tom [1 ]
Rudolph, Marco [1 ]
Rosenhahn, Bodo [1 ]
Wandt, Bastian [2 ]
机构
[1] Leibniz Univ Hannover, Hannover, Germany
[2] Univ British Columbia, Vancouver, BC, Canada
关键词
D O I
10.1109/ICCV48922.2021.01101
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D human pose estimation from monocular images is a highly ill-posed problem due to depth ambiguities and occlusions. Nonetheless, most existing works ignore these ambiguities and only estimate a single solution. In contrast, we generate a diverse set of hypotheses that represents the full posterior distribution of feasible 3D poses. To this end, we propose a normalizing flow based method that exploits the deterministic 3D-to-2D mapping to solve the ambiguous inverse 2D-to-3D problem. Additionally, uncertain detections and occlusions are effectively modeled by incorporating uncertainty information of the 2D detector as condition. Further keys to success are a learned 3D pose prior and a generalization of the best-of-M loss. We evaluate our approach on the two benchmark datasets Human3.6M and MPI-INF-3DHP, outperforming all comparable methods in most metrics. The implementation is available on GitHub(1).
引用
收藏
页码:11179 / 11188
页数:10
相关论文
共 50 条
  • [41] Monocular 3D Human Pose Estimation In The Wild Using Improved CNN Supervision
    Mehta, Dushyant
    Rhodin, Helge
    Casas, Dan
    Fua, Pascal
    Sotnychenko, Oleksandr
    Xu, Weipeng
    Theobalt, Christian
    PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2017, : 506 - 516
  • [42] Learning with privileged stereo knowledge for monocular absolute 3D human pose estimation
    Bian, Cunling
    Lu, Weigang
    Feng, Wei
    Wang, Song
    PATTERN RECOGNITION LETTERS, 2025, 189 : 143 - 149
  • [43] Frame-Padded Multiscale Transformer for Monocular 3D Human Pose Estimation
    Zhong, Yuanhong
    Yang, Guangxia
    Zhong, Daidi
    Yang, Xun
    Wang, Shanshan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 6191 - 6201
  • [44] Monocular Image 3D Human Pose Estimation under Self-Occlusion
    Radwan, Ibrahim
    Dhall, Abhinav
    Goecke, Roland
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 1888 - 1895
  • [45] Joint Optimization of the 3D Model and 6D Pose for Monocular Pose Estimation
    Guo, Liangchao
    Chen, Lin
    Wang, Qiufu
    Zhang, Zhuo
    Sun, Xiaoliang
    DRONES, 2024, 8 (11)
  • [46] Multi-View Pose Generator Based on Deep Learning for Monocular 3D Human Pose Estimation
    Sun, Jun
    Wang, Mantao
    Zhao, Xin
    Zhang, Dejun
    SYMMETRY-BASEL, 2020, 12 (07):
  • [47] Ray3D: ray-based 3D human pose estimation for monocular absolute 3D localization
    Zhan, Yu
    Li, Fenghai
    Weng, Renliang
    Choi, Wongun
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 13106 - 13115
  • [48] Recovering 3D human pose from monocular images
    Agarwal, A
    Triggs, B
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (01) : 44 - 58
  • [49] A Review of 3D Pose Estimation from a Monocular Image Sequence
    Shan Gan-lin
    Ji Bing
    Zhou Yun-feng
    PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOLS 1-9, 2009, : 1946 - 1950
  • [50] 3D Face pose estimation and tracking from a monocular camera
    Ji, Q
    IMAGE AND VISION COMPUTING, 2002, 20 (07) : 499 - 511