Probabilistic Monocular 3D Human Pose Estimation with Normalizing Flows

被引:41
|
作者
Wehrbein, Tom [1 ]
Rudolph, Marco [1 ]
Rosenhahn, Bodo [1 ]
Wandt, Bastian [2 ]
机构
[1] Leibniz Univ Hannover, Hannover, Germany
[2] Univ British Columbia, Vancouver, BC, Canada
关键词
D O I
10.1109/ICCV48922.2021.01101
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D human pose estimation from monocular images is a highly ill-posed problem due to depth ambiguities and occlusions. Nonetheless, most existing works ignore these ambiguities and only estimate a single solution. In contrast, we generate a diverse set of hypotheses that represents the full posterior distribution of feasible 3D poses. To this end, we propose a normalizing flow based method that exploits the deterministic 3D-to-2D mapping to solve the ambiguous inverse 2D-to-3D problem. Additionally, uncertain detections and occlusions are effectively modeled by incorporating uncertainty information of the 2D detector as condition. Further keys to success are a learned 3D pose prior and a generalization of the best-of-M loss. We evaluate our approach on the two benchmark datasets Human3.6M and MPI-INF-3DHP, outperforming all comparable methods in most metrics. The implementation is available on GitHub(1).
引用
收藏
页码:11179 / 11188
页数:10
相关论文
共 50 条
  • [31] Augmented Reality with Human Body Interaction Based on Monocular 3D Pose Estimation
    Lin, Huei-Yung
    Chen, Ting-Wen
    ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, PT I, 2010, 6474 : 321 - 331
  • [32] Multi-Person 3D Human Pose Estimation from Monocular Images
    Dabral, Rishabh
    Gundavarapu, Nitesh B.
    Mitra, Rahul
    Sharma, Abhishek
    Ramakrishnan, Ganesh
    Jain, Arjun
    2019 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2019), 2019, : 405 - 414
  • [33] Sparseness Meets Deepness: 3D Human Pose Estimation from Monocular Video
    Zhou, Xiaowei
    Zhu, Menglong
    Leonardos, Spyridon
    Derpanis, Konstantinos G.
    Daniilidis, Kostas
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 4966 - 4975
  • [34] Towards Alleviating the Modeling Ambiguity of Unsupervised Monocular 3D Human Pose Estimation
    Yu, Zhenbo
    Ni, Bingbing
    Xu, Jingwei
    Wang, Junjie
    Zhao, Chenglong
    Zhang, Wenjun
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 8631 - 8640
  • [35] Uncertainty-Aware 3D Human Pose Estimation from Monocular Video
    Zhang, Jinlu
    Chen, Yujin
    Tu, Zhigang
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5102 - 5113
  • [36] CanonPose: Self-Supervised Monocular 3D Human Pose Estimation in the Wild
    Wandt, Bastian
    Rudolph, Marco
    Zell, Petrissa
    Rhodin, Helge
    Rosenhahn, Bodo
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 13289 - 13299
  • [37] MonoEye: Monocular Fisheye Camera-based 3D Human Pose Estimation
    Hwang, Dong-Hyun
    Aso, Kohei
    Koike, Hideki
    2019 26TH IEEE CONFERENCE ON VIRTUAL REALITY AND 3D USER INTERFACES (VR), 2019, : 988 - 989
  • [38] A monocular 3D human pose estimation approach for virtual character skeleton retargeting
    Yang A.
    Liu G.
    Naeem W.
    Wu D.
    Zhou Y.
    Chen L.
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (07) : 9563 - 9574
  • [39] Modeling vs. Learning Approaches for Monocular 3D Human Pose Estimation
    Gong, Wenjuan
    Brauer, Juergen
    Arens, Michael
    Gonzalez, Jordi
    2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCV WORKSHOPS), 2011,
  • [40] Cascaded Deep Monocular 3D Human Pose Estimation with Evolutionary Training Data
    Li, Shichao
    Ke, Lei
    Pratama, Kevin
    Tai, Yu-Wing
    Tang, Chi-Keung
    Cheng, Kwang-Ting
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 6172 - 6182