In the Wild Human Pose Estimation Using Explicit 2D Features and Intermediate 3D Representations

被引：93

作者：

Habibie, Ikhsanul ^{[1
]}

Xu, Weipeng ^{[1
]}

Mehta, Dushyant ^{[1
]}

Pons-Moll, Gerard ^{[1
]}

Theobalt, Christian ^{[1
]}

机构：

[1] Max Planck Inst Informat, Saarland Informat Campus, Saarbrucken, Germany

来源：

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年

关键词：

D O I：

10.1109/CVPR.2019.01116

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Convolutional Neural Network based approaches for monocular 3D human pose estimation usually require a large amount of training images with 3D pose annotations. While it is feasible to provide 2D joint annotations for large corpora of in-the-wild images with humans, providing accurate 3D annotations to such in-the-wild corpora is hardly feasible in practice. Most existing 3D labelled data sets are either synthetically created or feature in-studio images. 3D pose estimation algorithms trained on such data often have limited ability to generalize to real world scene diversity. We therefore propose a new deep learning based method for monocular 3D human pose estimation that shows high accuracy and generalizes better to in-the-wild scenes. It has a network architecture that comprises a new disentangled hidden space encoding of explicit 2D and 3D features, and uses supervision by a new learned projection model from predicted 3D pose. Our algorithm can be jointly trained on image data with 3D labels and image data with only 2D labels. It achieves state-of-the-art accuracy on challenging in-the-wild data.

引用

页码：10897 / 10906

页数：10

共 44 条

[1] 2D Human Pose Estimation: New Benchmark and State of the Art Analysis [J].

Andriluka, Mykhaylo ;

Pishchulin, Leonid ;

Gehler, Peter ;

Schiele, Bernt .

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :3686-3693

[2]

[Anonymous], P 2014 IEEE C COMP V

[3] Keep It SMPL: Automatic Estimation of 3D Human Pose and Shape from a Single Image [J].

Bogo, Federica ;

Kanazawa, Angjoo ;

Lassner, Christoph ;

Gehler, Peter ;

Romero, Javier ;

Black, Michael J. .

COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 :561-578

[4] 3D Human Pose Estimation via Deep Learning from 2D annotations [J].

Brau, Ernesto ;

Jiang, Hao .

PROCEEDINGS OF 2016 FOURTH INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2016, :582-591

[5] Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields [J].

Cao, Zhe ;

Simon, Tomas ;

Wei, Shih-En ;

Sheikh, Yaser .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1302-1310

[6] 3D Human Pose Estimation=2D Pose Estimation plus Matching [J].

Chen, Ching-Hang ;

Ramanan, Deva .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5759-5767

[7]

Dabral Rishabh, 2018, LEARNING 3D HUMAN PO, V2, P4

[8]

Elhayek A, 2015, PROC CVPR IEEE, P3810, DOI 10.1109/CVPR.2015.7299005

[9]

He K., 2016, IEEE C COMPUT VIS PA, DOI [10.1007/978-3-319-46493-0_38, DOI 10.1007/978-3-319-46493-0_38, DOI 10.1109/CVPR.2016.90]

[10]

Huang S, 2016, TOUR ESSENT, V3, P1

← 1 2 3 4 5 →