Self-Supervised Learning of 3D Human Pose using Multi-view Geometry

被引:206
作者
Kocabas, Muhammed [1 ]
Karagoz, Salih [1 ]
Akbas, Emre [1 ]
机构
[1] Middle East Tech Univ, Dept Comp Engn, Ankara, Turkey
来源
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年
关键词
D O I
10.1109/CVPR.2019.00117
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Training accurate 3D human pose estimators requires large amount of 3D ground-truth data which is costly to collect. Various weakly or self supervised pose estimation methods have been proposed due to lack of 3D data. Nevertheless, these methods, in addition to 2D ground-truth poses, require either additional supervision in various forms (e.g. unpaired 3D ground truth data, a small subset of labels) or the camera parameters in multiview settings. To address these problems, we present EpipolarPose, a self-supervised learning method for 3D human pose estimation, which does not need any 3D ground-truth data or camera extrinsics. During training, EpipolarPose estimates 2D poses from multi-view images, and then, utilizes epipolar geometry to obtain a 3D pose and camera geometry which are subsequently used to train a 3D pose estimator. We demonstrate the effectiveness of our approach on standard benchmark datasets (i.e. Human3.6M and MPI-INF-3DHP) where we set the new state-of-the-art among weakly/self-supervised methods. Furthermore, we propose a new performance measure Pose Structure Score (PSS) which is a scale invariant, structure aware measure to evaluate the structural plausibility of a pose with respect to its ground truth. Code and pretrained models are available at http://github.com/mkocabas/Epipolarpose
引用
收藏
页码:1077 / 1086
页数:10
相关论文
共 45 条
[1]  
Agrawal A., 2018, EUR C COMP VIS WORKS
[2]   Multi-view Pictorial Structures for 3D Human Pose Estimation [J].
Amin, Sikandar ;
Andriluka, Mykhaylo ;
Rohrbach, Marcus ;
Schiele, Bernt .
PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2013, 2013,
[3]   2D Human Pose Estimation: New Benchmark and State of the Art Analysis [J].
Andriluka, Mykhaylo ;
Pishchulin, Leonid ;
Gehler, Peter ;
Schiele, Bernt .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :3686-3693
[4]  
[Anonymous], 2016, ECCV
[5]  
[Anonymous], 2017, INT C LEARN REPR
[6]  
[Anonymous], COMPUTER VISION IMAG
[7]   3D Pictorial Structures for Multiple Human Pose Estimation [J].
Belagiannis, Vasileios ;
Amin, Sikandar ;
Andriluka, Mykhaylo ;
Schiele, Bernt ;
Navab, Nassir ;
Ilic, Slobodan .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :1669-1676
[8]  
Belagiannis Vasileios, 2016, IEEE T PATTERN ANAL
[9]  
Bergtholdt Martin, 2010, INT J COMPUTER VISIO
[10]   3D Pictorial Structures for Multiple View Articulated Pose Estimation [J].
Burenius, Magnus ;
Sullivan, Josephine ;
Carlsson, Stefan .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :3618-3625