Learnable Triangulation of Human Pose

被引:288
作者
Iskakov, Karim [1 ]
Burkov, Egor [1 ,2 ]
Lempitsky, Victor [1 ,2 ]
Malkov, Yury [1 ]
机构
[1] Samsung AI Ctr, Moscow, Russia
[2] Skolkovo Inst Sci & Technol, Moscow, Russia
来源
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019) | 2019年
关键词
D O I
10.1109/ICCV.2019.00781
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present two novel solutions for multi-view 3D human pose estimation based on new learnable triangulation methods that combine 3D information from multiple 2D views. The first (baseline) solution is a basic differentiable algebraic triangulation with an addition of confidence weights estimated from the input images. The second solution is based on a novel method of volumetric aggregation from intermediate 2D backbone feature maps. The aggregated volume is then refined via 3D convolutions that produce final 3D joint heatmaps and allow implicit modelling a human pose prior. Crucially, both approaches are end-to-end differentiable, which allows us to directly optimize the target metric. We demonstrate transferability of the solutions across datasets and considerably improve the multiview state of the art on the Human3.6M dataset. Video demonstration, annotations and additional materials will be posted on our project page(1).
引用
收藏
页码:7717 / 7726
页数:10
相关论文
共 27 条
[1]  
[Anonymous], 2017, IEEE T PATTERN ANAL
[2]  
Hartley R., 2003, MULTIPLE VIEW GEOMET, DOI 10.1016/S0143-8166(01)00145-2
[3]   Exploiting Temporal Information for 3D Human Pose Estimation [J].
Hossain, Mir Rayat Imtiaz ;
Little, James J. .
COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 :69-86
[4]   Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments [J].
Ionescu, Catalin ;
Papava, Dragos ;
Olaru, Vlad ;
Sminchisescu, Cristian .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (07) :1325-1339
[5]  
Jaderberg M., 2015, Neural Inf. Process. Syst., V28, P2017, DOI DOI 10.48550/ARXIV.1506.02025
[6]  
Kadkhodamohammadi Abdolrahim, 2018, GEN APPROACH MULTIVI
[7]  
Kar Abhishek, 2017, NIPS
[8]   Multi-view Body Part Recognition with Random Forests [J].
Kazemi, Vahid ;
Burenius, Magnus ;
Azizpour, Hossein ;
Sullivan, Josephine .
PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2013, 2013,
[9]   Self-Supervised Learning of 3D Human Pose using Multi-view Geometry [J].
Kocabas, Muhammed ;
Karagoz, Salih ;
Akbas, Emre .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :1077-1086
[10]   Microsoft COCO: Common Objects in Context [J].
Lin, Tsung-Yi ;
Maire, Michael ;
Belongie, Serge ;
Hays, James ;
Perona, Pietro ;
Ramanan, Deva ;
Dollar, Piotr ;
Zitnick, C. Lawrence .
COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 :740-755