VoxelPose: Towards Multi-camera 3D Human Pose Estimation in Wild Environment

被引:125
作者
Tu, Hanyue [1 ,2 ]
Wang, Chunyu [1 ]
Zeng, Wenjun [1 ]
机构
[1] Microsoft Res Asia, Beijing, Peoples R China
[2] Univ Sci & Technol China, Hefei, Peoples R China
来源
COMPUTER VISION - ECCV 2020, PT I | 2020年 / 12346卷
关键词
3D human pose estimation;
D O I
10.1007/978-3-030-58452-8_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present VoxelPose to estimate 3D poses of multiple people from multiple camera views. In contrast to the previous efforts which require to establish cross-view correspondence based on noisy and incomplete 2D pose estimates, VoxelPose directly operates in the 3D space therefore avoids making incorrect decisions in each camera view. To achieve this goal, features in all camera views are aggregated in the 3D voxel space and fed into Cuboid Proposal Network (CPN) to localize all people. Then we propose Pose Regression Network (PRN) to estimate a detailed 3D pose for each proposal. The approach is robust to occlusion which occurs frequently in practice. Without bells and whistles, it outperforms the previous methods on several public datasets.
引用
收藏
页码:197 / 212
页数:16
相关论文
共 41 条
[1]   Multi-view Pictorial Structures for 3D Human Pose Estimation [J].
Amin, Sikandar ;
Andriluka, Mykhaylo ;
Rohrbach, Marcus ;
Schiele, Bernt .
PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2013, 2013,
[2]  
[Anonymous], 2003, Multiple View Geometry in Computer Vision
[3]   3D Pictorial Structures Revisited: Multiple Human Pose Estimation [J].
Belagiannis, Vasileios ;
Amin, Sikandar ;
Andriluka, Mykhaylo ;
Schiele, Bernt ;
Navab, Nassir ;
Ilic, Slobodan .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (10) :1929-1942
[4]   Multiple Human Pose Estimation with Temporally Consistent 3D Pictorial Structures [J].
Belagiannis, Vasileios ;
Wang, Xinchao ;
Schiele, Bernt ;
Fua, Pascal ;
Ilic, Slobodan ;
Navab, Nassir .
COMPUTER VISION - ECCV 2014 WORKSHOPS, PT I, 2015, 8925 :742-754
[5]   3D Pictorial Structures for Multiple Human Pose Estimation [J].
Belagiannis, Vasileios ;
Amin, Sikandar ;
Andriluka, Mykhaylo ;
Schiele, Bernt ;
Navab, Nassir ;
Ilic, Slobodan .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :1669-1676
[6]   Keep It SMPL: Automatic Estimation of 3D Human Pose and Shape from a Single Image [J].
Bogo, Federica ;
Kanazawa, Angjoo ;
Lassner, Christoph ;
Gehler, Peter ;
Romero, Javier ;
Black, Michael J. .
COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 :561-578
[7]  
Bridgeman Lewis, 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). Proceedings, P2487, DOI 10.1109/CVPRW.2019.00304
[8]   Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields [J].
Cao, Zhe ;
Simon, Tomas ;
Wei, Shih-En ;
Sheikh, Yaser .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1302-1310
[9]   Optimizing Network Structure for 3D Human Pose Estimation [J].
Ci, Hai ;
Wang, Chunyu ;
Ma, Xiaoxuan ;
Wang, Yizhou .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :2262-2271
[10]   Fast and Robust Multi-Person 3D Pose Estimation from Multiple Views [J].
Dong, Junting ;
Jiang, Wen ;
Huang, Qixing ;
Bao, Hujun ;
Zhou, Xiaowei .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :7784-7793