MVHM: A Large-Scale Multi-View Hand Mesh Benchmark for Accurate 3D Hand Pose Estimation

被引:19
作者
Chen, Liangjian [1 ]
Lin, Shih-Yao [2 ]
Xie, Yusheng [3 ]
Lin, Yen-Yu [4 ]
Xie, Xiaohui [1 ]
机构
[1] Univ Calif Irvine, Irvine, CA USA
[2] Tencent Amer, Palo Alto, CA USA
[3] Amazon, Palo Alto, CA 94301 USA
[4] Natl Chiao Tung Univ, Hsinchu, Taiwan
来源
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021) | 2021年
关键词
D O I
10.1109/WACV48630.2021.00088
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Estimating 3D hand poses from a single RGB image is challenging because depth ambiguity leads the problem illposed. Training hand pose estimators with 3D hand mesh annotations and multi-view images often results in significant performance gains. However, existing multi-view datasets are relatively small with hand joints annotated by off-the-shelf trackers or automated through model predictions, both of which may be inaccurate and can introduce biases. Collecting a large-scale multi-view 3D hand pose images with accurate mesh and joint annotations is valuable but strenuous. In this paper, we design a spin match algorithm that enables a rigid mesh model matching with any target mesh ground truth. Based on the match algorithm, we propose an efficient pipeline to generate a largescale multi-view hand mesh (MVHM) dataset with accurate 3D hand mesh and joint labels. We further present a multi-view hand pose estimation approach to verify that training a hand pose estimator with our generated dataset greatly enhances the performance. Experimental results show that our approach achieves the performance of 0.990 in AUC(20-50) on the MHP dataset compared to the previous state-of-the-art of 0.939 on this dataset. Our datasset is available at https://github.com/Kuzphi/MVHM.
引用
收藏
页码:836 / 845
页数:10
相关论文
共 49 条
[1]  
[Anonymous], 2017, ICIP
[2]  
[Anonymous], 2013, 2 INT C LEARN REPR I
[3]  
Anwar Shamama, 2019, NANOELECTRONICS CIRC
[4]   Pushing the Envelope for RGB-based Dense 3D Hand Pose Estimation via Neural Rendering [J].
Baek, Seungryul ;
Kim, Kwang In ;
Kim, Tae-Kyun .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :1067-1076
[5]   3D Hand Shape and Pose from Images in the Wild [J].
Boukhayma, Adnane ;
de Bem, Rodrigo ;
Torr, Philip H. S. .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :10835-10844
[6]   Weakly-Supervised 3D Hand Pose Estimation from Monocular RGB Images [J].
Cai, Yujun ;
Ge, Liuhao ;
Cai, Jianfei ;
Yuan, Junsong .
COMPUTER VISION - ECCV 2018, PT VI, 2018, 11210 :678-694
[7]   Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields [J].
Cao, Zhe ;
Simon, Tomas ;
Wei, Shih-En ;
Sheikh, Yaser .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1302-1310
[8]   Simultaneous Segmentation and Recognition: Towards more accurate Ego Gesture Recognition [J].
Chalasani, Tejo ;
Smolic, Aljosa .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, :4367-4375
[9]  
Chen L., 2019, BMVC
[10]  
Chen L., 2020, WACV