MVHM: A Large-Scale Multi-View Hand Mesh Benchmark for Accurate 3D Hand Pose Estimation

被引：19

作者：

Chen, Liangjian ^{[1
]}

Lin, Shih-Yao ^{[2
]}

Xie, Yusheng ^{[3
]}

Lin, Yen-Yu ^{[4
]}

Xie, Xiaohui ^{[1
]}

机构：

[1] Univ Calif Irvine, Irvine, CA USA

[2] Tencent Amer, Palo Alto, CA USA

[3] Amazon, Palo Alto, CA 94301 USA

[4] Natl Chiao Tung Univ, Hsinchu, Taiwan

来源：

2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021) | 2021年

关键词：

D O I：

10.1109/WACV48630.2021.00088

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Estimating 3D hand poses from a single RGB image is challenging because depth ambiguity leads the problem illposed. Training hand pose estimators with 3D hand mesh annotations and multi-view images often results in significant performance gains. However, existing multi-view datasets are relatively small with hand joints annotated by off-the-shelf trackers or automated through model predictions, both of which may be inaccurate and can introduce biases. Collecting a large-scale multi-view 3D hand pose images with accurate mesh and joint annotations is valuable but strenuous. In this paper, we design a spin match algorithm that enables a rigid mesh model matching with any target mesh ground truth. Based on the match algorithm, we propose an efficient pipeline to generate a largescale multi-view hand mesh (MVHM) dataset with accurate 3D hand mesh and joint labels. We further present a multi-view hand pose estimation approach to verify that training a hand pose estimator with our generated dataset greatly enhances the performance. Experimental results show that our approach achieves the performance of 0.990 in AUC(20-50) on the MHP dataset compared to the previous state-of-the-art of 0.939 on this dataset. Our datasset is available at https://github.com/Kuzphi/MVHM.

引用

页码：836 / 845

页数：10

共 49 条

[1]

[Anonymous], 2017, ICIP

[2]

[Anonymous], 2013, 2 INT C LEARN REPR I

[3]

Anwar Shamama, 2019, NANOELECTRONICS CIRC

[4] Pushing the Envelope for RGB-based Dense 3D Hand Pose Estimation via Neural Rendering [J].

Baek, Seungryul ;

Kim, Kwang In ;

Kim, Tae-Kyun .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :1067-1076

[5] 3D Hand Shape and Pose from Images in the Wild [J].

Boukhayma, Adnane ;

de Bem, Rodrigo ;

Torr, Philip H. S. .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :10835-10844

[6] Weakly-Supervised 3D Hand Pose Estimation from Monocular RGB Images [J].

Cai, Yujun ;

Ge, Liuhao ;

Cai, Jianfei ;

Yuan, Junsong .

COMPUTER VISION - ECCV 2018, PT VI, 2018, 11210 :678-694

[7] Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields [J].

Cao, Zhe ;

Simon, Tomas ;

Wei, Shih-En ;

Sheikh, Yaser .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1302-1310

[8] Simultaneous Segmentation and Recognition: Towards more accurate Ego Gesture Recognition [J].

Chalasani, Tejo ;

Smolic, Aljosa .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, :4367-4375

[9]

Chen L., 2019, BMVC

[10]

Chen L., 2020, WACV

← 1 2 3 4 5 →