Graph-Based CNNs With Self-Supervised Module for 3D Hand Pose Estimation From Monocular RGB

被引:27
|
作者
Guo, Shaoxiang [1 ]
Rigall, Eric [1 ]
Qi, Lin [1 ]
Dong, Xinghui [2 ]
Li, Haiyan [1 ]
Dong, Junyu [1 ]
机构
[1] Ocean Univ China, Dept Informat Sci & Technol, Qingdao 266100, Peoples R China
[2] Univ Manchester, Ctr Imaging Sci, Manchester M13 9PT, Lancs, England
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Three-dimensional displays; Pose estimation; Two dimensional displays; Feature extraction; Cameras; Convolutional neural networks; Solid modeling; Computer vision; hand pose estimation; graph CNNs; self-supervision;
D O I
10.1109/TCSVT.2020.3004453
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Hand pose estimation in 3D space from a single RGB image is a highly challenging problem due to self-geometric ambiguities, diverse texture, viewpoints, and self-occlusions. Existing work proves that a network structure with multi-scale resolution subnets, fused in parallel can more effectively shows the spatial accuracy of 2D pose estimation. Nevertheless, the features extracted by traditional convolutional neural networks cannot efficiently express the unique topological structure of hand key points based on discrete and correlated properties. Some applications of hand pose estimation based on traditional convolutional neural networks have demonstrated that the structural similarity between the graph and hand key points can improve the accuracy of the 3D hand pose regression. In this paper, we design and implement an end-to-end network for predicting 3D hand pose from a single RGB image. We first extract multiple feature maps from different resolutions and make parallel feature fusion, and then model a graph-based convolutional neural network module to predict the initial 3D hand key points. Next, we use 2D spatial relationships and 3D geometric knowledge to build a self-supervised module to eliminate domain gaps between 2D and 3D space. Finally, the final 3D hand pose is calculated by averaging the 3D hand poses from the GCN output and the self-supervised module output. We evaluate the proposed method on two challenging benchmark datasets for 3D hand pose estimation. Experimental results show the effectiveness of our proposed method that achieves state-of-the-art performance on the benchmark datasets.
引用
收藏
页码:1514 / 1525
页数:12
相关论文
共 50 条
  • [1] Self-Supervised 3D Hand Pose Estimation from monocular RGB via Contrastive Learning
    Spurr, Adrian
    Dahiya, Aneesh
    Wang, Xi
    Zhang, Xucong
    Hilliges, Otmar
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11210 - 11219
  • [2] 3D Hand Pose Estimation From Monocular RGB With Feature Interaction Module
    Guo, Shaoxiang
    Rigall, Eric
    Ju, Yakun
    Dong, Junyu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (08) : 5293 - 5306
  • [3] 3D Hand Pose Estimation from Monocular RGB with Feature Interaction Module
    Guo, Shaoxiang
    Rigall, Eric
    Ju, Yakun
    Dong, Junyu
    IEEE Transactions on Circuits and Systems for Video Technology, 2022, 32 (08): : 5293 - 5306
  • [4] Weakly-Supervised 3D Hand Pose Estimation from Monocular RGB Images
    Cai, Yujun
    Ge, Liuhao
    Cai, Jianfei
    Yuan, Junsong
    COMPUTER VISION - ECCV 2018, PT VI, 2018, 11210 : 678 - 694
  • [5] Monocular 3D human pose estimation with a semi-supervised graph-based method
    Abbasi, Mahdieh
    Rabiee, Hamid R.
    Gagne, Christian
    2015 INTERNATIONAL CONFERENCE ON 3D VISION, 2015, : 518 - 526
  • [6] CanonPose: Self-Supervised Monocular 3D Human Pose Estimation in the Wild
    Wandt, Bastian
    Rudolph, Marco
    Zell, Petrissa
    Rhodin, Helge
    Rosenhahn, Bodo
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 13289 - 13299
  • [7] A graph-based approach for absolute 3D hand pose estimation using a single RGB image
    Ikram Kourbane
    Yakup Genc
    Applied Intelligence, 2022, 52 : 16667 - 16682
  • [8] A graph-based approach for absolute 3D hand pose estimation using a single RGB image
    Kourbane, Ikram
    Genc, Yakup
    APPLIED INTELLIGENCE, 2022, 52 (14) : 16667 - 16682
  • [9] Self-supervised 3D hand pose estimation through training by fitting
    Wan, Chengde
    Probst, Thomas
    Van Gool, Luc
    Yao, Angela
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 10845 - 10854
  • [10] 3D Hand Pose Estimation via Graph-Based Reasoning
    Song, Jae-Hun
    Kang, Suk-Ju
    IEEE ACCESS, 2021, 9 : 35824 - 35833