Graph-Based CNNs With Self-Supervised Module for 3D Hand Pose Estimation From Monocular RGB

被引:27
|
作者
Guo, Shaoxiang [1 ]
Rigall, Eric [1 ]
Qi, Lin [1 ]
Dong, Xinghui [2 ]
Li, Haiyan [1 ]
Dong, Junyu [1 ]
机构
[1] Ocean Univ China, Dept Informat Sci & Technol, Qingdao 266100, Peoples R China
[2] Univ Manchester, Ctr Imaging Sci, Manchester M13 9PT, Lancs, England
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Three-dimensional displays; Pose estimation; Two dimensional displays; Feature extraction; Cameras; Convolutional neural networks; Solid modeling; Computer vision; hand pose estimation; graph CNNs; self-supervision;
D O I
10.1109/TCSVT.2020.3004453
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Hand pose estimation in 3D space from a single RGB image is a highly challenging problem due to self-geometric ambiguities, diverse texture, viewpoints, and self-occlusions. Existing work proves that a network structure with multi-scale resolution subnets, fused in parallel can more effectively shows the spatial accuracy of 2D pose estimation. Nevertheless, the features extracted by traditional convolutional neural networks cannot efficiently express the unique topological structure of hand key points based on discrete and correlated properties. Some applications of hand pose estimation based on traditional convolutional neural networks have demonstrated that the structural similarity between the graph and hand key points can improve the accuracy of the 3D hand pose regression. In this paper, we design and implement an end-to-end network for predicting 3D hand pose from a single RGB image. We first extract multiple feature maps from different resolutions and make parallel feature fusion, and then model a graph-based convolutional neural network module to predict the initial 3D hand key points. Next, we use 2D spatial relationships and 3D geometric knowledge to build a self-supervised module to eliminate domain gaps between 2D and 3D space. Finally, the final 3D hand pose is calculated by averaging the 3D hand poses from the GCN output and the self-supervised module output. We evaluate the proposed method on two challenging benchmark datasets for 3D hand pose estimation. Experimental results show the effectiveness of our proposed method that achieves state-of-the-art performance on the benchmark datasets.
引用
收藏
页码:1514 / 1525
页数:12
相关论文
共 50 条
  • [41] Uncertainty-Aware Adaptation for Self-Supervised 3D Human Pose Estimation
    Kundu, Jogendra Nath
    Seth, Siddharth
    Pradyumna, Y. M.
    Jampani, Varun
    Chakraborty, Anirban
    Babu, R. Venkatesh
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 20416 - 20427
  • [42] Exploring self-supervised learning techniques for hand pose estimation
    Dahiya, Aneesh
    Spurr, Adrian
    Hilliges, Otmar
    NEURIPS 2020 WORKSHOP ON PRE-REGISTRATION IN MACHINE LEARNING, VOL 148, 2020, 148 : 255 - 271
  • [43] 3D Human Pose Machines with Self-Supervised Learning
    Wang, Keze
    Lin, Liang
    Jiang, Chenhan
    Qian, Chen
    Wei, Pengxu
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (05) : 1069 - 1082
  • [44] SEMI-SUPERVISED LEARNING OF MONOCULAR 3D HAND POSE ESTIMATION FROM MULTI-VIEW IMAGES
    Mueller, Markus
    Poier, Georg
    Possegger, Horst
    Bischof, Horst
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 1104 - 1108
  • [45] Occlusion-Aware Self-Supervised Monocular 6D Object Pose Estimation
    Wang, Gu
    Manhardt, Fabian
    Liu, Xingyu
    Ji, Xiangyang
    Tombari, Federico
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (03) : 1788 - 1803
  • [46] Monocular 3D hand pose estimation based on high-resolution network
    Shengling Li
    Wanjuan Su
    Guansheng Luo
    Jinshan Tian
    Yifei Han
    Liman Liu
    Wenbing Tao
    Advances in Continuous and Discrete Models, 2025 (1):
  • [47] Cascaded Hierarchical CNN for RGB-Based 3D Hand Pose Estimation
    Dai, Shiming
    Liu, Wei
    Yang, Wenji
    Fan, Lili
    Zhang, Jihao
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
  • [48] 3D SCENEFLOWNET: SELF-SUPERVISED 3D SCENE FLOWESTIMATION BASED ON GRAPH CNN
    Lu, Yawen
    Zhu, Yuhao
    Lu, Guoyu
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 3647 - 3651
  • [49] Hand PointNet-based 3D Hand Pose Estimation in Egocentric RGB-D Images
    Le, Van-Hung
    Hoang, Van-Nam
    Vu, Hai
    Le, Thi-Lan
    Tran, Thanh-Hai
    Vu, Viet-Vu
    PROCEEDINGS OF 202013TH INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR COMMUNICATIONS (ATC 2020), 2020, : 215 - 220
  • [50] 3D interacting hand pose and shape estimation from a single RGB image
    Gao, Chengying
    Yang, Yujia
    Li, Wensheng
    NEUROCOMPUTING, 2022, 474 : 25 - 36