Graph-Based CNNs With Self-Supervised Module for 3D Hand Pose Estimation From Monocular RGB

被引:27
|
作者
Guo, Shaoxiang [1 ]
Rigall, Eric [1 ]
Qi, Lin [1 ]
Dong, Xinghui [2 ]
Li, Haiyan [1 ]
Dong, Junyu [1 ]
机构
[1] Ocean Univ China, Dept Informat Sci & Technol, Qingdao 266100, Peoples R China
[2] Univ Manchester, Ctr Imaging Sci, Manchester M13 9PT, Lancs, England
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Three-dimensional displays; Pose estimation; Two dimensional displays; Feature extraction; Cameras; Convolutional neural networks; Solid modeling; Computer vision; hand pose estimation; graph CNNs; self-supervision;
D O I
10.1109/TCSVT.2020.3004453
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Hand pose estimation in 3D space from a single RGB image is a highly challenging problem due to self-geometric ambiguities, diverse texture, viewpoints, and self-occlusions. Existing work proves that a network structure with multi-scale resolution subnets, fused in parallel can more effectively shows the spatial accuracy of 2D pose estimation. Nevertheless, the features extracted by traditional convolutional neural networks cannot efficiently express the unique topological structure of hand key points based on discrete and correlated properties. Some applications of hand pose estimation based on traditional convolutional neural networks have demonstrated that the structural similarity between the graph and hand key points can improve the accuracy of the 3D hand pose regression. In this paper, we design and implement an end-to-end network for predicting 3D hand pose from a single RGB image. We first extract multiple feature maps from different resolutions and make parallel feature fusion, and then model a graph-based convolutional neural network module to predict the initial 3D hand key points. Next, we use 2D spatial relationships and 3D geometric knowledge to build a self-supervised module to eliminate domain gaps between 2D and 3D space. Finally, the final 3D hand pose is calculated by averaging the 3D hand poses from the GCN output and the self-supervised module output. We evaluate the proposed method on two challenging benchmark datasets for 3D hand pose estimation. Experimental results show the effectiveness of our proposed method that achieves state-of-the-art performance on the benchmark datasets.
引用
收藏
页码:1514 / 1525
页数:12
相关论文
共 50 条
  • [31] Graph semantic information for self-supervised monocular depth estimation
    Zhang, Dongdong
    Wang, Chunping
    Wang, Huiying
    Fu, Qiang
    PATTERN RECOGNITION, 2024, 156
  • [32] GCNDepth: Self-supervised monocular depth estimation based on graph convolutional network
    Masoumian, Armin
    Rashwan, Hatem A.
    Abdulwahab, Saddam
    Cristiano, Julian
    Asif, M. Salman
    Puig, Domenec
    NEUROCOMPUTING, 2023, 517 : 81 - 92
  • [33] Supervised 3D Graph-Based Automated Epidermal Thickness Estimation
    Srivastava, Ruchir
    Yow, Ai Ping
    Cheng, Jun
    Wong, Damon W. K.
    2017 IEEE 2ND INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP), 2017, : 297 - 301
  • [34] Latent Representation Self-Supervised Pose Network for Accurate Monocular Pipe Pose Estimation
    Hu, Jia
    Liu, Shaoli
    Liu, Jianhua
    Wang, Zhenjie
    Zhang, Wenxiong
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (05) : 7180 - 7189
  • [35] Ssman: self-supervised masked adaptive network for 3D human pose estimation
    Shi, Yu
    Yue, Tianyi
    Zhao, Hu
    He, Guoping
    Ren, Keyan
    MACHINE VISION AND APPLICATIONS, 2024, 35 (03)
  • [36] Self-Supervised 3D Human Pose Estimation with Multiple-View Geometry
    Bouazizi, Arij
    Wiederer, Julian
    Kressel, Ulrich
    Belagiannis, Vasileios
    2021 16TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2021), 2021,
  • [37] Multi-View 3D Human Pose Estimation with Self-Supervised Learning
    Chang, Inho
    Park, Min-Gyu
    Kim, Jaewoo
    Yoon, Ju Hong
    3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION (IEEE ICAIIC 2021), 2021, : 255 - 257
  • [38] Ssman: self-supervised masked adaptive network for 3D human pose estimation
    Yu Shi
    Tianyi Yue
    Hu Zhao
    Guoping He
    Keyan Ren
    Machine Vision and Applications, 2024, 35
  • [39] Self-supervised Detection and Pose Estimation of Logistical Objects in 3D Sensor Data
    Mueller, Nikolas
    Stenzel, Jonas
    Chen, Jian-Jia
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 10251 - 10258
  • [40] Geometry-Driven Self-Supervised Method for 3D Human Pose Estimation
    Li, Yang
    Li, kan
    Jiang, Shuai
    Zhang, Ziyue
    Huang, Congzhentao
    Xu, Richard Yi Da
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11442 - 11449