A hybrid network for estimating 3D interacting hand pose from a single RGB image

被引:0
|
作者
Bao, Wenxia [1 ]
Gao, Qiuyue [1 ]
Yang, Xianjun [2 ]
机构
[1] Anhui Univ, Sch Elect & Informat Engn, Hefei 230601, Anhui, Peoples R China
[2] Chinese Acad Sci, Hefei Inst Phys Sci, Hefei 230031, Anhui, Peoples R China
关键词
3D hand pose estimation; Interacting Hand; Hybrid network; End to end network; TEXT; RECOGNITION; KHATT;
D O I
10.1007/s11760-024-03043-1
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The estimation of 3D interacting hand pose from a single RGB image is a challenging problem. The hands tend to occlude each other and are self-similar in two-handed interactions. In this study, a simple, accurate end-to-end framework called HybridPoseNet is proposed for estimating 3D interactive hand pose. The hybrid network employs an encoder-decoder architecture. More specifically, the feature encoder is a hybrid structure that combines a convolutional neural network (CNN) with a transformer to accomplish the feature encoding of hand information. An ordinary CNN is employed to extract the local detailed features of a given image, and a vision transformer is used to capture the long-distance spatial interactions between the cross-positional feature vectors. Moreover, the 3D pose decoder is based on left and right network branches, which are fused via a feature enhancement module (FEM). The FEM helps reduce the ambiguity in appearance caused by the self-similarity of the hands. The decoder elevates the 2D pose to the 3D pose by estimating two depth components. The ablation experiments demonstrate the effectiveness of each module in the network. In addition, comprehensive experiments on the InterHand2.6M dataset show that the proposed method outperforms previous state-of-the-art methods for estimating interactive hand pose.
引用
收藏
页码:3801 / 3814
页数:14
相关论文
共 50 条
  • [1] A hybrid network for estimating 3D interacting hand pose from a single RGB image
    Wenxia Bao
    Qiuyue Gao
    Xianjun Yang
    Signal, Image and Video Processing, 2024, 18 : 3801 - 3814
  • [2] Robust 3D Hand Detection from a Single RGB-D Image in Unconstrained Environments
    Xu, Chi
    Zhou, Jun
    Cai, Wendi
    Jiang, Yunkai
    Li, Yongbo
    Liu, Yi
    SENSORS, 2020, 20 (21) : 1 - 22
  • [3] Multiple-Hand 2D Pose Estimation From a Monocular RGB Image
    Mishra, Purnendu
    Sarawadekar, Kishor
    IEEE ACCESS, 2024, 12 : 40722 - 40735
  • [4] 3D Hand Pose Estimation Using Synthetic Data and Weakly Labeled RGB Images
    Cai, Yujun
    Ge, Liuhao
    Cai, Jianfei
    Thalmann, Nadia Magnenat
    Yuan, Junsong
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (11) : 3739 - 3753
  • [5] An Improved Approach for 3D Hand Pose Estimation Based on a Single Depth Image and Haar Random Forest
    Kim, Wonggi
    Chun, Junchul
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2015, 9 (08): : 3136 - 3150
  • [6] End-to-end weakly-supervised single-stage multiple 3D hand mesh reconstruction from a single RGB image
    Ren, Jinwei
    Zhu, Jianke
    Zhang, Jialiang
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 232
  • [7] Single Image 3D Object Detection and Pose Estimation for Grasping
    Zhu, Menglong
    Derpanis, Konstantinos G.
    Yang, Yinfei
    Brahmbhatt, Samarth
    Zhang, Mabel
    Phillips, Cody
    Lecce, Matthieu
    Daniilidis, Kostas
    2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2014, : 3936 - 3943
  • [8] PerspectiveNet: 3D Object Detection from a Single RGB Image via Perspective Points
    Huang, Siyuan
    Chen, Yixin
    Yuan, Tao
    Qi, Siyuan
    Zhu, Yixin
    Zhu, Song-Chun
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [9] 3D-Scene-Former: 3D scene generation from a single RGB image using Transformers
    Chatterjee, Jit
    Vega, Maria Torres
    VISUAL COMPUTER, 2025, 41 (04) : 2875 - 2889
  • [10] Learning a deep network with spherical part model for 3D hand pose estimation
    Chen, Tzu-Yang
    Ting, Pai-Wen
    Wu, Min-Yu
    Fu, Li-Chen
    PATTERN RECOGNITION, 2018, 80 : 1 - 20