FingerPoseNet: A finger-level multitask learning network with residual feature sharing for 3D hand pose estimation

被引:0
|
作者
Tewolde, Tekie Tsegay [1 ]
Manjotho, Ali Asghar [1 ]
Sarker, Prodip Kumar [1 ,3 ]
Niu, Zhendong [1 ,2 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci Technol, Beijing 100081, Peoples R China
[2] Minist Educ, Engn Res Ctr Integrat & Applicat Digital Learning, Beijing, Peoples R China
[3] Begum Rokeya Univ, Dept Comp Sci & Engn, Rangpur, Bangladesh
基金
中国国家自然科学基金;
关键词
Hand pose estimation; Information sharing; Multitask learning; Virtual reality; User behavior modeling; REGRESSION;
D O I
10.1016/j.neunet.2025.107315
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hand pose estimation approaches commonly rely on shared hand feature maps to regress the 3D locations of all hand joints. Subsequently, they struggle to enhance finger-level features which are invaluable in capturing joint-to-finger associations and articulations. To address this limitation, we propose a finger-level multitask learning network with residual feature sharing, named FingerPoseNet, for accurate 3D hand pose estimation from a depth image. FingerPoseNet comprises three stages: (a) a shared base feature map extraction backbone based on pre-trained ResNet-50; (b) a finger-level multitask learning stage that extracts and enhances feature maps for each finger and the palm; and (c) a multitask fusion layer for consolidating the estimation results obtained by each subtask. We exploit multitask learning by decoupling the hand pose estimation task into six subtasks dedicated to each finger and palm. Each subtask is responsible for subtask-specific feature extraction, enhancement, and 3D keypoint regression. To enhance subtask-specific features, we propose a residual feature- sharing approach scaled up to mine supplementary information from all subtasks. Experiments performed on five challenging public hand pose datasets, including ICVL, NYU, MSRA, Hands-2019-Task1, and HO3D-v3 demonstrate significant improvements in accuracy compared with state-of-the-art approaches.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Improvements in 3D Hand Pose Estimation Using Synthetic Data
    Kanis, Jakub
    Ryumin, Dmitry
    Krnoul, Zdenek
    INTERACTIVE COLLABORATIVE ROBOTICS, ICR 2018, 2018, 11097 : 105 - 115
  • [32] A survey on 3D hand pose estimation: Cameras, methods, and datasets
    Li, Rui
    Liu, Zhenyu
    Tan, Jianrong
    PATTERN RECOGNITION, 2019, 93 : 251 - 272
  • [33] Real-Time 3D Hand Pose Estimation with 3D Convolutional Neural Networks
    Ge, Liuhao
    Liang, Hui
    Yuan, Junsong
    Thalmann, Daniel
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (04) : 956 - 970
  • [34] Learning a Deep Predictive Coding Network for a Semi-Supervised 3D-Hand Pose Estimation
    Jamal Banzi
    Isack Bulugu
    Zhongfu Ye
    IEEE/CAA Journal of Automatica Sinica, 2020, 7 (05) : 1371 - 1379
  • [35] 3D Hand Pose Estimation via Graph-Based Reasoning
    Song, Jae-Hun
    Kang, Suk-Ju
    IEEE ACCESS, 2021, 9 : 35824 - 35833
  • [36] Regression-based 3D Hand Pose Estimation using Heatmaps
    Bandi, Chaitanya
    Thomas, Ulrike
    PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VOL 5: VISAPP, 2020, : 636 - 643
  • [37] FAST LIFTING FOR 3D HAND POSE ESTIMATION IN AR/VR APPLICATIONS
    Guleryuz, Onur G.
    Kaeser-Chen, Christine
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 106 - 110
  • [38] NETWORKS EFFECTIVELY UTILIZING 2D SPATIAL INFORMATION FOR ACCURATE 3D HAND POSE ESTIMATION
    Liu, Baoen
    Huang, Shiliang
    Ye, Zhongfu
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 574 - 578
  • [39] QMGR-Net: quaternion multi-graph reasoning network for 3D hand pose estimation
    Haomin Ni
    Shengli Xie
    Pingping Xu
    Xiaozhao Fang
    Weijun Sun
    Ribo Fang
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 4029 - 4045
  • [40] Differentiable Spatial Regression: A Novel Method for 3D Hand Pose Estimation
    Zhang, Xingyuan
    Zhang, Fuhai
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 166 - 176