FingerPoseNet: A finger-level multitask learning network with residual feature sharing for 3D hand pose estimation

被引:0
|
作者
Tewolde, Tekie Tsegay [1 ]
Manjotho, Ali Asghar [1 ]
Sarker, Prodip Kumar [1 ,3 ]
Niu, Zhendong [1 ,2 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci Technol, Beijing 100081, Peoples R China
[2] Minist Educ, Engn Res Ctr Integrat & Applicat Digital Learning, Beijing, Peoples R China
[3] Begum Rokeya Univ, Dept Comp Sci & Engn, Rangpur, Bangladesh
基金
中国国家自然科学基金;
关键词
Hand pose estimation; Information sharing; Multitask learning; Virtual reality; User behavior modeling; REGRESSION;
D O I
10.1016/j.neunet.2025.107315
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hand pose estimation approaches commonly rely on shared hand feature maps to regress the 3D locations of all hand joints. Subsequently, they struggle to enhance finger-level features which are invaluable in capturing joint-to-finger associations and articulations. To address this limitation, we propose a finger-level multitask learning network with residual feature sharing, named FingerPoseNet, for accurate 3D hand pose estimation from a depth image. FingerPoseNet comprises three stages: (a) a shared base feature map extraction backbone based on pre-trained ResNet-50; (b) a finger-level multitask learning stage that extracts and enhances feature maps for each finger and the palm; and (c) a multitask fusion layer for consolidating the estimation results obtained by each subtask. We exploit multitask learning by decoupling the hand pose estimation task into six subtasks dedicated to each finger and palm. Each subtask is responsible for subtask-specific feature extraction, enhancement, and 3D keypoint regression. To enhance subtask-specific features, we propose a residual feature- sharing approach scaled up to mine supplementary information from all subtasks. Experiments performed on five challenging public hand pose datasets, including ICVL, NYU, MSRA, Hands-2019-Task1, and HO3D-v3 demonstrate significant improvements in accuracy compared with state-of-the-art approaches.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Region ensemble network: Towards good practices for deep 3D hand pose estimation
    Wang, Guijin
    Chen, Xinghao
    Guo, Hengkai
    Zhang, Cairong
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2018, 55 : 404 - 414
  • [22] Coot optimization based Enhanced Global Pyramid Network for 3D hand pose estimation
    Malavath, Pallavi
    Devarakonda, Nagaraju
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2022, 3 (04):
  • [23] 3D hand pose estimation using RGBD images and hybrid deep learning networks
    Mofarreh-Bonab, Mohammad
    Seyedarabi, Hadi
    Mozaffari Tazehkand, Behzad
    Kasaei, Shohreh
    VISUAL COMPUTER, 2022, 38 (06) : 2023 - 2032
  • [24] A Comprehensive Study on Deep Learning-Based 3D Hand Pose Estimation Methods
    Chatzis, Theocharis
    Stergioulas, Andreas
    Konstantinidis, Dimitrios
    Dimitropoulos, Kosmas
    Daras, Petros
    APPLIED SCIENCES-BASEL, 2020, 10 (19): : 1 - 27
  • [25] SO-HandNet: Self-Organizing Network for 3D Hand Pose Estimation with Semi-supervised Learning
    Chen, Yujin
    Tu, Zhigang
    Ge, Liuhao
    Zhang, Dejun
    Chen, Ruizhi
    Yuan, Junsong
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6960 - 6969
  • [26] Multi-virtual View Scoring Network for 3D Hand Pose Estimation from a Single Depth Image
    Tian, Yimeng
    Li, Chen
    Tian, Lihua
    ARTIFICIAL INTELLIGENCE AND ROBOTICS, ISAIR 2023, 2024, 1998 : 147 - 164
  • [27] A Normalization Strategy for Weakly Supervised 3D Hand Pose Estimation
    Guo, Zizhao
    Li, Jinkai
    Tan, Jiyong
    APPLIED SCIENCES-BASEL, 2024, 14 (09):
  • [28] mmHand: 3D Hand Pose Estimation Leveraging mmWave Signals
    Kong, Hao
    Lyu, Haoxin
    Yu, Jiadi
    Kong, Linghe
    Yang, Junlin
    Ren, Yanzhi
    Liu, Hongbo
    Chen, Yi-Chao
    2024 IEEE 44TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, ICDCS 2024, 2024, : 1062 - 1073
  • [29] Three-Dimensional Human Hand Pose Estimation Based on Finger-Point Reinforcement and Multi-Level Feature Fusion
    Zhang Kaiyi
    Hong Ru
    Gai Shaoyan
    Da Feipeng
    ACTA OPTICA SINICA, 2022, 42 (19)
  • [30] Mobile robot control using 3D hand pose estimation
    Hoshino, Kiyoshi
    Kasahara, Takuya
    Igo, Naoki
    Tomida, Motomasa
    Tanimoto, Takanobu
    Mukai, Toshimitsu
    Brossard, Gilles
    Kotani, Hajime
    TENTH INTERNATIONAL CONFERENCE ON QUALITY CONTROL BY ARTIFICIAL VISION, 2011, 8000