FingerPoseNet: A finger-level multitask learning network with residual feature sharing for 3D hand pose estimation

被引:0
|
作者
Tewolde, Tekie Tsegay [1 ]
Manjotho, Ali Asghar [1 ]
Sarker, Prodip Kumar [1 ,3 ]
Niu, Zhendong [1 ,2 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci Technol, Beijing 100081, Peoples R China
[2] Minist Educ, Engn Res Ctr Integrat & Applicat Digital Learning, Beijing, Peoples R China
[3] Begum Rokeya Univ, Dept Comp Sci & Engn, Rangpur, Bangladesh
基金
中国国家自然科学基金;
关键词
Hand pose estimation; Information sharing; Multitask learning; Virtual reality; User behavior modeling; REGRESSION;
D O I
10.1016/j.neunet.2025.107315
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hand pose estimation approaches commonly rely on shared hand feature maps to regress the 3D locations of all hand joints. Subsequently, they struggle to enhance finger-level features which are invaluable in capturing joint-to-finger associations and articulations. To address this limitation, we propose a finger-level multitask learning network with residual feature sharing, named FingerPoseNet, for accurate 3D hand pose estimation from a depth image. FingerPoseNet comprises three stages: (a) a shared base feature map extraction backbone based on pre-trained ResNet-50; (b) a finger-level multitask learning stage that extracts and enhances feature maps for each finger and the palm; and (c) a multitask fusion layer for consolidating the estimation results obtained by each subtask. We exploit multitask learning by decoupling the hand pose estimation task into six subtasks dedicated to each finger and palm. Each subtask is responsible for subtask-specific feature extraction, enhancement, and 3D keypoint regression. To enhance subtask-specific features, we propose a residual feature- sharing approach scaled up to mine supplementary information from all subtasks. Experiments performed on five challenging public hand pose datasets, including ICVL, NYU, MSRA, Hands-2019-Task1, and HO3D-v3 demonstrate significant improvements in accuracy compared with state-of-the-art approaches.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] QMGR-Net: quaternion multi-graph reasoning network for 3D hand pose estimation
    Ni, Haomin
    Xie, Shengli
    Xu, Pingping
    Fang, Xiaozhao
    Sun, Weijun
    Fang, Ribo
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (12) : 4029 - 4045
  • [42] Learning dynamic relationship between joints for 3D hand pose estimation from single depth map
    Xing, Huiqin
    Yang, Jianyu
    Xiao, Yang
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 92
  • [43] An enhanced self-attention and A2J approach for 3D hand pose estimation
    Ng, Mei-Ying
    Chng, Chin-Boon
    Koh, Wai-Kin
    Chui, Chee-Kong
    Chua, Matthew Chin-Heng
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (29) : 41661 - 41676
  • [44] CLIP-Hand3D: Exploiting 3D Hand Pose Estimation via Context-Aware Prompting
    Guo, Shaoxiang
    Cai, Qing
    Qi, Lin
    Dong, Junyu
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 4896 - 4907
  • [45] 3D Hand Pose Estimation Based on Double Branches with Multi Scale Attention
    Ma S.-L.
    Li J.-H.
    Kong D.-H.
    Wang L.-C.
    Wang S.-F.
    Yin B.-C.
    Jisuanji Xuebao/Chinese Journal of Computers, 2023, 46 (07): : 1383 - 1395
  • [46] An end-to-end framework for unconstrained monocular 3D hand pose estimation
    Sharma, Sanjeev
    Huang, Shaoli
    PATTERN RECOGNITION, 2021, 115
  • [47] SEMI-SUPERVISED LEARNING OF MONOCULAR 3D HAND POSE ESTIMATION FROM MULTI-VIEW IMAGES
    Mueller, Markus
    Poier, Georg
    Possegger, Horst
    Bischof, Horst
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 1104 - 1108
  • [48] Cascading CNNs with S-DQN: A Parameter-Parsimonious Strategy for 3D Hand Pose Estimation
    Chen, Mingqi
    Li, Shaodong
    Shuang, Feng
    Luo, Kai
    MULTIMEDIA MODELING, MMM 2023, PT I, 2023, 13833 : 358 - 369
  • [49] HMTNet: 3D Hand Pose Estimation From Single Depth Image Based on Hand Morphological Topology
    Zhou, Weiguo
    Jiang, Xin
    Chen, Chen
    Mei, Sijia
    Liu, Yun-Hui
    IEEE SENSORS JOURNAL, 2020, 20 (11) : 6004 - 6011
  • [50] Dynamic Hand Gesture Recognition Based on 3D Hand Pose Estimation for Human-Robot Interaction
    Gao, Qing
    Chen, Yongquan
    Ju, Zhaojie
    Liang, Yi
    IEEE SENSORS JOURNAL, 2022, 22 (18) : 17421 - 17430