FingerPoseNet: A finger-level multitask learning network with residual feature sharing for 3D hand pose estimation

被引:0
|
作者
Tewolde, Tekie Tsegay [1 ]
Manjotho, Ali Asghar [1 ]
Sarker, Prodip Kumar [1 ,3 ]
Niu, Zhendong [1 ,2 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci Technol, Beijing 100081, Peoples R China
[2] Minist Educ, Engn Res Ctr Integrat & Applicat Digital Learning, Beijing, Peoples R China
[3] Begum Rokeya Univ, Dept Comp Sci & Engn, Rangpur, Bangladesh
基金
中国国家自然科学基金;
关键词
Hand pose estimation; Information sharing; Multitask learning; Virtual reality; User behavior modeling; REGRESSION;
D O I
10.1016/j.neunet.2025.107315
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hand pose estimation approaches commonly rely on shared hand feature maps to regress the 3D locations of all hand joints. Subsequently, they struggle to enhance finger-level features which are invaluable in capturing joint-to-finger associations and articulations. To address this limitation, we propose a finger-level multitask learning network with residual feature sharing, named FingerPoseNet, for accurate 3D hand pose estimation from a depth image. FingerPoseNet comprises three stages: (a) a shared base feature map extraction backbone based on pre-trained ResNet-50; (b) a finger-level multitask learning stage that extracts and enhances feature maps for each finger and the palm; and (c) a multitask fusion layer for consolidating the estimation results obtained by each subtask. We exploit multitask learning by decoupling the hand pose estimation task into six subtasks dedicated to each finger and palm. Each subtask is responsible for subtask-specific feature extraction, enhancement, and 3D keypoint regression. To enhance subtask-specific features, we propose a residual feature- sharing approach scaled up to mine supplementary information from all subtasks. Experiments performed on five challenging public hand pose datasets, including ICVL, NYU, MSRA, Hands-2019-Task1, and HO3D-v3 demonstrate significant improvements in accuracy compared with state-of-the-art approaches.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Hand Pose Estimation Based on 3D Residual Network with Data Padding and Skeleton Steadying
    Ting, Pai-Wen
    Chou, En-Te
    Tang, Ya-Hui
    Fu, Li-Chen
    COMPUTER VISION - ACCV 2018, PT V, 2019, 11365 : 293 - 307
  • [2] CASCADED POINT NETWORK FOR 3D HAND POSE ESTIMATION
    Dou, Yikun
    Wang, Xuguang
    Zhu, Yuying
    Deng, Xiaoming
    Ma, Cuixia
    Chang, Liang
    Wang, Hongan
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1982 - 1986
  • [3] Learning a deep network with spherical part model for 3D hand pose estimation
    Chen, Tzu-Yang
    Ting, Pai-Wen
    Wu, Min-Yu
    Fu, Li-Chen
    PATTERN RECOGNITION, 2018, 80 : 1 - 20
  • [4] Accurate 3D hand pose estimation network utilizing joints information
    Zhang, Xiongquan
    Huang, Shiliang
    Ye, Zhongfu
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2021, 90
  • [5] ASCS-Reinforcement Learning: A Cascaded Framework for Accurate 3D Hand Pose Estimation
    Chen, Mingqi
    Shuang, Feng
    Li, Shaodong
    Liu, Xi
    PROCEEDINGS OF THE 2023 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2023, 2023, : 335 - 342
  • [6] Lightweight 3D hand pose estimation by cascading CNNs with reinforcement learning
    Chen, Mingqi
    Li, Shaodong
    Shuang, Feng
    Liu, Xi
    Luo, Kai
    He, Wenbo
    PATTERN RECOGNITION LETTERS, 2023, 174 : 137 - 144
  • [7] Efficient Annotation and Learning for 3D Hand Pose Estimation: A Survey
    Ohkawa, Takehiko
    Furuta, Ryosuke
    Sato, Yoichi
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 131 (12) : 3193 - 3206
  • [8] Efficient Annotation and Learning for 3D Hand Pose Estimation: A Survey
    Takehiko Ohkawa
    Ryosuke Furuta
    Yoichi Sato
    International Journal of Computer Vision, 2023, 131 : 3193 - 3206
  • [9] 3D Capsule Hand Pose Estimation Network Based on Structural Relationship Information
    Wu, Yiqi
    Ma, Shichao
    Zhang, Dejun
    Sun, Jun
    SYMMETRY-BASEL, 2020, 12 (10): : 1 - 14
  • [10] 3D hand pose estimation and reconstruction based on multi-feature fusion
    Wang, Jiye
    Xiang, Xuezhi
    Ding, Shuai
    El Saddik, Abdulmotaleb
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 101