CrossFuNet: RGB and Depth Cross-Fusion Network for Hand Pose Estimation

被引:5
|
作者
Sun, Xiaojing [1 ]
Wang, Bin [1 ]
Huang, Longxiang [2 ]
Zhang, Qian [1 ]
Zhu, Sulei [1 ]
Ma, Yan [1 ]
机构
[1] Shanghai Normal Univ, Coll Informat Mech & Elect Engn, Shanghai 200234, Peoples R China
[2] Shenzhen Guangjian Technol Co Ltd, Shanghai 200135, Peoples R China
关键词
hand pose estimation; convolutional neural network; RGBD fusion; 3D HAND;
D O I
10.3390/s21186095
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Despite recent successes in hand pose estimation from RGB images or depth maps, inherent challenges remain. RGB-based methods suffer from heavy self-occlusions and depth ambiguity. Depth sensors rely heavily on distance and can only be used indoors, thus there are many limitations to the practical application of depth-based methods. The aforementioned challenges have inspired us to combine the two modalities to offset the shortcomings of the other. In this paper, we propose a novel RGB and depth information fusion network to improve the accuracy of 3D hand pose estimation, which is called CrossFuNet. Specifically, the RGB image and the paired depth map are input into two different subnetworks, respectively. The feature maps are fused in the fusion module in which we propose a completely new approach to combine the information from the two modalities. Then, the common method is used to regress the 3D key-points by heatmaps. We validate our model on two public datasets and the results reveal that our model outperforms the state-of-the-art methods.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] ON THE FUSION OF RGB AND DEPTH INFORMATION FOR HAND POSE ESTIMATION
    Kazakos, Evangelos
    Nikou, Christophoros
    Kakadiaris, Ioannis A.
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 868 - 872
  • [2] Improve Regression Network on Depth Hand Pose Estimation With Auxiliary Variable
    Xu, Lu
    Hu, Chen
    Tao, Jian
    Xue, Jianru
    Mei, Kuizhi
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (03) : 890 - 904
  • [3] Multiscale cross-fusion network for hyperspectral image classification
    Pan, Haizhu
    Zhu, Yuexia
    Ge, Haimiao
    Liu, Moqi
    Shi, Cuiping
    EGYPTIAN JOURNAL OF REMOTE SENSING AND SPACE SCIENCES, 2023, 26 (03) : 839 - 850
  • [4] Hand pose estimation with multi-scale network
    Zhongxu Hu
    Youmin Hu
    Bo Wu
    Jie Liu
    Dongmin Han
    Thomas Kurfess
    Applied Intelligence, 2018, 48 : 2501 - 2515
  • [5] Hand Pose Estimation with Attention-and-Sequence Network
    Hu, Tianping
    Wang, Wenhai
    Lu, Tong
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT I, 2018, 11164 : 556 - 566
  • [6] Hand pose estimation with multi-scale network
    Hu, Zhongxu
    Hu, Youmin
    Wu, Bo
    Liu, Jie
    Han, Dongmin
    Kurfess, Thomas
    APPLIED INTELLIGENCE, 2018, 48 (08) : 2501 - 2515
  • [7] Hand Pose Estimation in Depth Image using CNN and Random Forest
    Chen, Xi
    Cao, Zhiguo
    Xiao, Yang
    Fang, Zhiwen
    MIPPR 2017: PATTERN RECOGNITION AND COMPUTER VISION, 2017, 10609
  • [8] Pose guided structured region ensemble network for cascaded hand pose estimation
    Chen, Xinghao
    Wang, Guijin
    Guo, Hengkai
    Zhang, Cairong
    NEUROCOMPUTING, 2020, 395 (395) : 138 - 149
  • [9] EGOCENTRIC HAND POSE ESTIMATION AND DISTANCE RECOVERY IN A SINGLE RGB IMAGE
    Liang, Hui
    Yuan, Junsong
    Thalman, Daniel
    2015 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2015,
  • [10] RGB-D Hand Pose Estimation Using Fourier Descriptor
    Rong, Zihao
    Kong, Dehui
    Wang, Shaofan
    Yin, Baocai
    2018 7TH INTERNATIONAL CONFERENCE ON DIGITAL HOME (ICDH 2018), 2018, : 50 - 56