HandGCAT: Occlusion-Robust 3D Hand Mesh Reconstruction from Monocular Images

被引:1
作者
Wang, Shuaibing [1 ,2 ]
Wang, Shunli [1 ,2 ]
Yang, Dingkang [1 ,2 ]
Li, Mingcheng [1 ,2 ]
Qian, Ziyun [1 ,2 ]
Su, Liuzhen [1 ,2 ]
Zhang, Lihua [1 ,2 ,3 ,4 ]
机构
[1] Fudan Univ, Acad Engn & Technol, Shanghai, Peoples R China
[2] IPASS, Inst Meta Med, Shanghai, Peoples R China
[3] Jilin Prov Key Lab Intelligence Sci & Engn, Changchun, Peoples R China
[4] AI & Unmanned Syst Engn Res Ctr Jilin Prov, Changchun, Peoples R China
来源
2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME | 2023年
基金
国家重点研发计划;
关键词
3D hand mesh reconstruction; hand-object occlusion; computer vision;
D O I
10.1109/ICME55011.2023.00425
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a robust and accurate method for reconstructing 3D hand mesh from monocular images. This is a very challenging problem, as hands are often severely occluded by objects. Previous works often have disregarded 2D hand pose information, which contains hand prior knowledge that is strongly correlated with occluded regions. Thus, in this work, we propose a novel 3D hand mesh reconstruction network HandGCAT, that can fully exploit hand prior as compensation information to enhance occluded region features. Specifically, we designed the Knowledge-Guided Graph Convolution (KGC) module and the Cross-Attention Transformer (CAT) module. KGC extracts hand prior information from 2D hand pose by graph convolution. CAT fuses hand prior into occluded regions by considering their high correlation. Extensive experiments on popular datasets with challenging hand-object occlusions, such as HO3D v2, HO3D v3, and DexYCB demonstrate that our HandGCAT reaches state-of-the-art performance. The code is available at https://github.com/heartStrive/HandGCAT.
引用
收藏
页码:2495 / 2500
页数:6
相关论文
共 39 条
  • [1] Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields
    Cao, Zhe
    Simon, Tomas
    Wei, Shih-En
    Sheikh, Yaser
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1302 - 1310
  • [2] DexYCB: A Benchmark for Capturing Hand Grasping of Objects
    Chao, Yu-Wei
    Yang, Wei
    Xiang, Yu
    Molchanov, Pavlo
    Handa, Ankur
    Tremblay, Jonathan
    Narang, Yashraj S.
    Van Wyk, Karl
    Iqbal, Umar
    Birchfield, Stan
    Kautz, Jan
    Fox, Dieter
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 9040 - 9049
  • [3] Chen HX, 2023, Arxiv, DOI arXiv:2302.11611
  • [4] I2UV-HandNet: Image-to-UV Prediction Network for Accurate and High-fidelity 3D Hand Mesh Modeling
    Chen, Ping
    Chen, Yujin
    Yang, Dong
    Wu, Fangyin
    Li, Qin
    Xia, Qingpei
    Tan, Yong
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 12909 - 12918
  • [5] Chen X., 2022, P IEEE CVF C COMP VI
  • [6] Cheng Y, 2020, AAAI CONF ARTIF INTE, V34, P10631
  • [7] Choi H., 2020, ECCV 2020, P769, DOI DOI 10.1007/978-3-030-58571-6_45
  • [8] Defferrard M, 2016, ADV NEUR IN, V29
  • [9] RMPE: Regional Multi-Person Pose Estimation
    Fang, Hao-Shu
    Xie, Shuqin
    Tai, Yu-Wing
    Lu, Cewu
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2353 - 2362
  • [10] 3D Hand Shape and Pose Estimation from a Single RGB Image
    Ge, Liuhao
    Ren, Zhou
    Li, Yuncheng
    Xue, Zehao
    Wang, Yingying
    Cai, Jianfei
    Yuan, Junsong
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 10825 - 10834