Exploiting Spatial-temporal Relationships for 3D Pose Estimation via Graph Convolutional Networks

被引:329
作者
Cai, Yujun [1 ]
Ge, Liuhao [1 ]
Liu, Jun [1 ]
Cai, Jianfei [1 ,2 ]
Cham, Tat-Jen [1 ]
Yuan, Junsong [3 ]
Thalmann, Nadia Magnenat [1 ]
机构
[1] Nanyang Technol Univ, Singapore, Singapore
[2] Monash Univ, Clayton, Vic, Australia
[3] SUNY Buffalo, Buffalo, NY USA
来源
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019) | 2019年
基金
新加坡国家研究基金会;
关键词
NEURAL-NETWORKS;
D O I
10.1109/ICCV.2019.00236
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite great progress in 3D pose estimation from single-view images or videos, it remains a challenging task due to the substantial depth ambiguity and severe self-occlusions. Motivated by the effectiveness of incorporating spatial dependencies and temporal consistencies to alleviate these issues, we propose a novel graph-based method to tackle the problem of 3D human body and 3D hand pose estimation from a short sequence of 2D joint detections. Particularly, domain knowledge about the human hand (body) configurations is explicitly incorporated into the graph convolutional operations to meet the specific demand of the 3D pose estimation. Furthermore, we introduce a local-to-global network architecture, which is capable of learning multi-scale features for the graph-based representations. We evaluate the proposed method on challenging benchmark datasets for both 3D hand pose estimation and 3D body pose estimation. Experimental results show that our method achieves state-of-the-art performance on both tasks.
引用
收藏
页码:2272 / 2281
页数:10
相关论文
共 54 条
  • [51] Ying Wu, 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision, P606, DOI 10.1109/ICCV.1999.791280
  • [52] Zhang JW, 2017, IEEE IMAGE PROC, P982, DOI 10.1109/ICIP.2017.8296428
  • [53] Towards 3D Human Pose Estimation in the Wild: a Weakly-supervised Approach
    Zhou, Xingyi
    Huang, Qixing
    Sun, Xiao
    Xue, Xiangyang
    Wei, Yichen
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 398 - 407
  • [54] Learning to Estimate 3D Hand Pose from Single RGB Images
    Zimmermann, Christian
    Brox, Thomas
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4913 - 4921