Optimizing Network Structure for 3D Human Pose Estimation

被引:189
作者
Ci, Hai [1 ]
Wang, Chunyu [2 ]
Ma, Xiaoxuan [1 ]
Wang, Yizhou [1 ,3 ,4 ]
机构
[1] Peking Univ, Comp Sci Dept, Beijing, Peoples R China
[2] Microsoft Res Asia, Beijing, Peoples R China
[3] Deepwise AI Lab, Beijing, Peoples R China
[4] Peng Cheng Lab, Shenzhen, Peoples R China
来源
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019) | 2019年
关键词
D O I
10.1109/ICCV.2019.00235
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A human pose is naturally represented as a graph where the joints are the nodes and the bones are the edges. So it is natural to apply Graph Convolutional Network (GCN) to estimate 3D poses from 2D poses. In this work, we propose a generic formulation where both GCN and Fully Connected Network (FCN) are its special cases. From this formulation, we discover that GCN has limited representation power when used for estimating 3D poses. We overcome the limitation by introducing Locally Connected Network (LCN) which is naturally implemented by this generic formulation. It notably improves the representation capability over GCN. In addition, since every joint is only connected to a few joints in its neighborhood, it has strong generalization power. The experiments on public datasets show it: (1) outperforms the state-of-the-arts; (2) is less data hungry than alternative models; (3) generalizes well to unseen actions and datasets.
引用
收藏
页码:2262 / 2271
页数:10
相关论文
共 40 条
[1]  
Agarwal A., 2004, P 2004 IEEE COMP SOC, V2
[2]  
Akhter I, 2015, PROC CVPR IEEE, P1446, DOI 10.1109/CVPR.2015.7298751
[3]  
[Anonymous], 2017, ARXIV
[4]  
[Anonymous], 2018, AAAI C ART INT AAAI
[5]  
[Anonymous], 2018, CVPR, DOI DOI 10.1002/ADTP.201800049
[6]  
[Anonymous], 2017, CVPR, DOI DOI 10.1109/CVPR.2017.492
[7]  
[Anonymous], 2016, P 27 BRIT MACHINE VI
[8]  
[Anonymous], 2016, LECT NOTES COMPUT SC, DOI DOI 10.1007/978-3-319-46484-8_29
[9]  
[Anonymous], 2014, CVPR, DOI DOI 10.1109/CVPR.2014.303
[10]  
[Anonymous], 2018, CVPR