3D Human Pose Estimation With Spatial Structure Information

被引:3
作者
Huang, Xiaoshan [1 ,2 ]
Huang, Jun [2 ]
Tang, Zengming [2 ]
机构
[1] Univ Chinese Acad Sci, Sch Microelect, Beijing 100049, Peoples R China
[2] Chinese Acad Sci, Shanghai Adv Res Inst, Shanghai 201210, Peoples R China
关键词
Three-dimensional displays; Two dimensional displays; Pose estimation; Estimation; Solid modeling; Elbow; Convolution; 3D human poses; graph convolutional networks; adversarial learning; geometric priors; gradient vanish; in-the-wild scenes; NETWORK;
D O I
10.1109/ACCESS.2021.3062426
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Estimating 3D human poses from 2D poses is a challenging problem due to joints self-occlusion, weak generalization, and inherent ambiguity of recovering depth. Actually, there exists spatial structure dependence on human body key points which can be used to alleviate the problem of joints self-occlusion. Therefore, we represent human pose as a directed graph and propose a network implemented with graph convolution to predict 3D poses from the given 2D poses. In the digraph, we determine the connection weight of each edge according to the error distribution of joints estimation. This makes our model robust to noise. By optimizing coarse 3D estimation and adversarial learning, our algorithm can successfully improve the accuracy of estimation and relieve the ambiguity of mapping. Through testing on Human 3.6M and MPI-INF-3DHP datasets, we achieve excellent quantitative performance. More importantly, our algorithm also has a superior generalization to outdoor dataset MPII by the pre-training process.
引用
收藏
页码:35947 / 35956
页数:10
相关论文
共 46 条
[1]  
Akhter I, 2015, PROC CVPR IEEE, P1446, DOI 10.1109/CVPR.2015.7298751
[2]   2D Human Pose Estimation: New Benchmark and State of the Art Analysis [J].
Andriluka, Mykhaylo ;
Pishchulin, Leonid ;
Gehler, Peter ;
Schiele, Bernt .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :3686-3693
[3]  
[Anonymous], 2012, Lecture Notes in Computer Science
[4]  
[Anonymous], 2017, P 34 INT C MACH LEAR
[5]  
[Anonymous], 2017, ARXIV171006513
[6]  
Biswas S., 2019, IEEE IJCNN, P1
[7]  
Chang J. Y., 2019, ARXIV191012029
[8]   Unsupervised 3D Pose Estimation with Geometric Self-Supervision [J].
Chen, Ching-Hang ;
Tyagi, Ambrish ;
Agrawal, Amit ;
Drover, Dylan ;
Rohith, M., V ;
Stojanov, Stefan ;
Rehg, James M. .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5707-5717
[9]   Cascaded Pyramid Network for Multi-Person Pose Estimation [J].
Chen, Yilun ;
Wang, Zhicheng ;
Peng, Yuxiang ;
Zhang, Zhiqiang ;
Yu, Gang ;
Sun, Jian .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7103-7112
[10]   Optimizing Network Structure for 3D Human Pose Estimation [J].
Ci, Hai ;
Wang, Chunyu ;
Ma, Xiaoxuan ;
Wang, Yizhou .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :2262-2271