3D Human Pose Estimation With Spatial Structure Information

被引:3
作者
Huang, Xiaoshan [1 ,2 ]
Huang, Jun [2 ]
Tang, Zengming [2 ]
机构
[1] Univ Chinese Acad Sci, Sch Microelect, Beijing 100049, Peoples R China
[2] Chinese Acad Sci, Shanghai Adv Res Inst, Shanghai 201210, Peoples R China
关键词
Three-dimensional displays; Two dimensional displays; Pose estimation; Estimation; Solid modeling; Elbow; Convolution; 3D human poses; graph convolutional networks; adversarial learning; geometric priors; gradient vanish; in-the-wild scenes; NETWORK;
D O I
10.1109/ACCESS.2021.3062426
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Estimating 3D human poses from 2D poses is a challenging problem due to joints self-occlusion, weak generalization, and inherent ambiguity of recovering depth. Actually, there exists spatial structure dependence on human body key points which can be used to alleviate the problem of joints self-occlusion. Therefore, we represent human pose as a directed graph and propose a network implemented with graph convolution to predict 3D poses from the given 2D poses. In the digraph, we determine the connection weight of each edge according to the error distribution of joints estimation. This makes our model robust to noise. By optimizing coarse 3D estimation and adversarial learning, our algorithm can successfully improve the accuracy of estimation and relieve the ambiguity of mapping. Through testing on Human 3.6M and MPI-INF-3DHP datasets, we achieve excellent quantitative performance. More importantly, our algorithm also has a superior generalization to outdoor dataset MPII by the pre-training process.
引用
收藏
页码:35947 / 35956
页数:10
相关论文
共 46 条
[41]   Convolutional Pose Machines [J].
Wei, Shih-En ;
Ramakrishna, Varun ;
Kanade, Takeo ;
Sheikh, Yaser .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :4724-4732
[42]  
Yan SJ, 2018, AAAI CONF ARTIF INTE, P7444
[43]   Graph R-CNN for Scene Graph Generation [J].
Yang, Jianwei ;
Lu, Jiasen ;
Lee, Stefan ;
Batra, Dhruv ;
Parikh, Devi .
COMPUTER VISION - ECCV 2018, PT I, 2018, 11205 :690-706
[44]   Exploring Visual Relationship for Image Captioning [J].
Yao, Ting ;
Pan, Yingwei ;
Li, Yehao ;
Mei, Tao .
COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 :711-727
[45]   Joint 3D Human Motion Capture and Physical Analysis from Monocular Videos [J].
Zell, Petrissa ;
Wandt, Bastian ;
Rosenhahn, Bodo .
2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, :17-26
[46]   3D human pose estimation from image using couple sparse coding [J].
Zolfaghari, Mohammadreza ;
Jourabloo, Amin ;
Gozlou, Samira Ghareh ;
Pedrood, Bahman ;
Manzuri-Shalmani, Mohammad T. .
MACHINE VISION AND APPLICATIONS, 2014, 25 (06) :1489-1499