Deep 3D Pose Dictionary: 3D Human Pose Estimation from Single RGB Image Using Deep Convolutional Neural Network

被引:0
作者
Elbasiony, Reda [1 ,2 ,4 ]
Gomaa, Walid [1 ,3 ]
Ogata, Tetsuya [4 ]
机构
[1] Egypt Japan Univ Sci & Technol, Cyber Phys Syst Lab, New Borg El Arab, Egypt
[2] Tanta Univ, Fac Engn, Tanta, Egypt
[3] Alexandria Univ, Fac Engn, Alexandria, Egypt
[4] Waseda Univ, Grad Sch Fundamental Sci & Engn, Tokyo, Japan
来源
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT III | 2018年 / 11141卷
关键词
3D pose estimation; CNN; Deep learning; Human3.6m;
D O I
10.1007/978-3-030-01424-7_31
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we propose a new approach for 3D human pose estimation from a single monocular RGB image based on a deep convolutional neural network (CNN). The proposed method depends on reducing the huge search space of the continuous-valued 3D human poses by discretizing and approximating these continuous poses into many discrete key-poses. These key-poses constitute more restricted search space and then can be considered as multiple-class candidates of 3D human poses. Thus, a suitable classification technique is trained using a set of 3D key-poses and their corresponding RGB images to build a model to predict the 3D pose class of an input monocular RGB image. We use deep CNN as a suitable classifier because it is proven to be the most accurate technique for RGB image classification. Our approach is proven to achieve good accuracy which is comparable to the state-of-the-art methods.
引用
收藏
页码:310 / 320
页数:11
相关论文
共 19 条
[1]  
[Anonymous], P BRIT MACH VIS C BM
[2]  
[Anonymous], 1987, CLUSTERING MEANS MED
[3]   3D Human Pose Estimation=2D Pose Estimation plus Matching [J].
Chen, Ching-Hang ;
Ramanan, Deva .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5759-5767
[4]   Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments [J].
Ionescu, Catalin ;
Papava, Dragos ;
Olaru, Vlad ;
Sminchisescu, Cristian .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (07) :1325-1339
[5]   Caffe: Convolutional Architecture for Fast Feature Embedding [J].
Jia, Yangqing ;
Shelhamer, Evan ;
Donahue, Jeff ;
Karayev, Sergey ;
Long, Jonathan ;
Girshick, Ross ;
Guadarrama, Sergio ;
Darrell, Trevor .
PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, :675-678
[6]  
King DB, 2015, ACS SYM SER, V1214, P1
[7]   ImageNet Classification with Deep Convolutional Neural Networks [J].
Krizhevsky, Alex ;
Sutskever, Ilya ;
Hinton, Geoffrey E. .
COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90
[8]   Maximum-Margin Structured Learning with Deep Networks for 3D Human Pose Estimation [J].
Li, Sijin ;
Zhang, Weichen ;
Chan, Antoni B. .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2848-2856
[9]   3D Human Pose Estimation from Monocular Images with Deep Convolutional Neural Network [J].
Li, Sijin ;
Chan, Antoni B. .
COMPUTER VISION - ACCV 2014, PT II, 2015, 9004 :332-347
[10]   3D Human Pose Estimation from a Single Image via Distance Matrix Regression [J].
Moreno-Noguer, Francesc .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1561-1570