Deep 3D Pose Dictionary: 3D Human Pose Estimation from Single RGB Image Using Deep Convolutional Neural Network

被引：0

作者：

Elbasiony, Reda ^{[1
,2
,4
]}

Gomaa, Walid ^{[1
,3
]}

Ogata, Tetsuya ^{[4
]}

机构：

[1] Egypt Japan Univ Sci & Technol, Cyber Phys Syst Lab, New Borg El Arab, Egypt

[2] Tanta Univ, Fac Engn, Tanta, Egypt

[3] Alexandria Univ, Fac Engn, Alexandria, Egypt

[4] Waseda Univ, Grad Sch Fundamental Sci & Engn, Tokyo, Japan

来源：

ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT III | 2018年 / 11141卷

关键词：

3D pose estimation; CNN; Deep learning; Human3.6m;

D O I：

10.1007/978-3-030-01424-7_31

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this work, we propose a new approach for 3D human pose estimation from a single monocular RGB image based on a deep convolutional neural network (CNN). The proposed method depends on reducing the huge search space of the continuous-valued 3D human poses by discretizing and approximating these continuous poses into many discrete key-poses. These key-poses constitute more restricted search space and then can be considered as multiple-class candidates of 3D human poses. Thus, a suitable classification technique is trained using a set of 3D key-poses and their corresponding RGB images to build a model to predict the 3D pose class of an input monocular RGB image. We use deep CNN as a suitable classifier because it is proven to be the most accurate technique for RGB image classification. Our approach is proven to achieve good accuracy which is comparable to the state-of-the-art methods.

引用

页码：310 / 320

页数：11

共 19 条

[1]

[Anonymous], P BRIT MACH VIS C BM

[2]

[Anonymous], 1987, CLUSTERING MEANS MED

[3] 3D Human Pose Estimation=2D Pose Estimation plus Matching [J].

Chen, Ching-Hang ;

Ramanan, Deva .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5759-5767

[4] Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments [J].

Ionescu, Catalin ;

Papava, Dragos ;

Olaru, Vlad ;

Sminchisescu, Cristian .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (07) :1325-1339

[5] Caffe: Convolutional Architecture for Fast Feature Embedding [J].

Jia, Yangqing ;

Shelhamer, Evan ;

Donahue, Jeff ;

Karayev, Sergey ;

Long, Jonathan ;

Girshick, Ross ;

Guadarrama, Sergio ;

Darrell, Trevor .

PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, :675-678

[6]

King DB, 2015, ACS SYM SER, V1214, P1

[7] ImageNet Classification with Deep Convolutional Neural Networks [J].

Krizhevsky, Alex ;

Sutskever, Ilya ;

Hinton, Geoffrey E. .

COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90

[8] Maximum-Margin Structured Learning with Deep Networks for 3D Human Pose Estimation [J].

Li, Sijin ;

Zhang, Weichen ;

Chan, Antoni B. .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2848-2856

[9] 3D Human Pose Estimation from Monocular Images with Deep Convolutional Neural Network [J].

Li, Sijin ;

Chan, Antoni B. .

COMPUTER VISION - ACCV 2014, PT II, 2015, 9004 :332-347

[10] 3D Human Pose Estimation from a Single Image via Distance Matrix Regression [J].

Moreno-Noguer, Francesc .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1561-1570

← 1 2 →