Recurrent Human Pose Estimation

被引：155

作者：

Belagiannis, Vasileios ^{[1
]}

Zisserman, Andrew ^{[1
]}

机构：

[1] Univ Oxford, Dept Engn Sci, Visual Geometry Grp, Oxford, England

来源：

2017 12TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2017) | 2017年

基金：

英国工程与自然科学研究理事会;

关键词：

D O I：

10.1109/FG.2017.64

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose a ConvNet model for predicting 2D human body poses in an image. The model regresses a heatmap representation for each body keypoint, and is able to learn and represent both the part appearances and the context of the part configuration. We make the following three contributions: (i) an architecture combining a feed forward module with a recurrent module, where the recurrent module can be run iteratively to improve the performance; (ii) the model can be trained end-to-end and from scratch, with auxiliary losses incorporated to improve performance; (iii) we investigate whether keypoint visibility can also be predicted. The model is evaluated on two benchmark datasets. The result is a simple architecture that achieves performance on par with the state of the art, but without the complexity of a graphical model stage (or layers).

引用

页码：468 / 475

页数：8

共 44 条

[1]

Andriluka M., 2009, P CVPR

[2] 2D Human Pose Estimation: New Benchmark and State of the Art Analysis [J].

Andriluka, Mykhaylo ;

Pishchulin, Leonid ;

Gehler, Peter ;

Schiele, Bernt .

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :3686-3693

[3]

[Anonymous], 2015, P ACMM

[4]

[Anonymous], 2015, CVPR

[5]

[Anonymous], 2015, P ICCV

[6]

[Anonymous], 2015, INT C MACHINE LEARN

[7]

[Anonymous], 2011, P CVPR

[8]

Belagiannis V., 2015, P ICCV

[9]

Belagiannis V., 2014, P CVPR

[10] Large-Scale Machine Learning with Stochastic Gradient Descent [J].

Bottou, Leon .

COMPSTAT'2010: 19TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL STATISTICS, 2010, :177-186

← 1 2 3 4 5 →