Monocular 3-D Gait Tracking in Surveillance Scenes

被引：14

作者：

Rogez, Gregory ^{[1
]}

Rihan, Jonathan ^{[2
]}

Guerrero, Jose J. ^{[1
]}

Orrite, Carlos ^{[1
]}

机构：

[1] Univ Zaragoza, Aragon Inst Engn Res I3A, Zaragoza 50017, Spain

[2] Oxford Brookes Univ, Dept Comp, Oxford OX33 1HX, England

来源：

IEEE TRANSACTIONS ON CYBERNETICS | 2014年 / 44卷 / 06期

关键词：

3-D body pose; monocular gait tracking; particle filtering; video surveillance; view invariance; HUMAN MOTION; GENERATIVE MODELS; POSE; RECOGNITION; FRAMEWORK; PEOPLE; CAPTURE;

D O I：

10.1109/TCYB.2013.2275731

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Gait recognition can potentially provide a noninvasive and effective biometric authentication from a distance. However, the performance of gait recognition systems will suffer in real surveillance scenarios with multiple interacting individuals and where the camera is usually placed at a significant angle and distance from the floor. We present a methodology for view-invariant monocular 3-D human pose tracking in man-made environments in which we assume that observed people move on a known ground plane. First, we model 3-D body poses and camera viewpoints with a low dimensional manifold and learn a generative model of the silhouette from this manifold to a reduced set of training views. During the online stage, 3-D body poses are tracked using recursive Bayesian sampling conducted jointly over the scene's ground plane and the pose-viewpoint manifold. For each sample, the homography that relates the corresponding training plane to the image points is calculated using the dominant 3-D directions of the scene, the sampled location on the ground plane and the sampled camera view. Each regressed silhouette shape is projected using this homographic transformation and is matched in the image to estimate its likelihood. Our framework is able to track 3-D human walking poses in a 3-D environment exploring only a 4-D state space with success. In our experimental evaluation, we demonstrate the significant improvements of the homographic alignment over a commonly used similarity transformation and provide quantitative pose tracking results for the monocular sequences with a high perspective effect from the CAVIAR dataset.

引用

页码：894 / 909

页数：16

共 58 条

[1] Monocular 3D Pose Estimation and Tracking by Detection [J].

Andriluka, Mykhaylo ;

Roth, Stefan ;

Schiele, Bernt .

2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, :623-630

[2]

[Anonymous], 200137540 EC CAVIAR

[3]

[Anonymous], THESIS U ZARAGOZA ZA

[4]

[Anonymous], 2001, Robotica, DOI DOI 10.1017/S0263574700223217

[5]

[Anonymous], 2012, Public Heal, DOI DOI 10.1007/BF03403824

[6]

[Anonymous], 2001, CMU MOTION BODY MOBO

[7]

Baumberg A., 1994, Computer Vision - ECCV'94. Third European Conference on Computer Vision. Proceedings. Vol.I, P299

[8] Shape matching and object recognition using shape contexts [J].

Belongie, S ;

Malik, J ;

Puzicha, J .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (04) :509-522

[9]

Bouchrika I, 2009, LECT NOTES COMPUT SC, V5558, P990, DOI 10.1007/978-3-642-01793-3_100

[10] Human motion capture using scalable body models [J].

Canton-Ferrer, Cristian ;

Casas, Josep R. ;

Pardas, Montse .

COMPUTER VISION AND IMAGE UNDERSTANDING, 2011, 115 (10) :1363-1374

← 1 2 3 4 5 6 →