Loose-limbed People: Estimating 3D Human Pose and Motion Using Non-parametric Belief Propagation

被引:101
作者
Sigal, Leonid [1 ]
Isard, Michael [2 ]
Haussecker, Horst [3 ]
Black, Michael J. [4 ]
机构
[1] Disney Res, Pittsburgh, PA 15213 USA
[2] Microsoft Res Silicon Valley, Mountain View, CA 94043 USA
[3] Intel Labs, Santa Clara, CA 95054 USA
[4] Max Planck Inst Intelligent Syst, D-72076 Tubingen, Germany
基金
美国国家科学基金会;
关键词
Articulated pose estimation; Articulated tracking; Human pose estimation; Human motion tracking; Non-parametric belief propagation; TRACKING; REPRESENTATION; RECOGNITION; CAPTURE; MODELS;
D O I
10.1007/s11263-011-0493-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We formulate the problem of 3D human pose estimation and tracking as one of inference in a graphical model. Unlike traditional kinematic tree representations, our model of the body is a collection of loosely-connected body-parts. In particular, we model the body using an undirected graphical model in which nodes correspond to parts and edges to kinematic, penetration, and temporal constraints imposed by the joints and the world. These constraints are encoded using pair-wise statistical distributions, that are learned from motion-capture training data. Human pose and motion estimation is formulated as inference in this graphical model and is solved using Particle Message Passing (PaMPas). PaMPas is a form of non-parametric belief propagation that uses a variation of particle filtering that can be applied over a general graphical model with loops. The loose-limbed model and decentralized graph structure allow us to incorporate information from "bottom-up" visual cues, such as limb and head detectors, into the inference process. These detectors enable automatic initialization and aid recovery from transient tracking failures. We illustrate the method by automatically tracking people in multi-view imagery using a set of calibrated cameras and present quantitative evaluation using the HumanEva dataset.
引用
收藏
页码:15 / 48
页数:34
相关论文
共 102 条
  • [1] Recovering 3D human pose from monocular images
    Agarwal, A
    Triggs, B
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (01) : 44 - 58
  • [2] AlacCormick J., 2000, PROCEEDING ECCV, V2, P3, DOI DOI 10.1007/3-540-45053-X1
  • [3] Andriluka M, 2009, PROC CVPR IEEE, P1014, DOI 10.1109/CVPRW.2009.5206754
  • [4] [Anonymous], 2001, Sequential Monte Carlo methods in practice
  • [5] [Anonymous], 1990, Computer Graphics: Principles and Practice
  • [6] [Anonymous], 2010, IEEE COMP SOC C COMP
  • [7] Balan A. O., 2005, Proceedings. 2nd Joint IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance (VS-PETS) (IEEE Cat. No. 05EX1178), P349
  • [8] Banerjee A, 2005, J MACH LEARN RES, V6, P1345
  • [9] A Study of Parts-Based Object Class Detection Using Complete Graphs
    Bergtholdt, Martin
    Kappes, Joerg
    Schmidt, Stefan
    Schnoerr, Christoph
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 87 (1-2) : 93 - 117
  • [10] Bhatia S., 2004, IEEE WORKSH ART NONR