A Self-Training Approach for Visual Tracking and Recognition of Complex Human Activity Patterns

被引:22
作者
Bandouch, Jan [1 ]
Jenkins, Odest Chadwicke [2 ]
Beetz, Michael [1 ]
机构
[1] Tech Univ Munich, Intelligent Autonomous Syst Grp, D-85748 Garching, Germany
[2] Brown Univ, Dept Comp Sci, Providence, RI 02912 USA
关键词
Markerless human motion capture; Probabilistic state estimation; Self-trained models of human motion; Activity recognition; HUMAN MOTION; HUMAN MOVEMENT; CAPTURE; PEOPLE;
D O I
10.1007/s11263-012-0522-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatically observing and understanding human activities is one of the big challenges in computer vision research. Among the potential fields of application are areas such as robotics, human computer interaction or medical research. In this article we present our work on unintrusive observation and interpretation of human activities for the precise recognition of human fullbody motions. The presented system requires no more than three cameras and is capable of tracking a large spectrum of motions in a wide variety of scenarios. This includes scenarios where the subject is partially occluded, where it manipulates objects as part of its activities, or where it interacts with the environment or other humans. Our system is self-training, . it is capable of learning models of human motion over time. These are used both to improve the prediction of human dynamics and to provide the basis for the recognition and interpretation of observed activities. The accuracy and robustness obtained by our system is the combined result of several contributions. By taking an anthropometric human model and optimizing it towards use in a probabilistic tracking framework we obtain a detailed biomechanical representation of human shape, posture and motion. Furthermore, we introduce a sophisticated hierarchical sampling strategy for tracking that is embedded in a probabilistic framework and outperforms state-of-the-art Bayesian methods. We then show how to track complex manipulation activities in everyday environments using a combination of learned human appearance models and implicit environment models. Finally, we discuss a locally consistent representation of human motion that we use as a basis for learning environment- and task-specific motion models. All methods presented in this article have been subject to extensive experimental evaluation on today's benchmarks and several challenging sequences ranging from athletic exercises to ergonomic case studies to everyday manipulation tasks in a kitchen environment.
引用
收藏
页码:166 / 189
页数:24
相关论文
共 65 条
[1]   Recovering 3D human pose from monocular images [J].
Agarwal, A ;
Triggs, B .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (01) :44-58
[2]   SCAPE: Shape Completion and Animation of People [J].
Anguelov, D ;
Srinivasan, P ;
Koller, D ;
Thrun, S ;
Rodgers, J ;
Davis, J .
ACM TRANSACTIONS ON GRAPHICS, 2005, 24 (03) :408-416
[3]  
Anguelov D., 2004, 20 C UNC ART INT AUA
[4]  
[Anonymous], 2006, HUMANEVA SYNCHRONIZ
[5]  
[Anonymous], IEEE INT WORKSH TRAC
[6]   A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking [J].
Arulampalam, MS ;
Maskell, S ;
Gordon, N ;
Clapp, T .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2002, 50 (02) :174-188
[7]  
Balan A., 2008, EUR C COMP VIS ECCV
[8]  
Bandouch J., 2008, 19 BRIT MACH VIS C B
[9]  
BEETZ M, 2008, IEEE 17 INT S ROB HU
[10]   The recognition of human movement using temporal templates [J].
Bobick, AF ;
Davis, JW .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (03) :257-267