A Self-Training Approach for Visual Tracking and Recognition of Complex Human Activity Patterns

被引:0
作者
Jan Bandouch
Odest Chadwicke Jenkins
Michael Beetz
机构
[1] Technische Universität München,Intelligent Autonomous Systems Group
[2] Brown University,Department of Computer Science
来源
International Journal of Computer Vision | 2012年 / 99卷
关键词
Markerless human motion capture; Probabilistic state estimation; Self-trained models of human motion; Activity recognition;
D O I
暂无
中图分类号
学科分类号
摘要
Automatically observing and understanding human activities is one of the big challenges in computer vision research. Among the potential fields of application are areas such as robotics, human computer interaction or medical research. In this article we present our work on unintrusive observation and interpretation of human activities for the precise recognition of human fullbody motions. The presented system requires no more than three cameras and is capable of tracking a large spectrum of motions in a wide variety of scenarios. This includes scenarios where the subject is partially occluded, where it manipulates objects as part of its activities, or where it interacts with the environment or other humans. Our system is self-training, i.e. it is capable of learning models of human motion over time. These are used both to improve the prediction of human dynamics and to provide the basis for the recognition and interpretation of observed activities. The accuracy and robustness obtained by our system is the combined result of several contributions. By taking an anthropometric human model and optimizing it towards use in a probabilistic tracking framework we obtain a detailed biomechanical representation of human shape, posture and motion. Furthermore, we introduce a sophisticated hierarchical sampling strategy for tracking that is embedded in a probabilistic framework and outperforms state-of-the-art Bayesian methods. We then show how to track complex manipulation activities in everyday environments using a combination of learned human appearance models and implicit environment models. Finally, we discuss a locally consistent representation of human motion that we use as a basis for learning environment- and task-specific motion models. All methods presented in this article have been subject to extensive experimental evaluation on today’s benchmarks and several challenging sequences ranging from athletic exercises to ergonomic case studies to everyday manipulation tasks in a kitchen environment.
引用
收藏
页码:166 / 189
页数:23
相关论文
共 81 条
[1]  
Agarwal A.(2006)Recovering 3D human pose from monocular images IEEE Transactions on Pattern Analysis and Machine Intelligence 28 44-58
[2]  
Triggs B.(2005)Scape: shape completion and animation of people ACM Transactions on Graphics 24 408-416
[3]  
Anguelov D.(2002)A tutorial on particle filters for on-line non-linear/non-Gaussian Bayesian tracking IEEE Transactions on Signal Processing 50 174-188
[4]  
Srinivasan P.(2001)The recognition of human movement using temporal templates IEEE Transactions on Pattern Analysis and Machine Intelligence 23 257-267
[5]  
Koller D.(2007)Smart particle filtering for high-dimensional tracking Computer Vision and Image Understanding 106 116-129
[6]  
Thrun S.(2004)Twist based acquisition and tracking of animal and human kinematics International Journal of Computer Vision 56 179-194
[7]  
Rodgers J.(2005)Articulated body motion capture by stochastic search International Journal of Computer Vision 61 185-205
[8]  
Davis J.(2005)Pictorial structures for object recognition International Journal of Computer Vision 61 55-79
[9]  
Arulampalam S.(2010)Optimization and filtering for human motion capture International Journal of Computer Vision 87 75-92
[10]  
Maskell S.(1999)The visual analysis of human movement: a survey Computer Vision and Image Understanding 73 82-98