Exploiting Spatio-Temporal Scene Structure for Wide-Area Activity Analysis in Unconstrained Environments

被引:5
作者
Nayak, Nandita M. [1 ]
Zhu, Yingying [2 ]
Roy-Chowdhury, Amit K. [2 ]
机构
[1] Univ Calif Riverside, Dept Comp Sci, Riverside, CA 92521 USA
[2] Univ Calif Riverside, Dept Elect Engn, Riverside, CA 92521 USA
基金
美国国家科学基金会;
关键词
Context-aware activity recognition; Markov random field; wide-area activity analysis; RECOGNITION; TRACKING;
D O I
10.1109/TIFS.2013.2277669
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Surveillance videos in unconstrained environments typically consist of long duration sequences of activities which occur at different spatio-temporal locations and can involve multiple people acting simultaneously. Often, the activities have contextual relationships with one another. Although context has been studied in the past for the purpose of activity recognition, the use of context in recognition of activities in such challenging environments is relatively unexplored. In this paper, we propose a novel method for capturing the spatio-temporal context between activities in a Markov random field. The structure of the MRF is improvised upon during test time and not predefined, unlike many approaches that model the contextual relationships between activities. Given a collection of videos and a set of weak classifiers for individual activities, the spatio-temporal relationships between activities are represented as probabilistic edge weights in the MRF. This model provides a generic representation for an activity sequence that can extend to any number of objects and interactions in a video. We show that the recognition of activities in a video can be posed as an inference problem on the graph. We conduct experiments on the publicly available UCLA office dataset and the VIRAT dataset, to demonstrate the improvement in recognition accuracy using our proposed model as opposed to recognition using state-of-the-art features on individual activity regions.
引用
收藏
页码:1610 / 1619
页数:10
相关论文
共 27 条
[1]   Human Activity Analysis: A Review [J].
Aggarwal, J. K. ;
Ryoo, M. S. .
ACM COMPUTING SURVEYS, 2011, 43 (03)
[2]  
[Anonymous], P INT C COMP VIS BAR
[3]  
[Anonymous], IEEE I CONF COMP VIS
[4]  
[Anonymous], P EUR C COMP VIS F 3
[5]  
Brendel W, 2011, IEEE I CONF COMP VIS, P778, DOI 10.1109/ICCV.2011.6126316
[6]  
Chaudhry R, 2009, PROC CVPR IEEE, P1932, DOI 10.1109/CVPRW.2009.5206821
[7]  
Gupta A, 2009, PROC CVPR IEEE, P2012, DOI 10.1109/CVPRW.2009.5206492
[8]   What's going on? Discovering Spatio-Temporal Dependencies in Dynamic Scenes [J].
Kuettel, Daniel ;
Breitenstein, Michael D. ;
Van Gool, Luc ;
Ferrari, Vittorio .
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, :1951-1958
[9]  
Lan T, 2012, PROC CVPR IEEE, P1354, DOI 10.1109/CVPR.2012.6247821
[10]   Discriminative Latent Models for Recognizing Contextual Group Activities [J].
Lan, Tian ;
Wang, Yang ;
Yang, Weilong ;
Robinovitch, Stephen N. ;
Mori, Greg .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (08) :1549-1562