Recognizing Activities via Bag of Words for Attribute Dynamics

被引:18
作者
Li, Weixin [1 ]
Yu, Qian [2 ]
Sawhney, Harpreet [2 ]
Vasconcelos, Nuno [1 ]
机构
[1] Univ Calif San Diego, La Jolla, CA 92093 USA
[2] SRI Int Sarnoff, Princeton, NJ 08540 USA
来源
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2013年
关键词
D O I
10.1109/CVPR.2013.334
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we propose a novel video representation for activity recognition that models video dynamics with attributes of activities. A video sequence is decomposed into short-term segments, which are characterized by the dynamics of their attributes. These segments are modeled by a dictionary of attribute dynamics templates, which are implemented by a recently introduced generative model, the binary dynamic system (BDS). We propose methods for learning a dictionary of BDSs from a training corpus, and for quantizing attribute sequences extracted from videos into these BDS codewords. This procedure produces a representation of the video as a histogram of BDS codewords, which is denoted the bag-of-words for attribute dynamics (BoWAD). An extensive experimental evaluation reveals that this representation outperforms other state-of-the-art approaches in temporal structure modeling for complex activity recognition.
引用
收藏
页码:2587 / 2594
页数:8
相关论文
共 28 条
[21]   Holistic Context Models for Visual Recognition [J].
Rasiwasia, Nikhil ;
Vasconcelos, Nuno .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (05) :902-917
[22]  
Ravichandran A., 2012, IEEE TPAMI, P1
[23]  
Rohrbach M, 2012, LECT NOTES COMPUT SC, V7572, P144, DOI 10.1007/978-3-642-33718-5_11
[24]  
Sadanand S, 2012, PROC CVPR IEEE, P1234, DOI 10.1109/CVPR.2012.6247806
[25]  
Schein A.I., 2003, AISTATS
[26]  
Tang Kevin., 2012, CVPR
[27]  
Vishwanathan S., 2006, IJCV
[28]   Local features and kernels for classification of texture and object categories: A comprehensive study [J].
Zhang, J. ;
Marszalek, M. ;
Lazebnik, S. ;
Schmid, C. .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2007, 73 (02) :213-238