Recognizing Activities via Bag of Words for Attribute Dynamics

被引:18
作者
Li, Weixin [1 ]
Yu, Qian [2 ]
Sawhney, Harpreet [2 ]
Vasconcelos, Nuno [1 ]
机构
[1] Univ Calif San Diego, La Jolla, CA 92093 USA
[2] SRI Int Sarnoff, Princeton, NJ 08540 USA
来源
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2013年
关键词
D O I
10.1109/CVPR.2013.334
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we propose a novel video representation for activity recognition that models video dynamics with attributes of activities. A video sequence is decomposed into short-term segments, which are characterized by the dynamics of their attributes. These segments are modeled by a dictionary of attribute dynamics templates, which are implemented by a recently introduced generative model, the binary dynamic system (BDS). We propose methods for learning a dictionary of BDSs from a training corpus, and for quantizing attribute sequences extracted from videos into these BDS codewords. This procedure produces a representation of the video as a histogram of BDS codewords, which is denoted the bag-of-words for attribute dynamics (BoWAD). An extensive experimental evaluation reveals that this representation outperforms other state-of-the-art approaches in temporal structure modeling for complex activity recognition.
引用
收藏
页码:2587 / 2594
页数:8
相关论文
共 28 条
[1]  
[Anonymous], 2008, BRIT MACHINE VISION
[2]  
[Anonymous], CVPR
[3]  
[Anonymous], 2009, BMVC
[4]  
Chan A.B., 2007, CVPR
[5]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[6]  
Chaudhry Rizwan., 2009, CVPR
[7]   Dynamic textures [J].
Doretto, G ;
Chiuso, A ;
Wu, YN ;
Soatto, S .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2003, 51 (02) :91-109
[8]  
Fu Y., 2012, ECCV
[9]  
Gaidon A., 2011, 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), P3201, DOI 10.1109/CVPR.2011.5995646
[10]   Actions as space-time shapes [J].
Gorelick, Lena ;
Blank, Moshe ;
Shechtman, Eli ;
Irani, Michal ;
Basri, Ronen .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2007, 29 (12) :2247-2253