Recognizing Activities via Bag of Words for Attribute Dynamics

被引：18

作者：

Li, Weixin ^{[1
]}

Yu, Qian ^{[2
]}

Sawhney, Harpreet ^{[2
]}

Vasconcelos, Nuno ^{[1
]}

机构：

[1] Univ Calif San Diego, La Jolla, CA 92093 USA

[2] SRI Int Sarnoff, Princeton, NJ 08540 USA

来源：

2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2013年

关键词：

D O I：

10.1109/CVPR.2013.334

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this work, we propose a novel video representation for activity recognition that models video dynamics with attributes of activities. A video sequence is decomposed into short-term segments, which are characterized by the dynamics of their attributes. These segments are modeled by a dictionary of attribute dynamics templates, which are implemented by a recently introduced generative model, the binary dynamic system (BDS). We propose methods for learning a dictionary of BDSs from a training corpus, and for quantizing attribute sequences extracted from videos into these BDS codewords. This procedure produces a representation of the video as a histogram of BDS codewords, which is denoted the bag-of-words for attribute dynamics (BoWAD). An extensive experimental evaluation reveals that this representation outperforms other state-of-the-art approaches in temporal structure modeling for complex activity recognition.

引用

页码：2587 / 2594

页数：8

共 28 条

[21] Holistic Context Models for Visual Recognition [J].

Rasiwasia, Nikhil ;

Vasconcelos, Nuno .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (05) :902-917

[22]

Ravichandran A., 2012, IEEE TPAMI, P1

[23]

Rohrbach M, 2012, LECT NOTES COMPUT SC, V7572, P144, DOI 10.1007/978-3-642-33718-5_11

[24]

Sadanand S, 2012, PROC CVPR IEEE, P1234, DOI 10.1109/CVPR.2012.6247806

[25]

Schein A.I., 2003, AISTATS

[26]

Tang Kevin., 2012, CVPR

[27]

Vishwanathan S., 2006, IJCV

[28] Local features and kernels for classification of texture and object categories: A comprehensive study [J].

Zhang, J. ;

Marszalek, M. ;

Lazebnik, S. ;

Schmid, C. .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2007, 73 (02) :213-238

← 1 2 3 →