Augmenting bag-of-words: a robust contextual representation of spatiotemporal interest points for action recognition

被引：16

作者：

Li, Yang ^{[1
]}

Ye, Junyong ^{[1
]}

Wang, Tongqing ^{[1
]}

Huang, Shijian ^{[1
]}

机构：

[1] Chongqing Univ, Minist Educ, Key Lab Optoelect Technol & Syst, Chongqing 630044, Peoples R China

来源：

VISUAL COMPUTER | 2015年 / 31卷 / 10期

关键词：

Action recognition; Contextual features; Cumulative probability histogram; Sparse coding; APPEARANCE; FEATURES;

D O I：

10.1007/s00371-014-1020-8

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Although traditional bag-of-words model, together with local spatiotemporal features, has shown promising results for human action recognition, it ignores all structural information of features, which carries important information of motion structures in videos. Recent methods usually characterize the relationship of quantized spatiotemporal features to overcome this drawback. However, the propagation of quantization error leads to an unreliable representation. To alleviate the propagation of quantization error, we present a coding method, which considers not only the spatial similarity but also the reconstruction ability of visual words after giving a probabilistic interpretation of coding coefficients. Based on our coding method, a new type of feature called cumulative probability histogram is proposed to robustly characterize contextual structural information around interest points, which are extracted from multi-layered contexts and assumed to be complementary to local spatiotemporal features. The proposed method is verified on four benchmark datasets. Experiment results show that our method can achieve better performance than previous methods in action recognition.

引用

页码：1383 / 1394

页数：12

共 41 条

[1] Motion history image: its variants and applications [J].

Ahad, Md. Atiqur Rahman ;

Tan, J. K. ;

Kim, H. ;

Ishikawa, S. .

MACHINE VISION AND APPLICATIONS, 2012, 23 (02) :255-281

[2]

[Anonymous], P WMVC

[3]

[Anonymous], 2009, P BRIT MACH VIS C

[4] Contextual Statistics of Space-Time Ordered Features for Human Action Recognition [J].

Bilinski, Piotr ;

Bremond, Francois .

2012 IEEE NINTH INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL-BASED SURVEILLANCE (AVSS), 2012, :228-233

[5] Fusing appearance and distribution information of interest points for action recognition [J].

Bregonzio, Matteo ;

Xiang, Tao ;

Gong, Shaogang .

PATTERN RECOGNITION, 2012, 45 (03) :1220-1234

[6]

Bregonzio M, 2009, PROC CVPR IEEE, P1948, DOI 10.1109/CVPRW.2009.5206779

[7]

Choi J., 2008, ACM ICMR, P291

[8]

Dollar P., 2005, Proceedings. 2nd Joint IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance (VS-PETS) (IEEE Cat. No. 05EX1178), P65

[9] Actions as space-time shapes [J].

Gorelick, Lena ;

Blank, Moshe ;

Shechtman, Eli ;

Irani, Michal ;

Basri, Ronen .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2007, 29 (12) :2247-2253

[10]

Heng Wang, 2011, 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), P3169, DOI 10.1109/CVPR.2011.5995407

← 1 2 3 4 5 →