Supervised Learning and Codebook Optimization for Bag-of-Words Models

被引:28
作者
Jiu, Mingyuan [1 ]
Wolf, Christian [1 ]
Garcia, Christophe [1 ]
Baskurt, Atilla [1 ]
机构
[1] Univ Lyon, CNRS, INSA Lyon, LIRIS,UMR5205, F-69621 Villeurbanne, France
关键词
Bag-of-words models; Supervised learning; Neural networks; Action recognition; EVENT DETECTION; FEATURES;
D O I
10.1007/s12559-012-9137-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a novel approach for supervised codebook learning and optimization for bag-of-words models. This type of models is frequently used in visual recognition tasks like object class recognition or human action recognition. An entity is represented as a histogram of codewords, which are traditionally clustered with unsupervised methods like k-means or random forests and then classified in a supervised way. We propose a new supervised method for joint codebook creation and class learning, which learns the cluster centers of the codebook in a goal-directed way using the class labels of the training set. As a result, the codebook is highly correlated to the recognition problem, leading to a more discriminative codebook. We propose two different learning algorithms, one based on error backpropagation and the other based on cluster label reassignment. We apply the proposed method to human action recognition from video sequences and evaluate it on the KTH data set, reporting very promising results. The proposed technique allows us to improve the discriminative power of an unsupervised learned codebook or to keep the discriminative power while decreasing the size of the learned codebook, thus decreasing the computational complexity due to the nearest neighbor search.
引用
收藏
页码:409 / 419
页数:11
相关论文
共 51 条
[1]   Silhouette-based gesture and action recognition via modeling trajectories on Riemannian shape manifolds [J].
Abdelkader, Mohamed F. ;
Abd-Almageed, Wael ;
Srivastava, Anuj ;
Chellappa, Rama .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2011, 115 (03) :439-455
[2]  
Anh-Phuong Ta, 2010, Proceedings of the 2010 20th International Conference on Pattern Recognition (ICPR 2010), P3224, DOI 10.1109/ICPR.2010.788
[3]  
[Anonymous], 2008, CVPR
[4]  
[Anonymous], INT C ADV VID SIGN B
[5]  
[Anonymous], ACM COMPUT IN PRESS
[6]  
[Anonymous], 2004, P 2004WORKSHOP STAT
[7]  
[Anonymous], 2011, INT WORKSH HUM BEH U
[8]  
[Anonymous], 2010, ECCV
[9]  
[Anonymous], NEURAL NETWORKS PATT
[10]  
[Anonymous], ACM T INTELL SYST TE