Improved Bags-of-Words Algorithm for Scene Recognition

被引：4

作者：

Liu Gang ^{[1
]}

Wang Xiaochi ^{[2
]}

机构：

[1] Wenzhou Vocat Coll Sci & Technol Wenzhou, Dept Informat Technol, Wenzhou, Peoples R China

[2] Zhejiang Univ City Coll, Sch Informat & Elect Engn, Hangzhou, Peoples R China

来源：

INTERNATIONAL CONFERENCE ON APPLIED PHYSICS AND INDUSTRIAL ENGINEERING 2012, PT B | 2012年 / 24卷

关键词：

scene recognition; bags-of-words (BoW); GMM; soft assignment;

D O I：

10.1016/j.phpro.2012.02.188

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

This paper proposes a new bags-of-words (BoW)-based algorithm for scene/place recognition. Current scene recognition works that adopt BoW as the framework usually use a single codeword to represent the clusters obtained by k-means. Further, most of them often assign a hard value to a certain codeword to construct the BoW histogram. Using a single codeword to represent each cluster in fact is very preliminary since different clusters usually have different mean and covariance values. This causes using only mean value-based codeword will lose the covariance information and also makes the hard assignment to the codeword become biased. Considering this, this paper proposes an effective BoW-based technique to perform scene recognition. It first uses k-means algorithm to cluster the feature vectors into a certain number of clusters, in addition with an occurrence matrix. Gaussian mixed model (GMM) is then used to model the distribution of each cluster. Each GMM will be used as the new "codeword" of the codebook. Finally we propose to establish a new soft BoW histogram to represent each image through the soft assignment of the image features to each GMM. Support vector machine (SVM) is used to train these BoW histograms. Experimental results on the 15 categories dataset show that the proposed new BoW-based approach is very effective for scene/place recognition. (C) 2011 Published by Elsevier B.V. Selection and/or peer-review under responsibility of ICAPIE Organization Committee.

引用

页码：1255 / 1261

页数：7

共 11 条

[1] Csurka G., 2004, WORKSH STAT LEARN CO, V1, P1, DOI DOI 10.1234/12345678
[2] Hays J, 2007, ACM T GRAPHIC, V26, DOI [10.1145/1276377.1276382, 10.1145/1239451.1239455]
[3] Lazebnik S., 2006, P IEEE COMPUTER SOC, P2169, DOI 10.1109/CVPR.2006.68
[4] Fuzzy support vector machines
Lin, CF
Wang, SD
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (02): : 464 - 471
[5] Liu J, 2007, IEEE CONF WIREL MOB
[6] Distinctive image features from scale-invariant keypoints
Lowe, DG
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 60 (02) : 91 - 110
[7] Perronnin F., 2006, P EUR C COMP VIS GRA
[8] Image classification for content-based indexing
Vailaya, A
Figueiredo, MAT
Jain, AK
Zhang, HJ
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2001, 10 (01) : 117 - 130
[9] Yamauchi B, 1997, J ROBOTIC SYST, V14, P107, DOI 10.1002/(SICI)1097-4563(199702)14:2<107::AID-ROB5>3.0.CO
[10] 2-W

← 1 2 →