A new scene classification method based on spatial pyramid matching model

被引:0
作者
Dong, Baoyu [1 ,2 ]
Ren, Guang [1 ]
机构
[1] Marine Engineering College, Dalian Maritime University, Dalian
[2] Institute of Electric and Information, Dalian Jiaotong University, Dalian
来源
Journal of Information and Computational Science | 2015年 / 12卷 / 03期
关键词
BOVW; RILBP; Scene classification; Spatial pyramid; SVM;
D O I
10.12733/jics20104480
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Scene classification is an appealing and challenging problem in image processing and machine vision. Recently, Bag-of-visual-words (BOVW) method using pyramid matching scheme has shown remarkable performance for scene classification. But this method deriving from local keypoints does not contain texture features which are rich in scene images. To further improves the classification accuracy, this paper presents a new method combining Rotation Invariant Local Binary Patterns (RILBP) texture features and BOVW model in spatial pyramid matching framework. First, scene image is subdivided at different resolutions for constructing a spatial pyramid. Then based on scale invariant feature transform descriptor and K-means clustering, Pyramid Histogram of visual Words (PHOW) is extracted. And RILBP texture feature is extracted using the mean of a 3*3 neighborhood as threshold. Last we construct a composite kernel of spatial pyramid matching. We regard the keypoint features and texture features as two independent feature channels, and combine them to realize scene classification using one-against-rest SVMs with the composite kernel. Experiments results on the three different scene datasets show that our method is effective. ©, 2015, Binary Information Press. All right reserved.
引用
收藏
页码:1073 / 1080
页数:7
相关论文
共 10 条
[1]  
Zhou L., Zhou Z., Hu D., Scene classification using a multi-resolution bag-of-features model, Pattern Recognition, 46, pp. 424-433, (2013)
[2]  
Meng X., Wang Z., Wu L., Building global image features for scene recognition, Pattern Recognition, 45, pp. 373-380, (2012)
[3]  
Qin J., Yung N.H.C., Scene categorization via contextual visual words, Pattern Recognition, 43, pp. 1874-1888, (2010)
[4]  
Elfiky N.M., Gonzalez J., Xavier Roca F., Compact and adaptive spatial pyramids for scene recognition, Image and Vision Computing, 30, pp. 492-500, (2012)
[5]  
Cakir F., Gudukbay U., Ulusoy F., Nearest-neighbor based metric functions for indoor scene recognition, Computer Vision and Image Understanding, 115, pp. 1483-1492, (2011)
[6]  
Lazebnik S., Schmid C., Ponce J., Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories, IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 2169-2178, (2006)
[7]  
Liu L., Fieguth P., Clausi D., Kuang G., Sorted random projections for robust rotation-invariant texture classification, Pattern Recognition, 45, pp. 2405-2418, (2012)
[8]  
Ojala T., Pietikainen M., Maeenpaa T., Multiresolution gray-scale and rotation invariant texture classification with local binary patterns, IEEE Transactions on Pattern Analysis and Machine Intelligence, 24, 7, pp. 971-987, (2002)
[9]  
Lowe D.G., Distinctive image features from scale-invariant keypoints, International Journal of Computer Vision, 60, 2, pp. 91-110, (2004)
[10]  
Qian X., Hua X., Chen P., Ke L., PLBP: An effective local binary patterns texture descriptor with pyramid representation, Pattern Recognition, 44, pp. 2502-2515, (2011)