Biologically Inspired Feature Manifold for Scene Classification

被引:169
作者
Song, Dongjin [1 ]
Tao, Dacheng [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
关键词
Biologically inspired feature; dimensionality reduction; image retrieval; manifold learning; scene classification; TEXTURE CLASSIFICATION; RETRIEVAL;
D O I
10.1109/TIP.2009.2032939
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Biologically inspired feature (BIF) and its variations have been demonstrated to be effective and efficient for scene classification. It is unreasonable to measure the dissimilarity between two BIFs based on their Euclidean distance. This is because BIFs are extrinsically very high dimensional and intrinsically low dimensional, i.e., BIFs are sampled from a low-dimensional manifold and embedded in a high-dimensional space. Therefore, it is essential to find the intrinsic structure of a set of BIFs, obtain a suitable mapping to implement the dimensionality reduction, and measure the dissimilarity between two BIFs in the low-dimensional space based on their Euclidean distance. In this paper, we study the manifold constructed by a set of BIFs utilized for scene classification, form a new dimensionality reduction algorithm by preserving both the geometry of intra BIFs and the discriminative information inter BIFs termed Discriminative and Geometry Preserving Projections (DGPP), and construct a new framework for scene classification. In this framework, we represent an image based on a new BIF, which combines the intensity channel, the color channel, and the C1 unit of a color image; then we project the high-dimensional BIF to a low-dimensional space based on DGPP; and, finally, we conduct the classification based on the multiclass support vector machine (SVM). Thorough empirical studies based on the USC scene dataset demonstrate that the proposed framework improves the classification rates around 100% relatively and the training speed 60 times for different sites in comparing with previous gist proposed by Siagian and Itti in 2007.
引用
收藏
页码:174 / 184
页数:11
相关论文
共 48 条
[1]  
[Anonymous], 1997, Proceedings of the 4th ACM International Conference on Multimedia, MULTIMEDIA 1996, DOI DOI 10.1145/244130.244148
[2]  
Belkin M, 2002, ADV NEUR IN, V14, P585
[3]  
BOSE B, 2003, IEEE INT WORKSH VIS
[4]  
Boutell Matthew., 2002, Review of the State of the Art in Semantic Scene Classification'
[5]  
Cai D., 2007, AAAI, P528
[6]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[7]   Texture analysis and classification with tree-structured wavelet transform [J].
Chang, Tianhorng ;
Kuo, C. -C. Jay .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 1993, 2 (04) :429-441
[8]   TWO-DIMENSIONAL SPECTRAL-ANALYSIS OF CORTICAL RECEPTIVE-FIELD PROFILES [J].
DAUGMAN, JG .
VISION RESEARCH, 1980, 20 (10) :847-856
[9]   Applications of video-content analysis and retrieval [J].
Dimitrova, N ;
Zhang, HJ ;
Shahraray, B ;
Sezan, I ;
Huang, T ;
Zakhor, A .
IEEE MULTIMEDIA, 2002, 9 (03) :42-55
[10]   The parahippocampal place area: Recognition, navigation, or encoding? [J].
Epstein, R ;
Harris, A ;
Stanley, D ;
Kanwisher, N .
NEURON, 1999, 23 (01) :115-125