Integrated visual vocabulary in latent Dirichlet allocation-based scene classification for IKONOS image

被引:29
作者
Kusumaningrum, Retno [1 ]
Wei, Hong [2 ]
Manurung, Ruli [3 ]
Murni, Aniati [3 ]
机构
[1] Diponegoro Univ, Dept Informat, Semarang 50275, Indonesia
[2] Univ Reading, Sch Syst Engn, Reading RG6 6AY, Berks, England
[3] Univ Indonesia, Fac Comp Sci, Depok 16424, Indonesia
关键词
latent Dirichlet allocation; scene classification; integrated visual vocabulary; bag of visual words; IKONOS;
D O I
10.1117/1.JRS.8.083690
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Scene classification based on latent Dirichlet allocation (LDA) is a more general modeling method known as a bag of visual words, in which the construction of a visual vocabulary is a crucial quantization process to ensure success of the classification. A framework is developed using the following new aspects: Gaussian mixture clustering for the quantization process, the use of an integrated visual vocabulary (IVV), which is built as the union of all centroids obtained from the separate quantization process of each class, and the usage of some features, including edge orientation histogram, CIELab color moments, and gray-level co-occurrence matrix (GLCM). The experiments are conducted on IKONOS images with six semantic classes (tree, grassland, residential, commercial/ industrial, road, and water). The results show that the use of an IVV increases the overall accuracy (OA) by 11 to 12% and 6% when it is implemented on the selected and all features, respectively. The selected features of CIELab color moments and GLCM provide a better OA than the implementation over CIELab color moment or GLCM as individuals. The latter increases the OA by only similar to 2 to 3%. Moreover, the results show that the OA of LDA outperforms the OA of C4.5 and naive Bayes tree by similar to 20%. (C) 2014 Society of Photo-Optical Instrumentation Engineers (SPIE)
引用
收藏
页数:17
相关论文
共 21 条
[1]   Fusing integrated visual vocabularies-based bag of visual words and weighted colour moments on spatial pyramid layout for natural scene image classification [J].
Alqasrawi, Yousef ;
Neagu, Daniel ;
Cowling, Peter I. .
SIGNAL IMAGE AND VIDEO PROCESSING, 2013, 7 (04) :759-775
[2]  
[Anonymous], 2008, PARAMETER ESTIMATION
[3]  
[Anonymous], 2014, C4. 5: programs for machine learning
[4]  
[Anonymous], 2006, Advances in Neural Information Processing Systems
[5]  
[Anonymous], 1995, STORAGE RETRIEVAL IM, DOI DOI 10.1117/12.205308
[6]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[7]   Which is the best way to organize/classify images by content? [J].
Bosch, Anna ;
Munoz, Xavier ;
Marti, Robert .
IMAGE AND VISION COMPUTING, 2007, 25 (06) :778-791
[8]  
Bosch A, 2006, LECT NOTES COMPUT SC, V3954, P517
[9]  
Bouman C.A., 2005, CLUSTER UNSUPERVISED
[10]  
Wang C, 2009, PROC CVPR IEEE, P1903, DOI [10.1109/CVPR.2009.5206800, 10.1109/CVPRW.2009.5206800]