Biterm topic model-based land use classification of moderate-resolution remote sensing images

被引:0
作者
Shao H. [1 ]
Li Y. [2 ,3 ]
Ding Y. [2 ,3 ]
Liu F. [1 ]
机构
[1] College of Geomatics Science and Technology, Nanjing Tech University, Nanjing
[2] Key Laboratory of VGE Ministry of Education, Nanjing Normal University, Nanjing
[3] Jiangsu Center for Collaborative Innovation in Geographical Information Resource Development and Application, Nanjing
来源
Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering | 2016年 / 32卷 / 22期
关键词
Biterm topic model; Land use; Models; Moderate resolution; Probabilistic topic model; Remote sensing; Remote sensing image classification;
D O I
10.11975/j.issn.1002-6819.2016.22.036
中图分类号
学科分类号
摘要
Land Use/Land Cover type automatic interpretation based on remote sensing data is one of the key problems in many relevant fields. Although a large number of image classification algorithms have been developed, most of them can hardly meet the application requirements. Probabilistic topic models, represented by Latent Dirichlet Allocation (LDA) model, have showed a great success in the field of natural language processing and image processing, which can be used to effectively overcome the gap between low-level features and high-level semantic. In recent years it have also been introduced into remote sensing image analysis field, while most of the researches focused on the analysis of high-resolution remote sensing images. Nonetheless, the moderate-resolution remote sensing data is one of the main sources in Land Use/Land Cover type automatic interpretation. The study analyzed the problem faced by traditional probabilistic topic models in reduced resolution remote sensing image analyzing, and pointed out that low segmentation scale made the image objects small and contained fewer pixels. In fact the objects, which are regarded as image documents in current work, are sparse in moderate resolution remote sensing image. The scarcity led to poor stability when using the standard LDA model to infer the semantic of short documents. So Biterm Topic Model (BTM) showed the ability of inferring the semantic of sparse documents. BTM learns topics by directly modeling the generation of word co-occurrence patterns in the corpus, making the inference effective with the rich corpus-level information. By segmenting the remote sensing image into two scales and regarding the image objects at two levels as short documents and visual words respectively, BTM was introduced to the classification of moderate resolution remote sensing image. The co-occurrence of words denoted as biterm in a document were modeled in BTM extracted by setting a short context refers to a small, fixed-size window over a term sequence. However, the sequence pattern of image visual words is different from that in text. While the spatial relationship is the most important relationship among the visual words, the spatial neighborhood visual words can express the law of image of a certain land use/land cover type and is more aligned with the principle of the humans' observation process. So, it was proposed to use space adjacent visual word pairs as the observations in BTM called S-BTM to reduce the quantity of observation objects. Similar to LDA, it was intractable to exactly solve the parameters, Gibbs sampling was used to infer the topic of visual words. Advance Land Observing Satellite (ALOS) images were used in the experiment, whose spatial resolution was 10 m with 4 bands. LDA, BTM and S-BTM were compared in the classification of land use types. Both BTM and SBTM had higher classification accuracy than LDA. BTM and S-BTM reached the highest accuracy respectively when the visual dictionary size was 480 and 400. When the visual dictionary size was fixed at 400, S-BTM was more effective than BTM at different topic size and both reached the highest accuracy with 20 topics. S-BTM used 33562 biterms to infer the image documents' topic while the number in BTM was 167 455, which showed that S-BTM needed less computation. At last, when topic size was fixed at 20, both overall classification accuracy and Kappa's Coefficient showed that S-BTM achieved better results than LDA and BTM. © 2016, Editorial Department of the Transactions of the Chinese Society of Agricultural Engineering. All right reserved.
引用
收藏
页码:259 / 265
页数:6
相关论文
共 26 条
[1]  
Chen J., Chen J., Liao A., Et al., Concepts and key techniques for 30 m global land cover mapping, Acta Geodaetica et Cartographica Sinica, 43, 6, pp. 551-557, (2014)
[2]  
Chen Y., Xu H., Chen Y., Et al., Analysis of land function classification and transformation in county based on remote sensing image, Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 32, 13, pp. 263-272, (2016)
[3]  
Jia K., Li Q., Tian Y., Et al., A review of classification methods of remote sensing imagery, Spectroscopy and Spectral Analysis, 31, 10, pp. 2618-2623, (2011)
[4]  
Blei D.M., Ng A.Y., Jordan M.I., Latent dirichlet allocation, J. Mach. Learn. Res., 3, 4-5, pp. 993-1022, (2003)
[5]  
Mcauliffe J.D., Blei D.M., Supervised topic models, Advances in Neural Information Processing Systems, pp. 121-128, (2008)
[6]  
Ramage D., Hall D., Nallapati R., Et al., Labeled LDA: A supervised topic model for credit attribution in multi-labeled corpora, Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, pp. 248-256, (2009)
[7]  
Zhu J., Ahmed A., Xing E.P., MedLDA: maximum margin supervised topic models, Journal of Machine Learning Research, 13, pp. 2237-2278, (2012)
[8]  
Cheng X.Q., Yan X.H., Lan Y.Y., Et al., BTM: topic modeling over short texts, IEEE Transactions on Knowledge and Data Engineering, 26, 12, pp. 2928-2941, (2014)
[9]  
Li F.F., Perona P., A Bayesian hierarchical model for learning natural scene categories, IEEE Conference on Computer Vision and Pattern Recognition, pp. 524-531, (2005)
[10]  
Rasiwasia N., Vasconcelos N., Latent dirichlet allocation models for image classification, IEEE Transactions on Pattern Analysis and Machine Intelligence, 35, 11, pp. 2665-2679, (2013)