Using convolutional features and a sparse autoencoder for land-use scene classification

被引:144
作者
Othmana, Esam [1 ]
Bazi, Yakoub [1 ]
Alajlan, Naif [1 ]
Alhichri, Haikel [1 ]
Melgani, Farid [2 ]
机构
[1] King Saud Univ, Coll Comp & Informat Sci, Dept Comp Engn, Riyadh 11543, Saudi Arabia
[2] Univ Trento, Dept Informat Engn & Comp Sci, Trento, Italy
关键词
NEURAL-NETWORKS; OBJECT DETECTION; DEEP; IMAGES; DOMAIN;
D O I
10.1080/01431161.2016.1171928
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
In this article, we propose a novel approach based on convolutional features and sparse autoencoder (AE) for scene-level land-use (LU) classification. This approach starts by generating an initial feature representation of the scenes under analysis from a deep convolutional neural network (CNN) pre-learned on a large amount of labelled data from an auxiliary domain. Then these convolutional features are fed as input to a sparse AE for learning a new suitable representation in an unsupervised manner. After this pre-training phase, we propose two different scenarios for building the classification system. In the first scenario, we add a softmax layer on the top of the AE encoding layer and then fine-tune the resulting network in a supervised manner using the target training images available at hand. Then we classify the test images based on the posterior probabilities provided by the softmax layer. In the second scenario, we view the classification problem from a reconstruction perspective. To this end we train several class-specific AEs (i.e. one AE per class) and then classify the test images based on the reconstruction error. Experimental results conducted on the University of California (UC) Merced and Banja-Luka LU public data sets confirm the superiority of the proposed approach compared to state-of-the-art methods.
引用
收藏
页码:2149 / 2167
页数:19
相关论文
共 41 条
[1]  
[Anonymous], 2007, P ACM MM
[2]   Subset based deep learning for RGB-D object recognition [J].
Bai, Jing ;
Wu, Yan ;
Zhang, Junming ;
Chen, Fuqiang .
NEUROCOMPUTING, 2015, 165 :280-292
[3]   Efficient Training of Convolutional Deep Belief Networks in the Frequency Domain for Application to High-Resolution 2D and 3D Images [J].
Brosch, Tom ;
Tam, Roger .
NEURAL COMPUTATION, 2015, 27 (01) :211-227
[4]  
Chatfield K., 2014, P BMVC NOTT SEPT
[5]   Pyramid of Spatial Relatons for Scene-Level Land Use Classification [J].
Chen, Shizhi ;
Tian, YingLi .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2015, 53 (04) :1947-1957
[6]   Vehicle Detection in Satellite Images by Hybrid Deep Convolutional Neural Networks [J].
Chen, Xueyun ;
Xiang, Shiming ;
Liu, Cheng-Lin ;
Pan, Chun-Hong .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2014, 11 (10) :1797-1801
[7]   Spectral-Spatial Classification of Hyperspectral Data Based on Deep Belief Network [J].
Chen, Yushi ;
Zhao, Xing ;
Jia, Xiuping .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2015, 8 (06) :2381-2392
[8]   Deep Learning-Based Classification of Hyperspectral Data [J].
Chen, Yushi ;
Lin, Zhouhan ;
Zhao, Xing ;
Wang, Gang ;
Gu, Yanfeng .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2014, 7 (06) :2094-2107
[9]   Effective and Efficient Midlevel Visual Elements-Oriented Land-Use Classification Using VHR Remote Sensing Images [J].
Cheng, Gong ;
Han, Junwei ;
Guo, Lei ;
Liu, Zhenbao ;
Bu, Shuhui ;
Ren, Jinchang .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2015, 53 (08) :4238-4249
[10]   Multi-class geospatial object detection and geographic image classification based on collection of part detectors [J].
Cheng, Gong ;
Han, Junwei ;
Zhou, Peicheng ;
Guo, Lei .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2014, 98 :119-132