Unsupervised Deep Feature Extraction for Remote Sensing Image Classification

被引:589
作者
Romero, Adriana [1 ]
Gatta, Carlo [2 ]
Camps-Valls, Gustau [3 ]
机构
[1] Univ Barcelona, Dept Appl Math & Anal, E-08007 Barcelona, Spain
[2] Univ Autonoma Barcelona, Comp Vis Ctr, Barcelona 01873, Spain
[3] Univ Valencia, Image Proc Lab, Valencia 46980, Spain
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2016年 / 54卷 / 03期
关键词
Aerial image classification; classification; deep convolutional networks; deep learning; feature extraction; hyperspectral (HS) image; multispectral (MS) images; segmentation; sparse features learning; very high resolution (VHR); NEURAL-NETWORKS; CLOUD; ENVIRONMENT; ALGORITHM;
D O I
10.1109/TGRS.2015.2478379
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
This paper introduces the use of single-layer and deep convolutional networks for remote sensing data analysis. Direct application to multi-and hyperspectral imagery of supervised (shallow or deep) convolutional networks is very challenging given the high input data dimensionality and the relatively small amount of available labeled data. Therefore, we propose the use of greedy layerwise unsupervised pretraining coupled with a highly efficient algorithm for unsupervised learning of sparse features. The algorithm is rooted on sparse representations and enforces both population and lifetime sparsity of the extracted features, simultaneously. We successfully illustrate the expressive power of the extracted representations in several scenarios: classification of aerial scenes, as well as land-use classification in very high resolution or land-cover classification from multi-and hyperspectral images. The proposed algorithmclearly outperforms standard principal component analysis (PCA) and its kernel counterpart (kPCA), as well as current state-of-the-art algorithms of aerial classification, while being extremely computationally efficient at learning representations of data. Results show that single-layer convolutional networks can extract powerful discriminative features only when the receptive field accounts for neighboring pixels and are preferred when the classification requires high resolution and detailed results. However, deep architectures significantly outperform single-layer variants, capturing increasing levels of abstraction and complexity throughout the feature hierarchy.
引用
收藏
页码:1349 / 1362
页数:14
相关论文
共 67 条
[1]  
[Anonymous], 2010, P ADV NEUR INF PROC
[2]  
[Anonymous], 2002, Principal components analysis
[3]  
[Anonymous], P IEEE 1 WHISPERS AU
[4]  
[Anonymous], 2009, Kernel methods for remote sensing data analysis
[5]  
[Anonymous], 2006, Proc. Adv. Neural Inf. Process. Syst.
[6]  
[Anonymous], 2009, ICML
[7]  
[Anonymous], P WHISPERS LAUS SWIT
[8]  
[Anonymous], 2011, P 28 INT C MACHINE L
[9]  
[Anonymous], 2011, Advances in Neural Information Processing Systems
[10]   Compressive Coded Aperture Spectral Imaging [J].
Arce, Gonzalo R. ;
Brady, David J. ;
Carin, Lawrence ;
Arguello, Henry ;
Kittle, David S. .
IEEE SIGNAL PROCESSING MAGAZINE, 2014, 31 (01) :105-115