Unsupervised Deep Feature Extraction for Remote Sensing Image Classification

被引：589

作者：

Romero, Adriana ^{[1
]}

Gatta, Carlo ^{[2
]}

Camps-Valls, Gustau ^{[3
]}

机构：

[1] Univ Barcelona, Dept Appl Math & Anal, E-08007 Barcelona, Spain

[2] Univ Autonoma Barcelona, Comp Vis Ctr, Barcelona 01873, Spain

[3] Univ Valencia, Image Proc Lab, Valencia 46980, Spain

来源：

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2016年 / 54卷 / 03期

关键词：

Aerial image classification; classification; deep convolutional networks; deep learning; feature extraction; hyperspectral (HS) image; multispectral (MS) images; segmentation; sparse features learning; very high resolution (VHR); NEURAL-NETWORKS; CLOUD; ENVIRONMENT; ALGORITHM;

D O I：

10.1109/TGRS.2015.2478379

中图分类号：

P3 [地球物理学]; P59 [地球化学];

学科分类号：

0708 ; 070902 ;

摘要：

This paper introduces the use of single-layer and deep convolutional networks for remote sensing data analysis. Direct application to multi-and hyperspectral imagery of supervised (shallow or deep) convolutional networks is very challenging given the high input data dimensionality and the relatively small amount of available labeled data. Therefore, we propose the use of greedy layerwise unsupervised pretraining coupled with a highly efficient algorithm for unsupervised learning of sparse features. The algorithm is rooted on sparse representations and enforces both population and lifetime sparsity of the extracted features, simultaneously. We successfully illustrate the expressive power of the extracted representations in several scenarios: classification of aerial scenes, as well as land-use classification in very high resolution or land-cover classification from multi-and hyperspectral images. The proposed algorithmclearly outperforms standard principal component analysis (PCA) and its kernel counterpart (kPCA), as well as current state-of-the-art algorithms of aerial classification, while being extremely computationally efficient at learning representations of data. Results show that single-layer convolutional networks can extract powerful discriminative features only when the receptive field accounts for neighboring pixels and are preferred when the classification requires high resolution and detailed results. However, deep architectures significantly outperform single-layer variants, capturing increasing levels of abstraction and complexity throughout the feature hierarchy.

引用

页码：1349 / 1362

页数：14

共 67 条

[1]

[Anonymous], 2010, P ADV NEUR INF PROC

[2]

[Anonymous], 2002, Principal components analysis

[3]

[Anonymous], P IEEE 1 WHISPERS AU

[4]

[Anonymous], 2009, Kernel methods for remote sensing data analysis

[5]

[Anonymous], 2006, Proc. Adv. Neural Inf. Process. Syst.

[6]

[Anonymous], 2009, ICML

[7]

[Anonymous], P WHISPERS LAUS SWIT

[8]

[Anonymous], 2011, P 28 INT C MACHINE L

[9]

[Anonymous], 2011, Advances in Neural Information Processing Systems

[10] Compressive Coded Aperture Spectral Imaging [J].

Arce, Gonzalo R. ;

Brady, David J. ;

Carin, Lawrence ;

Arguello, Henry ;

Kittle, David S. .

IEEE SIGNAL PROCESSING MAGAZINE, 2014, 31 (01) :105-115

← 1 2 3 4 5 6 7 →