Meta-analysis of deep neural networks in remote sensing: A comparative study of mono-temporal classification to support vector machines

被引:54
作者
Heydari, Shahriar S. [1 ]
Mountrakis, Giorgos [1 ]
机构
[1] SUNY Coll Environm Sci & Forestry, Dept Environm Resources Engn, 1 Forestry Dr, Syracuse, NY 13210 USA
关键词
Deep learning; Classification; Convolutional neural network; Deep belief network; Stacked auto encoder; Support vector machine; SPECTRAL-SPATIAL CLASSIFICATION; LAND-COVER CLASSIFICATION; SCENE CLASSIFICATION; SATELLITE IMAGES; HYPERSPECTRAL IMAGES; REPRESENTATIONS; EXTRACTION; CNN; SEGMENTATION; INFORMATION;
D O I
10.1016/j.isprsjprs.2019.04.016
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Deep learning methods have recently found widespread adoption for remote sensing tasks, particularly in image or pixel classification. Their flexibility and versatility has enabled researchers to propose many different designs to process remote sensing data in all spectral, spatial, and temporal dimensions. In most of the reported cases they surpass their non-deep rivals in overall classification accuracy. However, there is considerable diversity in implementation details in each case and a systematic quantitative comparison to non-deep classifiers does not exist. In this paper, we look at the major research papers that have studied deep learning image classifiers in recent years and undertake a meta-analysis on their performance compared to the most used non-deep rival, Support Vector Machine (SVM) classifiers. We focus on mono-temporal classification as the time-series image classification did not offer sufficient samples. Our work covered 103 manuscripts and included 92 cases that supported direct accuracy comparisons between deep learners and SVMs. Our general findings are the following: (i) Deep networks have better performance than non-deep spectral SVM implementations, with Convolutional Neural Networks (CNNs) performing better than other deep learners. This advantage, however, diminishes when feeding SVM with richer features extracted from data (e.g. spatial filters). (ii) Transfer learning and fine-tuning on pre-trained CNNs are offering promising results over spectral or enhanced SVM, however these pre-trained networks are currently limited to RGB input data, therefore currently lack applicability in multi/hyperspectral data. (iii) There is no strong relationship between network complexity and accuracy gains over SVM; small to medium networks perform similarly to more complex networks. (iv) Contrary to the popular belief, there are numerous cases of high deep networks performance with training proportions of 10% or less. Our study also indicates that the new generation of classifiers is often overperforming existing benchmark datasets, with accuracies surpassing 99%. There is a clear need for new benchmark dataset collections with diverse spectral, spatial and temporal resolutions and coverage that will enable us to study the design generalizations, challenge these new classifiers, and further advance remote sensing science. Our community could also benefit from a coordinated effort to create a large pre-trained network specifically designed for remote sensing images that users could later fine-tune and adjust to their study specifics.
引用
收藏
页码:192 / 210
页数:19
相关论文
共 135 条
  • [81] MugNet: Deep learning for hyperspectral image classification using limited samples
    Pan, Bin
    Shi, Zhenwei
    Xu, Xia
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 145 : 108 - 119
  • [82] A new deep convolutional neural network for fast hyperspectral image classification
    Paoletti, M. E.
    Haut, J. M.
    Plaza, J.
    Plaza, A.
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 145 : 120 - 147
  • [83] Penatti Otavio A. B., 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), P44, DOI 10.1109/CVPRW.2015.7301382
  • [84] Scene classification for aerial images based on CNN using sparse coding technique
    Qayyum, Abdul
    Malik, Aamir Saeed
    Saad, Naufal M.
    Iqbal, Mahboob
    Abdullah, Mohd Faris
    Rasheed, Waqas
    Abdullah, Tuan A. B. Rashid
    Bin Jafaar, Mohd Yaqoob
    [J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 2017, 38 (8-10) : 2662 - 2685
  • [85] Deep Convolutional Neural Network for Complex Wetland Classification Using Optical Remote Sensing Imagery
    Rezaee, Mohammad
    Mahdianpari, Masoud
    Zhang, Yun
    Salehi, Bahram
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2018, 11 (09) : 3030 - 3039
  • [86] Unsupervised Deep Feature Extraction for Remote Sensing Image Classification
    Romero, Adriana
    Gatta, Carlo
    Camps-Valls, Gustau
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (03): : 1349 - 1362
  • [87] THE PERCEPTRON - A PROBABILISTIC MODEL FOR INFORMATION-STORAGE AND ORGANIZATION IN THE BRAIN
    ROSENBLATT, F
    [J]. PSYCHOLOGICAL REVIEW, 1958, 65 (06) : 386 - 408
  • [88] LEARNING REPRESENTATIONS BY BACK-PROPAGATING ERRORS
    RUMELHART, DE
    HINTON, GE
    WILLIAMS, RJ
    [J]. NATURE, 1986, 323 (6088) : 533 - 536
  • [89] MULTI-TEMPORAL LAND COVER CLASSIFICATION WITH LONG SHORT-TERM MEMORY NEURAL NETWORKS
    Russwurm, M.
    Koermer, M.
    [J]. ISPRS HANNOVER WORKSHOP: HRIGI 17 - CMRT 17 - ISA 17 - EUROCOW 17, 2017, 42-1 (W1): : 551 - 558
  • [90] RUSSWURM M, 2018, ARXIV180202080CS