Meta-analysis of deep neural networks in remote sensing: A comparative study of mono-temporal classification to support vector machines

被引:54
作者
Heydari, Shahriar S. [1 ]
Mountrakis, Giorgos [1 ]
机构
[1] SUNY Coll Environm Sci & Forestry, Dept Environm Resources Engn, 1 Forestry Dr, Syracuse, NY 13210 USA
关键词
Deep learning; Classification; Convolutional neural network; Deep belief network; Stacked auto encoder; Support vector machine; SPECTRAL-SPATIAL CLASSIFICATION; LAND-COVER CLASSIFICATION; SCENE CLASSIFICATION; SATELLITE IMAGES; HYPERSPECTRAL IMAGES; REPRESENTATIONS; EXTRACTION; CNN; SEGMENTATION; INFORMATION;
D O I
10.1016/j.isprsjprs.2019.04.016
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Deep learning methods have recently found widespread adoption for remote sensing tasks, particularly in image or pixel classification. Their flexibility and versatility has enabled researchers to propose many different designs to process remote sensing data in all spectral, spatial, and temporal dimensions. In most of the reported cases they surpass their non-deep rivals in overall classification accuracy. However, there is considerable diversity in implementation details in each case and a systematic quantitative comparison to non-deep classifiers does not exist. In this paper, we look at the major research papers that have studied deep learning image classifiers in recent years and undertake a meta-analysis on their performance compared to the most used non-deep rival, Support Vector Machine (SVM) classifiers. We focus on mono-temporal classification as the time-series image classification did not offer sufficient samples. Our work covered 103 manuscripts and included 92 cases that supported direct accuracy comparisons between deep learners and SVMs. Our general findings are the following: (i) Deep networks have better performance than non-deep spectral SVM implementations, with Convolutional Neural Networks (CNNs) performing better than other deep learners. This advantage, however, diminishes when feeding SVM with richer features extracted from data (e.g. spatial filters). (ii) Transfer learning and fine-tuning on pre-trained CNNs are offering promising results over spectral or enhanced SVM, however these pre-trained networks are currently limited to RGB input data, therefore currently lack applicability in multi/hyperspectral data. (iii) There is no strong relationship between network complexity and accuracy gains over SVM; small to medium networks perform similarly to more complex networks. (iv) Contrary to the popular belief, there are numerous cases of high deep networks performance with training proportions of 10% or less. Our study also indicates that the new generation of classifiers is often overperforming existing benchmark datasets, with accuracies surpassing 99%. There is a clear need for new benchmark dataset collections with diverse spectral, spatial and temporal resolutions and coverage that will enable us to study the design generalizations, challenge these new classifiers, and further advance remote sensing science. Our community could also benefit from a coordinated effort to create a large pre-trained network specifically designed for remote sensing images that users could later fine-tune and adjust to their study specifics.
引用
收藏
页码:192 / 210
页数:19
相关论文
共 135 条
  • [1] [Anonymous], 2018, REMOTE SENS-BASEL, DOI DOI 10.3390/RS10050779
  • [2] [Anonymous], 2017, J ADV TRANSPORT, DOI DOI 10.1155/2017/8608032
  • [3] [Anonymous], 2015, ARXIV150606579CS
  • [4] [Anonymous], 2013, ARXIV13112901CS
  • [5] [Anonymous], 2017, P IEEE, DOI DOI 10.1109/JPROC.2017.2675998
  • [6] Deep Learning With Attribute Profiles for Hyperspectral Image Classification
    Aptoula, Erchan
    Ozdemir, Murat Can
    Yanikoglu, Berrin
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2016, 13 (12) : 1970 - 1974
  • [7] Semantic Segmentation of Earth Observation Data Using Multimodal and Multi-scale Deep Networks
    Audebert, Nicolas
    Le Saux, Bertrand
    Lefevre, Sebastien
    [J]. COMPUTER VISION - ACCV 2016, PT I, 2017, 10111 : 180 - 196
  • [8] Supervised remote sensing image segmentation using boosted convolutional neural networks
    Basaeed, Essa
    Bhaskar, Harish
    Al-Mualla, Mohammed
    [J]. KNOWLEDGE-BASED SYSTEMS, 2016, 99 : 19 - 27
  • [9] DeepSat - A Learning framework for Satellite Imagery
    Basu, Saikat
    Ganguly, Sangram
    Mukhopadhyay, Supratik
    DiBiano, Robert
    Karki, Manohar
    Nemani, Ramakrishna
    [J]. 23RD ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2015), 2015,
  • [10] 3-D Deep Learning Approach for Remote Sensing Image Classification
    Ben Hamida, Amina
    Benoit, Alexandre
    Lambert, Patrick
    Ben Amar, Chokri
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (08): : 4420 - 4434