Scalability analysis of typical remote sensing data classification methods: A case of remote sensing image scene

被引:0
作者
Zhao L. [1 ,2 ]
Tang P. [1 ]
机构
[1] Institute of Remote Sensing and Digital Earth, Chinese Academy of Sciences, Beijing
[2] University of Chinese Academy of Sciences, Beijing
来源
Yaogan Xuebao/Journal of Remote Sensing | 2016年 / 20卷 / 02期
关键词
Classifier; Data classification; Remote sensing image; Scalability; Scene classification;
D O I
10.11834/jrs.20164279
中图分类号
学科分类号
摘要
The classification of remote sensing data plays an important role in all stages of remote sensing data processing and analysis. With the increase in the volume of remote sensing data, new problems concerning remote sensing big data classification tasks arise. Currently, the commonly used classifiers are usually designed for simple tasks to provide satisfactory results. However, for the processing of large volumes of remote sensing data, the scalability of classification efficiency and precision should be further investigated. Therefore, this study emphasizes on the comparisons of the scalability of typical remote sensing data classification methods to achieve this goal. Method: This study takes remote sensing image scene classification as an example and selects four well-known classification methods for comparison, namely, K Nearest Neighbor (KNN), Random Forest (RF), Support Vector Machine (SVM), and Sparse Representation-based Classifier (SRC), to conduct scalability analysis. The comparisons are conducted in terms of parameter sensitivity, effect of training sample data volume on classifier performance, effect of testing sample data volume on classifier performance, and effect of feature dimension on classifier performance. Results: The experimental results are as below: (1) The classifiers of KNN, RF, and L0-SRC are less parameter-sensitive than the classifiers of RBF-SVM, Linear-SVM, and L1-SRC. (2) In cases where the samples to be classified are fixed, all the classifiers tend to increase with the increase in the number of training samples. The SRC-type classification methods have the highest accuracy, followed by the SVM-type classification methods, the RF, and the KNN classifiers. In terms of overall classification time, the results show that the methods can be ranked as below: L0-SRC > L1-SRC > RF > RBF-SVM/Linear-SVM > KNN/L0-SRC-Batch. (3) In cases where the training samples are fixed, the classification accuracies of all the classifiers are seldom affected by the number of samples to be classified, which may be due to the learning abilities of all the different classifiers. (4) The feature dimension affects the efficiency and accuracy of different classifiers, in which SRC and KNN can obtain satisfactory results without high feature dimensions. SVM is tolerant to high feature dimensions and has a good learning ability with such high feature dimensions. By contrast, RF is insensitive to the increase in feature dimensions, and higher feature dimensions do not contribute much to the improvement of classification performance. Under such circumstances, the RBFSVM exhibits the best performance, followed by the L1-SRC classifier, the Linear-SVM classifier, and the RF and L0-SRC/L0-SRC-Batch classifiers. In terms of overall classification time, the classifiers of L1-SRC and L0-SRC are the most time-consuming, whereas the other classifiers have relatively higher efficiency. Conclusion: Different classification methods have different advantages and disadvantages. In the tasks of classifying a large volume of remote sensing data, the selection of classifiers should be balanced and based on their characteristics and practical applications. Generally, a classifier that is less parameter-sensitive and less time-consuming during classification and obtains more accurate classification results is preferable. © 2016, Science Press. All right reserved.
引用
收藏
页码:157 / 171
页数:14
相关论文
共 25 条
  • [1] Breiman L., Random forests, Machine Learning, 45, 1, pp. 5-32, (2001)
  • [2] Breiman L., Random forests, Machine Learning, 45, 1, pp. 5-32, (2001)
  • [3] Chen S.S.B., Donoho D.L., Saunders M.A., Atomic decomposition by basis pursuit, SIAM Journal on Scientific Computing, 20, 1, pp. 33-61, (1998)
  • [4] Chen Y., Nasrabadi N.M., Tran T.D., Hyperspectral image classification using dictionary-based sparse representation, IEEE Transactions on Geoscience and Remote Sensing, 49, 10, pp. 3973-3985, (2011)
  • [5] Du P.J., Xia J.S., Zhang W., Tan K., Liu Y., Liu S.C., Multiple classifier system for remote sensing image classification: A review, Sensors, 12, 4, pp. 4764-4792, (2012)
  • [6] Franco-Lopez H., Ek A.R., Bauer M.E., Estimation and mapping of forest stand density, volume, and cover type using the knearest neighbors method, Remote Sensing of Environment, 77, 3, pp. 251-274, (2001)
  • [7] Friedl M.A., Brodley C.E., Decision tree classification of land cover from remotely sensed data, Remote Sensing of Environment, 61, 3, pp. 399-409, (1997)
  • [8] Hsu C.W., Lin C.J., A comparison of methods for multiclass support vector machine, IEEE Transactions on Neural Networks, 13, 2, pp. 415-425, (2002)
  • [9] Li C.C., Wang J., Wang L., Hu L.Y., Gong P., Comparison of classification algorithms and training sample sizes in urban land classification with Landsat thematic mapper imagery, Remote Sensing, 6, 2, pp. 964-983, (2014)
  • [10] Lu D., Weng Q., A survey of image classification methods and techniques for improving classification performance, International Journal of Remote Sensing, 28, 5, pp. 823-870, (2007)