Advances of Four Machine Learning Methods for Spatial Data Handling: a Review

被引:139
作者
Du, Peijun [1 ,2 ,3 ]
Bai, Xuyu [1 ,2 ,3 ]
Tan, Kun [4 ]
Xue, Zhaohui [5 ]
Samat, Alim [6 ]
Xia, Junshi [7 ]
Li, Erzhu [8 ]
Su, Hongjun [5 ]
Liu, Wei [8 ]
机构
[1] Nanjing Univ, Sch Geog & Ocean Sci, Nanjing 210023, Peoples R China
[2] Key Lab Land Satellite Remote Sensing Applicat, Minist Nat Resources China, Nanjing 210023, Peoples R China
[3] Jiangsu Ctr Collaborat Innovat Geog, Informat Resource Dev & Applicat, Nanjing 210023, Peoples R China
[4] East China Normal Univ, Minist Educ, Key Lab Geog Informat Sci, Shanghai 200241, Peoples R China
[5] Hohai Univ, Sch Earth Sci & Engn, Nanjing 211100, Peoples R China
[6] Chinese Acad Sci, Xinjiang Inst Ecol & Geog, State Key Lab Desert & Oasis Ecol, Urumqi 830011, Peoples R China
[7] RIKEN Ctr Adv Intelligence Project, Tokyo 1030027, Japan
[8] Jiangsu Normal Univ, Sch Geog, Geomat & Planning, Xuzhou 221116, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Machine learning; Remote sensing image classification; Spatial interpolation; Support vector machine; Ensemble learning; Deep learning; Semi-supervised learning; Active learning; HYPERSPECTRAL IMAGE CLASSIFICATION; SUPPORT VECTOR MACHINES; CONVOLUTIONAL NEURAL-NETWORKS; REMOTE-SENSING IMAGES; SCENE CLASSIFICATION; FEATURE-SELECTION; FEATURE-EXTRACTION; ENSEMBLE; SVM; INFORMATION;
D O I
10.1007/s41651-020-00048-5
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Most machine learning tasks can be categorized into classification or regression problems. Regression and classification models are normally used to extract useful geographic information from observed or measured spatial data, such as land cover classification, spatial interpolation, and quantitative parameter retrieval. This paper reviews the progress of four advanced machine learning methods for spatial data handling, namely, support vector machine (SVM)-based kernel learning, semi-supervised and active learning, ensemble learning, and deep learning. These four machine learning modes are representative because they improve learning performances from different views, for example, feature space transform and decision function (SVM), optimized uses of samples (semi-supervised and active learning), and enhanced learning models and capabilities (ensemble learning and deep learning). For spatial data handling via machine learning that can be improved by the four machine learning models, three key elements are learning algorithms, training samples, and input features. To apply machine learning methods to spatial data handling successfully, a four-level strategy is suggested: experimenting and evaluating the applicability, extending the algorithms by embedding spatial properties, optimizing the parameters for better performance, and enhancing the algorithm by multiple means. Firstly, the advances of SVM are reviewed to demonstrate the merits of novel machine learning methods for spatial data, running the line from direct use and comparison with traditional classifiers, and then targeted improvements to address multiple class problems, to optimize parameters of SVM, and to use spatial and spectral features. To overcome the limits of small-size training samples, semi-supervised learning and active learning methods are then utilized to deal with insufficient labeled samples, showing the potential of learning from small-size training samples. Furthermore, considering the poor generalization capacity and instability of machine learning algorithms, ensemble learning is introduced to integrate the advantages of multiple learners and to enhance the generalization capacity. The typical research lines, including the combination of multiple classifiers, advanced ensemble classifiers, and spatial interpolation, are presented. Finally, deep learning, one of the most popular branches of machine learning, is reviewed with specific examples for scene classification and urban structural type recognition from high-resolution remote sensing images. By this review, it can be concluded that machine learning methods are very effective for spatial data handling and have wide application potential in the big data era.
引用
收藏
页数:25
相关论文
共 144 条
[61]   Improvements to Platt's SMO algorithm for SVM classifier design [J].
Keerthi, SS ;
Shevade, SK ;
Bhattacharyya, C ;
Murthy, KRK .
NEURAL COMPUTATION, 2001, 13 (03) :637-649
[62]   From dynamic classifier selection to dynamic ensemble selection [J].
Ko, Albert H. R. ;
Sabourin, Robert ;
Britto, Alceu Souza, Jr. .
PATTERN RECOGNITION, 2008, 41 (05) :1718-1731
[63]   ImageNet Classification with Deep Convolutional Neural Networks [J].
Krizhevsky, Alex ;
Sutskever, Ilya ;
Hinton, Geoffrey E. .
COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90
[64]   An Ensemble of Fine-Tuned Convolutional Neural Networks for Medical Image Classification [J].
Kumar, Ashnil ;
Kim, Jinman ;
Lyndon, David ;
Fulham, Michael ;
Feng, Dagan .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2017, 21 (01) :31-40
[65]   Switching between selection and fusion in combining classifiers: An experiment [J].
Kuncheva, LI .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2002, 32 (02) :146-156
[66]   A Kernel-Based Feature Selection Method for SVM With RBF Kernel for Hyperspectral Image Classification [J].
Kuo, Bor-Chen ;
Ho, Hsin-Hua ;
Li, Cheng-Hsuan ;
Hung, Chih-Cheng ;
Taur, Jin-Shiuh .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2014, 7 (01) :317-326
[67]  
Lazebnik S., 2006, P IEEE CVF C COMP VI, P2169, DOI DOI 10.1109/CVPR.2006.68
[68]  
Leng JB, 2016, PROC INT C TOOLS ART, P1027, DOI [10.1109/ICTAI.2016.155, 10.1109/ICTAI.2016.0158]
[69]   Integrating Multilayer Features of Convolutional Neural Networks for Remote Sensing Scene Classification [J].
Li, Erzhu ;
Xia, Junshi ;
Du, Peijun ;
Lin, Cong ;
Samat, Alim .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2017, 55 (10) :5653-5665
[70]   Mid-Level Feature Representation via Sparse Autoencoder for Remotely Sensed Scene Classification [J].
Li, Erzhu ;
Du, Peijun ;
Samat, Alim ;
Meng, Yaping ;
Che, Meiqin .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2017, 10 (03) :1068-1081