An Ensemble Learning Approach for Land Use/Land Cover Classification of Arid Regions for Climate Simulation: A Case Study of Xinjiang, Northwest China

被引:19
作者
Du, Haoyang [1 ]
Li, Manchun [2 ,3 ,4 ,5 ]
Xu, Yunyun [1 ]
Zhou, Chen [1 ]
机构
[1] Nanjing Univ, Sch Geog & Ocean Sci, Dept Geog Informat Sci, Nanjing 210023, Peoples R China
[2] Nanjing Univ, Sch Geog & Ocean Sci, Jiangsu Prov Key Lab Geog Informat Technol, Nanjing 210023, Peoples R China
[3] Nanjing Univ, Dept Geog Informat Sci, Nanjing 210023, Peoples R China
[4] Nanjing Univ, Collaborat Innovat Ctr South Sea Studies, Nanjing 210093, Peoples R China
[5] Nanjing Univ, Collaborat Innovat Ctr Novel Software Technol & In, Nanjing 210023, Peoples R China
基金
中国国家自然科学基金;
关键词
Ensemble learning; Classification algorithms; Remote sensing; Support vector machines; Machine learning algorithms; Climate change; Radio frequency; Arid areas; China; ensemble learning; land use; land cover classification; machine learning; SENSING IMAGE CLASSIFICATION; IGBP DISCOVER; SYSTEM; ACCURACY; DATABASE; MODIS; RASTERIZATION; INFORMATION; ALGORITHMS; CONVERSION;
D O I
10.1109/JSTARS.2023.3247624
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Accurate classifications of land use/land cover (LULC) in arid regions are vital for analyzing changes in climate. We propose an ensemble learning approach for improving LULC classification accuracy in Xinjiang, northwest China. First, multisource geographical datasets were applied, and the study area was divided into Northern Xinjiang, Tianshan, and Southern Xinjiang. Second, five machine learning algorithms-k-nearest neighbor, support vector machine (SVM), random forest (RF), artificial neural network (ANN), and C4.5-were chosen to develop different ensemble learning strategies according to the climatic and topographic characteristics of each subregion. Third, stratified random sampling was used to obtain training samples and optimal parameters for each machine learning algorithm. Lastly, each derived approach was applied across Xinjiang, and subregion performance was evaluated. The results showed that the LULC classification accuracy achieved across Xinjiang via the proposed ensemble learning approach was improved by >= 6.85% compared with individual machine learning algorithms. By specific subregion, the accuracies for Northern Xinjiang, Tianshan, and Southern Xinjiang increased by >= 6.70%, 5.87%, and 6.86%, respectively. Moreover, the ensemble learning strategy combining four machine learning algorithms (i.e., SVM, RF, ANN, and C4.5) was superior across Xinjiang and Tianshan; whereas, the three-algorithm (i.e., SVM, RF, and ANN) strategy worked best for the Northern and Southern Xinjiang. The innovation of this study is to develop a novel ensemble learning approach to divide Xinjiang into different subregions, accurately classify land cover, and generate a new land cover product for simulating climate change in Xinjiang.
引用
收藏
页码:2413 / 2426
页数:14
相关论文
共 86 条
[1]   Land cover and land use classification performance of machine learning algorithms in a boreal landscape using Sentinel-2 data [J].
Abdi, Abdulhakim Mohamed .
GISCIENCE & REMOTE SENSING, 2020, 57 (01) :1-20
[2]   Performance evaluation of artificial neural networks for natural terrain classification [J].
Akwensi, Perpetual Hope ;
Brantson, Eric Thompson ;
Niipele, Johanna Ngula ;
Ziggah, Yao Yevenyo .
APPLIED GEOMATICS, 2021, 13 (03) :453-465
[3]   GLC2000:: a new approach to global land cover mapping from Earth observation data [J].
Bartholomé, E ;
Belward, AS .
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2005, 26 (09) :1959-1977
[4]   Random forest in remote sensing: A review of applications and future directions [J].
Belgiu, Mariana ;
Dragut, Lucian .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2016, 114 :24-31
[5]   Effects of geographic information system vector-raster-vector data conversion on landscape indices [J].
Bettinger, P ;
Bradshaw, GA ;
Weaver, GW .
CANADIAN JOURNAL OF FOREST RESEARCH, 1996, 26 (08) :1416-1425
[6]   Less Is More: A Comprehensive Framework for the Number of Components of Ensemble Classifiers [J].
Bonab, Hamed ;
Can, Fazli .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (09) :2735-2745
[7]   A Theoretical Framework on the Ideal Number of Classifiers for Online Ensembles in Data Streams [J].
Bonab, Hamed R. ;
Can, Fazli .
CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, :2053-2056
[8]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[9]   Land-cover change in the Caucasus Mountains since 1987 based on the topographic correction of multi-temporal Landsat composites [J].
Buchner, Johanna ;
Yin, He ;
Frantz, David ;
Kuemmerle, Tobias ;
Askerov, Elshad ;
Bakuradze, Tamar ;
Bleyhl, Benjamin ;
Elizbarashvili, Nodar ;
Komarova, Anna ;
Lewinska, Katarzyna E. ;
Rizayeva, Afag ;
Sayadyan, Hovik ;
Tan, Bin ;
Tepanosyan, Garegin ;
Zazanashvili, Nugzar ;
Radeloff, Volker C. .
REMOTE SENSING OF ENVIRONMENT, 2020, 248
[10]   A tutorial on Support Vector Machines for pattern recognition [J].
Burges, CJC .
DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) :121-167