Land cover classification in hilly and mountainous areas using multi-source data and Stacking-SHAP technique

被引:0
作者
Zhou Y. [1 ,2 ]
Chen H. [1 ]
Liu H. [1 ,2 ]
机构
[1] College of Resources and Environment, Southwest University, Chongqing
[2] Chongqing Key Laboratory of Digital Agriculture, Chongqing
来源
Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering | 2022年 / 38卷 / 23期
关键词
hilly and mountainous areas; land cover classification; multi-source data; remote sensing; SHAP technique; Stacking algorithm;
D O I
10.11975/j.issn.1002-6819.2022.23.023
中图分类号
学科分类号
摘要
An accurate classification of land cover can greatly contribute to the basic dataset for regional ecological protection and environmental management. Remote sensing (RS) images are commonly used as the main data source for the extraction of land cover at present. However, there is a complex landscape, broken distribution of ground objects, frequent cloud cover, as well as serious radiometric distortion in the hilly and mountainous areas. Thus, it is difficult to accurately gain the distribution information of ground objects only by satellite images. Fortunately, the collaborative application of multi-source heterogeneous data can be expected to bridge the deficiency of a single data source, in order to accumulate more valuable information for the separability of ground objects. Great prospects can be realized to extract the land cover in areas with the complex surface landscape. In addition, the stacking algorithm with advanced machine learning can present superior and robust predictive performance in recent classification tasks. Therefore, the purpose of the current study is to explore the effectiveness of the multi-source heterogeneous data and stacking algorithm on land cover classification in hilly and mountainous areas. The study area was taken as the Qian Jiang District in Chongqing Province of China. Specifically, the various feature variables were extracted from the multi-source heterogeneous data, including the Sentinel-1/2 images, Digital Elevation Model (DEM), soil and climate data. Boruta method and Variance Inflation Factor (VIF) were applied to eliminate the redundant feature for the simple statistics. Then, five schemes with different inputs were created using the subset of the optimized variables, including the purely RS variables, RS variables plus climate factors, RS variables plus terrain parameters, RS variables plus soil parameters, and all variables. A stacking algorithm was also used to construct the classification model for the impacts of different types of variables on the classification accuracy of land cover. Meanwhile, the best classification using the stacking algorithm was compared with the Support Vector Machine (SVM), Random Forest (RF), and extreme gradient boosting (XGBoost). Additionally, a novel shapley addictive explanation (SHAP) was introduced to quantify the importance of variables in the model. The results showed that the overall accuracy, Kappa coefficient, and F1-score were significantly improved after the introduction of the climate, soil, and terrain variables. By contrast, the lowest classification accuracy of land cover was found in the model only using remote sensing variables. Among them, the soil variables contributed the most improvement, followed by the terrain, and climate variables. The classification accuracy of agricultural land types (dry farmland, paddy field, and orchard) was greater than that of the rest. The best classification accuracy was achieved in the experimental scheme with all feature variables, indicating an overall accuracy of 96.61%, Kappa of 0.96, and F1-score of 94.81%. The classification accuracy of the improved was higher than that of the SVM, RF, and XGBoost under the same variables. The SHAP technique can be expected to quantify and evaluate the global importance of each variable, indicating that the traditional vegetation and water spectral indicators were the most important feature variables. Besides, the local contribution of each variable for each land cover type can provide more value to optimize the parameters for the extraction of object information in hilly and mountainous areas. This finding can offer technical support and theoretical reference for land cover mapping in complex landscape areas. © 2022 Chinese Society of Agricultural Engineering. All rights reserved.
引用
收藏
页码:213 / 222
页数:9
相关论文
共 34 条
[1]  
Verde N, Kokkoris I, Georgiadis C, Et al., National scale land cover classification for ecosystem services mapping and assessment using multitemporal copernicus EO data and google earth engine, Remote Sensing, 12, 20, (2020)
[2]  
Liu H, Gong P, Wang J, Et al., Production of global daily seamless data cubes and quantification of global land cover change from 1985 to 2020 - iMap World 1.0[J], Remote Sensing of Environment, 258, (2021)
[3]  
He Yun, Huang Chong, Li He, Et al., Land-cover classification of random forest based on Sentinel-2A image feature optimization, Resources Science, 41, 5, pp. 992-1001, (2019)
[4]  
Hou Mengjing, Yin Jianpeng, Ge Jing, Et al., Land cover remote sensing classification method of alpine wetland region based on random forest algorithm, Transactions of the Chinese Society for Agricultural Machinery, 51, 7, pp. 220-227, (2020)
[5]  
Wang Lijuan, Kong Yuru, Yang Xiaodong, Et al., Classification of land use in farming areas based on feature optimization random forest algorithm, Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 36, 4, pp. 244-250, (2020)
[6]  
Ning Xiaogang, Chang Wentao, Wang Hao, Et al., Extraction of marsh wetland in Heilongjiang Basin based on GEE and multi-source remote sensing data, National Remote Sensing Bulletin, 26, 2, pp. 386-396, (2022)
[7]  
Kpienbaareh D, Sun X, Wang J, Et al., Crop type and land cover mapping in northern Malawi using the integration of Sentinel-1, Sentinel-2, and Planetscope satellite data, Remote Sensing, 13, 4, (2021)
[8]  
Yao Jinxi, Wang Lang, Li Jianzhong, Et al., Multi-source remote sensing and multi-feature combination ground object classification in Nuomuhong areas,Qinghai Province of China, Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 38, 3, pp. 247-256, (2022)
[9]  
Zhai Pengfei, Li Shihua, Hu Yueming, Object-oriented land cover change detection combining optical and radar remote sensing data, Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 37, 23, pp. 216-224, (2021)
[10]  
Wang Y, Feng C, Duc H, Et al., C. Feng, H. Vu Duc Integrating multi sensor remote sensing data for land use/cover mapping in a tropical mountainous area in Northern Thailand, Geographical Research, 50, 3, pp. 320-331, (2012)