Evaluating the impact of sampling designs on the performance of machine learning techniques for land use land cover classification using Sentinel-2 data

被引:2
作者
Rawat, Shivam [1 ]
Saini, Rashmi [1 ]
机构
[1] GB Pant Inst Engn & Technol, Dept Comp Sci, Pauri Garhwal 246194, India
关键词
Remote sensing; land use land cover; stratified random sampling; machine learning; support vector machine; random forest; k nearest neighbours; ACCURACY ASSESSMENT; SELECTION; SIZE;
D O I
10.1080/01431161.2023.2290994
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
In today's world, by integrating remote sensing technology and modern state-of-the-art machine learning techniques, obtaining Land Use Land Cover (LULC) maps has become easier in comparison to traditional manual methods. The performance of a Machine Learning classifier is influenced by various factors. The objective of this study is to evaluate the impact of sampling design in rough complex terrain located in the Northern Himalayan region in Uttarakhand state, India, where reference data is often limited due to the geographical characteristics of the study area. Three sampling design strategies have been incorporated in this study, namely, stratified random sampling with a proportional number of samples (SRS)proportional, stratified random sampling with an equal number of samples (SRS)equivalent and stratified systematic sampling with an equal number of samples with a minimum distance of 10 m between the consecutive samples (SSS)D = 10 m for the LULC classification. In this study, Sentinel-2 data of 10 m spatial resolution for the study area of Dehradun district, Uttarakhand, India, has been selected. The following conclusions can be drawn from the results of this study (i) (SRS)proportional achieved the highest Overall Accuracy (OA) among all the three sampling techniques. The OA and kappa score (ka) using (SRS)proportional are OA = 90.25 and ka = 0.874 by Random Forest, OA = 88.84 and ka = 0.856 by Support Vector Machine and k Nearest Neighbours (kNN) obtained OA = 87.72 and ka = 0.842, respectively. (ii) It was found that in the case of (SRS)proportional, the majority classes like the deciduous forest, evergreen forest and cropland achieved higher recall and precision values in comparison to those obtained from the other two sampling strategies, i.e. (SRS)equivalent and (SSS)D = 10 m. (iii) The results showed that while switching from (SRS)proportional to (SRS)equivalent or from (SRS)proportional to (SSS)D = 10 m, there was a slight reduction in the precision and recall values for the majority classes and a slight increase for a few of the minority classes.
引用
收藏
页码:7889 / 7908
页数:20
相关论文
共 50 条
[31]   Comparing machine learning techniques for aquatic vegetation classification using Sentinel-2 data [J].
Piaser, Erika ;
Villa, Paolo .
2022 IEEE 21ST MEDITERRANEAN ELECTROTECHNICAL CONFERENCE (IEEE MELECON 2022), 2022, :465-470
[32]   Deep Learning in the Mapping of Agricultural Land Use Using Sentinel-2 Satellite Data [J].
Singh, Gurwinder ;
Singh, Sartajvir ;
Sethi, Ganesh ;
Sood, Vishakha .
GEOGRAPHIES, 2022, 2 (04) :691-700
[33]   Improving Urban Land Cover Classification with Combined Use of Sentinel-2 and Sentinel-1 Imagery [J].
Hu, Bin ;
Xu, Yongyang ;
Huang, Xiao ;
Cheng, Qimin ;
Ding, Qing ;
Bai, Linze ;
Li, Yan .
ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2021, 10 (08)
[34]   Global land use / land cover with Sentinel 2 and deep learning [J].
Karra, Krishna ;
Kontgis, Caitlin ;
Statman-Weil, Zoe ;
Mazzariello, Joseph C. ;
Mathis, Mark ;
Brumby, Steven P. .
2021 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM IGARSS, 2021, :4704-4707
[35]   Data-driven approach for land surface temperature retrieval with machine learning and sentinel-2 data [J].
Zegaar, Aymen ;
Telli, Abdelmoutia ;
Ounoki, Samira ;
Shahabi, Himan ;
Rueda, Francisco .
REMOTE SENSING APPLICATIONS-SOCIETY AND ENVIRONMENT, 2024, 36
[36]   Land use and land cover classification using machine learning algorithms in google earth engine [J].
Arpitha M ;
S A Ahmed ;
Harishnaika N .
Earth Science Informatics, 2023, 16 :3057-3073
[37]   DEEP LEARNING BASED PATCH-WISE LAND COVER LAND USE CLASSIFICATION: A NEW SMALL BENCHMARK SENTINEL-2 IMAGE DATASET [J].
Alp, Gulsah ;
Sertel, Elif .
2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, :3179-3182
[38]   Assessing Machine Learning Algorithms for Land Use and Land Cover Classification in Morocco Using Google Earth Engine [J].
Ouchra, Hafsa ;
Belangour, Abdessamad ;
Erraissi, Allae ;
Banane, Mouad .
IMAGE ANALYSIS AND PROCESSING - ICIAP 2023 WORKSHOPS, PT I, 2024, 14365 :395-405
[39]   Evaluating Landsat-8, Landsat-9 and Sentinel-2 imageries in land use and land cover (LULC) classification in a heterogeneous urban area [J].
Jombo, Simbarashe ;
Adelabu, Samuel .
GEOJOURNAL, 2023, 88 (Suppl 1) :377-399
[40]   Spatial and semantic effects of LUCAS samples on fully automated land use/land cover classification in high-resolution Sentinel-2 data [J].
Weigand, Matthias ;
Staab, Jeroen ;
Wurm, Michael ;
Taubenboeck, Hannes .
INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2020, 88