Evaluating the impact of sampling designs on the performance of machine learning techniques for land use land cover classification using Sentinel-2 data

被引:2
作者
Rawat, Shivam [1 ]
Saini, Rashmi [1 ]
机构
[1] GB Pant Inst Engn & Technol, Dept Comp Sci, Pauri Garhwal 246194, India
关键词
Remote sensing; land use land cover; stratified random sampling; machine learning; support vector machine; random forest; k nearest neighbours; ACCURACY ASSESSMENT; SELECTION; SIZE;
D O I
10.1080/01431161.2023.2290994
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
In today's world, by integrating remote sensing technology and modern state-of-the-art machine learning techniques, obtaining Land Use Land Cover (LULC) maps has become easier in comparison to traditional manual methods. The performance of a Machine Learning classifier is influenced by various factors. The objective of this study is to evaluate the impact of sampling design in rough complex terrain located in the Northern Himalayan region in Uttarakhand state, India, where reference data is often limited due to the geographical characteristics of the study area. Three sampling design strategies have been incorporated in this study, namely, stratified random sampling with a proportional number of samples (SRS)proportional, stratified random sampling with an equal number of samples (SRS)equivalent and stratified systematic sampling with an equal number of samples with a minimum distance of 10 m between the consecutive samples (SSS)D = 10 m for the LULC classification. In this study, Sentinel-2 data of 10 m spatial resolution for the study area of Dehradun district, Uttarakhand, India, has been selected. The following conclusions can be drawn from the results of this study (i) (SRS)proportional achieved the highest Overall Accuracy (OA) among all the three sampling techniques. The OA and kappa score (ka) using (SRS)proportional are OA = 90.25 and ka = 0.874 by Random Forest, OA = 88.84 and ka = 0.856 by Support Vector Machine and k Nearest Neighbours (kNN) obtained OA = 87.72 and ka = 0.842, respectively. (ii) It was found that in the case of (SRS)proportional, the majority classes like the deciduous forest, evergreen forest and cropland achieved higher recall and precision values in comparison to those obtained from the other two sampling strategies, i.e. (SRS)equivalent and (SSS)D = 10 m. (iii) The results showed that while switching from (SRS)proportional to (SRS)equivalent or from (SRS)proportional to (SSS)D = 10 m, there was a slight reduction in the precision and recall values for the majority classes and a slight increase for a few of the minority classes.
引用
收藏
页码:7889 / 7908
页数:20
相关论文
共 50 条
[41]   INFLUENCE OF SAMPLE SIZE IN LAND COVER CLASSIFICATION ACCURACY USING RANDOM FOREST AND SENTINEL-2 DATA IN PORTUGAL [J].
Moraes, Daniel ;
Benevides, Pedro ;
Costa, Hugo ;
Moreira, Francisco D. ;
Caetano, Mario .
2021 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM IGARSS, 2021, :4232-4235
[42]   Integration of Sentinel-1 and Sentinel-2 Data with the G-SMOTE Technique for Boosting Land Cover Classification Accuracy [J].
Ebrahimy, Hamid ;
Naboureh, Amin ;
Feizizadeh, Bakhtiar ;
Aryal, Jagannath ;
Ghorbanzadeh, Omid .
APPLIED SCIENCES-BASEL, 2021, 11 (21)
[43]   Assessment of Machine Learning Algorithms for Land Cover Classification Using Remotely Sensed Data [J].
Park, Jeongmook ;
Lee, Yongkyu ;
Lee, Jungsoo .
SENSORS AND MATERIALS, 2021, 33 (11) :3885-3902
[44]   Land cover mapping in Latvia using hyperspectral airborne and simulated Sentinel-2 data [J].
Jakovels, Dainis ;
Filipovs, Jevgenijs ;
Brauns, Agris ;
Taskovs, Juris ;
Erins, Gatis .
FOURTH INTERNATIONAL CONFERENCE ON REMOTE SENSING AND GEOINFORMATION OF THE ENVIRONMENT (RSCY2016), 2016, 9688
[45]   Performance evaluation of MLE, RF and SVM classification algorithms for watershed scale land use/land cover mapping using sentinel 2 bands [J].
Rana, Vikas Kumar ;
Suryanarayana, Tallavajhala Maruthi Venkata .
REMOTE SENSING APPLICATIONS-SOCIETY AND ENVIRONMENT, 2020, 19
[46]   Addressing the impact of land use land cover changes on land surface temperature using machine learning algorithms [J].
Ullah, Sajid ;
Qiao, Xiuchen ;
Abbas, Mohsin .
SCIENTIFIC REPORTS, 2024, 14 (01)
[47]   Automated Production of a Land Cover/Use Map of Europe Based on Sentinel-2 Imagery [J].
Malinowski, Radek ;
Lewinski, Stanislaw ;
Rybicki, Marcin ;
Gromny, Ewa ;
Jenerowicz, Malgorzata ;
Krupinski, Michal ;
Nowakowski, Artur ;
Wojtkowski, Cezary ;
Krupinski, Marcin ;
Kraetzschmar, Elke ;
Schauer, Peter .
REMOTE SENSING, 2020, 12 (21) :1-27
[48]   PlanetScope, Sentinel-2, and Sentinel-1 Data Integration for Object-Based Land Cover Classification in Google Earth Engine [J].
Vizzari, Marco .
REMOTE SENSING, 2022, 14 (11)
[49]   Tree Species Classification Using Hyperion and Sentinel-2 Data with Machine Learning in South Korea and China [J].
Lim, Joongbin ;
Kim, Kyoung-Min ;
Jin, Ri .
ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2019, 8 (03)
[50]   Comparing Pan-sharpened Landsat-9 and Sentinel-2 for Land-Use Classification Using Machine Learning Classifiers [J].
Bouslihim, Yassine ;
Kharrou, Mohamed Hakim ;
Miftah, Abdelhalim ;
Attou, Taha ;
Bouchaou, Lhoussaine ;
Chehbouni, Abdelghani .
JOURNAL OF GEOVISUALIZATION AND SPATIAL ANALYSIS, 2022, 6 (02)