Evaluating the impact of sampling designs on the performance of machine learning techniques for land use land cover classification using Sentinel-2 data

被引:2
作者
Rawat, Shivam [1 ]
Saini, Rashmi [1 ]
机构
[1] GB Pant Inst Engn & Technol, Dept Comp Sci, Pauri Garhwal 246194, India
关键词
Remote sensing; land use land cover; stratified random sampling; machine learning; support vector machine; random forest; k nearest neighbours; ACCURACY ASSESSMENT; SELECTION; SIZE;
D O I
10.1080/01431161.2023.2290994
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
In today's world, by integrating remote sensing technology and modern state-of-the-art machine learning techniques, obtaining Land Use Land Cover (LULC) maps has become easier in comparison to traditional manual methods. The performance of a Machine Learning classifier is influenced by various factors. The objective of this study is to evaluate the impact of sampling design in rough complex terrain located in the Northern Himalayan region in Uttarakhand state, India, where reference data is often limited due to the geographical characteristics of the study area. Three sampling design strategies have been incorporated in this study, namely, stratified random sampling with a proportional number of samples (SRS)proportional, stratified random sampling with an equal number of samples (SRS)equivalent and stratified systematic sampling with an equal number of samples with a minimum distance of 10 m between the consecutive samples (SSS)D = 10 m for the LULC classification. In this study, Sentinel-2 data of 10 m spatial resolution for the study area of Dehradun district, Uttarakhand, India, has been selected. The following conclusions can be drawn from the results of this study (i) (SRS)proportional achieved the highest Overall Accuracy (OA) among all the three sampling techniques. The OA and kappa score (ka) using (SRS)proportional are OA = 90.25 and ka = 0.874 by Random Forest, OA = 88.84 and ka = 0.856 by Support Vector Machine and k Nearest Neighbours (kNN) obtained OA = 87.72 and ka = 0.842, respectively. (ii) It was found that in the case of (SRS)proportional, the majority classes like the deciduous forest, evergreen forest and cropland achieved higher recall and precision values in comparison to those obtained from the other two sampling strategies, i.e. (SRS)equivalent and (SSS)D = 10 m. (iii) The results showed that while switching from (SRS)proportional to (SRS)equivalent or from (SRS)proportional to (SSS)D = 10 m, there was a slight reduction in the precision and recall values for the majority classes and a slight increase for a few of the minority classes.
引用
收藏
页码:7889 / 7908
页数:20
相关论文
共 50 条
[21]   Can a Hierarchical Classification of Sentinel-2 Data Improve Land Cover Mapping? [J].
Wasniewski, Adam ;
Hoscilo, Agata ;
Chmielewska, Milena .
REMOTE SENSING, 2022, 14 (04)
[22]   Fusion of sentinel-1 SAR and sentinel-2 MSI data for accurate Urban land use-land cover classification in Gondar City, Ethiopia [J].
Dagne, Shimelis Sishah ;
Hirpha, Hurgesa Hundera ;
Tekoye, Addisu Teshome ;
Dessie, Yeshambel Barko ;
Endeshaw, Adane Addis .
ENVIRONMENTAL SYSTEMS RESEARCH, 2023, 12 (01)
[23]   Comparison of Three Machine Learning Algorithms Using Google Earth Engine for Land Use Land Cover Classification [J].
Zhao, Zhewen ;
Islam, Fakhrul ;
Waseem, Liaqat Ali ;
Tariq, Aqil ;
Nawaz, Muhammad ;
Ul Islam, Ijaz ;
Bibi, Tehmina ;
Rehman, Nazir Ur ;
Ahmad, Waqar ;
Aslam, Rana Waqar ;
Raza, Danish ;
Hatamleh, Wesam Atef .
RANGELAND ECOLOGY & MANAGEMENT, 2024, 92 :129-137
[24]   Uncertainty Analysis of Object-Based Land-Cover Classification Using Sentinel-2 Time-Series Data [J].
Ma, Lei ;
Schmitt, Michael ;
Zhu, Xiaoxiang .
REMOTE SENSING, 2020, 12 (22) :1-17
[25]   The impact of selection of reference samples and DEM on the accuracy of land cover classification based on Sentinel-2 data [J].
Wasniewski, Adam ;
Hoscilo, Agata ;
Aune-Lundberg, Linda .
REMOTE SENSING APPLICATIONS-SOCIETY AND ENVIRONMENT, 2023, 32
[26]   UNET NEURAL NETWORK IN AGRICULTURAL LAND COVER CLASSIFICATION USING SENTINEL-2 [J].
Kramarczyk, P. ;
Hejmanowska, B. .
2ND GEOBENCH WORKSHOP ON EVALUATION AND BENCHMARKING OF SENSORS, SYSTEMS AND GEOSPATIAL DATA IN PHOTOGRAMMETRY AND REMOTE SENSING, VOL. 48-1, 2023, :85-90
[27]   ASSESSMENT OF CLASSIFICATION ACCURACIES OF SENTINEL-2 AND LANDSAT-8 DATA FOR LAND COVER/USE MAPPING [J].
Topaloglu, Raziye Hale ;
Sertel, Elif ;
Musaoglu, Nebiye .
XXIII ISPRS CONGRESS, COMMISSION VIII, 2016, 41 (B8) :1055-1059
[28]   Combining Sentinel-1 and Sentinel-2 data for improved land use and land cover mapping of monsoon regions [J].
Steinhausen, Max J. ;
Wagner, Paul D. ;
Narasimhan, Balaji ;
Waske, Bjoern .
INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2018, 73 :595-604
[29]   The 2017 Land Use/Land Cover Map of Catalonia based on Sentinel-2 images and auxiliary data [J].
Gonzalez-Guerrero, O. ;
Pons, X. .
REVISTA DE TELEDETECCION, 2020, (55) :81-92
[30]   Image Classification and Land Cover Mapping Using Sentinel-2 Imagery: Optimization of SVM Parameters [J].
Yousefi, Saleh ;
Mirzaee, Somayeh ;
Almohamad, Hussein ;
Al Dughairi, Ahmed Abdullah ;
Gomez, Christopher ;
Siamian, Narges ;
Alrasheedi, Mona ;
Abdo, Hazem Ghassan .
LAND, 2022, 11 (07)