Evaluating the impact of sampling designs on the performance of machine learning techniques for land use land cover classification using Sentinel-2 data

被引:1
|
作者
Rawat, Shivam [1 ]
Saini, Rashmi [1 ]
机构
[1] GB Pant Inst Engn & Technol, Dept Comp Sci, Pauri Garhwal 246194, India
关键词
Remote sensing; land use land cover; stratified random sampling; machine learning; support vector machine; random forest; k nearest neighbours; ACCURACY ASSESSMENT; SELECTION; SIZE;
D O I
10.1080/01431161.2023.2290994
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
In today's world, by integrating remote sensing technology and modern state-of-the-art machine learning techniques, obtaining Land Use Land Cover (LULC) maps has become easier in comparison to traditional manual methods. The performance of a Machine Learning classifier is influenced by various factors. The objective of this study is to evaluate the impact of sampling design in rough complex terrain located in the Northern Himalayan region in Uttarakhand state, India, where reference data is often limited due to the geographical characteristics of the study area. Three sampling design strategies have been incorporated in this study, namely, stratified random sampling with a proportional number of samples (SRS)proportional, stratified random sampling with an equal number of samples (SRS)equivalent and stratified systematic sampling with an equal number of samples with a minimum distance of 10 m between the consecutive samples (SSS)D = 10 m for the LULC classification. In this study, Sentinel-2 data of 10 m spatial resolution for the study area of Dehradun district, Uttarakhand, India, has been selected. The following conclusions can be drawn from the results of this study (i) (SRS)proportional achieved the highest Overall Accuracy (OA) among all the three sampling techniques. The OA and kappa score (ka) using (SRS)proportional are OA = 90.25 and ka = 0.874 by Random Forest, OA = 88.84 and ka = 0.856 by Support Vector Machine and k Nearest Neighbours (kNN) obtained OA = 87.72 and ka = 0.842, respectively. (ii) It was found that in the case of (SRS)proportional, the majority classes like the deciduous forest, evergreen forest and cropland achieved higher recall and precision values in comparison to those obtained from the other two sampling strategies, i.e. (SRS)equivalent and (SSS)D = 10 m. (iii) The results showed that while switching from (SRS)proportional to (SRS)equivalent or from (SRS)proportional to (SSS)D = 10 m, there was a slight reduction in the precision and recall values for the majority classes and a slight increase for a few of the minority classes.
引用
收藏
页码:7889 / 7908
页数:20
相关论文
共 50 条
  • [1] Land cover and land use classification performance of machine learning algorithms in a boreal landscape using Sentinel-2 data
    Abdi, Abdulhakim Mohamed
    GISCIENCE & REMOTE SENSING, 2020, 57 (01) : 1 - 20
  • [2] Urban land use and land cover classification with interpretable machine learning - A case study using Sentinel-2 and auxiliary data*
    Hosseiny, Benyamin
    Abdi, Abdulhakim M.
    Jamali, Sadegh
    REMOTE SENSING APPLICATIONS-SOCIETY AND ENVIRONMENT, 2022, 28
  • [3] Land cover classification: a comparative analysis of clustering techniques using Sentinel-2 data
    Sharma, Mayuri
    Kumar, Chandan Jyoti
    Deka, Aniruddha
    INTERNATIONAL JOURNAL OF SUSTAINABLE AGRICULTURAL MANAGEMENT AND INFORMATICS, 2021, 7 (04) : 321 - 342
  • [4] Performance Evaluation of Sentinel-2 and Landsat 8 OLI Data for Land Cover/Use Classification Using a Comparison between Machine Learning Algorithms
    Ghayour, Laleh
    Neshat, Aminreza
    Paryani, Sina
    Shahabi, Himan
    Shirzadi, Ataollah
    Chen, Wei
    Al-Ansari, Nadhir
    Geertsema, Marten
    Pourmehdi Amiri, Mehdi
    Gholamnia, Mehdi
    Dou, Jie
    Ahmad, Anuar
    REMOTE SENSING, 2021, 13 (07)
  • [5] Mapping heterogeneous land use/land cover and crop types in Senegal using sentinel-2 data and machine learning algorithms
    Gumma, Murali Krishna
    Panjala, Pranay
    Teluguntla, Pardhasaradhi
    INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2024, 17 (01)
  • [6] Impact of Various Atmospheric Corrections on Sentinel-2 Land Cover Classification Accuracy Using Machine Learning Classifiers
    Rumora, Luka
    Miler, Mario
    Medak, Damir
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2020, 9 (04)
  • [7] LAND-COVER AND LAND-USE CLASSIFICATION BASED ON MULTITEMPORAL SENTINEL-2 DATA
    Weinmann, Martin
    Weidner, Uwe
    IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 4946 - 4949
  • [8] INVESTIGATIONS ON THE POTENTIAL OF HYPERSPECTRAL AND SENTINEL-2 DATA FOR LAND-COVER / LAND-USE CLASSIFICATION
    Weinmann, M.
    Maier, P. M.
    Florath, J.
    Weidner, U.
    ISPRS TC I MID-TERM SYMPOSIUM INNOVATIVE SENSING - FROM SENSORS TO METHODS AND APPLICATIONS, 2018, 4-1 : 155 - 162
  • [9] Sentinel-2 Data for Land Cover/Use Mapping: A Review
    Phiri, Darius
    Simwanda, Matamyo
    Salekin, Serajis
    Nyirenda, Vincent R.
    Murayama, Yuji
    Ranagalage, Manjula
    REMOTE SENSING, 2020, 12 (14)
  • [10] ANALYSIS OF LAND COVER AND LAND USE CHANGES USING SENTINEL-2 IMAGES
    Iurist , Nicoleta
    Statescu, Florian
    Lates, Iustina
    PRESENT ENVIRONMENT AND SUSTAINABLE DEVELOPMENT, 2016, 10 (02) : 161 - 172