Evaluating the Effect of Training Data Size and Composition on the Accuracy of Smallholder Irrigated Agriculture Mapping in Mozambique Using Remote Sensing and Machine Learning Algorithms

被引:8
|
作者
Weitkamp, Timon [1 ,2 ]
Karimi, Poolad [3 ]
机构
[1] Resilience BV, NL-6703 AA Wageningen, Netherlands
[2] Wageningen Univ & Res, Water Resource Management WRM Dept, NL-6708 PB Wageningen, Netherlands
[3] IHE Delft, NL-2611 AX Delft, Netherlands
关键词
irrigated agriculture; training data; sub-Saharan Africa; machine-learning algorithms; class imbalance; RANDOM FOREST; IMAGE CLASSIFICATION; AREA;
D O I
10.3390/rs15123017
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Mapping smallholder irrigated agriculture in sub-Saharan Africa using remote sensing techniques is challenging due to its small and scattered areas and heterogenous cropping practices. A study was conducted to examine the impact of sample size and composition on the accuracy of classifying irrigated agriculture in Mozambique's Manica and Gaza provinces using three algorithms: random forest (RF), support vector machine (SVM), and artificial neural network (ANN). Four scenarios were considered, and the results showed that smaller datasets can achieve high and sufficient accuracies, regardless of their composition. However, the user and producer accuracies of irrigated agriculture do increase when the algorithms are trained with larger datasets. The study also found that the composition of the training data is important, with too few or too many samples of the "irrigated agriculture" class decreasing overall accuracy. The algorithms' robustness depends on the training data's composition, with RF and SVM showing less decrease and spread in accuracies than ANN. The study concludes that the training data size and composition are more important for classification than the algorithms used. RF and SVM are more suitable for the task as they are more robust or less sensitive to outliers than the ANN. Overall, the study provides valuable insights into mapping smallholder irrigated agriculture in sub-Saharan Africa using remote sensing techniques.
引用
收藏
页数:23
相关论文
共 50 条
  • [41] Machine Learning (ML)-Based Copper Mineralization Prospectivity Mapping (MPM) Using Mining Geochemistry Method and Remote Sensing Satellite Data
    Abedini, Mahnaz
    Ziaii, Mansour
    Timkin, Timofey
    Pour, Amin Beiranvand
    REMOTE SENSING, 2023, 15 (15)
  • [42] Integrating Active and Passive Remote Sensing Data for Mapping Soil Salinity Using Machine Learning and Feature Selection Approaches in Arid Regions
    Mohamed, Sayed A.
    Metwaly, Mohamed M.
    Metwalli, Mohamed R.
    AbdelRahman, Mohamed A. E.
    Badreldin, Nasem
    REMOTE SENSING, 2023, 15 (07)
  • [43] Banana Mapping in Heterogenous Smallholder Farming Systems Using High-Resolution Remote Sensing Imagery and Machine Learning Models with Implications for Banana Bunchy Top Disease Surveillance
    Alabi, Tunrayo R.
    Adewopo, Julius
    Duke, Ojo Patrick
    Kumar, P. Lava
    REMOTE SENSING, 2022, 14 (20)
  • [44] Machine Learning Algorithms for Automatic Lithological Mapping Using Remote Sensing Data: A Case Study from Souk Arbaa Sahel, Sidi Ifni Inlier, Western Anti-Atlas, Morocco
    Bachri, Imane
    Hakdaoui, Mustapha
    Raji, Mohammed
    Teodoro, Ana Claudia
    Benbouziane, Abdelmajid
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2019, 8 (06)
  • [45] Smaller is better? Unduly nice accuracy assessments in roof detection using remote sensing data with machine learning and k-fold cross-validation
    Abriha, David
    Srivastava, Prashant K.
    Szabo, Szilard
    HELIYON, 2023, 9 (03)
  • [46] Approach for generating high accuracy machine learning model for high resolution geochemical map completion using remote sensing data - Case study of Arizona, USA
    Huang, Chenhui
    Shibuya, Akinobu
    EARTH RESOURCES AND ENVIRONMENTAL REMOTE SENSING/GIS APPLICATIONS X, 2019, 11156
  • [47] Fuzzy Similarity Analysis of Effective Training Samples to Improve Machine Learning Estimations of Water Quality Parameters Using Sentinel-2 Remote Sensing Data
    Dehkordi, Alireza Taheri
    Zoej, Mohammad Javad Valadan
    Mehran, Ali
    Jafari, Mohsen
    Chegoonian, Amir Masoud
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 5121 - 5136
  • [48] Estimating Above-Ground Biomass of the Regional Forest Landscape of Northern Western Ghats Using Machine Learning Algorithms and Multi-sensor Remote Sensing Data
    Faseela V. Sainuddin
    Guljar Malek
    Ankur Rajwadi
    Padamnabhi S. Nagar
    Smitha V. Asok
    C. Sudhakar Reddy
    Journal of the Indian Society of Remote Sensing, 2024, 52 : 885 - 902
  • [49] Estimating Above-Ground Biomass of the Regional Forest Landscape of Northern Western Ghats Using Machine Learning Algorithms and Multi-sensor Remote Sensing Data
    Sainuddin, Faseela V.
    Malek, Guljar
    Rajwadi, Ankur
    Nagar, Padamnabhi S.
    Asok, Smitha V.
    Reddy, C. Sudhakar
    JOURNAL OF THE INDIAN SOCIETY OF REMOTE SENSING, 2024, 52 (04) : 885 - 902