Using machine learning to generate an open-access cropland map from satellite images time series in the Indian Himalayan region

被引:0
|
作者
Li, Danya [1 ,2 ]
Gajardo, Joaquin [1 ]
Volpi, Michele [3 ,4 ]
Defraeye, Thijs [1 ]
机构
[1] Empa, Swiss Fed Labs Mat Sci & Technol, Lab Biomimet Membranes & Text, Lerchenfeldstr 5, CH-9014 St Gallen, Switzerland
[2] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
[3] Swiss Fed Inst Technol, Swiss Data Sci Ctr, Zurich, Switzerland
[4] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
关键词
Cropland mapping; Smallholders; Remote sensing; High-altitude region; Random forest; Feature engineering; Google earth engine; Sentinel-2; EXTENT;
D O I
10.1016/j.rsase.2023.101057
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Crop maps are crucial for agricultural monitoring and food management and can additionally support domain-specific applications, such as setting cold supply chain infrastructure in developing countries. Machine learning (ML) models, combined with freely-available satellite imagery, can be used to produce cost-effective and high spatial-resolution crop maps. However, accessing ground truth data for supervised learning is especially challenging in developing countries due to factors such as smallholding and fragmented geography, which often results in a lack of crop type maps or even reliable cropland maps. Our area of interest for this study lies in Himachal Pradesh, India, where we aim at producing an open-access binary cropland map at 10-m resolution for the Kullu, Shimla, and Mandi districts. To this end, we developed an ML pipeline that relies on Sen-tinel-2 satellite images time series. We investigated two pixel-based supervised classifiers, sup-port vector machines (SVM) and random forest (RF), which are used to classify per-pixel time series for binary cropland mapping. The ground truth data used for training, validation and testing was manually annotated from a combination of field survey reference points and visual interpretation of very high resolution (VHR) imagery. We trained and validated the models via spatial cross-validation to account for local spatial autocorrelation and improve the generalization capability of the model. We tested the model on hold out test sets of each district, achieving an aver-age accuracy for the RF (our best model) of 87%. We noticed NIR band at the early and late stage of the apple harvest season (main crop in the region) to be of critical importance for the model. Finally, we used this model to generate a cropland map for three districts of Himachal Pradesh, spanning 14,600 km2, which improves the resolution and quality of existing public maps, and made the code open-source.
引用
收藏
页数:13
相关论文
共 28 条
  • [1] Google Earth Engine, Open-Access Satellite Data, and Machine Learning in Support of Large-Area Probabilistic Wetland Mapping
    Hird, Jennifer N.
    DeLancey, Evan R.
    McDermid, Gregory J.
    Kariyeva, Jahan
    REMOTE SENSING, 2017, 9 (12)
  • [2] Coffee-Yield Estimation Using High-Resolution Time-Series Satellite Images and Machine Learning
    Martello, Mauricio
    Molin, Jose Paulo
    Wei, Marcelo Chan Fu
    Canal Filho, Ricardo
    Nicoletti, Joao Vitor Moreira
    AGRIENGINEERING, 2022, 4 (04): : 888 - 902
  • [3] Estimating population density using open-access satellite images and geographic information system: Case of Al Ain city, UAE
    Yagoub, M. M.
    Tesfaldet, Yacob T.
    AlSumaiti, Tareefa
    Al Hosani, Naeema
    Elmubarak, Marwan G.
    REMOTE SENSING APPLICATIONS-SOCIETY AND ENVIRONMENT, 2024, 33
  • [4] Agricultural cropland extent and areas of South Asia derived using Landsat satellite 30-m time-series big-data using random forest machine learning algorithms on the Google Earth Engine cloud
    Gumma, Murali Krishna
    Thenkabail, Prasad S.
    Teluguntla, Pardhasaradhi G.
    Oliphant, Adam
    Xiong, Jun
    Giri, Chandra
    Pyla, Vineetha
    Dixit, Sreenath
    Whitbread, Anthony M.
    GISCIENCE & REMOTE SENSING, 2020, 57 (03) : 302 - 322
  • [5] Estimating wheat yields in Australia using climate records, satellite image time series and machine learning methods
    Kamir, Elisa
    Waldner, Francois
    Hochman, Zvi
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2020, 160 : 124 - 135
  • [6] Multitemporal time series analysis using machine learning models for ground deformation in the Erhai region, China
    Yahui Guo
    Shunqiang Hu
    Wenxiang Wu
    Yuyi Wang
    J. Senthilnath
    Environmental Monitoring and Assessment, 2020, 192 (7)
  • [7] Multitemporal time series analysis using machine learning models for ground deformation in the Erhai region, China
    Guo, Yahui
    Hu, Shunqiang
    Wu, Wenxiang
    Wang, Yuyi
    Senthilnath, J.
    ENVIRONMENTAL MONITORING AND ASSESSMENT, 2020, 192 (07)
  • [8] Land use land cover mapping and snow cover detection in Himalayan region using machine learning and multispectral Sentinel-2 satellite imagery
    Saini R.
    Singh S.
    International Journal of Information Technology, 2024, 16 (2) : 675 - 686
  • [9] A multimodality test outperforms three machine learning classifiers for identifying and mapping paddocks using time series satellite imagery
    O'Hara, Rob
    Zimmermann, Jesko
    Green, Stuart
    GEOCARTO INTERNATIONAL, 2022, 37 (25) : 9748 - 9766
  • [10] Long-Term Satellite Image Time-Series for Land Use/Land Cover Change Detection Using Refined Open Source Data in a Rural Region
    Viana, Claudia M.
    Girao, Ines
    Rocha, Jorge
    REMOTE SENSING, 2019, 11 (09)