Feasibility of machine learning-based rice yield prediction in India at the district level using climate reanalysis and remote sensing data

被引:5
作者
De Clercq, Djavan [1 ]
Mahdi, Adam [1 ]
机构
[1] Univ Oxford, Oxford, England
关键词
Rice; Yield prediction; Machine learning; Climate reanalysis; Remote sensing; CROP YIELD; SATELLITE DATA; MODEL; DIFFUSION; HEALTH;
D O I
10.1016/j.agsy.2024.104099
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
CONTEXT: Yield forecasting, the science of predicting agricultural productivity before the crop harvest occurs, helps a wide range of stakeholders make better decisions around agricultural planning. OBJECTIVE: This study aims to investigate whether machine learning-based yield prediction models can capably predict Kharif season rice yields at the district level in India several months before the rice harvest takes place. METHODOLOGY: The methodology involved training 19 machine learning models such as CatBoost, LightGBM, Orthogonal Matching Pursuit, and Extremely Randomized Trees on 20 years of climate, satellite, and rice yield data across 247 of India's rice-producing districts. In addition to model-building, a dynamic dashboard was built understand how the reliability of rice yield predictions varies across district. RESULTS AND CONCLUSIONS: The results of the proof-of-concept machine learning pipeline demonstrated that rice yields can be predicted with a reasonable degree of accuracy, with out-of-sample R2, MAE, and MAPE performance of up to 0.82, 0.29, and 0.16 respectively. This performance outperformed test set performance reported in related literature on rice yield modelling in other contexts and countries. In addition, SHAP value analysis was conducted to infer both the importance and directional impact of the climate and remote sensing variables included in the model. Important features driving rice yields included temperature, soil water volume, and leaf area index. In particular, higher temperatures in August correlate with increased rice yields, particularly when the leaf area index in August is also high. Building on the results, a proof-of-concept dashboard was developed to allow users to easily explore which districts may experience a rise or fall in yield relative to the previous year. The dashboard show that the model may perform better in some regions than in others. For instance, the absolute percentage error for predicted versus actual yields ranged from an average of 7.1 % in districts in Uttarakhand to an average of 14.7 % in Uttar Pradesh. SIGNIFICANCE: This study underscores the potential for policymakers to consider scaling and operationalizing machine learning approaches to rice yield prediction in the context of agricultural early warning systems to deliver timely crop yield forecasts on a rolling basis throughout the season, thereby equipping agricultural decision-makers with the ability to make informed choices on irrigation scheduling, fertilizer application, and harvest planning to optimize crop output and resource use.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Remote Sensing Based Yield Estimation of Rice (Oryza Sativa L.) Using Gradient Boosted Regression in India
    Arumugam, Ponraj
    Chemura, Abel
    Schauberger, Bernhard
    Gornott, Christoph
    REMOTE SENSING, 2021, 13 (12)
  • [32] Modern computational approaches for rice yield prediction: A systematic review of statistical and machine learning-based methods
    De Clercq, Djavan
    Mahdi, Adam
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2025, 231
  • [33] Improving Wheat Yield Prediction with Multi-Source Remote Sensing Data and Machine Learning in Arid Regions
    Raza, Aamir
    Shahid, Muhammad Adnan
    Zaman, Muhammad
    Miao, Yuxin
    Huang, Yanbo
    Safdar, Muhammad
    Maqbool, Sheraz
    Muhammad, Nalain E.
    REMOTE SENSING, 2025, 17 (05)
  • [34] Groundwater Withdrawal Prediction Using Integrated Multitemporal Remote Sensing Data Sets and Machine Learning
    Majumdar, S.
    Smith, R.
    Butler, J. J.
    Lakshmi, V
    WATER RESOURCES RESEARCH, 2020, 56 (11)
  • [35] Utilization of synthetic minority oversampling technique for improving potato yield prediction using remote sensing data and machine learning algorithms with small sample size of yield data
    Ebrahimy, Hamid
    Wang, Yi
    Zhang, Zhou
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2023, 201 : 12 - 25
  • [36] Probabilistic machine learning-based phytoplankton abundance using hyperspectral remote sensing
    Kwon, Do Hyuck
    Ahn, Jung Min
    Pyo, Jong Cheol
    Lee, Jiye
    Abbas, Ather
    Park, Sanghyun
    Kim, Kyunghyun
    Lee, Hyuk
    Cho, Kyung Hwa
    GISCIENCE & REMOTE SENSING, 2025, 62 (01)
  • [37] County-level corn yield prediction using supervised machine learning
    Khan, Shahid Nawaz
    Khan, Abid Nawaz
    Tariq, Aqil
    Lu, Linlin
    Malik, Naeem Abbas
    Umair, Muhammad
    Hatamleh, Wesam Atef
    Zawaideh, Farah Hanna
    EUROPEAN JOURNAL OF REMOTE SENSING, 2023, 56 (01)
  • [38] UAV Remote Sensing for High-Throughput Phenotyping and for Yield Prediction of Miscanthus by Machine Learning Techniques
    Impollonia, Giorgio
    Croci, Michele
    Ferrarini, Andrea
    Brook, Jason
    Martani, Enrico
    Blandinieres, Henri
    Marcone, Andrea
    Awty-Carroll, Danny
    Ashman, Chris
    Kam, Jason
    Kiesel, Andreas
    Trindade, Luisa M.
    Boschetti, Mirco
    Clifton-Brown, John
    Amaducci, Stefano
    REMOTE SENSING, 2022, 14 (12)
  • [39] A Multiple Instance Dictionary Learning Approach for Corn Yield Prediction From Remote Sensing Data
    Huang, Risheng
    Chen, Shuhan
    Li, Xiaorun
    Cao, Zeyu
    IEEE SENSORS JOURNAL, 2024, 24 (24) : 41702 - 41716
  • [40] Accurate Wheat Yield Prediction Using Machine Learning and Climate-NDVI Data Fusion
    Ashfaq, Muhammad
    Khan, Imran
    Alzahrani, Abdulrahman
    Tariq, Muhammad Usman
    Khan, Humera
    Ghani, Anwar
    IEEE ACCESS, 2024, 12 : 40947 - 40961