Feasibility of machine learning-based rice yield prediction in India at the district level using climate reanalysis and remote sensing data

被引:5
作者
De Clercq, Djavan [1 ]
Mahdi, Adam [1 ]
机构
[1] Univ Oxford, Oxford, England
关键词
Rice; Yield prediction; Machine learning; Climate reanalysis; Remote sensing; CROP YIELD; SATELLITE DATA; MODEL; DIFFUSION; HEALTH;
D O I
10.1016/j.agsy.2024.104099
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
CONTEXT: Yield forecasting, the science of predicting agricultural productivity before the crop harvest occurs, helps a wide range of stakeholders make better decisions around agricultural planning. OBJECTIVE: This study aims to investigate whether machine learning-based yield prediction models can capably predict Kharif season rice yields at the district level in India several months before the rice harvest takes place. METHODOLOGY: The methodology involved training 19 machine learning models such as CatBoost, LightGBM, Orthogonal Matching Pursuit, and Extremely Randomized Trees on 20 years of climate, satellite, and rice yield data across 247 of India's rice-producing districts. In addition to model-building, a dynamic dashboard was built understand how the reliability of rice yield predictions varies across district. RESULTS AND CONCLUSIONS: The results of the proof-of-concept machine learning pipeline demonstrated that rice yields can be predicted with a reasonable degree of accuracy, with out-of-sample R2, MAE, and MAPE performance of up to 0.82, 0.29, and 0.16 respectively. This performance outperformed test set performance reported in related literature on rice yield modelling in other contexts and countries. In addition, SHAP value analysis was conducted to infer both the importance and directional impact of the climate and remote sensing variables included in the model. Important features driving rice yields included temperature, soil water volume, and leaf area index. In particular, higher temperatures in August correlate with increased rice yields, particularly when the leaf area index in August is also high. Building on the results, a proof-of-concept dashboard was developed to allow users to easily explore which districts may experience a rise or fall in yield relative to the previous year. The dashboard show that the model may perform better in some regions than in others. For instance, the absolute percentage error for predicted versus actual yields ranged from an average of 7.1 % in districts in Uttarakhand to an average of 14.7 % in Uttar Pradesh. SIGNIFICANCE: This study underscores the potential for policymakers to consider scaling and operationalizing machine learning approaches to rice yield prediction in the context of agricultural early warning systems to deliver timely crop yield forecasts on a rolling basis throughout the season, thereby equipping agricultural decision-makers with the ability to make informed choices on irrigation scheduling, fertilizer application, and harvest planning to optimize crop output and resource use.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Applicability of machine learning techniques in predicting wheat yield based on remote sensing and climate data in Pakistan, South Asia
    Arshad, Sana
    Kazmi, Jamil Hasan
    Javed, Muhammad Gohar
    Mohammed, Safwan
    EUROPEAN JOURNAL OF AGRONOMY, 2023, 147
  • [22] Machine and deep learning-based prediction of potential geothermal areas in Hangjiahu Plain by integrating remote sensing data and GIS
    Wang, Yuhan
    Zhang, Xuan
    Qian, Junfeng
    Li, Xiang
    Liu, Yangui
    Wu, Wenyuan
    Lu, Zhe
    Xie, Bin
    ENERGY, 2025, 315
  • [23] Sweet corn yield prediction using machine learning models and field-level data
    Dhaliwal, Daljeet S.
    Williams, Martin M.
    PRECISION AGRICULTURE, 2024, 25 (01) : 51 - 64
  • [24] Integrating Remote Sensing and Weather Variables for Mango Yield Prediction Using a Machine Learning Approach
    Torgbor, Benjamin Adjah
    Rahman, Muhammad Moshiur
    Brinkhoff, James
    Sinha, Priyakant
    Robson, Andrew
    REMOTE SENSING, 2023, 15 (12)
  • [25] Developing automated machine learning approach for fast and robust crop yield prediction using a fusion of remote sensing, soil, and weather dataset
    Kheir, Ahmed M. S.
    Govind, Ajit
    Nangia, Vinay
    Devkota, Mina
    Elnashar, Abdelrazek
    Omar, Mohie El Din
    Feike, Til
    ENVIRONMENTAL RESEARCH COMMUNICATIONS, 2024, 6 (04):
  • [26] Machine Learning-Based Wetland Vulnerability Assessment in the Sindh Province Ramsar Site Using Remote Sensing Data
    Aslam, Rana Waqar
    Shu, Hong
    Naz, Iram
    Quddoos, Abdul
    Yaseen, Andaleeb
    Gulshad, Khansa
    Alarifi, Saad S.
    REMOTE SENSING, 2024, 16 (05)
  • [27] Downscaling Administrative-Level Crop Yield Statistics to 1 km Grids Using Multisource Remote Sensing Data and Ensemble Machine Learning
    Pei, Jie
    Zou, Yaopeng
    Liu, Yibo
    He, Yinan
    Tan, Shaofeng
    Wang, Tianxing
    Huang, Jianxi
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 14437 - 14453
  • [28] Machine learning-based prediction of agricultural drought using global climatic indices for the Palakkad district in India
    Das K., Saranya
    Chithra, N. R.
    THEORETICAL AND APPLIED CLIMATOLOGY, 2024, 155 (6) : 4357 - 4369
  • [29] Corn yield prediction in site-specific management zones using proximal soil sensing, remote sensing, and machine learning approach
    Bantchina, Bere Benjamin
    Qaswar, Muhammad
    Arslan, Selcuk
    Ulusoy, Yahya
    Gundogdu, Kemal Sulhi
    Tekin, Yucel
    Mouazen, Abdul Mounem
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2024, 225
  • [30] Leveraging Remote Sensing Data for Yield Prediction with Deep Transfer Learning
    Huber, Florian
    Inderka, Alvin
    Steinhage, Volker
    SENSORS, 2024, 24 (03)