Using machine learning for crop yield prediction in the past or the future

被引:27
|
作者
Morales, Alejandro [1 ]
Villalobos, Francisco J. [2 ,3 ]
机构
[1] Wageningen Univ & Res, Ctr Crop Syst Anal, Plant Sci Grp, Wageningen, Netherlands
[2] Consejo Super Invest Cient IAS CSIC, Inst Agr Sostenible, Cordoba, Spain
[3] Univ Cordoba, Dept Agron, ETSIAM, Cordoba, Spain
来源
FRONTIERS IN PLANT SCIENCE | 2023年 / 14卷
关键词
machine learning; crop simulation model; wheat; sunflower; DSSAT; neural network; ARTIFICIAL NEURAL-NETWORKS; WHEAT; MODEL;
D O I
10.3389/fpls.2023.1128388
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
The use of ML in agronomy has been increasing exponentially since the start of the century, including data-driven predictions of crop yields from farm-level information on soil, climate and management. However, little is known about the effect of data partitioning schemes on the actual performance of the models, in special when they are built for yield forecast. In this study, we explore the effect of the choice of predictive algorithm, amount of data, and data partitioning strategies on predictive performance, using synthetic datasets from biophysical crop models. We simulated sunflower and wheat data using OilcropSun and Ceres-Wheat from DSSAT for the period 2001-2020 in 5 areas of Spain. Simulations were performed in farms differing in soil depth and management. The data set of farm simulated yields was analyzed with different algorithms (regularized linear models, random forest, artificial neural networks) as a function of seasonal weather, management, and soil. The analysis was performed with Keras for neural networks and R packages for all other algorithms. Data partitioning for training and testing was performed with ordered data (i.e., older data for training, newest data for testing) in order to compare the different algorithms in their ability to predict yields in the future by extrapolating from past data. The Random Forest algorithm had a better performance (Root Mean Square Error 35-38%) than artificial neural networks (37-141%) and regularized linear models (64-65%) and was easier to execute. However, even the best models showed a limited advantage over the predictions of a sensible baseline (average yield of the farm in the training set) which showed RMSE of 42%. Errors in seasonal weather forecasting were not taken into account, so real-world performance is expected to be even closer to the baseline. Application of AI algorithms for yield prediction should always include a comparison with the best guess to evaluate if the additional cost of data required for the model compensates for the increase in predictive power. Random partitioning of data for training and validation should be avoided in models for yield forecasting. Crop models validated for the region and cultivars of interest may be used before actual data collection to establish the potential advantage as illustrated in this study.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Crop yield prediction with environmental and chemical variables using optimized ensemble predictive model in machine learning
    Krishnadoss, Nivethitha
    Ramasamy, Lokesh Kumar
    ENVIRONMENTAL RESEARCH COMMUNICATIONS, 2024, 6 (10):
  • [42] Statistical and machine learning models for location-specific crop yield prediction using weather indices
    Ajith, S.
    Debnath, Manoj Kanti
    Karthik, R.
    INTERNATIONAL JOURNAL OF BIOMETEOROLOGY, 2024, 68 (12) : 2453 - 2475
  • [43] Crop yield prediction: two-tiered machine learning model approach
    Shidnal S.
    Latte M.V.
    Kapoor A.
    International Journal of Information Technology, 2021, 13 (5) : 1983 - 1991
  • [44] Statistical and machine learning methods for crop yield prediction in the context of precision agriculture
    Burdett, Hannah
    Wellen, Christopher
    PRECISION AGRICULTURE, 2022, 23 (05) : 1553 - 1574
  • [45] Knowledge informed hybrid machine learning in agricultural yield prediction
    von Bloh, Malte
    Lobell, David
    Asseng, Senthold
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2024, 227
  • [46] Crop Pests Prediction Method using Regression and Machine Learning Technology: Survey
    Kim, Yun Hwan
    Yoo, Seong Joon
    Gu, Yeong Hyeon
    Lim, Jin Hee
    Han, Dongil
    Baik, Sung Wook
    2013 INTERNATIONAL CONFERENCE ON FUTURE SOFTWARE ENGINEERING AND MULTIMEDIA ENGINEERING (ICFM 2013), 2014, 6 : 52 - 56
  • [47] Agricultural Analysis and Crop Yield Prediction of Habiganj using Multispectral Bands of Satellite Imagery with Machine Learning
    Shahrin, Fariha
    Zahin, Labiba
    Rahman, Ramisa
    Hossain, A. S. M. Jahir
    Kaf, Abdulla Hil
    Azad, A. K. M. Abdul Malek
    PROCEEDINGS OF 2020 11TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (ICECE), 2020, : 21 - 24
  • [48] Prediction of Teff Yield Using a Machine Learning Approach
    Mulatu, Adugna Necho
    Tamir, Eneyachew
    ARTIFICIAL INTELLIGENCE AND DIGITALIZATION FOR SUSTAINABLE DEVELOPMENT, ICAST 2022, 2023, 455 : 159 - 176
  • [49] Crop yield prediction via explainable AI and interpretable machine learning: Dangers of black box models for evaluating climate change impacts on crop yield
    Hu, Tongxi
    Zhang, Xuesong
    Bohrer, Gil
    Liu, Yanlan
    Zhou, Yuyu
    Martin, Jay
    Li, Yang
    Zhao, Kaiguang
    AGRICULTURAL AND FOREST METEOROLOGY, 2023, 336
  • [50] Crop Selection Method to Maximize Crop Yield Rate using Machine Learning Technique
    Kumar, Rakesh
    Singh, M. P.
    Kumar, Prabhat
    Singh, J. P.
    2015 INTERNATIONAL CONFERENCE ON SMART TECHNOLOGIES AND MANAGEMENT FOR COMPUTING, COMMUNICATION, CONTROLS, ENERGY AND MATERIALS (ICSTM), 2015, : 138 - 145