From data to harvest: Leveraging ensemble machine learning for enhanced crop yield predictions across Canada amidst climate change

被引:7
作者
Gharakhanlou, Navid Mahdizadeh [1 ]
Perez, Liliana [1 ]
机构
[1] Univ Montreal, Lab Geosimulat Environm LEDGE, Dept Geog, 1375 Ave Therese Lavoie Roux, Montreal, PQ H2V 0B3, Canada
关键词
Climate change scenarios; Bagging and boosting approaches; Honeybee impact; Canada's agricultural landscape; Pollinator-dependent crops; Geographical information systems (GIS); PRECISION AGRICULTURE; NEURAL-NETWORKS; MODEL; CORN; CLASSIFICATION; REDUCTION; IMPACT; SOIL;
D O I
10.1016/j.scitotenv.2024.175764
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Accurate crop yield predictions are crucial for farmers and policymakers. Despite the widespread use of ensemble machine learning (ML) models in computer science, their application in crop yield prediction remains relatively underexplored. This study, conducted in Canada, aims to assess the potential of five distinct ensemble ML models-Adaptive Boosting (AdaBoost), Gradient Boosting Machine (GBM), XGBoost, LightGBM, and Random Forest (RF)-in predicting crop yields chosen for their ability to manage complex datasets and their strong performance potential. The study integrated various factors, including climate variables, satellite-derived vegetation indices, soil characteristics, and honeybee census data. Data preparation comprised two main steps: first, climate variables were interpolated and averaged for croplands in ArcGIS Pro, along with averaging vegetation indices and soil characteristics. Honeybee census data was also incorporated. Second, the data was organized in Python to create a structured format for models' input. The models' accuracy was assessed using Root Mean Squared Error (RMSE), R-squared, and Mean Absolute Error (MAE). XGBoost emerged as the most accurate model, with the lowest MAE (68.70 for canola and 39.47 for soybeans), lowest RMSE (119.48 for canola and 102.39 for soybeans), and highest R-squared values (0.95 for canola and 0.96 for soybeans) on the test dataset. The study also assessed crop yields under various climate change scenarios, finding minimal variations across the scenarios, but significant negative impacts on canola and soybean yields across Canada. Honeybee colonies were identified as the most influential factor on crop yields, contributing 52.34 % to canola and 57.18 % to soybean yields. This research provides detailed crop yield maps of canola and soybeans at the Census Consolidated Subdivisions (CCS) level across Canada's agricultural landscape, offering valuable forecasts for localized decision-making. Additionally, it offers a proactive strategy for climate change preparedness, assisting farmers and stakeholders optimise resource allocation and manage risks effectively.
引用
收藏
页数:15
相关论文
共 99 条
  • [81] Razzaq A., 2019, Biomed. J. Sci. Tech. Res., V22, P16833
  • [82] Scenarios of long-term socio-economic and environmental development under climate stabilization
    Riahi, Keywan
    Gruebler, Amulf
    Nakicenovic, Nebojsa
    [J]. TECHNOLOGICAL FORECASTING AND SOCIAL CHANGE, 2007, 74 (07) : 887 - 935
  • [83] Influence of honey bee (Hymenoptera: Apidae) density on the production of canola (Crucifera: Brassicacae)
    Sabbahi, R
    De Oliveira, D
    Marceau, J
    [J]. JOURNAL OF ECONOMIC ENTOMOLOGY, 2005, 98 (02) : 367 - 372
  • [84] Nonlinear temperature effects indicate severe damages to US crop yields under climate change
    Schlenker, Wolfram
    Roberts, Michael J.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2009, 106 (37) : 15594 - 15598
  • [85] Forecasting Corn Yield With Machine Learning Ensembles
    Shahhosseini, Mohsen
    Hu, Guiping
    Archontoulis, Sotirios V.
    [J]. FRONTIERS IN PLANT SCIENCE, 2020, 11
  • [86] Statistics Canada, 2021, Table 32-10-0359-01 Estimated areas, yield, production, average farm price and total farm value of principal field crops, in metric and imperial units
  • [87] Statistics Canada, 2023, Bees, Census of Agriculture, 2021
  • [88] United States Geological Survey (USGS), 2024, Vegetation Indices 16-DayL3 Global 250m (MYD13Q1), DOI [10.5067/MODIS/MYD13Q1.006, DOI 10.5067/MODIS/MYD13Q1.006]
  • [89] Crop yield prediction using machine learning: A systematic literature review
    van Klompenburg, Thomas
    Kassahun, Ayalew
    Catal, Cagatay
    [J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2020, 177
  • [90] Stabilizing greenhouse gas concentrations at low levels: an assessment of reduction strategies and costs
    van Vuuren, Detlef P.
    Den Elzen, Michel G. J.
    Lucas, Paul L.
    Eickhout, Bas
    Strengers, Bart J.
    van Ruijven, Bas
    Wonink, Steven
    van Houdt, Roy
    [J]. CLIMATIC CHANGE, 2007, 81 (02) : 119 - 159