Optimizing machine learning for agricultural productivity: A novel approach with RScv and remote sensing data over Europe

被引:11
作者
Asadollah, Seyed Babak Haji Seyed [1 ,2 ]
Jodar-Abellan, Antonio [3 ]
Pardo, Miguel Angel [1 ]
机构
[1] Univ Alicante, Dept Civil Engn, Alicante 03690, Spain
[2] SUNY, Coll Environm Sci & Forestry, Dept Environm Resources Engn, 1 Forestry Dr, Syracuse, NY 13210 USA
[3] Ctr Appl Soil Sci & Biol Segura, Spanish Natl Res Council CEBAS CSIC, Soil & Water Conservat Res Grp, POB 164, Murcia 30100, Spain
关键词
Crop yield; Remote sensing; Machine learning; Randomized search; Agricultural prediction; CROP YIELD; LASSO ANALYSIS; RANDOM FOREST; REGRESSION; STEPWISE; SUBSET; MODELS;
D O I
10.1016/j.agsy.2024.103955
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
CONTEXT: Accurate estimating of crop yield is crucial for developing effective global food security strategies which can lead to reduce of hunger and more sustainable development. However, predicting crop yields is a complex task as it requires frequent monitoring of many weather and socio-economic factors over an extended period. Satellite remote sensing products have become a reliable source for climate-based variables. They are easier to obtain and provide detailed spatial and temporal coverage. OBJECTIVE: The aim of this study is to assess the effectiveness of implement a novel optimization algorithm, called Randomized Search cross validation (RScv), on various machine learning algorithms and measure the prediction accuracy enhancement. METHODS: Annual yields of four crops (Barley, Oats, Rye, and Wheat) were predicted across 20 European countries for 20 years (2000-2019). Two NASA missions, namely GPCP and GLDAS satellites, provided us with climate- and soil-based input variables. Those variables were employed as the input of four ensemble Machine Learning (ML) algorithms (Ada-Boost (AB), Gradient Boost (GB), Random Forest (RF) and Extra Tree (ET)) which are faster and more adoptable compare to classic AI algorithms. RESULTS AND CONCLUSIONS: Main results show that applying RScv improves the prediction ability of all ML models over the four crops. In particular, the RScv-AB reaches the overall highest accuracy for predicting yields (R2 max = 0.9). Spatial evaluation of predicting errors depicts that the proposed models were more shifted toward underestimation. An uncertainty analysis was also carried out which shows that applying ML algorithms creates higher and lowers uncertainty in Barley and Wheat respectively. SIGNIFICANCE: Considering the robustness of the optimised ML models and the global coverage of remote sensing data, our current methodology demonstrates great transferability and can be applied in other regions across the globe with higher temporal extents. In addition, this tool could be beneficial to decision makers in various sectors to improve the water allocations, deal with climate change effects and keep sustainable agricultural development.
引用
收藏
页数:15
相关论文
共 89 条
  • [41] On the connection between large-scale atmospheric circulation and winter GPCP precipitation over the Mediterranean region for the period 1980-2017
    Kotsias, G.
    Lolis, C. J.
    Hatzianastassiou, N.
    Levizzani, V
    Bartzokas, A.
    [J]. ATMOSPHERIC RESEARCH, 2020, 233
  • [42] Artificial intelligence for classification and regression tree based feature selection method for network intrusion detection system in various telecommunication technologies
    Kumar, Neeraj
    Kumar, Upendra
    [J]. COMPUTATIONAL INTELLIGENCE, 2024, 40 (01)
  • [43] Lencastre P., 2023, Phys. D: Nonlinear Phenomena, V453
  • [44] Combining remote sensing-derived management zones and an auto-calibrated crop simulation model to determine optimal nitrogen fertilizer rates
    Leo, Stephen
    Migliorati, Massimiliano De Antoni
    Nguyen, Trung H.
    Grace, Peter R.
    [J]. AGRICULTURAL SYSTEMS, 2023, 205
  • [45] Crop yield forecasting and associated optimum lead time analysis based on multi-source environmental data across China
    Li, Linchao
    Wang, Bin
    Feng, Puyu
    Wang, Huanhuan
    He, Qinsi
    Wang, Yakai
    Liu, De Li
    Li, Yi
    He, Jianqiang
    Feng, Hao
    Yang, Guijun
    Yu, Qiang
    [J]. AGRICULTURAL AND FOREST METEOROLOGY, 2021, 308
  • [46] Estimation of Battery State of Health Using Probabilistic Neural Network
    Lin, Ho-Ta
    Liang, Tsorng-Juu
    Chen, Shih-Ming
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2013, 9 (02) : 679 - 685
  • [47] Artificial intelligence: a survey on evolution, models, applications and future trends
    Lu, Yang
    [J]. JOURNAL OF MANAGEMENT ANALYTICS, 2019, 6 (01) : 1 - 29
  • [48] Crop yield estimation based on assimilation of crop models and remote sensing data: A systematic evaluation
    Luo, Li
    Sun, Shikun
    Xue, Jing
    Gao, Zihan
    Zhao, Jinfeng
    Yin, Yali
    Gao, Fei
    Luan, Xiaobo
    [J]. AGRICULTURAL SYSTEMS, 2023, 210
  • [49] Exploring applicability of artificial intelligence and multivariate linear regression model for prediction of trihalomethanes in drinking water
    Mahato, J. K.
    Gupta, S. K.
    [J]. INTERNATIONAL JOURNAL OF ENVIRONMENTAL SCIENCE AND TECHNOLOGY, 2022, 19 (06) : 5275 - 5288
  • [50] Discussion of "Best Subset, Forward Stepwise or Lasso? Analysis and Recommendations Based on Extensive Comparisons"
    Mazumder, Rahul
    [J]. STATISTICAL SCIENCE, 2020, 35 (04) : 602 - 608