Enhancing crop yield prediction in Senegal using advanced machine learning techniques and synthetic data

被引:1
|
作者
Razavi, Mohammad Amin [1 ]
Nejadhashemi, A. Pouyan [2 ]
Majidi, Babak [3 ]
Razavi, Hoda S. [2 ]
Kpodo, Josue [2 ,4 ]
Eeswaran, Rasu [2 ,5 ,6 ]
Ciampitti, Ignacio [6 ]
Prasad, P. V. Vara [7 ]
机构
[1] Univ Tehran, Sch Elect & Comp Engn, Tehran, Iran
[2] Michigan State Univ, Dept Biosyst & Agr Engn, E Lansing, MI 48824 USA
[3] Khatam Univ, Dept Comp Engn, Tehran, Iran
[4] Michigan State Univ, Dept Comp Sci & Engn, E Lansing, MI USA
[5] Univ Jaffna, Fac Agr, Dept Agron, Kilinochchi, Sri Lanka
[6] Kansas State Univ, Dept Agron, Manhattan, KS USA
[7] Kansas State Univ, Feed Future Sustainable Intensificat Innovat Lab, Manhattan, KS USA
来源
基金
美国食品与农业研究所;
关键词
Crop yield prediction; Variational auto encoder; Pattern recognition on spatiotemporal and; physiographical variables; Synthetic tabular data generation; Ensemble learning; INTERPOLATION METHODS; CLIMATE-CHANGE; AGRICULTURE; MANAGEMENT; SYSTEMS;
D O I
10.1016/j.aiia.2024.11.005
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
In this study, we employ advanced data-driven techniques to investigate the complex relationships between the yields of five major crops and various geographical and spatiotemporal features in Senegal. We analyze how these features influence crop yields by utilizing remotely sensed data. Our methodology incorporates clustering algorithms and correlation matrix analysis to identify significant patterns and dependencies, offering a comprehensive understanding of the factors affecting agricultural productivity in Senegal. To optimize the model's performance and identify the optimal hyperparameters, we implemented a comprehensive grid search across four distinct machine learning regressors: Random Forest, Extreme Gradient Boosting (XGBoost), Categorical Boosting (CatBoost), and Light Gradient-Boosting Machine (LightGBM). Each regressor offers unique functionalities, enhancing our exploration of potential model configurations. The top-performing models were selected based on evaluating multiple performance metrics, ensuring robust and accurate predictive capabilities. The results demonstrated that XGBoost and CatBoost perform better than the other two. We introduce synthetic crop data generated using a Variational Auto Encoder to address the challenges posed by limited agricultural datasets. By achieving high similarity scores with real-world data, our synthetic samples enhance model robustness, mitigate overfitting, and provide a viable solution for small dataset issues in agriculture. Our approach distinguishes itself by creating a flexible model applicable to various crops together. By integrating five crop datasets and generating high-quality synthetic data, we improve model performance, reduce overfitting, and enhance realism. Our findings provide crucial insights for productivity drivers in key cropping systems, enabling robust recommendations and strengthening the decision-making capabilities of policymakers and farmers in datascarce regions. (c) 2024 The Authors. Publishing services by Elsevier B.V. on behalf of KeAi Communications Co., Ltd. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:99 / 114
页数:16
相关论文
共 50 条
  • [31] Maximizing Crop Yield: Crop Yield Prediction using Advanced ML Algorithms
    Nitin, Narra Naga
    Srikar, R. Sai
    Dileep, P.
    Hema, Deva D.
    2024 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION AND APPLIED INFORMATICS, ACCAI 2024, 2024,
  • [32] Wheat Crop Field and Yield Prediction using Remote Sensing and Machine Learning
    Ayub, Maheen
    Khan, Najeed Ahmed
    Haider, Rana Zeeshan
    PROCEEDINGS OF 2ND IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (ICAI 2022), 2022, : 158 - 164
  • [33] Crop yield prediction using machine learning: An extensive and systematic literature review
    Shawon, Sarowar Morshed
    Ema, Falguny Barua
    Mahi, Asura Khanom
    Niha, Fahima Lokman
    Zubair, H. T.
    SMART AGRICULTURAL TECHNOLOGY, 2025, 10
  • [34] Intelligent Crop Recommender System for Yield Prediction Using Machine Learning Strategy
    Maheswary A.
    Nagendram S.
    Kiran K.U.
    Ahammad S.H.
    Priya P.P.
    Hossain M.A.
    Rashed A.N.Z.
    Journal of The Institution of Engineers (India): Series B, 2024, 105 (04) : 979 - 987
  • [35] Predicting crop yields in Senegal using machine learning methods
    Sarr, Alioune Badara
    Sultan, Benjamin
    INTERNATIONAL JOURNAL OF CLIMATOLOGY, 2023, 43 (04) : 1817 - 1838
  • [36] Development of multistage crop yield estimation model using machine learning and deep learning techniques
    Aravind, K. S.
    Vashisth, Ananta
    Krishnan, P.
    Kundu, Monika
    Prasad, Shiv
    Meena, M. C.
    Lama, Achal
    Das, Pankaj
    Das, Bappa
    INTERNATIONAL JOURNAL OF BIOMETEOROLOGY, 2025, 69 (02) : 499 - 515
  • [37] An Empirical Evaluation of Machine Learning Techniques for Crop Prediction
    Mariammal, G.
    Suruliandi, A.
    Raja, S. P.
    Poongothai, E.
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2023, 8 (04): : 96 - 104
  • [38] Early crop yield prediction for agricultural drought monitoring using drought indices, remote sensing, and machine learning techniques
    Pandya, Parthsarthi
    Gontia, Narendra Kumar
    JOURNAL OF WATER AND CLIMATE CHANGE, 2023, 14 (12) : 4729 - 4746
  • [39] IoT-Based Crop Yield Prediction System in Indian Sub-continent Using Machine Learning Techniques
    Nithya V.
    Josephine M.S.
    Jeyabalaraja V.
    Remote Sensing in Earth Systems Sciences, 2023, 6 (3-4) : 156 - 166
  • [40] Advanced prediction of rice yield gaps under climate uncertainty using machine learning techniques in Eastern India
    Sahoo, Satiprasad
    Singha, Chiranjit
    Govind, Ajit
    JOURNAL OF AGRICULTURE AND FOOD RESEARCH, 2024, 18