Crop Classification and Yield Prediction Using Robust Machine Learning Models for Agricultural Sustainability

被引:0
|
作者
Badshah, Abid [1 ]
Alkazemi, Basem Yousef [2 ]
Din, Fakhrud [1 ]
Zamli, Kamal Z. [3 ,4 ]
Haris, Muhammad [4 ]
机构
[1] Univ Malakand, Dept Comp Sci & IT, Fac Informat Technol IT, Chakdara 18800, Khyber Pakhtunk, Pakistan
[2] Umm Al Qura Univ, Coll Comp, Dept Software Engn, Mecca 24382, Saudi Arabia
[3] Univ Malaysia Pahang Al Sultan Abdullah UMPSA, Fac Comp, Kuantan 26600, Pahang, Malaysia
[4] Univ Airlangga, Fac Sci & Technol, C Campus JI Dr H Soekamo, Surabaya 60115, Indonesia
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Crops; Machine learning; Biological system modeling; Predictive models; Data models; Soil; Mathematical models; Production; Agriculture; Accuracy; Agricultural planning; crop recommendation; crop yield forecasting; explainable AI; K-fold cross-validation; machine learning;
D O I
10.1109/ACCESS.2024.3486653
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Agriculture is pivotal for the economy of a country as it is a major source of food, employment and raw materials. However, challenges such as diseases, soil degradation, and water scarcity persist. Technology adoption can address these issues, improving production and quality. Machine learning, a subset of Artificial Intelligence (AI), enables prediction, classification, and automation in agriculture. It optimizes irrigation, fertilization, and crop selection, aiding decision-making for food security and crop management. This study proposes two robust machine learning architectures for classification and regression based on distinct datasets. Firstly, we delve into a crop recommendation dataset obtained from Kaggle, consisting of various input attributes such as the pH of the soil, temperature, humidity, and nutrient levels. Leveraging machine learning classification techniques such as Extra Tree Classifier (ETC), Logistic Regression (LR), Decision Tree (DT), Random Forest (RF), K-Nearest Neighbour (KNN), Gaussian Naive Bayes (GNB), and Support Vector Machine (SVM), we suggest twenty-two different crops founded on these inputs. Through the use of K-fold cross-validation, Explainable AI (XAI) and feature engineering, we identify the best-performing model, with Random Forest coming out on top scoring an accuracy of 99.7% with precision, recall, F1 score, and confusion matrix. Secondly, we investigate wheat yield prediction data snagged from the World Bank and Food and Agriculture Organization (FAO), covering the years 1992-2013 for Pakistan. Using Multivariate Imputation by Chained Equations (MICE) to tackle data restrictions, we gauge wheat production for 2014-2024 and forecast the 2025 yield using machine learning regression models. Once again, using hyper parameter tuning with K-fold cross-validation, Support Vector Regressor (SVR) stands out as the top-performing model, achieving an accuracy of 99.9% with R-2 Score. Transparency and confidence in agricultural decision-making are increased when machine learning decisions are made comprehensible using Explainable AI (XAI) approaches. Two widely used XAI approaches, namely Feature Importance and Local Interpretable Model-Agnostic Explanations (LIME) are used to interpret and explain outcomes of the proposed models. The study can increase agricultural productivity, minimize risks, enhance food security, and promote more environmentally friendly farming approaches.
引用
收藏
页码:162799 / 162813
页数:15
相关论文
共 50 条
  • [31] Crop Yield Prediction Using Deep Learning
    Jeny, J. R. V.
    Divya, Phulari
    Varsha, Kolanu
    Mrunalini, Anantha
    Irfan, S. K. M.
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON DATA SCIENCE, MACHINE LEARNING AND APPLICATIONS, VOL 1, ICDSMLA 2023, 2025, 1273 : 1192 - 1199
  • [32] Effect of Climate on Photovoltaic Yield Prediction Using Machine Learning Models
    Alcaniz, Alba
    Lindfors, Anders, V
    Zeman, Miro
    Ziar, Hesan
    Isabella, Olindo
    GLOBAL CHALLENGES, 2023, 7 (01)
  • [33] Crop Yield Prediction Based on Bacterial Biomarkers and Machine Learning
    Ma, Li
    Niu, Wenquan
    Li, Guochun
    Du, Yadan
    Sun, Jun
    Siddique, Kadambot H. M.
    JOURNAL OF SOIL SCIENCE AND PLANT NUTRITION, 2024, 24 (02) : 2798 - 2814
  • [34] A Comprehensive Review of Crop Yield Prediction Using Machine Learning Approaches With Special Emphasis on Palm Oil Yield Prediction
    Rashid, Mamunur
    Bari, Bifta Sama
    Yusup, Yusri
    Kamaruddin, Mohamad Anuar
    Khan, Nuzhat
    IEEE ACCESS, 2021, 9 : 63406 - 63439
  • [35] Validation of scenario generation for decision-making using machine learning prediction models A case study for crop yield
    Emirhuseyinoglu, Gorkem
    Shahhosseini, Mohsen
    Hu, Guiping
    Ryan, Sarah M.
    OPTIMIZATION LETTERS, 2024, 18 (09) : 2021 - 2032
  • [36] Stacked ensemble model for accurate crop yield prediction using machine learning techniques
    Ramesh, V
    Kumaresan, P.
    ENVIRONMENTAL RESEARCH COMMUNICATIONS, 2025, 7 (03):
  • [37] Prediction of crop yield using satellite vegetation indices combined with machine learning approaches
    Jhajharia, Kavita
    Mathur, Pratistha
    ADVANCES IN SPACE RESEARCH, 2023, 72 (09) : 3998 - 4007
  • [38] Analysis of Classification Models Based on Cuisine Prediction Using Machine Learning
    Jayaraman, Shobhna
    Choudhury, Tanupriya
    Kumar, Praveen
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON SMART TECHNOLOGIES FOR SMART NATION (SMARTTECHCON), 2017, : 1485 - 1490
  • [39] Landslide Classification and Prediction of Debris Flow Using Machine Learning Models
    Shameem Ansar, A.
    Sudha, S.
    Vinayagamoorthi, Savita
    Menachery, Michelle Marianne
    Francis, Suresh
    IETE JOURNAL OF RESEARCH, 2024, 70 (04) : 3763 - 3779
  • [40] A Comparative Study for Depression Prediction Using Machine Learning Classification Models
    Pramanik, Rwittika
    Khare, Sandali
    Harshvardhan, G. M.
    Gourisaria, Mahendra Kumar
    ADVANCES IN DATA AND INFORMATION SCIENCES, 2022, 318 : 233 - 246