Using Machine Learning and Feature Selection for Alfalfa Yield Prediction

被引:31
|
作者
Whitmire, Christopher D. D. [1 ]
Vance, Jonathan M. M. [2 ]
Rasheed, Hend K. K.
Missaoui, Ali [3 ]
Rasheed, Khaled M. M. [1 ,2 ]
Maier, Frederick W. W. [1 ]
机构
[1] Univ Georgia, Inst Artificial Intelligence, 515 Boyd Grad Studies,200 DW Brooks Dr, Athens, GA 30602 USA
[2] Univ Georgia, Dept Comp Sci, 415 Boyd Grad Studies,200 D W Brooks Dr, Athens, GA 30602 USA
[3] Univ Georgia, Inst Plant Breeding Genet & Genom, Dept Crop & Soil Sci, 4317 Miller Plant Sci, Athens, GA 30602 USA
关键词
alfalfa; cross validation; feature selection; machine learning; regression; yield prediction;
D O I
10.3390/ai2010006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Predicting alfalfa biomass and crop yield for livestock feed is important to the daily lives of virtually everyone, and many features of data from this domain combined with corresponding weather data can be used to train machine learning models for yield prediction. In this work, we used yield data of different alfalfa varieties from multiple years in Kentucky and Georgia, and we compared the impact of different feature selection methods on machine learning (ML) models trained to predict alfalfa yield. Linear regression, regression trees, support vector machines, neural networks, Bayesian regression, and nearest neighbors were all developed with cross validation. The features used included weather data, historical yield data, and the sown date. The feature selection methods that were compared included a correlation-based method, the ReliefF method, and a wrapper method. We found that the best method was the correlation-based method, and the feature set it found consisted of the Julian day of the harvest, the number of days between the sown and harvest dates, cumulative solar radiation since the previous harvest, and cumulative rainfall since the previous harvest. Using these features, the k-nearest neighbor and random forest methods achieved an average R value over 0.95, and average mean absolute error less than 200 lbs./acre. Our top R-2 of 0.90 beats a previous work's best R-2 of 0.87. Our primary contribution is the demonstration that ML, with feature selection, shows promise in predicting crop yields even on simple datasets with a handful of features, and that reporting accuracies in R and R-2 offers an intuitive way to compare results among various crops.
引用
收藏
页码:71 / 88
页数:18
相关论文
共 50 条
  • [31] Systematic Analysis of Machine Learning and Feature Selection Techniques for Prediction of the Kp Index
    Zhelayskaya, I. S.
    Vasile, R.
    Shprits, Y. Y.
    Stolle, C.
    Matzka, J.
    SPACE WEATHER-THE INTERNATIONAL JOURNAL OF RESEARCH AND APPLICATIONS, 2019, 17 (10): : 1461 - 1486
  • [32] Prediction of Korean Prosodic Phrase Boundary by Efficient Feature Selection in Machine Learning
    Kim, Minho
    Jung, Youngim
    Kwon, Hyuk-Chul
    ICTAI: 2009 21ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, 2009, : 323 - +
  • [33] Integrating Feature Selection with Machine Learning for Accurate Reservoir Landslide Displacement Prediction
    Ge, Qi
    Wang, Jingyong
    Liu, Cheng
    Wang, Xiaohong
    Deng, Yiyan
    Li, Jin
    WATER, 2024, 16 (15)
  • [34] Congestive heart failure prediction based on feature selection and machine learning algorithms
    Morillo-Velepucha, Diego
    Reategui, Ruth
    Valdiviezo-Diaz, Priscila
    Barba-Guaman, Luis
    2022 17TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI), 2022,
  • [35] PREDICTION OF TYPE 2 DIABETES MELLITUS USING FEATURE SELECTION-BASED MACHINE LEARNING ALGORITHMS
    Yilmaz, Atinc
    HEALTH PROBLEMS OF CIVILIZATION, 2022, 16 (02) : 128 - 139
  • [36] Diabetes Prediction: Optimization of Machine Learning through Feature Selection and Dimensionality Reduction
    Aouragh, Abd Allah
    Bahaj, Mohamed
    Toufik, Fouad
    INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2024, 20 (08) : 100 - 114
  • [37] Sorghum Yield Prediction using Machine Learning
    Zannou, Judicael Geraud N.
    Houndji, Vinasetan Ratheil
    2019 3RD INTERNATIONAL CONFERENCE ON BIO-ENGINEERING FOR SMART TECHNOLOGIES (BIOSMART), 2019,
  • [38] Cooperative prediction method of gas emission from mining face based on feature selection and machine learning
    Zhou, Jie
    Lin, Haifei
    Jin, Hongwei
    Li, Shugang
    Yan, Zhenguo
    Huang, Shiyin
    INTERNATIONAL JOURNAL OF COAL SCIENCE & TECHNOLOGY, 2022, 9 (01)
  • [39] Prediction of Cardiovascular Disease by Feature Selection and Machine Learning Techniques
    Ranade, Aditya
    Pise, Nitin
    ARTIFICIAL INTELLIGENCE: THEORY AND APPLICATIONS, VOL 2, AITA 2023, 2024, 844 : 457 - 472
  • [40] Feature selection for effective prediction of SARS-COV-2 using machine learning
    Punacha, Gagan
    Adiga, Rama
    GENES & GENOMICS, 2024, 46 (01) : 95 - 112