Application of Machine Learning Algorithms in Plant Breeding: Predicting Yield From Hyperspectral Reflectance in Soybean

被引:119
|
作者
Yoosefzadeh-Najafabadi, Mohsen [1 ]
Earl, Hugh J. [1 ]
Tulpan, Dan [2 ]
Sulik, John [1 ]
Eskandari, Milad [1 ]
机构
[1] Univ Guelph, Dept Plant Agr, Guelph, ON, Canada
[2] Univ Guelph, Dept Anim Biosci, Guelph, ON, Canada
来源
关键词
artificial intelligence; data-driven model; ensemble methods; high-throughput phenotyping; random forest; recursive feature elimination; NEURAL-NETWORK; MULTILAYER PERCEPTRON; CANCER CLASSIFICATION; REGRESSION-MODELS; GENE SELECTION; GRAIN-YIELD; INDEX; STRESS; SVM; PERFORMANCE;
D O I
10.3389/fpls.2020.624273
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Recent substantial advances in high-throughput field phenotyping have provided plant breeders with affordable and efficient tools for evaluating a large number of genotypes for important agronomic traits at early growth stages. Nevertheless, the implementation of large datasets generated by high-throughput phenotyping tools such as hyperspectral reflectance in cultivar development programs is still challenging due to the essential need for intensive knowledge in computational and statistical analyses. In this study, the robustness of three common machine learning (ML) algorithms, multilayer perceptron (MLP), support vector machine (SVM), and random forest (RF), were evaluated for predicting soybean (Glycine max) seed yield using hyperspectral reflectance. For this aim, the hyperspectral reflectance data for the whole spectra ranged from 395 to 1005 nm, which were collected at the R4 and R5 growth stages on 250 soybean genotypes grown in four environments. The recursive feature elimination (RFE) approach was performed to reduce the dimensionality of the hyperspectral reflectance data and select variables with the largest importance values. The results indicated that R5 is more informative stage for measuring hyperspectral reflectance to predict seed yields. The 395 nm reflectance band was also identified as the high ranked band in predicting the soybean seed yield. By considering either full or selected variables as the input variables, the ML algorithms were evaluated individually and combined-version using the ensemble-stacking (E-S) method to predict the soybean yield. The RF algorithm had the highest performance with a value of 84% yield classification accuracy among all the individual tested algorithms. Therefore, by selecting RF as the metaClassifier for E-S method, the prediction accuracy increased to 0.93, using all variables, and 0.87, using selected variables showing the success of using E-S as one of the ensemble techniques. This study demonstrated that soybean breeders could implement E-S algorithm using either the full or selected spectra reflectance to select the high-yielding soybean genotypes, among a large number of genotypes, at early growth stages.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Predicting grain yield using canopy hyperspectral reflectance in wheat breeding data
    Osval A. Montesinos-López
    Abelardo Montesinos-López
    José Crossa
    Gustavo de los Campos
    Gregorio Alvarado
    Mondal Suchismita
    Jessica Rutkoski
    Lorena González-Pérez
    Juan Burgueño
    Plant Methods, 13
  • [2] Predicting grain yield using canopy hyperspectral reflectance in wheat breeding data
    Montesinos-Lopez, Osval A.
    Montesinos-Lopez, Abelardo
    Crossa, Jose
    de los Campos, Gustavo
    Alvarado, Gregorio
    Suchismita, Mondal
    Rutkoski, Jessica
    Gonzalez-Perez, Lorena
    Burgueno, Juan
    PLANT METHODS, 2017, 13
  • [3] Machine learning algorithms for soybean yield forecasting in the Brazilian Cerrado
    Barbosa dos Santos, Valter
    Moreno Ferreira dos Santos, Aline
    da Silva Cabral de Moraes, Jose Reinaldo
    de Oliveira Vieira, Igor Cristian
    de Souza Rolim, Glauco
    JOURNAL OF THE SCIENCE OF FOOD AND AGRICULTURE, 2022, 102 (09) : 3665 - 3672
  • [4] Hyperspectral reflectance for classification of medicinal Cannabis varieties using machine learning algorithms
    Henao-Cespedes, Vladimir
    Cardona-Morales, Oscar
    Andres Vargas-Alzate, Jolian
    Ricardo Leon-Zuleta, Julian
    Mackniven Guzman-Buendia, Eddy
    Alberto Garces-Gomez, Yeison
    OPTICAL ENGINEERING, 2024, 63 (06)
  • [5] Predicting gypsum tofu quality from soybean seeds using hyperspectral imaging and machine learning
    Malik, Amanda
    Ram, Billy
    Arumugam, Dharanidharan
    Jin, Zhao
    Sun, Xin
    Xu, Minwei
    FOOD CONTROL, 2024, 160
  • [6] Application of Machine Learning Algorithms in Predicting Hepatitis C
    Wang, Yunchuan
    Yin, Baohua
    Zhu, Qiang
    PROCEEDINGS OF 2023 4TH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE FOR MEDICINE SCIENCE, ISAIMS 2023, 2023, : 359 - 365
  • [7] Early Detection of Dendroctonus valens Infestation with Machine Learning Algorithms Based on Hyperspectral Reflectance
    Gao, Bingtao
    Yu, Linfeng
    Ren, Lili
    Zhan, Zhongyi
    Luo, Youqing
    REMOTE SENSING, 2022, 14 (06)
  • [8] Linear Support Vector Machine Classification of Plant Stress From Soybean Aphid (Hemiptera: Aphididae) Using Hyperspectral Reflectance
    Marston, Zachary P. D.
    Cira, Theresa M.
    Knight, Joseph F.
    Mulla, David
    Alves, Tavvs M.
    Hodgson, Erin W.
    Ribeiro, Arthur, V
    MacRae, Ian, V
    Koch, Robert L.
    JOURNAL OF ECONOMIC ENTOMOLOGY, 2022, 115 (05) : 1557 - 1563
  • [9] Comparison of different machine learning algorithms for predicting maize grain yield using UAV-based hyperspectral images
    Guo, Yahui
    Xiao, Yi
    Hao, Fanghua
    Zhang, Xuan
    Chen, Jiahao
    de Beurs, Kirsten
    He, Yuhong
    Fu, Yongshuo H.
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2023, 124
  • [10] Nondestructive Classification of Soybean Seed Varieties by Hyperspectral Imaging and Ensemble Machine Learning Algorithms
    Wei, Yanlin
    Li, Xiaofeng
    Pan, Xin
    Li, Lei
    SENSORS, 2020, 20 (23) : 1 - 12