Application of Machine Learning Algorithms in Plant Breeding: Predicting Yield From Hyperspectral Reflectance in Soybean

被引:119
|
作者
Yoosefzadeh-Najafabadi, Mohsen [1 ]
Earl, Hugh J. [1 ]
Tulpan, Dan [2 ]
Sulik, John [1 ]
Eskandari, Milad [1 ]
机构
[1] Univ Guelph, Dept Plant Agr, Guelph, ON, Canada
[2] Univ Guelph, Dept Anim Biosci, Guelph, ON, Canada
来源
关键词
artificial intelligence; data-driven model; ensemble methods; high-throughput phenotyping; random forest; recursive feature elimination; NEURAL-NETWORK; MULTILAYER PERCEPTRON; CANCER CLASSIFICATION; REGRESSION-MODELS; GENE SELECTION; GRAIN-YIELD; INDEX; STRESS; SVM; PERFORMANCE;
D O I
10.3389/fpls.2020.624273
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Recent substantial advances in high-throughput field phenotyping have provided plant breeders with affordable and efficient tools for evaluating a large number of genotypes for important agronomic traits at early growth stages. Nevertheless, the implementation of large datasets generated by high-throughput phenotyping tools such as hyperspectral reflectance in cultivar development programs is still challenging due to the essential need for intensive knowledge in computational and statistical analyses. In this study, the robustness of three common machine learning (ML) algorithms, multilayer perceptron (MLP), support vector machine (SVM), and random forest (RF), were evaluated for predicting soybean (Glycine max) seed yield using hyperspectral reflectance. For this aim, the hyperspectral reflectance data for the whole spectra ranged from 395 to 1005 nm, which were collected at the R4 and R5 growth stages on 250 soybean genotypes grown in four environments. The recursive feature elimination (RFE) approach was performed to reduce the dimensionality of the hyperspectral reflectance data and select variables with the largest importance values. The results indicated that R5 is more informative stage for measuring hyperspectral reflectance to predict seed yields. The 395 nm reflectance band was also identified as the high ranked band in predicting the soybean seed yield. By considering either full or selected variables as the input variables, the ML algorithms were evaluated individually and combined-version using the ensemble-stacking (E-S) method to predict the soybean yield. The RF algorithm had the highest performance with a value of 84% yield classification accuracy among all the individual tested algorithms. Therefore, by selecting RF as the metaClassifier for E-S method, the prediction accuracy increased to 0.93, using all variables, and 0.87, using selected variables showing the success of using E-S as one of the ensemble techniques. This study demonstrated that soybean breeders could implement E-S algorithm using either the full or selected spectra reflectance to select the high-yielding soybean genotypes, among a large number of genotypes, at early growth stages.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Remote Prediction of Soybean Yield Using UAV-Based Hyperspectral Imaging and Machine Learning Models
    Berveglieri, Adilson
    Imai, Nilton Nobuhiro
    Watanabe, Fernanda Sayuri Yoshino
    Tommaselli, Antonio Maria Garcia
    Ederli, Gloria Maria Padovani
    de Araujo, Fabio Fernandes
    Lupatini, Gelci Carlos
    Honkavaara, Eija
    AGRIENGINEERING, 2024, 6 (03): : 3242 - 3260
  • [32] Soybean yield prediction by machine learning and climate
    Torsoni, Guilherme Botega
    Aparecido, Lucas Eduardo de Oliveira
    dos Santos, Gabriela Marins
    Chiquitto, Alisson Gaspar
    Moraes, Jose Reinaldo da Silva Cabral
    Rolim, Glauco de Souza
    THEORETICAL AND APPLIED CLIMATOLOGY, 2023, 151 (3-4) : 1709 - 1725
  • [33] Machine learning for soybean yield forecasting in Brazil
    von Bloh, Malte
    Noia, Rogerio de S.
    Wangerpohl, Xaver
    Saltik, Ahmet Oguz
    Haller, Vivian
    Kaiser, Leoni
    Asseng, Senthold
    AGRICULTURAL AND FOREST METEOROLOGY, 2023, 341
  • [34] Yield prediction in a peanut breeding program using remote sensing data and machine learning algorithms
    Pugh, N. Ace
    Young, Andrew
    Ojha, Manisha
    Emendack, Yves
    Sanchez, Jacobo
    Xin, Zhanguo
    Puppala, Naveen
    FRONTIERS IN PLANT SCIENCE, 2024, 15
  • [35] Non-Destructive Detection of Tea Leaf Chlorophyll Content Using Hyperspectral Reflectance and Machine Learning Algorithms
    Sonobe, Rei
    Hirono, Yuhei
    Oi, Ayako
    PLANTS-BASEL, 2020, 9 (03):
  • [36] Using Machine Learning Algorithms With In Situ Hyperspectral Reflectance Data to Assess Comprehensive Water Quality of Urban Rivers
    Cai, Jiannan
    Chen, Jun
    Dou, Xianhui
    Xing, Qianguo
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [37] Utilizing Hyperspectral Reflectance and Machine Learning Algorithms for Non-Destructive Estimation of Chlorophyll Content in Citrus Leaves
    Li, Dasui
    Hu, Qingqing
    Ruan, Siqi
    Liu, Jun
    Zhang, Jinzhi
    Hu, Chungen
    Liu, Yongzhong
    Dian, Yuanyong
    Zhou, Jingjing
    REMOTE SENSING, 2023, 15 (20)
  • [38] Prediction of Soybean Yield in Jilin Based on Diverse Machine Learning Algorithms and Meteorological Disaster Indices
    Song, Lifang
    Song, Ruixiang
    Wang, Chengcheng
    Zhang, Xinjing
    Li, Yunfeng
    Li, Lei
    Li, Mingyan
    Zhang, Meng
    INTERNATIONAL JOURNAL OF MULTIPHYSICS, 2024, 18 (02) : 278 - 287
  • [39] Soybean yield prediction using machine learning algorithms under a cover crop management system
    Santos, Leticia Bernabe
    Gentry, Donna
    Tryforos, Alex
    Fultz, Lisa
    Beasley, Jeffrey
    Gentimis, Thanos
    SMART AGRICULTURAL TECHNOLOGY, 2024, 8
  • [40] Application of machine learning and genetic algorithms to the prediction and optimization of biodiesel yield from waste cooking oil
    Ahmad, Aqueel
    Yadav, Ashok Kumar
    Singh, Achhaibar
    KOREAN JOURNAL OF CHEMICAL ENGINEERING, 2023, 40 (12) : 2941 - 2956