Application of Machine Learning Algorithms in Plant Breeding: Predicting Yield From Hyperspectral Reflectance in Soybean

被引:119
|
作者
Yoosefzadeh-Najafabadi, Mohsen [1 ]
Earl, Hugh J. [1 ]
Tulpan, Dan [2 ]
Sulik, John [1 ]
Eskandari, Milad [1 ]
机构
[1] Univ Guelph, Dept Plant Agr, Guelph, ON, Canada
[2] Univ Guelph, Dept Anim Biosci, Guelph, ON, Canada
来源
关键词
artificial intelligence; data-driven model; ensemble methods; high-throughput phenotyping; random forest; recursive feature elimination; NEURAL-NETWORK; MULTILAYER PERCEPTRON; CANCER CLASSIFICATION; REGRESSION-MODELS; GENE SELECTION; GRAIN-YIELD; INDEX; STRESS; SVM; PERFORMANCE;
D O I
10.3389/fpls.2020.624273
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Recent substantial advances in high-throughput field phenotyping have provided plant breeders with affordable and efficient tools for evaluating a large number of genotypes for important agronomic traits at early growth stages. Nevertheless, the implementation of large datasets generated by high-throughput phenotyping tools such as hyperspectral reflectance in cultivar development programs is still challenging due to the essential need for intensive knowledge in computational and statistical analyses. In this study, the robustness of three common machine learning (ML) algorithms, multilayer perceptron (MLP), support vector machine (SVM), and random forest (RF), were evaluated for predicting soybean (Glycine max) seed yield using hyperspectral reflectance. For this aim, the hyperspectral reflectance data for the whole spectra ranged from 395 to 1005 nm, which were collected at the R4 and R5 growth stages on 250 soybean genotypes grown in four environments. The recursive feature elimination (RFE) approach was performed to reduce the dimensionality of the hyperspectral reflectance data and select variables with the largest importance values. The results indicated that R5 is more informative stage for measuring hyperspectral reflectance to predict seed yields. The 395 nm reflectance band was also identified as the high ranked band in predicting the soybean seed yield. By considering either full or selected variables as the input variables, the ML algorithms were evaluated individually and combined-version using the ensemble-stacking (E-S) method to predict the soybean yield. The RF algorithm had the highest performance with a value of 84% yield classification accuracy among all the individual tested algorithms. Therefore, by selecting RF as the metaClassifier for E-S method, the prediction accuracy increased to 0.93, using all variables, and 0.87, using selected variables showing the success of using E-S as one of the ensemble techniques. This study demonstrated that soybean breeders could implement E-S algorithm using either the full or selected spectra reflectance to select the high-yielding soybean genotypes, among a large number of genotypes, at early growth stages.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Dissection of hyperspectral reflectance to estimate nitrogen and chlorophyll contents in tea leaves based on machine learning algorithms
    Hiroto Yamashita
    Rei Sonobe
    Yuhei Hirono
    Akio Morita
    Takashi Ikka
    Scientific Reports, 10
  • [22] Hyperspectral Leaf Reflectance as Proxy for Photosynthetic Capacities: An Ensemble Approach Based on Multiple Machine Learning Algorithms
    Fu, Peng
    Meacham-Hensold, Katherine
    Guan, Kaiyu
    Bernacchi, Carl J.
    FRONTIERS IN PLANT SCIENCE, 2019, 10
  • [23] Dissection of hyperspectral reflectance to estimate nitrogen and chlorophyll contents in tea leaves based on machine learning algorithms
    Yamashita, Hiroto
    Sonobe, Rei
    Hirono, Yuhei
    Morita, Akio
    Ikka, Takashi
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [24] Machine Learning for Plant Breeding and Biotechnology
    Niazian, Mohsen
    Niedbala, Gniewko
    AGRICULTURE-BASEL, 2020, 10 (10): : 1 - 23
  • [25] Application of machine learning algorithms in predicting the photocatalytic degradation of perfluorooctanoic acid
    Navidpour, Amir H.
    Hosseinzadeh, Ahmad
    Huang, Zhenguo
    Li, Donghao
    Zhou, John L.
    CATALYSIS REVIEWS-SCIENCE AND ENGINEERING, 2024, 66 (02): : 687 - 712
  • [26] Predicting the Unpredictable: An Application of Machine Learning Algorithms in Indian Stock Market
    Saini A.
    Sharma A.
    Annals of Data Science, 2022, 9 (04) : 791 - 799
  • [27] Application of Machine Learning Algorithms in Predicting Extreme Rainfall Events in Rwanda
    Kagabo, James
    Kattel, Giri Raj
    Kazora, Jonah
    Shangwe, Charmant Nicolas
    Habiyakare, Fabien
    ATMOSPHERE, 2024, 15 (06)
  • [28] Application of two machine learning algorithms for predicting bridge vertical deflection
    Ha, Hoang
    Nguyen, Dam Duc
    Manh, Le Van
    Hiep, Le Van
    Pham, Binh Thai
    PROCEEDINGS OF THE INSTITUTION OF CIVIL ENGINEERS-ENGINEERING AND COMPUTATIONAL MECHANICS, 2024, 177 (01) : 15 - 21
  • [29] Predicting Soybean Relative Maturity and Seed Yield Using Canopy Reflectance
    Christenson, Brent S.
    Schapaugh, William T., Jr.
    An, Nan
    Price, Kevin P.
    Prasad, Vara
    Fritz, Allan K.
    CROP SCIENCE, 2016, 56 (02) : 625 - 643
  • [30] Genome-Wide Association Studies of Soybean Yield-Related Hyperspectral Reflectance Bands Using Machine Learning-Mediated Data Integration Methods
    Yoosefzadeh-Najafabadi, Mohsen
    Torabi, Sepideh
    Tulpan, Dan
    Rajcan, Istvan
    Eskandari, Milad
    FRONTIERS IN PLANT SCIENCE, 2021, 12