Application of Machine Learning Algorithms in Plant Breeding: Predicting Yield From Hyperspectral Reflectance in Soybean

被引:119
|
作者
Yoosefzadeh-Najafabadi, Mohsen [1 ]
Earl, Hugh J. [1 ]
Tulpan, Dan [2 ]
Sulik, John [1 ]
Eskandari, Milad [1 ]
机构
[1] Univ Guelph, Dept Plant Agr, Guelph, ON, Canada
[2] Univ Guelph, Dept Anim Biosci, Guelph, ON, Canada
来源
关键词
artificial intelligence; data-driven model; ensemble methods; high-throughput phenotyping; random forest; recursive feature elimination; NEURAL-NETWORK; MULTILAYER PERCEPTRON; CANCER CLASSIFICATION; REGRESSION-MODELS; GENE SELECTION; GRAIN-YIELD; INDEX; STRESS; SVM; PERFORMANCE;
D O I
10.3389/fpls.2020.624273
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Recent substantial advances in high-throughput field phenotyping have provided plant breeders with affordable and efficient tools for evaluating a large number of genotypes for important agronomic traits at early growth stages. Nevertheless, the implementation of large datasets generated by high-throughput phenotyping tools such as hyperspectral reflectance in cultivar development programs is still challenging due to the essential need for intensive knowledge in computational and statistical analyses. In this study, the robustness of three common machine learning (ML) algorithms, multilayer perceptron (MLP), support vector machine (SVM), and random forest (RF), were evaluated for predicting soybean (Glycine max) seed yield using hyperspectral reflectance. For this aim, the hyperspectral reflectance data for the whole spectra ranged from 395 to 1005 nm, which were collected at the R4 and R5 growth stages on 250 soybean genotypes grown in four environments. The recursive feature elimination (RFE) approach was performed to reduce the dimensionality of the hyperspectral reflectance data and select variables with the largest importance values. The results indicated that R5 is more informative stage for measuring hyperspectral reflectance to predict seed yields. The 395 nm reflectance band was also identified as the high ranked band in predicting the soybean seed yield. By considering either full or selected variables as the input variables, the ML algorithms were evaluated individually and combined-version using the ensemble-stacking (E-S) method to predict the soybean yield. The RF algorithm had the highest performance with a value of 84% yield classification accuracy among all the individual tested algorithms. Therefore, by selecting RF as the metaClassifier for E-S method, the prediction accuracy increased to 0.93, using all variables, and 0.87, using selected variables showing the success of using E-S as one of the ensemble techniques. This study demonstrated that soybean breeders could implement E-S algorithm using either the full or selected spectra reflectance to select the high-yielding soybean genotypes, among a large number of genotypes, at early growth stages.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Application of machine learning and genetic algorithms to the prediction and optimization of biodiesel yield from waste cooking oil
    Aqueel Ahmad
    Ashok Kumar Yadav
    Achhaibar Singh
    Korean Journal of Chemical Engineering, 2023, 40 : 2941 - 2956
  • [42] Bridging the gap between hyperspectral imaging and crop breeding: soybean yield prediction and lodging classification with prototype contrastive learning
    Sun, Guangyao
    Zhang, Yong
    Wang, Lei
    Zhou, Longyu
    Fei, Shuaipeng
    Han, Shiteng
    Xiao, Shunfu
    Che, Yingpu
    Yan, Long
    Xu, Yun
    Li, Yinghui
    Qiu, Lijuan
    Ma, Yuntao
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2025, 230
  • [43] Application and Comparison of Machine Learning Algorithms for Predicting Rock Deformation in Hydraulic Tunnels
    Liu, Yixin
    Ren, Xuhua
    Zhang, Jixun
    Zhang, Yuxian
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [44] Application and Comparison of Machine Learning Algorithms for Predicting Rock Deformation in Hydraulic Tunnels
    Liu, Yixin
    Ren, Xuhua
    Zhang, Jixun
    Zhang, Yuxian
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [45] Application of Machine-Learning Algorithms for Predicting California Bearing Ratio of Soil
    Bherde, Vaishnavi
    Mallikarjunappa, Likhith Kudlur
    Baadiga, Ramu
    Balunaini, Umashankar
    JOURNAL OF TRANSPORTATION ENGINEERING PART B-PAVEMENTS, 2023, 149 (04)
  • [46] Hyperspectral field reflectance measurements to estimate wheat grain yield and plant height
    Xavier, AC
    Rudorff, BFT
    Moreira, MA
    Alvarenga, BS
    de Freitas, JG
    Salomon, MV
    SCIENTIA AGRICOLA, 2006, 63 (02): : 130 - 138
  • [47] Machine Learning Approach for Prescriptive Plant Breeding
    Parmley, Kyle A.
    Higgins, Race H.
    Ganapathysubramanian, Baskar
    Sarkar, Soumik
    Singh, Asheesh K.
    SCIENTIFIC REPORTS, 2019, 9 (1)
  • [48] Predicting plant distribution on the River Nile islands in Egypt using machine learning algorithms
    Nahool, T. A.
    Ayed, F. A. A.
    Ahmed, D. A.
    Sheded, M. G.
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL SCIENCE AND TECHNOLOGY, 2025,
  • [49] Machine Learning Approach for Prescriptive Plant Breeding
    Kyle A. Parmley
    Race H. Higgins
    Baskar Ganapathysubramanian
    Soumik Sarkar
    Asheesh K. Singh
    Scientific Reports, 9
  • [50] Machine learning algorithms for predicting scapular kinematics
    Nicholson, Kristen F.
    Richardson, R. Tyler
    van Roden, Elizabeth A. Rapp
    Quinton, R. Garry
    Anzilotti, Kert F.
    Richards, James G.
    MEDICAL ENGINEERING & PHYSICS, 2019, 65 : 39 - 45