learnMET: an R package to apply machine learning methods for genomic prediction using multi-environment trial data

被引:6
|
作者
Westhues, Cathy C. [1 ,2 ]
Simianer, Henner [2 ,3 ]
Beissinger, Timothy M. [1 ,2 ]
机构
[1] Univ Goettingen, Dept Crop Sci, Div Plant Breeding Methodol, Carl Sprengel Weg 1, D-37075 Gottingen, Germany
[2] Univ Goettingen, Ctr Integrated Breeding Res, Carl Sprengel Weg 1, Gottingen, Germany
[3] Univ Goettingen, Dept Anim Sci, Anim Breeding & Genet Grp, Albrecht Thaer Weg 3, D-37075 Gottingen, Germany
来源
G3-GENES GENOMES GENETICS | 2022年 / 12卷 / 11期
关键词
multienvironment trials; machine learning; genotype x; environment interaction; genomic prediction; R software; SELECTION; REGRESSION; PLANT;
D O I
10.1093/g3journal/jkac226
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
We introduce the R-package learnMET, developed as a flexible framework to enable a collection of analyses on multi-environment trial breeding data with machine learning-based models. learnMET allows the combination of genomic information with environmental data such as climate and/or soil characteristics. Notably, the package offers the possibility of incorporating weather data from field weather stations, or to retrieve global meteorological datasets from a NASA database. Daily weather data can be aggregated over specific periods of time based on naive (for instance, nonoverlapping 10-day windows) or phenological approaches. Different machine learning methods for genomic prediction are implemented, including gradient-boosted decision trees, random forests, stacked ensemble models, and multilayer perceptrons. These prediction models can be evaluated via a collection of cross-validation schemes that mimic typical scenarios encountered by plant breeders working with multi-environment trial experimental data in a user-friendly way. The package is published under an MIT license and accessible on GitHub.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] A Comparison of Three Machine Learning Methods for Multivariate Genomic Prediction Using the Sparse Kernels Method (SKM) Library
    Montesinos-Lopez, Osval A.
    Montesinos-Lopez, Abelardo
    Cano-Paez, Bernabe
    Moises Hernandez-Suarez, Carlos
    Santana-Mancilla, Pedro C.
    Crossa, Jose
    GENES, 2022, 13 (08)
  • [22] Prediction of prognosis in hepatocellular carcinoma using machine learning based on genomic expression data
    Wang, Fengyan
    Xue, Changqing
    JOURNAL OF GASTROENTEROLOGY AND HEPATOLOGY, 2023, 38 : 49 - 50
  • [23] Prediction of prognosis in hepatocellular carcinoma using machine learning based on genomic expression data
    Wang, Fengyan
    Xue, Changqing
    JOURNAL OF GASTROENTEROLOGY AND HEPATOLOGY, 2023, 38 : 49 - 50
  • [24] Machine learning approaches for the prediction of bone mineral density by using genomic and phenotypic data of 5130 older men
    Wu, Qing
    Nasoz, Fatma
    Jung, Jongyun
    Bhattarai, Bibek
    Han, Mira, V
    Greenes, Robert A.
    Saag, Kenneth G.
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [25] Prediction of Bitcoin Prices with Machine Learning Methods using Time Series Data
    Karasu, Seckin
    Altan, Aytac
    Sarac, Zehra
    Hacioglu, Rifat
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [26] Breast cancer prediction using different machine learning methods applying multi factors
    Elham Nazari
    Hamid Naderi
    Mahla Tabadkani
    Reza ArefNezhad
    Amir Hossein Farzin
    Mohammad Dashtiahangar
    Majid Khazaei
    Gordon A. Ferns
    Amin Mehrabian
    Hamed Tabesh
    Amir Avan
    Journal of Cancer Research and Clinical Oncology, 2023, 149 : 17133 - 17146
  • [27] Genomic prediction using machine learning: a comparison of the performance of regularized regression, ensemble, instance-based and deep learning methods on synthetic and empirical data
    Lourenco, Vanda M.
    Ogutu, Joseph O.
    Rodrigues, Rui A. P.
    Posekany, Alexandra
    Piepho, Hans-Peter
    BMC GENOMICS, 2024, 25 (01)
  • [28] Genomic prediction using machine learning: a comparison of the performance of regularized regression, ensemble, instance-based and deep learning methods on synthetic and empirical data
    Vanda M. Lourenço
    Joseph O. Ogutu
    Rui A.P. Rodrigues
    Alexandra Posekany
    Hans-Peter Piepho
    BMC Genomics, 25
  • [29] COVID-19 detection and classification for machine learning methods using human genomic data
    Ahemad M.T.
    Hameed M.A.
    Vankdothu R.
    Measurement: Sensors, 2022, 24
  • [30] MegaR: an interactive R package for rapid sample classification and phenotype prediction using metagenome profiles and machine learning
    Eliza Dhungel
    Yassin Mreyoud
    Ho-Jin Gwak
    Ahmad Rajeh
    Mina Rho
    Tae-Hyuk Ahn
    BMC Bioinformatics, 22