learnMET: an R package to apply machine learning methods for genomic prediction using multi-environment trial data

被引:6
作者
Westhues, Cathy C. [1 ,2 ]
Simianer, Henner [2 ,3 ]
Beissinger, Timothy M. [1 ,2 ]
机构
[1] Univ Goettingen, Dept Crop Sci, Div Plant Breeding Methodol, Carl Sprengel Weg 1, D-37075 Gottingen, Germany
[2] Univ Goettingen, Ctr Integrated Breeding Res, Carl Sprengel Weg 1, Gottingen, Germany
[3] Univ Goettingen, Dept Anim Sci, Anim Breeding & Genet Grp, Albrecht Thaer Weg 3, D-37075 Gottingen, Germany
来源
G3-GENES GENOMES GENETICS | 2022年 / 12卷 / 11期
关键词
multienvironment trials; machine learning; genotype x; environment interaction; genomic prediction; R software; SELECTION; REGRESSION; PLANT;
D O I
10.1093/g3journal/jkac226
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
We introduce the R-package learnMET, developed as a flexible framework to enable a collection of analyses on multi-environment trial breeding data with machine learning-based models. learnMET allows the combination of genomic information with environmental data such as climate and/or soil characteristics. Notably, the package offers the possibility of incorporating weather data from field weather stations, or to retrieve global meteorological datasets from a NASA database. Daily weather data can be aggregated over specific periods of time based on naive (for instance, nonoverlapping 10-day windows) or phenological approaches. Different machine learning methods for genomic prediction are implemented, including gradient-boosted decision trees, random forests, stacked ensemble models, and multilayer perceptrons. These prediction models can be evaluated via a collection of cross-validation schemes that mimic typical scenarios encountered by plant breeders working with multi-environment trial experimental data in a user-friendly way. The package is published under an MIT license and accessible on GitHub.
引用
收藏
页数:13
相关论文
empty
未找到相关数据