Supervised Machine Learning Techniques for Breeding Value Prediction in Horses: An Example Using Gait Visual Scores

被引:1
作者
Bussiman, Fernando [1 ]
Alves, Anderson A. C. [1 ]
Richter, Jennifer [1 ]
Hidalgo, Jorge [1 ]
Veroneze, Renata [1 ,2 ]
Oliveira, Tiago [3 ]
机构
[1] Univ Georgia, Anim & Dairy Sci Dept, Athens, GA 30602 USA
[2] Univ Fed Vicosa, Anim Sci Dept, BR-36570900 Vicosa, Brazil
[3] State Univ Paraiba, Stat Dept, Campina Grande, Brazil
关键词
support vector regression; machine learning; gait prediction; visual scores; GENETIC-PARAMETERS; FUNCTIONAL TRAITS; REGRESSION; SELECTION; BIAS;
D O I
10.3390/ani14182723
中图分类号
S8 [畜牧、 动物医学、狩猎、蚕、蜂];
学科分类号
0905 ;
摘要
Simple Summary In the artificial intelligence era, much is speculated about the use of machine learning techniques for the most diverse scientific purposes. Machine learning methods are acknowledged to better handle non-linearity and subjectivity than traditional statistical methods. In the horse industry, visual scores are widely used to evaluate gaited horses. This phenotyping strategy is an effective low-cost alternative to more accurate methods. However, since it heavily depends on the person assessing the gait, subjectivity is introduced in the phenotype. Our study evaluated the application of machine learning techniques in the breeding value prediction for visual scores in Brazilian gaited horses. We used a dataset with horses that were measured for at least one of the following gait scores: dissociation, comfort, style, regularity, and development. Traditional methods, such as ordinary least-squares and multiple-trait models, were combined with artificial neural networks and other machine learning regression methods, and each model was evaluated according to its accuracy, bias, and dispersion. Machine learning techniques had accuracy comparable to traditional methods; however, they presented slightly more bias and were over-dispersed. For selection purposes, more studies are needed; however, machine learning techniques are a feasible alternative for unofficial evaluation runs.Abstract Gait scores are widely used in the genetic evaluation of horses. However, the nature of such measurement may limit genetic progress since there is subjectivity in phenotypic information. This study aimed to assess the application of machine learning techniques in the prediction of breeding values for five visual gait scores in Campolina horses: dissociation, comfort, style, regularity, and development. The dataset contained over 5000 phenotypic records with 107,951 horses (14 generations) in the pedigree. A fixed model was used to estimate least-square solutions for fixed effects and adjusted phenotypes. Variance components and breeding values (EBV) were obtained via a multiple-trait model (MTM). Adjusted phenotypes and fixed effects solutions were used to train machine learning models (using the EBV from MTM as target variable): artificial neural network (ANN), random forest regression (RFR) and support vector regression (SVR). To validate the models, the linear regression method was used. Accuracy was comparable across all models (but it was slightly higher for ANN). The highest bias was observed for ANN, followed by MTM. Dispersion varied according to the trait; it was higher for ANN and the lowest for MTM. Machine learning is a feasible alternative to EBV prediction; however, this method will be slightly biased and over-dispersed for young animals.
引用
收藏
页数:22
相关论文
共 72 条
[31]   Workload of official contests, net cost of transport, and metabolic power of Mangalarga Marchador horses of marcha batida or picada gaits [J].
Lage, J. ;
Fonseca, M. G. ;
de Barros, G. G. M. ;
Feringer-Junior, W. H. ;
Pereira, G. T. ;
Ferraz, G. C. .
JOURNAL OF ANIMAL SCIENCE, 2017, 95 (06) :2488-2495
[32]   Semi-parametric estimates of population accuracy and bias of predictions of breeding values and future phenotypes using the LR method [J].
Legarra, Andres ;
Reverter, Antonio .
GENETICS SELECTION EVOLUTION, 2018, 50
[33]   Application of support vector regression to genome-assisted prediction of quantitative traits [J].
Long, Nanye ;
Gianola, Daniel ;
Rosa, Guilherme J. M. ;
Weigel, Kent A. .
THEORETICAL AND APPLIED GENETICS, 2011, 123 (07) :1065-1074
[34]  
Lopez Osval Antonio Montesinos, 2022, Multivariate statistical machine learning methods for genomic prediction, P337, DOI 10.1007/978-3-030-89010-0_9
[35]  
lvares S.F.C., 2023, Masters Thesis
[36]   Removing data and using metafounders alleviates biases for all traits in Lacaune dairy sheep predictions [J].
Macedo, F. L. ;
Astruc, J. M. ;
Meuwissen, T. H. E. ;
Legarra, A. .
JOURNAL OF DAIRY SCIENCE, 2022, 105 (03) :2439-2452
[37]   Behavior of the Linear Regression method to estimate bias and accuracies with correct and incorrect genetic evaluation models [J].
Macedo, F. L. ;
Reverter, A. ;
Legarra, A. .
JOURNAL OF DAIRY SCIENCE, 2020, 103 (01) :529-544
[38]  
Meyer David, 2024, CRAN
[39]   Genetic principal components for live ultrasound scan traits of Angus cattle [J].
Meyer, K .
ANIMAL SCIENCE, 2005, 81 :337-345
[40]   Perils of Parsimony: Properties of Reduced-Rank Estimates of Genetic Covariance Matrices [J].
Meyer, Karin ;
Kirkpatrick, Mark .
GENETICS, 2008, 180 (02) :1153-1166