Classification of porcine reproductive and respiratory syndrome clinical impact in Ontario sow herds using machine learning approaches

被引:2
作者
Chadha, Akshay [1 ]
Dara, Rozita [1 ]
Pearl, David L. [2 ]
Gillis, Daniel [1 ]
Rosendal, Thomas [3 ]
Poljak, Zvonimir [2 ]
机构
[1] Univ Guelph, Sch Comp Sci, Guelph, ON, Canada
[2] Univ Guelph, Ontario Vet Coll, Dept Populat Med, Guelph, ON, Canada
[3] Natl Vet Inst, SVA, Uppsala, Sweden
基金
加拿大自然科学与工程研究理事会;
关键词
PRRS; pathogenicity; classifcation; machine learning; ORF5; gene; SYNDROME VIRUS PRRSV; SWINE HERDS; MORTALITY; ASSOCIATION; GENETICS; DISEASE; SIGNS;
D O I
10.3389/fvets.2023.1175569
中图分类号
S85 [动物医学(兽医学)];
学科分类号
0906 ;
摘要
Since the early 1990s, porcine reproductive and respiratory syndrome (PRRS) virus outbreaks have been reported across various parts of North America, Europe, and Asia. The incursion of PRRS virus (PRRSV) in swine herds could result in various clinical manifestations, resulting in a substantial impact on the incidence of respiratory morbidity, reproductive loss, and mortality. Veterinary experts, among others, regularly analyze the PRRSV open reading frame-5 (ORF-5) for prognostic purposes to assess the risk of severe clinical outcomes. In this study, we explored if predictive modeling techniques could be used to identify the severity of typical clinical signs observed during PRRS outbreaks in sow herds. Our study aimed to evaluate four baseline machine learning (ML) algorithms: logistic regression (LR) with ridge and lasso regularization techniques, random forest (RF), k-nearest neighbor (KNN), and support vector machine (SVM), for the clinical impact classification of ORF-5 sequences and demographic data into high impact and low impact categories. First, baseline classifiers were evaluated using different input representations of ORF-5 nucleotides, amino acid sequences, and demographic data using a 10-fold cross-validation technique. Then, we designed a consensus voting ensemble approach to aggregate the different types of input representations for genetic and demographic data for classifying clinical impact. In this study, we observed that: (a) for abortion and pre-weaning mortality (PWM), different classifiers gained improvement over baseline accuracy, which showed the plausible presence of both genotypic-phenotypic and demographic-phenotypic relationships, (b) for sow mortality (SM), no baseline classifier successfully established such linkages using either genetic or demographic input data, (c) baseline classifiers showed good performance with a moderate variance of the performance metrics, due to high-class overlap and the small dataset size used for training, and (d) the use of consensus voting ensemble techniques helped to make the predictions more robust and stabilized the performance evaluation metrics, but overall accuracy did not substantially improve the diagnostic metrics over baseline classifiers.
引用
收藏
页数:10
相关论文
共 52 条
[1]   Applications of Bayesian Phylodynamic Methods in a Recent US Porcine Reproductive and Respiratory Syndrome Virus Outbreak [J].
Alkhamis, Mohammad A. ;
Perez, Andres M. ;
Murtaugh, Michael P. ;
Wang, Xiong ;
Morrison, Robert B. .
FRONTIERS IN MICROBIOLOGY, 2016, 7
[2]  
Attaluri P.K., 2010, 2010 IEEE S COMP INT, P279
[3]  
Baekbo P., 2015, P INT PRRS C GHENT B
[4]  
Boes KatieM., 2017, Pathologic Basis of Veterinary Disease, VSixth, DOI DOI 10.1016/B978-0-323-35775-3.00009-6
[5]   Genomic sequence and virulence comparison of four Type 2 porcine reproductive and respiratory syndrome virus strains [J].
Brockmeier, Susan L. ;
Loving, Crystal L. ;
Vorwald, Ann C. ;
Kehrli, Marcus E., Jr. ;
Baker, Rodney B. ;
Nicholson, Tracy L. ;
Lager, Kelly M. ;
Miller, Laura C. ;
Faaberg, Kay S. .
VIRUS RESEARCH, 2012, 169 (01) :212-221
[6]  
Buitinck L., 2013, arXiv, DOI [DOI 10.48550/ARXIV.1309.0238,ARXIV, DOI 10.48550/ARXIV.1309.0238]
[7]  
Chadha Akshay, 2019, 2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA), P1570, DOI 10.1109/ICMLA.2019.00259
[8]  
CHAGNON M, 1991, CAN J VET RES, V55, P180
[9]   K-mer-Based Motif Analysis in Insect Species across Anopheles, Drosophila, and Glossina Genera and Its Application to Species Classification [J].
Cserhati, Matyas ;
Xiao, Peng ;
Guda, Chittibabu .
COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2019, 2019
[10]  
DAllaire S, 1996, CAN VET J, V37, P237