Comparison of classification algorithms to predict outcomes of feedlot cattle identified and treated for bovine respiratory disease

被引:31
作者
Amrine, David E. [1 ,2 ]
White, Brad J. [2 ]
Larson, Robert L. [2 ]
机构
[1] Kansas State Univ, Coll Vet Med, Dept Diagnost Med & Pathobiol, Manhattan, KS 66506 USA
[2] Kansas State Univ, Coll Vet Med, Dept Clin Sci, Manhattan, KS 66506 USA
基金
美国农业部;
关键词
Bovine respiratory disease; Machine learning; Cattle; Diagnosis; YOUDEN INDEX; PERFORMANCE; MORTALITY; RISK;
D O I
10.1016/j.compag.2014.04.009
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
Bovine respiratory disease (BRD) continues to be the primary cause of morbidity and mortality in feedyard cattle. Accurate identification of those animals that will not finish the production cycle normally following initial treatment for BRD would provide feedyard managers with opportunities to more effectively manage those animals. Our objectives were to assess the ability of different classification algorithms to accurately predict an individual calf's outcome based on data available at first identification of and treatment for BRD and also to identify characteristics of calves where predictive models performed well as gauged by accuracy. Data from 23 feedyards in multiple geographic locations within the U.S. from 2000 to 2009 representing over one million animals were analyzed to identify animals clinically diagnosed with BRD and treated with an antimicrobial. These data were analyzed both as a single dataset and as multiple datasets based on individual feedyards and partitioned into training, testing, and validation datasets. Classifiers were trained and optimized to identify calves that did not finish the production cycle with their cohort. Following classifier training, accuracy was evaluated using validation data. Analysis was also done to identify sub-groups of calves within populations where classifiers performed better compared to other sub-groups. Accuracy of individual classifiers varied by dataset. The accuracy of the best performing classifier by dataset ranged from a low of 63% in one dataset up to 95% in a different dataset. Sub-groups of calves were identified within some datasets where accuracy of a classifiers were greater than 98%; however these accuracies must be interpreted in relation to the prevalence of the class of interest within those populations. We found that by pairing the correct classifier with the data available, accurate predictions could be made that would provide feedlot managers with valuable information. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:9 / 19
页数:11
相关论文
共 36 条
[1]  
[Anonymous], 2003, ICML-2003 Workshop on Learning from Imbalanced Data Sets II
[2]  
[Anonymous], NAT AN HLTH MON SYST
[3]   A multivariable assessment quantifying effects of cohort-level factors associated with combined mortality and culling risk in cohorts of U.S. commercial feedlot cattle [J].
Babcock, A. H. ;
Cernicchiaro, N. ;
White, B. J. ;
Dubnicka, S. R. ;
Thomson, D. U. ;
Ives, S. E. ;
Scott, H. M. ;
Milliken, G. A. ;
Renter, D. G. .
PREVENTIVE VETERINARY MEDICINE, 2013, 108 (01) :38-46
[4]   Temporal distributions of respiratory disease events within cohorts of feedlot cattle and associations with cattle health and performance indices [J].
Babcock, A. H. ;
Renter, D. G. ;
White, B. J. ;
Dubnicka, S. R. ;
Scott, H. M. .
PREVENTIVE VETERINARY MEDICINE, 2010, 97 (3-4) :198-219
[5]   Feedlot health and performance effects associated with the timing of respiratory disease treatment [J].
Babcock, A. H. ;
White, B. J. ;
Dritz, S. S. ;
Thomson, D. U. ;
Renter, D. G. .
JOURNAL OF ANIMAL SCIENCE, 2009, 87 (01) :314-327
[6]  
Babcock AH, 2013, CAN J VET RES, V77, P33
[7]  
Berckmans D., 2004, Proc. Int. Soc. Anim. Hyg. Conf. Anim. Prod. Eur.: The way forward in a changing world, V1, P31
[8]   KNIME:: The Konstanz Information Miner [J].
Berthold, Michael R. ;
Cebron, Nicolas ;
Dill, Fabian ;
Gabriel, Thomas R. ;
Koetter, Tobias ;
Meinl, Thorsten ;
Ohl, Peter ;
Sieb, Christoph ;
Thiel, Kilian ;
Wiswedel, Bernd .
DATA ANALYSIS, MACHINE LEARNING AND APPLICATIONS, 2008, :319-326
[9]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[10]   Associations between the distance traveled from sale barns to commercial feedlots in the United States and overall performance, risk of respiratory disease, and cumulative mortality in feeder cattle during 1997 to 2009 [J].
Cernicchiaro, N. ;
White, B. J. ;
Renter, D. G. ;
Babcock, A. H. ;
Kelly, L. ;
Slattery, R. .
JOURNAL OF ANIMAL SCIENCE, 2012, 90 (06) :1929-1939