Suitability of different machine learning algorithms for the classification of the proportion of grassland-based forages at the herd level using mid-infrared spectral information from routine milk control

被引:0
作者
Birkinshaw, A. [1 ]
Sutter, M. [1 ]
Nussbaum, M. [1 ,2 ]
Kreuzer, M. [3 ]
Reidy, B. [1 ]
机构
[1] Bern Univ Appl Sci BFH, Sch Agr Forest & Food Sci HAFL, CH-3052 Zollikofen, Switzerland
[2] Univ Utrecht, Dept Phys Geog, NL-3584 CB Utrecht, Netherlands
[3] Swiss Fed Inst Technol, Inst Agr Sci, CH-8315 Lindau, Switzerland
关键词
machine learning; mid-infrared spectroscopy; grassland-based feeding; seasonality index; FTIR SPECTROSCOPY; PASTURE; AUTHENTICATION; ORIGIN;
D O I
10.3168/jds.2024-25090
中图分类号
S8 [畜牧、 动物医学、狩猎、蚕、蜂];
学科分类号
0905 ;
摘要
As the call for an international standard for milk from grassland-based production systems continues to grow, so too do the monitoring and evaluation policies surrounding this topic. Individual stipulations by countries and milk producers to market their milk under their own grass-fed labels include a compulsory number of grazing days per year (ranging from 120 d for certain labels to 180 d for others), a specified amount of herbage in the diet, or a prescribed dietary proportion of grassland- based forages (GBF) fed and produced on-farm. As these multifarious policy and label requirements are laborious and costly to monitor on-farm, fast economical proxies would be advantageous to verify the proportion of GBF consumed by the cows in the final product. With this in mind, we employed readily available mid-infrared spectral data (n = 1,132 spectra) from routine milk controls to develop binary classification models for 4 main feed groups from a primarily forage-based diet: total GBF (>= 50% [n = 955], >= 75% [n = 599], >= 85% [n = 356]), pasture (>= 20% [n = 451], >= 50% [n = 284], >= 70% [n = 152]), fresh herbage (pasture + fresh herbage indoor feeding; >= 20% [n = 517], >= 50% [n = 325], >= 70% [n = 182]), and whole plant corn (fresh + conserved; >= 10% [n = 646], >= 30% [n = 187]), with the latter as a negative control. We compared 4 machine learning methods to assess which statistical model performs best at discriminating these classes. Three of these models have not yet been tested for herd-level dietary proportion classification, and all 4 follow completely different approaches: least absolute shrinkage and selection operator (LASSO), partial least squares discriminant analysis (PLS-DA), random forest (RF), and support vector machines (SVM). Seasonal- ity has been a missing element from previous dietary herbage proportion classification models. As grazing and fresh herbage indoor feeding are highly dependent on the season, we developed an indicator to incorporate seasonality in a consistent, unbiased manner into our models. We also tested 3 sets of covariates. The first set included only mid-infrared spectra derived data, the second included mid-infrared spectra derived data plus seasonality indices and the third included mid-infrared spectra derived data, seasonality indices and additional herd specific information (DIM, breed, and parity). Of the 4 machine learning algorithms tested for the binary classification of GBF proportion at herd level, LASSO and PLS-DA performed best according to evaluation metrics; however, the RF and SVM models were not far behind the best performing model evaluation metrics in each feed category. Our best performing model, the LASSO model containing seasonality indices and herd specific information, classified total GBF >= 50% with an accuracy of 78.6%, precision of 85.1%, sensitivity of 90.6%, specificity of 14.1%, and F1 score (harmonic mean of precision and sensitivity) of 87.7%; this was very similar to the PLS-DA model. Our results suggest that in general, LASSO and PLS-DA machine learning algorithms perform better for dietary GBF classification than RF or SVM algorithms.
引用
收藏
页码:10724 / 10737
页数:14
相关论文
共 53 条
[11]   Verification of fresh grass feeding, pasture grazing and organic farming by FTIR spectroscopy analysis of bovine milk [J].
Capuano, Edoardo ;
Rademaker, Jan ;
van den Bijgaart, Harrie ;
van Ruth, Saskia M. .
FOOD RESEARCH INTERNATIONAL, 2014, 60 :59-65
[12]   Climate warming from managed grasslands cancels the cooling effect of carbon sinks in sparsely grazed and natural grasslands [J].
Chang, Jinfeng ;
Ciais, Philippe ;
Gasser, Thomas ;
Smith, Pete ;
Herrero, Mario ;
Havlik, Petr ;
Obersteiner, Michael ;
Guenet, Bertrand ;
Goll, Daniel S. ;
Li, Wei ;
Naipal, Victoria ;
Peng, Shushi ;
Qiu, Chunjing ;
Tian, Hanqin ;
Viovy, Nicolas ;
Yue, Chao ;
Zhu, Dan .
NATURE COMMUNICATIONS, 2021, 12 (01)
[13]   Prediction of indicators of cow diet composition and authentication of feeding specificateions of Protected Designation of Origin cheese using mid-infrared spectroscopy on milk [J].
Coppa, M. ;
Martin, B. ;
Hulin, S. ;
Guillemin, J. ;
Gauzentes, J., V ;
Pecou, A. ;
Andueza, D. .
JOURNAL OF DAIRY SCIENCE, 2021, 104 (01) :112-125
[14]   Authentication of cow feeding and geographic origin on milk using visible and near-infrared spectroscopy [J].
Coppa, M. ;
Martin, B. ;
Agabriel, C. ;
Chassaing, C. ;
Sibra, C. ;
Constant, I. ;
Graulet, B. ;
Andueza, D. .
JOURNAL OF DAIRY SCIENCE, 2012, 95 (10) :5544-5551
[15]   Comparison between Mid-Infrared (ATR-FTIR) Spectroscopy and Official Analysis Methods for Determination of the Concentrations of Alcohol, SO2, and Total Acids in Wine [J].
Dini, Irene ;
Di Lorenzo, Ritamaria ;
Senatore, Antonello ;
Coppola, Daniele ;
Laneri, Sonia .
SEPARATIONS, 2021, 8 (10)
[16]  
FOAG (Federal Office of Agriculture), 2023, Tierwohl
[17]  
FOAG (Federal Office of Agriculture), 2019, Animal Tracing Database TVD
[18]  
FOAG (Federal Office of Agriculture), 2022, Grassland Ecosystem Services
[19]  
FOAG (Federal Office of Agriculture), 2024, Beitrag fur graslandbasierte Milchund Fleischproduktion
[20]   Application of machine-learning methods to milk mid-infrared spectra for discrimination of cow milk from pasture or total mixed ration diets [J].
Frizzarin, M. ;
O'Callaghan, T. F. ;
Murphy, T. B. ;
Hennessy, D. ;
Casa, A. .
JOURNAL OF DAIRY SCIENCE, 2021, 104 (12) :12394-12402