A comparison of 4 different machine learning algorithms to predict lactoferrin content in bovine milk from mid-infrared spectra

被引:36
|
作者
Soyeurt, H. [1 ]
Grelet, C. [2 ]
McParland, S. [3 ]
Calmels, M. [4 ]
Coffey, M. [5 ]
Tedde, A. [1 ]
Delhez, P. [1 ,6 ]
Dehareng, F. [2 ]
Gengler, N. [1 ]
机构
[1] Univ Liege, TERRA Res & Teaching Ctr, Gembloux Agrobio Tech, Gembloux, Belgium
[2] Walloon Res Ctr, Valorisat Agr Prod, Gembloux, Belgium
[3] TEAGASC, Anim & Grassland Res & Innovat Ctr, Moorepk, Fermoy, Cork, Ireland
[4] Seenovia, Res & Dev, St Berthevin, France
[5] Scotlands Rural Coll, Livestock Breeding Anim & Vet Sci, Edinburgh, Midlothian, Scotland
[6] Natl Fund Sci Res, Brussels, Belgium
基金
爱尔兰科学基金会; 英国生物技术与生命科学研究理事会;
关键词
milk; lactoferrin; mid infrared; machine learning; BETA-HYDROXYBUTYRATE; INFRARED-SPECTRUM; DAIRY-COWS; STAGE; REGRESSION; MASTITIS; BREEDS;
D O I
10.3168/jds.2020-18870
中图分类号
S8 [畜牧、 动物医学、狩猎、蚕、蜂];
学科分类号
0905 ;
摘要
Lactoferrin (LF) is a glycoprotein naturally present in milk. Its content varies throughout lactation, but also with mastitis; therefore it is a potential additional indicator of udder health beyond somatic cell count. Condequently, there is an interest in quantifying this biomolecule routinely. First prediction equations proposed in the literature to predict the content in milk using milk mid-infrared spectrometry were built using partial least square regression (PLSR) due to the limited size of the data set. Thanks to a large data set, the current study aimed to test 4 different machine learning algorithms using a large data set comprising 6,619 records collected across different herds, breeds, and countries. The first algorithm was a PLSR, as used in past investigations. The second and third algorithms used partial least square (PLS) factors combined with a linear and polynomial support vector regression (PLS + SVR). The fourth algorithm also used PLS factors, but included in an artificial neural network with 1 hidden layer (PLS + ANN). The training and validation sets comprised 5,541 and 836 records, respectively. Even if the calibration prediction performances were the best for PLS + polynomial SVR, their validation prediction performances were the worst. The 3 other algorithms had similar validation performances. Indeed, the validation root mean squared error (RMSE) ranged between 162.17 and 166.75 mg/L of milk. However, the lower standard deviation of cross-validation RMSE and the better normality of the residual distribution observed for PLS + ANN suggest that this modeling was more suitable to predict the LF content in milk from milk mid-infrared spectra (R(2)v = 0.60 and validation RMSE = 162.17 mg/L of milk). This PLS +ANN model was then applied to almost 6 million spectral records. The predicted LF showed the expected relationships with milk yield, somatic cell score, somatic cell count, and stage of lactation. The model tended to underestimate high LF values (higher than 600 rng/L of milk). However, if the prediction threshold was set to 500 mg/L, 82% of samples from the validation having a content of LF higher than 600 mg/L were detected. Future research should aim to increase the number of those extremely high LF records in the calibration set.
引用
收藏
页码:11585 / 11596
页数:12
相关论文
共 46 条
  • [31] Comparing human milk macronutrients measured using analyzers based on mid-infrared spectroscopy and ultrasound and the application of machine learning in data fitting
    Ruan, Huijuan
    Tang, Qingya
    Zhang, Yajie
    Zhao, Xuelin
    Xiang, Yi
    Feng, Yi
    Cai, Wei
    BMC PREGNANCY AND CHILDBIRTH, 2022, 22 (01)
  • [32] Comparing human milk macronutrients measured using analyzers based on mid-infrared spectroscopy and ultrasound and the application of machine learning in data fitting
    Huijuan Ruan
    Qingya Tang
    Yajie Zhang
    Xuelin Zhao
    Yi Xiang
    Yi Feng
    Wei Cai
    BMC Pregnancy and Childbirth, 22
  • [33] Near- and mid-infrared determination of some quality parameters of cheese manufactured from the mixture of different milk species
    Huseyin Ayvaz
    Mustafa Mortas
    Muhammed Ali Dogan
    Mustafa Atan
    Gulgun Yildiz Tiryaki
    Yonca Karagul Yuceer
    Journal of Food Science and Technology, 2021, 58 : 3981 - 3992
  • [34] Near- and mid-infrared determination of some quality parameters of cheese manufactured from the mixture of different milk species
    Ayvaz, Huseyin
    Mortas, Mustafa
    Dogan, Muhammed Ali
    Atan, Mustafa
    Yildiz Tiryaki, Gulgun
    Karagul Yuceer, Yonca
    JOURNAL OF FOOD SCIENCE AND TECHNOLOGY-MYSORE, 2021, 58 (10): : 3981 - 3992
  • [35] Comparison of different Machine Learning algorithms for lithofacies classification from well logs
    Dell'Aversana, P.
    BOLLETTINO DI GEOFISICA TEORICA ED APPLICATA, 2019, 60 (01) : 69 - 80
  • [36] Blood serum lipid profiling may improve the management of recurrent miscarriage: a combination of machine learning of mid-infrared spectra and biochemical assays
    Guleken, Zozan
    Bahat, Pinar Yalcin
    Toto, Omer Faruk
    Bulut, Huri
    Jakubczyk, Pawel
    Cebulski, Jozef
    Paja, Wieslaw
    Pancerz, Krzysztof
    Wosiak, Agnieszka
    Depciuch, Joanna
    ANALYTICAL AND BIOANALYTICAL CHEMISTRY, 2022, 414 (29-30) : 8341 - 8352
  • [37] Estimation of body condition score change in dairy cows in a seasonal calving pasture-based system using routinely available milk mid-infrared spectra and machine learning techniques
    Frizzarin, M.
    Gormley, I. C.
    Berry, D. P.
    McParland, S.
    JOURNAL OF DAIRY SCIENCE, 2023, 106 (06) : 4232 - 4244
  • [38] Comparison of the Potential Abilities of Three Spectroscopy Methods: Near-Infrared, Mid-Infrared, and Molecular Fluorescence, to Predict Carotenoid, Vitamin and Fatty Acid Contents in Cow Milk
    Soulat, Julien
    Andueza, Donato
    Graulet, Benoit
    Girard, Christiane L.
    Labonne, Cyril
    Ait-Kaddour, Abderrahmane
    Martin, Bruno
    Ferlay, Anne
    FOODS, 2020, 9 (05)
  • [39] Blood serum lipid profiling may improve the management of recurrent miscarriage: a combination of machine learning of mid-infrared spectra and biochemical assays
    Zozan Guleken
    Pınar Yalçın Bahat
    Ömer Faruk Toto
    Huri Bulut
    Paweł Jakubczyk
    Jozef Cebulski
    Wiesław Paja
    Krzysztof Pancerz
    Agnieszka Wosiak
    Joanna Depciuch
    Analytical and Bioanalytical Chemistry, 2022, 414 : 8341 - 8352
  • [40] Can We Observe Expected Behaviors at Large and Individual Scales for Feed Efficiency-Related Traits Predicted Partly from Milk Mid-Infrared Spectra?
    Zhang, Lei
    Gengler, Nicolas
    Dehareng, Frederic
    Colinet, Frederic
    Froidmont, Eric
    Soyeurt, Helene
    ANIMALS, 2020, 10 (05):