Integrating diverse data sources to predict disease risk in dairy cattle-a machine learning approach

被引:8
|
作者
Lasser, Jana [1 ,2 ,3 ]
Matzhold, Caspar [1 ,3 ]
Egger-Danner, Christa [4 ]
Fuerst-Waltl, Birgit [5 ]
Steininger, Franz [4 ]
Wittek, Thomas [6 ]
Klimek, Peter [1 ,3 ]
机构
[1] Med Univ Vienna, Ctr Med Stat Informat & Intelligent Syst, Sect Sci Complex Syst, A-1090 Vienna, Austria
[2] Graz Univ Technol, Inst Interact Syst & Data Sci, A-8010 Graz, Austria
[3] Complex Sci Hub Vienna, A-1080 Vienna, Austria
[4] ZuchtData EDV Dienstleistungen GmbH, A-1200 Vienna, Austria
[5] Univ Nat Resources & Life Sci, Div Livestock Sci, A-1180 Vienna, Austria
[6] Vetmeduni Vienna, Univ Clin Ruminants, A-1210 Vienna, Austria
关键词
data integration; disease prediction; machine learning; precision livestock farming; LAMENESS SCORING SYSTEM; BODY CONDITION SCORE; TEST DAY MILK; COWS; MASTITIS; HEALTH; YIELD; ASSOCIATION; KETOSIS; TRAITS;
D O I
10.1093/jas/skab294
中图分类号
S8 [畜牧、 动物医学、狩猎、蚕、蜂];
学科分类号
0905 ;
摘要
Livestock farming is currently undergoing a digital revolution and becoming increasingly data-driven. Yet, such data often reside in disconnected silos making them impossible to leverage their full potential to improve animal well-being. Here, we introduce a precision livestock farming approach, bringing together information streams from a variety of life domains of dairy cattle to study whether including more and diverse data sources improves the quality of predictions for eight diseases and whether using more complex prediction algorithms can, to some extent, compensate for less diverse data. Using three machine learning approaches of varying complexity (from logistic regression to gradient boosted trees) trained on data from 5,828 animals in 165 herds in Austria, we show that the prediction of lameness, acute and chronic mastitis, anestrus, ovarian cysts, metritis, ketosis (hyperketonemia), and periparturient hypocalcemia (milk fever) from routinely available data gives encouraging results. For example, we can predict lameness with high sensitivity and specificity (F1= 0.74). An analysis of the importance of individual variables to prediction performance shows that disease in dairy cattle is a product of the complex interplay between a multitude of life domains, such as housing, nutrition, or climate, that including more and diverse data sources increases prediction performance, and that the reuse of existing data can create actionable information for preventive interventions. Our findings pave the way toward data-driven point-of-care interventions and demonstrate the added value of integrating all available data in the dairy industry to improve animal well-being and reduce disease risk.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Machine learning to predict stroke risk from routine hospital data: A systematic review
    Heseltine-Carp, William
    Courtman, Megan
    Browning, Daniel
    Kasabe, Aishwarya
    Allen, Michael
    Streeter, Adam
    Ifeachor, Emmanuel
    James, Martin
    Mullin, Stephen
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2025, 196
  • [32] A staged approach using machine learning and uncertainty quantification to predict the risk of hip fracture
    Shaik, Anjum
    Larsen, Kristoffer
    Lane, Nancy E.
    Zhao, Chen
    Su, Kuan-Jui
    Keyak, Joyce H.
    Tian, Qing
    Sha, Qiuying
    Shen, Hui
    Deng, Hong-Wen
    Zhou, Weihua
    BONE REPORTS, 2024, 22
  • [33] Integration of epigenomic and genomic data to predict residual feed intake and the feed conversion ratio in dairy sheep via machine learning algorithms
    Fonseca, Pablo Augusto de Souza
    Suarez-Vega, Aroa
    Esteban-Blanco, Cristina
    Marina, Hector
    Pelayo, Rocio
    Gutierrez-Gil, Beatriz
    Arranz, Juan-Jose
    BMC GENOMICS, 2025, 26 (01):
  • [34] A Machine Learning Model Based on Genetic and Traditional Cardiovascular Risk Factors to Predict Premature Coronary Artery Disease
    Liu, Benrong
    Fang, Lei
    Xiong, Yujuan
    Du, Qiqi
    Xiang, Yang
    Chen, Xiaohui
    Tian, Chao-Wei
    Liu, Shi-Ming
    FRONTIERS IN BIOSCIENCE-LANDMARK, 2022, 27 (07):
  • [35] Derivation and validation of a machine learning risk score using biomarker and electronic patient data to predict progression of diabetic kidney disease
    Lili Chan
    Girish N. Nadkarni
    Fergus Fleming
    James R. McCullough
    Patricia Connolly
    Gohar Mosoyan
    Fadi El Salem
    Michael W. Kattan
    Joseph A. Vassalotti
    Barbara Murphy
    Michael J. Donovan
    Steven G. Coca
    Scott M. Damrauer
    Diabetologia, 2021, 64 : 1504 - 1515
  • [36] Novel models by machine learning to predict the risk of cardiac disease-specific death in young patients with breast cancer
    Li, Yi
    Li, Handong
    Ye, Xuan
    Zhu, Zhigang
    Qiu, Yixuan
    DISCOVER ONCOLOGY, 2024, 15 (01)
  • [37] Neural Dynamics in Parkinson's Disease: Integrating Machine Learning and Stochastic Modelling with Connectomic Data
    Shaheen, Hina
    Melnik, Roderick
    COMPUTATIONAL SCIENCE, ICCS 2024, PT IV, 2024, 14835 : 46 - 60
  • [38] A machine-learning model to predict suicide risk in Japan based on national survey data
    Chou, Po-Han
    Wang, Shao-Cheng
    Wu, Chi-Shin
    Horikoshi, Masaru
    Ito, Masaya
    FRONTIERS IN PSYCHIATRY, 2022, 13
  • [39] A machine learning approach to Predict the Engineering Students at risk of dropout and factors behind: Bangladesh Perspective
    Ahmed, Sheikh Arif
    Khan, Shahidul Islam
    2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
  • [40] A Machine Learning Approach to Predict MRI Brain Abnormalities in Preterm Infants Using Clinical Data
    Ortega-Leon, Arantxa
    Khaled, Roa'a
    Rodriguez-Garcia, Maria Inmaculada
    Urda, Daniel
    Turias, Ignacio J.
    BIOINFORMATICS AND BIOMEDICAL ENGINEERING, IWBBIO 2023, PT I, 2023, 13919 : 419 - 430