Can machine-learning improve cardiovascular risk prediction using routine clinical data?

被引:689
|
作者
Weng, Stephen F. [1 ,2 ]
Reps, Jenna [3 ,4 ]
Kai, Joe [1 ,2 ]
Garibaldi, Jonathan M. [3 ,4 ]
Qureshi, Nadeem [1 ,2 ]
机构
[1] Univ Nottingham, NIHR Sch Primary Care Res, Nottingham, England
[2] Univ Nottingham, Sch Med, Div Primary Care, Nottingham, England
[3] Univ Nottingham, Adv Data Anal Ctr, Nottingham, England
[4] Univ Nottingham, Sch Comp Sci, Nottingham, England
来源
PLOS ONE | 2017年 / 12卷 / 04期
关键词
CORONARY EVENTS; VALIDATION; MODELS; REGRESSION; DISEASE; MUNSTER; PROFILE; WOMEN; MEN;
D O I
10.1371/journal.pone.0174944
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background Current approaches to predict cardiovascular risk fail to identify many people who would benefit from preventive treatment, while others receive unnecessary intervention. Machinelearning offers opportunity to improve accuracy by exploiting complex interactions between risk factors. We assessed whether machine-learning can improve cardiovascular risk prediction. Methods Prospective cohort study using routine clinical data of 378,256 patients from UK family practices, free from cardiovascular disease at outset. Four machine-learning algorithms (random forest, logistic regression, gradient boosting machines, neural networks) were compared to an established algorithm (American College of Cardiology guidelines) to predict first cardiovascular event over 10-years. Predictive accuracy was assessed by area under the 'receiver operating curve' (AUC); and sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV) to predict 7.5% cardiovascular risk (threshold for initiating statins). Findings 24,970 incident cardiovascular events (6.6%) occurred. Compared to the established risk prediction algorithm (AUC 0.728, 95% CI 0.723-0.735), machine-learning algorithms improved prediction: random forest + 1.7% (AUC 0.745, 95% CI 0.739-0.750), logistic regression + 3.2% (AUC 0.760, 95% CI 0.755-0.766), gradient boosting + 3.3% (AUC 0.761, 95% CI 0.755-0.766), neural networks + 3.6% (AUC 0.764, 95% CI 0.759-0.769). The highest achieving (neural networks) algorithm predicted 4,998/7,404 cases (sensitivity 67.5%, PPV 18.4%) and 53,458/75,585 non-cases (specificity 70.7%, NPV 95.7%), correctly predicting 355 (+ 7.6%) more patients who developed cardiovascular disease compared to the established algorithm. Conclusions Machine-learning significantly improves accuracy of cardiovascular risk prediction, increasing the number of patients identified who could benefit from preventive treatment, while avoiding unnecessary treatment of others.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Prediction of Recurrent Atherosclerotic Cardiovascular Disease Risk Using Machine Learning and Electronic Health Record Data
    Sarraju, Ashish
    Ward, Andrew
    Chung, Sukyung
    Li, Jiang
    Scheinker, David
    Rodriguez, Fatima
    CIRCULATION, 2020, 142
  • [42] RETRACTED: Clinical Data Analysis for Prediction of Cardiovascular Disease Using Machine Learning Techniques (Retracted Article)
    Nadakinamani, Rajkumar Gangappa
    Reyana, A.
    Kautish, Sandeep
    Vibith, A. S.
    Gupta, Yogita
    Abdelwahab, Sayed F.
    Mohamed, Ali Wagdy
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [43] COMPARISON OF A MACHINE-LEARNING PREDICTION ALGORITHM WITH CLINICAL TOOLS FOR THE IDENTIFICATION OF DIABETIC PATIENTS AT RISK FOR NASH
    Tietz, Andreas
    Bader, Giovanni
    Doherty, Matt
    Reinhart, Brenda
    Balp, Maria-Magdalena
    Pedrosa, Marcos C.
    Acharya, Sandip
    Loeffler, Juergen
    Schattenberg, Joern M.
    HEPATOLOGY, 2020, 72 : 907A - 908A
  • [44] Machine-learning algorithm that can improve the diagnostic accuracy of septic arthritis of the knee
    Eun-Seok Choi
    Jae Ang Sim
    Young Gon Na
    Jong- Keun Seon
    Hyun Dae Shin
    Knee Surgery, Sports Traumatology, Arthroscopy, 2021, 29 : 3142 - 3148
  • [45] Machine-learning algorithm that can improve the diagnostic accuracy of septic arthritis of the knee
    Choi, Eun-Seok
    Sim, Jae Ang
    Na, Young Gon
    Seon, Jong-Keun
    Shin, Hyun Dae
    KNEE SURGERY SPORTS TRAUMATOLOGY ARTHROSCOPY, 2021, 29 (10) : 3142 - 3148
  • [46] Can machine-learning algorithms improve upon classical palaeoenvironmental reconstruction models?
    Sun, Peng
    Holden, Philip B.
    Birks, H. John B.
    CLIMATE OF THE PAST, 2024, 20 (10) : 2373 - 2398
  • [47] Credit Risk Analysis Using Machine-Learning Algorithms
    Alagoz, Gokhan
    Canakoglu, Ethem
    29TH IEEE CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS (SIU 2021), 2021,
  • [48] Classification of Cardiovascular Risk Using Accelerometer Data and Machine Learning Algorithms
    Boiarskaia, Elena
    Liang, Feng
    Zhu, Weimo
    MEDICINE AND SCIENCE IN SPORTS AND EXERCISE, 2014, 46 (05): : 717 - 717
  • [49] Characterizing EMG data using machine-learning tools
    Yousefi, Jamileh
    Hamilton-Wright, Andrew
    COMPUTERS IN BIOLOGY AND MEDICINE, 2014, 51 : 1 - 13
  • [50] Machine-learning for the prediction of one-year seizure recurrence based on routine electroencephalography
    Émile Lemoine
    Denahin Toffa
    Geneviève Pelletier-Mc Duff
    An Qi Xu
    Mezen Jemel
    Jean-Daniel Tessier
    Frédéric Lesage
    Dang K. Nguyen
    Elie Bou Assi
    Scientific Reports, 13