Personalized prediction of incident hospitalization for cardiovascular disease in patients with hypertension using machine learning

被引:9
作者
Feng, Yuanchao [1 ,2 ]
Leung, Alexander A. [1 ,2 ,3 ]
Lu, Xuewen [4 ]
Liang, Zhiying [1 ,2 ]
Quan, Hude [1 ,2 ,5 ,6 ]
Walker, Robin L. [1 ,5 ,6 ]
机构
[1] Univ Calgary, Ctr Hlth informat, Cumming Sch Med, Dept Community Hlth Sci, Calgary, AB, Canada
[2] Univ Calgary, Libin Cardiovasc Inst, Calgary, AB, Canada
[3] Univ Calgary, Cumming Sch Med, Dept Med, Calgary, AB, Canada
[4] Univ Calgary, Dept Math & Stat, Calgary, AB, Canada
[5] OBrien Inst Publ Hlth, 3280 Hosp Drive NW, Calgary, AB T2N 4Z6, Canada
[6] Alberta Hlth Serv, 3280 Hosp Drive NW, Calgary, AB T2N 4Z6, Canada
关键词
Administrative health data; Machine learning; Personalized prediction; Hypertension patients; Cardiovascular disease; PREVENTION; GUIDELINES; MORTALITY;
D O I
10.1186/s12874-022-01814-3
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Prognostic information for patients with hypertension is largely based on population averages. The purpose of this study was to compare the performance of four machine learning approaches for personalized prediction of incident hospitalization for cardiovascular disease among newly diagnosed hypertensive patients. Methods: Using province-wide linked administrative health data in Alberta, we analyzed a cohort of 259,873 newly-diagnosed hypertensive patients from 2009 to 2015 who collectively had 11,863 incident hospitalizations for heart failure, myocardial infarction, and stroke. Linear multi-task logistic regression, neural multi-task logistic regression, random survival forest and Cox proportional hazard models were used to determine the number of event-free survivors at each time-point and to construct individual event-free survival probability curves. The predictive performance was evaluated by root mean squared error, mean absolute error, concordance index, and the Brier score. Results: The random survival forest model has the lowest root mean squared error value at 33.94 and lowest mean absolute error value at 28.37. Machine learning methods provide similar discrimination and calibration in the personalized survival prediction of hospitalizations for cardiovascular events in patients with hypertension. Neural multi-task logistic regression model has the highest concordance index at 0.8149 and lowest Brier score at 0.0242 for the personalized survival prediction. Conclusions: This is the first personalized survival prediction for cardiovascular diseases among hypertensive patients using administrative data. The four models tested in this analysis exhibited a similar discrimination and calibration ability in predicting personalized survival prediction of hypertension patients.
引用
收藏
页数:11
相关论文
共 28 条
[1]   Artificial neural networks for diagnosis and survival prediction in colon cancer [J].
Ahmed, Farid E. .
MOLECULAR CANCER, 2005, 4 (1)
[2]   Cardiovascular Event Prediction by Machine Learning The Multi-Ethnic Study of Atherosclerosis [J].
Ambale-Venkatesh, Bharath ;
Yang, Xiaoying ;
Wu, Colin O. ;
Liu, Kiang ;
Hundley, W. Gregory ;
McClelland, Robyn ;
Gomes, Antoinette S. ;
Folsom, Aaron R. ;
Shea, Steven ;
Guallar, Eliseo ;
Bluemke, David A. ;
Lima, Joao A. C. .
CIRCULATION RESEARCH, 2017, 121 (09) :1092-+
[3]  
Feng Y., 2020, PERSONALIZED SURVIVA
[4]   Saving 100 million lives by improving global treatment of hypertension and reducing cardiovascular disease risk factors [J].
Frieden, Thomas R. ;
Jaffe, Marc G. .
JOURNAL OF CLINICAL HYPERTENSION, 2018, 20 (02) :208-211
[5]  
Institute S, 2016, SAS SYST WIND
[6]  
Jachan M., 2009, PROBABILISTIC FORECA
[7]   Machine learning prediction in cardiovascular diseases: a meta-analysis [J].
Krittanawong, Chayakrit ;
Virk, Hafeez Ul Hassan ;
Bangalore, Sripal ;
Wang, Zhen ;
Johnson, Kipp W. ;
Pinotti, Rachel ;
Zhang, HongJu ;
Kaplin, Scott ;
Narasimhan, Bharat ;
Kitai, Takeshi ;
Baber, Usman ;
Halperin, Jonathan L. ;
Tang, W. H. Wilson .
SCIENTIFIC REPORTS, 2020, 10 (01)
[8]   Artificial Intelligence in Precision Cardiovascular Medicine [J].
Krittanawong, Chayakrit ;
Zhang, HongJu ;
Wang, Zhen ;
Aydar, Mehmet ;
Kitai, Takeshi .
JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2017, 69 (21) :2657-2664
[9]   The application of big data to cardiovascular disease: paths to precision medicine [J].
Leopold, Jane A. ;
Maron, Bradley A. ;
Loscalzo, Joseph .
JOURNAL OF CLINICAL INVESTIGATION, 2020, 130 (01) :29-38
[10]   A Multi-Task Learning Formulation for Survival Analysis [J].
Li, Yan ;
Wang, Jie ;
Ye, Jieping ;
Reddy, Chandan K. .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :1715-1724