Predicting survival in heart failure: a risk score based on machine-learning and change point algorithm

被引:8
作者
Kim, Wonse [1 ,14 ]
Park, Jin Joo [2 ]
Lee, Hae-Young [3 ]
Kim, Kye Hun [4 ]
Yoo, Byung-Su [5 ]
Kang, Seok-Min [6 ]
Baek, Sang Hong [7 ]
Jeon, Eun-Seok [8 ]
Kim, Jae-Joong [9 ]
Cho, Myeong-Chan [10 ]
Chae, Shung Chull [11 ]
Oh, Byung-Hee [12 ]
Kook, Woong [1 ]
Choi, Dong-Ju [2 ,13 ]
机构
[1] Seoul Natl Univ, Dept Math Sci, Gwanak Ro 1, Seoul, South Korea
[2] Seoul Natl Univ, Cardiovasc Ctr, Dept Internal Med, Div Cardiol,Bundang Hosp, Seongnam, South Korea
[3] Seoul Natl Univ Hosp, Dept Internal Med, Seoul, South Korea
[4] Chonnam Natl Univ, Heart Res Ctr, Gwangju, South Korea
[5] Yonsei Univ, Dept Internal Med, Wonju Coll Med, Wonju, South Korea
[6] Yonsei Univ, Dept Internal Med, Coll Med, Seoul, South Korea
[7] Catholic Univ Korea, Dept Internal Med, Seoul, South Korea
[8] Sungkyunkwan Univ, Dept Internal Med, Coll Med, Seoul, South Korea
[9] Asan Med Ctr, Dept Internal Med, Seoul, South Korea
[10] Chungbuk Natl Univ, Dept Internal Med, Coll Med, Cheongju, South Korea
[11] Kyungpook Natl Univ, Dept Internal Med, Coll Med, Daegu, South Korea
[12] Mediplex Sejong Hosp, Dept Internal Med, Incheon, South Korea
[13] Seoul Natl Univ, Cardiovasc Ctr, Dept Internal Med, Div Cardiol,Bundang Hosp, Gumiro 166, Gyeonggi Do, Seongnam, South Korea
[14] MetaEyes, 41 Yonsei Ro 5Da Gil, Seoul, South Korea
关键词
Heart failure; Machine learning; Grouped Lasso; Prognostic model; Mortality; Change-point analysis; OPERATING CHARACTERISTIC CURVES; MORTALITY; MODEL; HOSPITALIZATION; DISCHARGE;
D O I
10.1007/s00392-021-01870-7
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Objective Machine learning (ML) algorithm can improve risk prediction because ML can select features and segment continuous variables effectively unbiased. We generated a risk score model for mortality with ML algorithms in East-Asian patients with heart failure (HF). Methods From the Korean Acute Heart Failure (KorAHF) registry, we used the data of 3683 patients with 27 continuous and 44 categorical variables. Grouped Lasso algorithm was used for the feature selection, and a novel continuous variable segmentation algorithm which is based on change-point analysis was developed for effectively segmenting the ranges of the continuous variables. Then, a risk score was assigned to each feature reflecting nonlinear relationship between features and survival times, and an integer score of maximum 100 was calculated for each patient. Results During 3-year follow-up time, 32.8% patients died. Using grouped Lasso, we identified 15 highly significant independent clinical features. The calculated risk score of each patient ranged between 1 and 71 points with a median of 36 (interquartile range: 27-45). The 3-year survival differed according to the quintiles of the risk score, being 80% and 17% in the 1st and 5th quintile, respectively. In addition, ML risk score had higher AUCs than MAGGIC-HF score to predict 1-year mortality (0.751 vs. 0.711, P < 0.001). Conclusions In East-Asian patients with HF, a novel risk score model based on ML and the new continuous variable segmentation algorithm performs better for mortality prediction than conventional prediction models.
引用
收藏
页码:1321 / 1333
页数:13
相关论文
共 29 条
[1]   Improving risk prediction in heart failure using machine learning [J].
Adler, Eric D. ;
Voors, Adriaan A. ;
Klein, Liviu ;
Macheret, Fima ;
Braun, Oscar O. ;
Urey, Marcus A. ;
Zhu, Wenhong ;
Sama, Iziah ;
Tadel, Matevz ;
Campagnari, Claudio ;
Greenberg, Barry ;
Yagil, Avi .
EUROPEAN JOURNAL OF HEART FAILURE, 2020, 22 (01) :139-147
[2]   Estimating and comparing time-dependent areas under receiver operating characteristic curves for censored event times with competing risks [J].
Blanche, Paul ;
Dartigues, Jean-Francois ;
Jacqmin-Gadda, Helene .
STATISTICS IN MEDICINE, 2013, 32 (30) :5381-5397
[3]  
Breheny P, 2009, STAT INTERFACE, V2, P369
[4]  
Bühlmann P, 2011, SPRINGER SER STAT, P1, DOI 10.1007/978-3-642-20192-9
[5]  
Chen J., 2000, Parametric Statistical Change Point Analysis with Applications to Genetics, V2nd, DOI 10.1007/978-1-4757-3131-6
[6]  
Chen Jie, 2011, Parametric Statistical Change Point Analysis With Applications to Genetics, Medicine, and Finance
[7]  
Geron A., 2019, Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow Concepts, Tools, and Techniques to Build Intelligent Systems
[8]  
Hanushek E.A., 2013, Statistical methods for social scientists
[9]  
Hastie T, 2009, Springer Series in Statistics
[10]   Artificial Intelligence in Cardiology [J].
Johnson, Kipp W. ;
Soto, Jessica Torres ;
Glicksberg, Benjamin S. ;
Shameer, Khader ;
Miotto, Riccardo ;
Ali, Mohsin ;
Ashley, Euan ;
Dudley, Joel T. .
JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2018, 71 (23) :2668-2679