Prediction of Long-Term Stroke Recurrence Using Machine Learning Models

被引：32

作者：

Abedi, Vida ^{[1
,2
]}

Avula, Venkatesh ^{[1
]}

Chaudhary, Durgesh ^{[3
]}

Shahjouei, Shima ^{[3
]}

Khan, Ayesha ^{[3
]}

Griessenauer, Christoph J. ^{[3
,4
]}

Li, Jiang ^{[1
]}

Zand, Ramin ^{[3
]}

机构：

[1] Geisinger Hlth Syst, Dept Mol & Funct Genom, Danville, PA 17822 USA

[2] Virginia Tech, Biocomplex Inst, Blacksburg, VA 24061 USA

[3] Geisinger Hlth Syst, Geisinger Neurosci Inst, Danville, PA 17822 USA

[4] Paracelsus Med Univ, Res Inst Neurointervent, A-5020 Salzburg, Austria

来源：

JOURNAL OF CLINICAL MEDICINE | 2021年 / 10卷 / 06期

关键词：

healthcare; artificial intelligence; machine learning; interpretable machine learning; explainable machine learning; ischemic stroke; clinical decision support system; electronic health record; outcome prediction; recurrent stroke; INSTRUMENT-II; RISK SCORE; VALIDATION;

D O I：

10.3390/jcm10061286

中图分类号：

R5 [内科学];

学科分类号：

1002 ; 100201 ;

摘要：

Background: The long-term risk of recurrent ischemic stroke, estimated to be between 17% and 30%, cannot be reliably assessed at an individual level. Our goal was to study whether machine-learning can be trained to predict stroke recurrence and identify key clinical variables and assess whether performance metrics can be optimized. Methods: We used patient-level data from electronic health records, six interpretable algorithms (Logistic Regression, Extreme Gradient Boosting, Gradient Boosting Machine, Random Forest, Support Vector Machine, Decision Tree), four feature selection strategies, five prediction windows, and two sampling strategies to develop 288 models for up to 5-year stroke recurrence prediction. We further identified important clinical features and different optimization strategies. Results: We included 2091 ischemic stroke patients. Model area under the receiver operating characteristic (AUROC) curve was stable for prediction windows of 1, 2, 3, 4, and 5 years, with the highest score for the 1-year (0.79) and the lowest score for the 5-year prediction window (0.69). A total of 21 (7%) models reached an AUROC above 0.73 while 110 (38%) models reached an AUROC greater than 0.7. Among the 53 features analyzed, age, body mass index, and laboratory-based features (such as high-density lipoprotein, hemoglobin A1c, and creatinine) had the highest overall importance scores. The balance between specificity and sensitivity improved through sampling strategies. Conclusion: All of the selected six algorithms could be trained to predict the long-term stroke recurrence and laboratory-based variables were highly associated with stroke recurrence. The latter could be targeted for personalized interventions. Model performance metrics could be optimized, and models can be implemented in the same healthcare system as intelligent decision support for targeted intervention.

引用

页码：1 / 16

页数：16

共 50 条

[21] Comparative assessment of Artificial Intelligence-Machine Learning (AI-ML) models for prediction of the long-term clinical outcome in stroke patients
Vaidya, Bhalchandra
Saraf, Amit
Mathew, Manu
Parsons, Mark
Singh, Sanjay
CEREBROVASCULAR DISEASES, 2024, 53 : 145 - 145
[22] DEEP PHENOTYPING AND PREDICTION OF LONG-TERM HEART FAILURE BY MACHINE LEARNING
Zhuang, Xiaodong
Sun, Xiuting
Zhong, Xiangbin
Zhou, Huimin
Zhang, Shaozhao
Liao, Xinxue
JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2019, 73 (09) : 690 - 690
[23] Application of Innovative Machine Learning Techniques for Long-Term Rainfall Prediction
Markuna, Suman
Kumar, Pankaj
Ali, Rawshan
Vishwkarma, Dinesh Kumar
Kushwaha, Kuldeep Singh
Kumar, Rohitashw
Singh, Vijay Kumar
Chaudhary, Sumit
Kuriqi, Alban
PURE AND APPLIED GEOPHYSICS, 2023, 180 (01) : 335 - 363
[24] Application of Innovative Machine Learning Techniques for Long-Term Rainfall Prediction
Suman Markuna
Pankaj Kumar
Rawshan Ali
Dinesh Kumar Vishwkarma
Kuldeep Singh Kushwaha
Rohitashw Kumar
Vijay Kumar Singh
Sumit Chaudhary
Alban Kuriqi
Pure and Applied Geophysics, 2023, 180 : 335 - 363
[25] A Review of Machine Learning Methods for Long-Term Time Series Prediction
Ptotic, Milan P.
Stojanovic, Milos B.
Popovic, Predrag M.
2022 57TH INTERNATIONAL SCIENTIFIC CONFERENCE ON INFORMATION, COMMUNICATION AND ENERGY SYSTEMS AND TECHNOLOGIES (ICEST), 2022, : 205 - 208
[26] Evaluation of Machine Learning Methods for the Long-Term Prediction of Cardiac Diseases
Schlemmer, Alexander
Zwirnmann, Henning
Zabel, Markus
Parlitz, Ulrich
Luther, Stefan
2014 8TH CONFERENCE OF THE EUROPEAN STUDY GROUP ON CARDIOVASCULAR OSCILLATIONS (ESGCO), 2014, : 157 - +
[27] Pavement Roughness Prediction Using Explainable and Supervised Machine Learning Technique for Long-Term Performance
Sandamal, Kelum
Shashiprabha, Sachini
Muttil, Nitin
Rathnayake, Upaka
SUSTAINABILITY, 2023, 15 (12)
[28] Prediction of long-term prestress loss for prestressed concrete cylinder structures using machine learning
Zhang, Hang
Guo, Quan-Quan
Xu, Li-Yan
ENGINEERING STRUCTURES, 2023, 279
[29] Prediction of long-term creep modulus of thermoplastics using brief tests and interpretable machine learning
Lobato, Hector
Cernuda, Carlos
Zulueta, Kepa
Arriaga, Aitor
Matxain, Jon M.
Burgoa, Aizeti
INTERNATIONAL JOURNAL OF SOLIDS AND STRUCTURES, 2024, 304
[30] Forecasting of Mid- and Long-Term Wind Power Using Machine Learning and Regression Models
Ahmed, Sina Ibne
Ranganathan, Prakash
Salehfar, Hossein
2021 IEEE KANSAS POWER AND ENERGY CONFERENCE (KPEC), 2021,

← 1 2 3 4 5 →