Prediction of Long-Term Stroke Recurrence Using Machine Learning Models

被引:32
|
作者
Abedi, Vida [1 ,2 ]
Avula, Venkatesh [1 ]
Chaudhary, Durgesh [3 ]
Shahjouei, Shima [3 ]
Khan, Ayesha [3 ]
Griessenauer, Christoph J. [3 ,4 ]
Li, Jiang [1 ]
Zand, Ramin [3 ]
机构
[1] Geisinger Hlth Syst, Dept Mol & Funct Genom, Danville, PA 17822 USA
[2] Virginia Tech, Biocomplex Inst, Blacksburg, VA 24061 USA
[3] Geisinger Hlth Syst, Geisinger Neurosci Inst, Danville, PA 17822 USA
[4] Paracelsus Med Univ, Res Inst Neurointervent, A-5020 Salzburg, Austria
关键词
healthcare; artificial intelligence; machine learning; interpretable machine learning; explainable machine learning; ischemic stroke; clinical decision support system; electronic health record; outcome prediction; recurrent stroke; INSTRUMENT-II; RISK SCORE; VALIDATION;
D O I
10.3390/jcm10061286
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background: The long-term risk of recurrent ischemic stroke, estimated to be between 17% and 30%, cannot be reliably assessed at an individual level. Our goal was to study whether machine-learning can be trained to predict stroke recurrence and identify key clinical variables and assess whether performance metrics can be optimized. Methods: We used patient-level data from electronic health records, six interpretable algorithms (Logistic Regression, Extreme Gradient Boosting, Gradient Boosting Machine, Random Forest, Support Vector Machine, Decision Tree), four feature selection strategies, five prediction windows, and two sampling strategies to develop 288 models for up to 5-year stroke recurrence prediction. We further identified important clinical features and different optimization strategies. Results: We included 2091 ischemic stroke patients. Model area under the receiver operating characteristic (AUROC) curve was stable for prediction windows of 1, 2, 3, 4, and 5 years, with the highest score for the 1-year (0.79) and the lowest score for the 5-year prediction window (0.69). A total of 21 (7%) models reached an AUROC above 0.73 while 110 (38%) models reached an AUROC greater than 0.7. Among the 53 features analyzed, age, body mass index, and laboratory-based features (such as high-density lipoprotein, hemoglobin A1c, and creatinine) had the highest overall importance scores. The balance between specificity and sensitivity improved through sampling strategies. Conclusion: All of the selected six algorithms could be trained to predict the long-term stroke recurrence and laboratory-based variables were highly associated with stroke recurrence. The latter could be targeted for personalized interventions. Model performance metrics could be optimized, and models can be implemented in the same healthcare system as intelligent decision support for targeted intervention.
引用
收藏
页码:1 / 16
页数:16
相关论文
共 50 条
  • [21] Comparative assessment of Artificial Intelligence-Machine Learning (AI-ML) models for prediction of the long-term clinical outcome in stroke patients
    Vaidya, Bhalchandra
    Saraf, Amit
    Mathew, Manu
    Parsons, Mark
    Singh, Sanjay
    CEREBROVASCULAR DISEASES, 2024, 53 : 145 - 145
  • [22] DEEP PHENOTYPING AND PREDICTION OF LONG-TERM HEART FAILURE BY MACHINE LEARNING
    Zhuang, Xiaodong
    Sun, Xiuting
    Zhong, Xiangbin
    Zhou, Huimin
    Zhang, Shaozhao
    Liao, Xinxue
    JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2019, 73 (09) : 690 - 690
  • [23] Application of Innovative Machine Learning Techniques for Long-Term Rainfall Prediction
    Markuna, Suman
    Kumar, Pankaj
    Ali, Rawshan
    Vishwkarma, Dinesh Kumar
    Kushwaha, Kuldeep Singh
    Kumar, Rohitashw
    Singh, Vijay Kumar
    Chaudhary, Sumit
    Kuriqi, Alban
    PURE AND APPLIED GEOPHYSICS, 2023, 180 (01) : 335 - 363
  • [24] Application of Innovative Machine Learning Techniques for Long-Term Rainfall Prediction
    Suman Markuna
    Pankaj Kumar
    Rawshan Ali
    Dinesh Kumar Vishwkarma
    Kuldeep Singh Kushwaha
    Rohitashw Kumar
    Vijay Kumar Singh
    Sumit Chaudhary
    Alban Kuriqi
    Pure and Applied Geophysics, 2023, 180 : 335 - 363
  • [25] A Review of Machine Learning Methods for Long-Term Time Series Prediction
    Ptotic, Milan P.
    Stojanovic, Milos B.
    Popovic, Predrag M.
    2022 57TH INTERNATIONAL SCIENTIFIC CONFERENCE ON INFORMATION, COMMUNICATION AND ENERGY SYSTEMS AND TECHNOLOGIES (ICEST), 2022, : 205 - 208
  • [26] Evaluation of Machine Learning Methods for the Long-Term Prediction of Cardiac Diseases
    Schlemmer, Alexander
    Zwirnmann, Henning
    Zabel, Markus
    Parlitz, Ulrich
    Luther, Stefan
    2014 8TH CONFERENCE OF THE EUROPEAN STUDY GROUP ON CARDIOVASCULAR OSCILLATIONS (ESGCO), 2014, : 157 - +
  • [27] Pavement Roughness Prediction Using Explainable and Supervised Machine Learning Technique for Long-Term Performance
    Sandamal, Kelum
    Shashiprabha, Sachini
    Muttil, Nitin
    Rathnayake, Upaka
    SUSTAINABILITY, 2023, 15 (12)
  • [28] Prediction of long-term prestress loss for prestressed concrete cylinder structures using machine learning
    Zhang, Hang
    Guo, Quan-Quan
    Xu, Li-Yan
    ENGINEERING STRUCTURES, 2023, 279
  • [29] Prediction of long-term creep modulus of thermoplastics using brief tests and interpretable machine learning
    Lobato, Hector
    Cernuda, Carlos
    Zulueta, Kepa
    Arriaga, Aitor
    Matxain, Jon M.
    Burgoa, Aizeti
    INTERNATIONAL JOURNAL OF SOLIDS AND STRUCTURES, 2024, 304
  • [30] Forecasting of Mid- and Long-Term Wind Power Using Machine Learning and Regression Models
    Ahmed, Sina Ibne
    Ranganathan, Prakash
    Salehfar, Hossein
    2021 IEEE KANSAS POWER AND ENERGY CONFERENCE (KPEC), 2021,