Prediction of Long-Term Stroke Recurrence Using Machine Learning Models

被引:32
|
作者
Abedi, Vida [1 ,2 ]
Avula, Venkatesh [1 ]
Chaudhary, Durgesh [3 ]
Shahjouei, Shima [3 ]
Khan, Ayesha [3 ]
Griessenauer, Christoph J. [3 ,4 ]
Li, Jiang [1 ]
Zand, Ramin [3 ]
机构
[1] Geisinger Hlth Syst, Dept Mol & Funct Genom, Danville, PA 17822 USA
[2] Virginia Tech, Biocomplex Inst, Blacksburg, VA 24061 USA
[3] Geisinger Hlth Syst, Geisinger Neurosci Inst, Danville, PA 17822 USA
[4] Paracelsus Med Univ, Res Inst Neurointervent, A-5020 Salzburg, Austria
关键词
healthcare; artificial intelligence; machine learning; interpretable machine learning; explainable machine learning; ischemic stroke; clinical decision support system; electronic health record; outcome prediction; recurrent stroke; INSTRUMENT-II; RISK SCORE; VALIDATION;
D O I
10.3390/jcm10061286
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background: The long-term risk of recurrent ischemic stroke, estimated to be between 17% and 30%, cannot be reliably assessed at an individual level. Our goal was to study whether machine-learning can be trained to predict stroke recurrence and identify key clinical variables and assess whether performance metrics can be optimized. Methods: We used patient-level data from electronic health records, six interpretable algorithms (Logistic Regression, Extreme Gradient Boosting, Gradient Boosting Machine, Random Forest, Support Vector Machine, Decision Tree), four feature selection strategies, five prediction windows, and two sampling strategies to develop 288 models for up to 5-year stroke recurrence prediction. We further identified important clinical features and different optimization strategies. Results: We included 2091 ischemic stroke patients. Model area under the receiver operating characteristic (AUROC) curve was stable for prediction windows of 1, 2, 3, 4, and 5 years, with the highest score for the 1-year (0.79) and the lowest score for the 5-year prediction window (0.69). A total of 21 (7%) models reached an AUROC above 0.73 while 110 (38%) models reached an AUROC greater than 0.7. Among the 53 features analyzed, age, body mass index, and laboratory-based features (such as high-density lipoprotein, hemoglobin A1c, and creatinine) had the highest overall importance scores. The balance between specificity and sensitivity improved through sampling strategies. Conclusion: All of the selected six algorithms could be trained to predict the long-term stroke recurrence and laboratory-based variables were highly associated with stroke recurrence. The latter could be targeted for personalized interventions. Model performance metrics could be optimized, and models can be implemented in the same healthcare system as intelligent decision support for targeted intervention.
引用
收藏
页码:1 / 16
页数:16
相关论文
共 50 条
  • [41] The Estimation of the Long-Term Agricultural Output with a Robust Machine Learning Prediction Model
    Kuan, Chin-Hung
    Leu, Yungho
    Lin, Wen-Shin
    Lee, Chien-Pang
    AGRICULTURE-BASEL, 2022, 12 (08):
  • [42] Machine Learning Tools for Long-Term Type 2 Diabetes Risk Prediction
    Fazakis, Nikos
    Kocsis, Otilia
    Dritsas, Elias
    Alexiou, Sotiris
    Fakotakis, Nikos
    Moustakas, Konstantinos
    IEEE ACCESS, 2021, 9 : 103737 - 103757
  • [43] Long-Term Interbank Bond Rate Prediction Based on ICEEMDAN and Machine Learning
    Yu, Yue
    Kuang, Guangwu
    Zhu, Jianrui
    Shen, Lei
    Wang, Mengjia
    IEEE ACCESS, 2024, 12 : 46241 - 46262
  • [44] Machine Learning Models for the Prediction of Kidney Stone Composition and Recurrence
    Bargagli, Matteo
    Peischl, Stephan
    Vogt, Bruno
    Bruggmann, Remy
    Fuster, Daniel G.
    SWISS MEDICAL WEEKLY, 2023, 153 : 16S - 16S
  • [45] Early Stroke Prediction Using Machine Learning
    Sharma, Chetan
    Sharma, Shamneesh
    Kumar, Mukesh
    Sodhi, Ankur
    2022 INTERNATIONAL CONFERENCE ON DECISION AID SCIENCES AND APPLICATIONS (DASA), 2022, : 890 - 894
  • [46] Equitable hospital length of stay prediction for patients with learning disabilities and multiple long-term conditions using machine learning
    Abakasanga, Emeka
    Kousovista, Rania
    Cosma, Georgina
    Akbari, Ashley
    Zaccardi, Francesco
    Kaur, Navjot
    Fitt, Danielle
    Jun, Gyuchan Thomas
    Kiani, Reza
    Gangadharan, Satheesh
    FRONTIERS IN DIGITAL HEALTH, 2025, 7
  • [47] Stroke recurrence prediction using machine learning and segmented neural network risk factor aggregation
    Ding, Xueting
    Meng, Yang
    Xiang, Liner
    Boden-Albala, Bernadette
    DISCOVER PUBLIC HEALTH, 2024, 21 (01)
  • [48] Prediction of long-term mortality by using machine learning models in Chinese patients with connective tissue disease-associated interstitial lung disease
    Di Sun
    Yu Wang
    Qing Liu
    Tingting Wang
    Pengfei Li
    Tianci Jiang
    Lingling Dai
    Liuqun Jia
    Wenjing Zhao
    Zhe Cheng
    Respiratory Research, 23
  • [49] Prediction of long-term mortality by using machine learning models in Chinese patients with connective tissue disease-associated interstitial lung disease
    Sun, Di
    Wang, Yu
    Liu, Qing
    Wang, Tingting
    Li, Pengfei
    Jiang, Tianci
    Dai, Lingling
    Jia, Liuqun
    Zhao, Wenjing
    Cheng, Zhe
    RESPIRATORY RESEARCH, 2022, 23 (01)
  • [50] Cryptogenic stroke in young patients: Long-term prognosis and recurrence
    Arauz, A.
    Merlos-Benitez, M.
    Roa, L. F.
    Hernandez-Curiel, B.
    Cantu, C.
    Murillo, L.
    Roldan, J.
    Vargas-Barron, J.
    Barinagarrementeria, F.
    NEUROLOGIA, 2011, 26 (05): : 279 - 284