Machine learning to predict stroke risk from routine hospital data: A systematic review

被引:1
作者
Heseltine-Carp, William [1 ]
Courtman, Megan [1 ,2 ]
Browning, Daniel [1 ]
Kasabe, Aishwarya [1 ]
Allen, Michael [3 ]
Streeter, Adam [4 ]
Ifeachor, Emmanuel [4 ,5 ]
James, Martin [6 ]
Mullin, Stephen [1 ]
机构
[1] Univ Plymouth, Room N6 ITTC Bldg Plymouth Sci Pk, Plymouth PL68BX, England
[2] Univ Plymouth, Plymouth PL4 8AA, England
[3] Univ Exeter, Med Sch, St Lukes Campus Heavitree Rd SC 2-30, Exeter EX4 4QJ, England
[4] Univ Plymouth, N15 ITTC1 Plymouth Sci Pk, Plymouth PL6 8BX, England
[5] Univ Plymouth, Sch Engn Comp & Math, Plymouth PL4 8AA, England
[6] Univ Exeter, Royal Devon & Exeter Hosp, Acad Dept Healthcare Older People, Exeter EX2 5DW, England
基金
英国医学研究理事会;
关键词
Stroke; Machine learning; Artificial intelligence; Routine hospital data; Risk evaluation; Ischaemic stroke; ATRIAL-FIBRILLATION; THROMBOEMBOLISM; STRATIFICATION;
D O I
10.1016/j.ijmedinf.2025.105811
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Purpose: Stroke remains a leading cause of morbidity and mortality. Despite this, current risk stratification tools such as CHA2DS2-VASc and QRISK3 are of limited accuracy, particularly in those without a diagnosis of atrial- fibrillation. Hence, there is a need for more accurate stroke risk prediction models. Machine-learning (ML) may provide a solution to this by leveraging existing routine hospital databases to build accurate stroke risk prediction models and identify novel risk factors for stroke. Aims: In this systematic review we appraise current research using ML to predict stroke risk from routine hospital data. Based on these findings we then highlight common methodological limitations and recommendations for future research. Methods: In this review we identify 49 original research (38 in the general population and 11 in AF specific populations) articles from the PUBMED database from January-2013 to December-2024 using ML and routine hospital data to predict the risk of stroke. Results: ML models were able to accurately predict stroke risk in both AF specific and general populations, with AUCs ranging from 0.64 to 0.99. Where tested, ML also consistently outperformed traditional risk stratification tool, such as CHA2DS2-VASc. ML also appeared useful in identifying several novel risk factors from electrocardiogram, laboratory test and echocardiography data. However, the quality of datasets were often limited, there was a high suspicion of overfitting and models often lacked calibration, external validation and explainability analysis. Conclusion: Whilst ML has shown great potential in stroke prediction and identifying novel risk factors for stroke, improvements in study methodology is required prior to integration of ML into routine healthcare. Future research should adhere to the EQUATOR guidance on prediction models and encourage interdisciplinary collaboration between computer scientists and clinicians. Further prospective RCTs are also required to validate models in the clinical setting and the identify barriers of integrating ML into routine healthcare.
引用
收藏
页数:19
相关论文
共 50 条
[41]   Machine learning vs. regression models to predict the risk of Legionella contamination in a hospital water network [J].
De Giglio, Osvalda ;
Fasano, Fabrizio ;
Diella, Giusy ;
Spagnuolo, Valentina ;
Triggiano, Francesco ;
Lopuzzo, Marco ;
Apollonio, Francesca ;
Leone, Carla Maria ;
Montagna, Maria Teresa .
ANNALI DI IGIENE MEDICINA PREVENTIVA E DI COMUNITA, 2025, 37 (01) :128-140
[42]   Machine Learning for Detecting Atrial Fibrillation from ECGs: Systematic Review and Meta-Analysis [J].
Xie, Chenggong ;
Wang, Zhao ;
Yang, Chenglong ;
Liu, Jianhe ;
Liang, Hao .
REVIEWS IN CARDIOVASCULAR MEDICINE, 2024, 25 (01)
[43]   Machine Learning Approaches for Stroke Risk Prediction: Findings from the Suita Study [J].
Vu, Thien ;
Kokubo, Yoshihiro ;
Inoue, Mai ;
Yamamoto, Masaki ;
Mohsen, Attayeb ;
Martin-Morales, Agustin ;
Inoue, Takao ;
Dawadi, Research ;
Araki, Michihiro .
JOURNAL OF CARDIOVASCULAR DEVELOPMENT AND DISEASE, 2024, 11 (07)
[44]   Using Machine Learning to Predict 30-Day Hospital Readmissions in Patients with Atrial Fibrillation Undergoing Catheter Ablation [J].
Hung, Man ;
Lauren, Evelyn ;
Hon, Eric ;
Xu, Julie ;
Ruiz-Negron, Bianca ;
Rosales, Megan ;
Li, Wei ;
Barton, Tanner ;
O'Brien, Jacob ;
Su, Weicong .
JOURNAL OF PERSONALIZED MEDICINE, 2020, 10 (03) :1-10
[45]   Machine Learning for Credit Risk Prediction: A Systematic Literature Review [J].
Noriega, Jomark Pablo ;
Rivera, Luis Antonio ;
Herrera, Jose Alfredo .
DATA, 2023, 8 (11)
[46]   Machine learning approaches for the discovery of clinical pathways from patient data: A systematic review [J].
Muyama, Lillian ;
Neuraz, Antoine ;
Coulet, Adrien .
JOURNAL OF BIOMEDICAL INFORMATICS, 2024, 160
[47]   Machine Learning and Big Data for Cybersecurity: Systematic Literature Review [J].
El Bouchtioui, En Naji ;
Bentaleb, Asmae ;
Abouchabaka, Jaafar .
DIGITAL TECHNOLOGIES AND APPLICATIONS, ICDTA 2024, VOL 1, 2024, 1098 :97-106
[48]   Machine Learning Approach to Predict Risk of 90-Day Hospital Readmissions in Patients With Atrial Fibrillation: Implications for Quality Improvement in Healthcare [J].
Hung, Man ;
Hon, Eric S. ;
Lauren, Evelyn ;
Xu, Julie ;
Judd, Gary ;
Su, Weicong .
HEALTH SERVICES RESEARCH AND MANAGERIAL EPIDEMIOLOGY, 2020, 7
[49]   Machine Learning to Predict Outcomes in Patients with Acute Gastrointestinal Bleeding: A Systematic Review [J].
Shung, Dennis ;
Simonov, Michael ;
Gentry, Mark ;
Au, Benjamin ;
Laine, Loren .
DIGESTIVE DISEASES AND SCIENCES, 2019, 64 (08) :2078-2087