Predictive Performance of Machine Learning-Based Models for Poststroke Clinical Outcomes in Comparison With Conventional Prognostic Scores: Multicenter, Hospital-Based Observational Study

被引:0
作者
Irie, Fumi [1 ,2 ]
Matsumoto, Koutarou [3 ]
Matsuo, Ryu [1 ,2 ]
Nohara, Yasunobu [4 ]
Wakisaka, Yoshinobu [2 ]
Ago, Tetsuro [2 ,5 ]
Nakashima, Naoki [6 ]
Kitazono, Takanari [2 ,5 ]
Kamouchi, Masahiro [1 ,5 ]
机构
[1] Kyushu Univ, Grad Sch Med Sci, Dept Hlth Care Adm & Management, 3-1-1 Maidashi,Higashi Ku, Fukuoka, 8128582, Japan
[2] Kyushu Univ, Grad Sch Med Sci, Dept Med & Clin Sci, Fukuoka, Japan
[3] Kurume Univ, Biostat Ctr, Grad Sch Med, Kurume, Japan
[4] Kumamoto Univ, Fac Adv Sci & Technol, Big Data Sci & Technol, Kumamoto, Japan
[5] Kyushu Univ, Ctr Cohort Studies, Grad Sch Med Sci, Fukuoka, Japan
[6] Kyushu Univ Hosp, Med Informat Ctr, Fukuoka, Japan
来源
JMIR AI | 2024年 / 3卷
关键词
brain infarction; outcome; prediction; machine learning; prognostic score; ACUTE ISCHEMIC-STROKE; HEALTH-CARE PROFESSIONALS; BIG DATA; LOGISTIC-REGRESSION; EARLY MANAGEMENT; HIGH-RISK; ALGORITHMS; GUIDELINES;
D O I
10.2196/46840
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Although machine learning is a promising tool for making prognoses, the performance of machine learning in predicting outcomes after stroke remains to be examined. Objective: This study aims to examine how much data-driven models with machine learning improve predictive performance for poststroke outcomes compared with conventional stroke prognostic scores and to elucidate how explanatory variables in machine learning-based models differ from the items of the stroke prognostic scores. Methods: We used data from 10,513 patients who were registered in a multicenter prospective stroke registry in Japan between 2007 and 2017. The outcomes were poor functional outcome (modified Rankin Scale score >2) and death at 3 months after stroke. Machine learning-based models were developed using all variables with regularization methods, random forests, or boosted trees. Item-based regression models were developed using the items of these 3 scores. The model performance was assessed in terms of discrimination and calibration. To compare the predictive performance of the data-driven model with that of the item-based model, we performed internal validation after random splits of identical populations into 80% of patients as a training set and 20% of patients as a test set; the models were developed in the training set and were validated in the test set. We evaluated the contribution of each variable to the models and compared the predictors used in the machine learning-based models with the items of the stroke prognostic scores. Results: The mean age of the study patients was 73.0 (SD 12.5) years, and 59.1% (6209/10,513) of them were men. The area under the receiver operating characteristic curves and the area under the precision-recall curves for predicting poststroke outcomes were higher for machine learning-based models than for item-based models in identical populations after random splits. Machine learning-based modelsalso performed better than item-based modelsin terms of the Brier score. Machine learning-based models used different explanatory variables, such as laboratory data, from the items of the conventional stroke prognostic scores. Including these data in the machine learning-based models as explanatory variables improved performance in predicting outcomesafter stroke, especially poststroke death. Conclusions: Machine learning-based models performed better in predicting poststroke outcomes than regression models using the items of conventional stroke prognostic scores, although they required additional variables, such as laboratory data, to attain improved performance. Further studies are warranted to validate the usefulness of machine learning in clinical settings.
引用
收藏
页数:20
相关论文
共 44 条
[1]   Functional Outcome Prediction in Ischemic Stroke: A Comparison of Machine Learning Algorithms and Regression Models [J].
Alaka, Shakiru A. ;
Menon, Bijoy K. ;
Brobbey, Anita ;
Williamson, Tyler ;
Goyal, Mayank ;
Demchuk, Andrew M. ;
Hill, Michael D. ;
Sajobi, Tolulope T. .
FRONTIERS IN NEUROLOGY, 2020, 11
[2]   Machine Learning for Outcome Prediction of Acute Ischemic Stroke Post Intra-Arterial Therapy [J].
Asadi, Hamed ;
Dowling, Richard ;
Yan, Bernard ;
Mitchell, Peter .
PLOS ONE, 2014, 9 (02)
[3]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[4]   Multimodal Predictive Modeling of Endovascular Treatment Outcome for Acute Ischemic Stroke Using Machine-Learning [J].
Brugnara, Gianluca ;
Neuberger, Ulf ;
Mahmutoglu, Mustafa A. ;
Foltyn, Martha ;
Herweh, Christian ;
Nagel, Simon ;
Schonenberger, Silvia ;
Heiland, Sabine ;
Ulfert, Christian ;
Ringleb, Peter Arthur ;
Bendszus, Martin ;
Mohlenbruch, Markus A. ;
Pfaff, Johannes A. R. ;
Vollmuth, Philipp .
STROKE, 2020, 51 (12) :3541-3551
[5]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[6]   A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models [J].
Christodoulou, Evangelia ;
Ma, Jie ;
Collins, Gary S. ;
Steyerberg, Ewout W. ;
Verbakel, Jan Y. ;
Van Calster, Ben .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2019, 110 :12-22
[7]  
Collins GS, 2015, ANN INTERN MED, V162, P55, DOI [10.1016/j.jclinepi.2014.11.010, 10.7326/M14-0697, 10.1002/bjs.9736, 10.1016/j.eururo.2014.11.025, 10.7326/M14-0698, 10.1038/bjc.2014.639, 10.1136/bmj.g7594, 10.1186/s12916-014-0241-z, 10.1111/eci.12376]
[8]   Logistic regression and machine learning predicted patient mortality from large sets of diagnosis codes comparably [J].
Cowling, Thomas E. ;
Cromwell, David A. ;
Bellot, Alexis ;
Sharples, Linda D. ;
van der Meulen, Jan .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2021, 133 :43-52
[9]   Logistic regression and artificial neural network classification models: a methodology review [J].
Dreiseitl, S ;
Ohno-Machado, L .
JOURNAL OF BIOMEDICAL INFORMATICS, 2002, 35 (5-6) :352-359
[10]   Comprehensive Stroke Care and Outcomes Time for a Paradigm Shift [J].
Duncan, Pamela W. ;
Bushnell, Cheryl ;
Sissine, Mysha ;
Coleman, Sylvia ;
Lutz, Barbara J. ;
Johnson, Anna M. ;
Radman, Meghan ;
Bettger, Janet Pvru ;
Zorowitz, Richard D. ;
Stein, Joel .
STROKE, 2021, 52 (01) :385-393