Development and validation of a machine learning-based predictive model for assessing the 90-day prognostic outcome of patients with spontaneous intracerebral hemorrhage

被引:16
作者
Geng, Zhi [1 ,2 ,3 ]
Yang, Chaoyi [1 ,2 ,3 ]
Zhao, Ziye [1 ,2 ,3 ]
Yan, Yibing [1 ,2 ,3 ]
Guo, Tao [4 ]
Liu, Chaofan [4 ]
Wu, Aimei [5 ]
Wu, Xingqi [1 ,2 ,3 ]
Wei, Ling [1 ,2 ,3 ]
Tian, Yanghua [1 ,2 ,3 ,8 ]
Hu, Panpan [1 ,2 ,3 ,6 ,7 ]
Wang, Kai [1 ,2 ,3 ,6 ,7 ]
机构
[1] Anhui Med Univ, Affiliated Hosp 1, Dept Neurol, Hefei, Peoples R China
[2] Anhui Prov Key Lab Cognit & Neuropsychiat Disorder, Hefei, Peoples R China
[3] Collaborat Innovat Ctr Neuropsychiat Disorder & Me, Hefei 230000, Peoples R China
[4] Univ Sci & Technol China, Ctr Biomed Imaging, Hefei 230026, Anhui, Peoples R China
[5] Anhui Med Univ, Hefei Hosp, Peoples Hosp Hefei 2, Dept Neurol, Hefei, Peoples R China
[6] Hefei Comprehens Natl Sci Ctr, Inst Artificial Intelligence, Hefei, Peoples R China
[7] Anhui Med Univ, Anhui Prov Inst Translat Med, Hefei, Peoples R China
[8] Anhui Med Univ, Hosp 2, Dept Sleep Psychol, Hefei, Peoples R China
基金
中国国家自然科学基金;
关键词
Spontaneous intracerebral hemorrhage; Prognosis; Prediction model; Machine learning; RISK-FACTORS; STROKE;
D O I
10.1186/s12967-024-04896-3
中图分类号
R-3 [医学研究方法]; R3 [基础医学];
学科分类号
1001 ;
摘要
Background Spontaneous intracerebral hemorrhage (sICH) is associated with significant mortality and morbidity. Predicting the prognosis of patients with sICH remains an important issue, which significantly affects treatment decisions. Utilizing readily available clinical parameters to anticipate the unfavorable prognosis of sICH patients holds notable clinical significance. This study employs five machine learning algorithms to establish a practical platform for the prediction of short-term prognostic outcomes in individuals afflicted with sICH. Methods Within the framework of this retrospective analysis, the model underwent training utilizing data gleaned from 413 cases from the training center, with subsequent validation employing data from external validation center. Comprehensive clinical information, laboratory analysis results, and imaging features pertaining to sICH patients were harnessed as training features for machine learning. We developed and validated the model efficacy using all the selected features of the patients using five models: Support Vector Machine (SVM), Logistic Regression (LR), Random Forest (RF), XGboost and LightGBM, respectively. The process of Recursive Feature Elimination (RFE) was executed for optimal feature screening. An internal five-fold cross-validation was employed to pinpoint the most suitable hyperparameters for the model, while an external five-fold cross-validation was implemented to discern the machine learning model demonstrating the superior average performance. Finally, the machine learning model with the best average performance is selected as our final model while using it for external validation. Evaluation of the machine learning model's performance was comprehensively conducted through the utilization of the ROC curve, accuracy, and other relevant indicators. The SHAP diagram was utilized to elucidate the variable importance within the model, culminating in the amalgamation of the above metrics to discern the most succinct features and establish a practical prognostic prediction platform. Results A total of 413 patients with sICH patients were collected in the training center, of which 180 were patients with poor prognosis. A total of 74 patients with sICH were collected in the external validation center, of which 26 were patients with poor prognosis. Within the training set, the test set AUC values for SVM, LR, RF, XGBoost, and LightGBM models were recorded as 0.87, 0.896, 0.916, 0.885, and 0.912, respectively. The best average performance of the machine learning models in the training set was the RF model (average AUC: 0.906 +/- 0.029, P < 0.01). The model still maintains a good performance in the external validation center, with an AUC of 0.817 (95% CI 0.705-0.928). Pertaining to feature importance for short-term prognostic attributes of sICH patients, the NIHSS score reigned supreme, succeeded by AST, Age, white blood cell, and hematoma volume, among others. In culmination, guided by the RF model's variable importance weight and the model's ROC curve insights, the NIHSS score, AST, Age, white blood cell, and hematoma volume were integrated to forge a short-term prognostic prediction platform tailored for sICH patients. Conclusion We constructed a prediction model based on the results of the RF model incorporating five clinically accessible predictors with reliable predictive efficacy for the short-term prognosis of sICH patients. Meanwhile, the performance of the external validation set was also more stable, which can be used for accurate prediction of short-term prognosis of sICH patients.
引用
收藏
页数:12
相关论文
共 40 条
[1]   National Institutes of Health Stroke Scale Item Profiles as Predictor of Patient Outcome External Validation on Safe Implementation of Thrombolysis in Stroke-Monitoring Study Data [J].
Abdul-Rahim, Azmil H. ;
Fulton, Rachael L. ;
Sucharew, Heidi ;
Kleindorfer, Dawn ;
Khatri, Pooja ;
Broderick, Joseph P. ;
Lees, Kennedy R. .
STROKE, 2015, 46 (10) :2779-2785
[2]   Rapid Blood-Pressure Lowering in Patients with Acute Intracerebral Hemorrhage [J].
Anderson, Craig S. ;
Heeley, Emma ;
Huang, Yining ;
Wang, Jiguang ;
Stapf, Christian ;
Delcourt, Candice ;
Lindley, Richard ;
Robinson, Thompson ;
Lavados, Pablo ;
Neal, Bruce ;
Hata, Jun ;
Arima, Hisatomi ;
Parsons, Mark ;
Li, Yuechun ;
Wang, Jinchao ;
Heritier, Stephane ;
Li, Qiang ;
Woodward, Mark ;
Simes, R. John ;
Davis, Stephen M. ;
Chalmers, John .
NEW ENGLAND JOURNAL OF MEDICINE, 2013, 368 (25) :2355-2365
[3]   Recursive Feature Elimination by Sensitivity Testing [J].
Escanilla, Nicholas Sean ;
Hellerstein, Lisa ;
Kleiman, Ross ;
Kuang, Zhaobin ;
Shull, James D. ;
Page, David .
2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2018, :40-47
[4]   Worldwide stroke incidence and early case fatality reported in 56 population-based studies: a systematic review [J].
Feigin, Valery L. ;
Lawes, Carlene M. M. ;
Bennett, Derrick A. ;
Barker-Collo, Suzanne I. ;
Parag, Varsha .
LANCET NEUROLOGY, 2009, 8 (04) :355-369
[5]   Prediction of 30-Day All-Cause Readmissions in Patients Hospitalized for Heart Failure Comparison of Machine Learning and Other Statistical Approaches [J].
Frizzell, Jarrod D. ;
Liang, Li ;
Schulte, Phillip J. ;
Yancy, Clyde W. ;
Heidenreich, Paul A. ;
Hernandez, Adrian F. ;
Bhatt, Deepak L. ;
Fonarow, Gregg C. ;
Laskey, Warren K. .
JAMA CARDIOLOGY, 2017, 2 (02) :204-209
[6]   Development and validation of a novel clinical prediction model to predict the 90-day functional outcome of spontaneous intracerebral hemorrhage [J].
Geng, Zhi ;
Guo, Tao ;
Cao, Ziwei ;
He, Xiaolu ;
Chen, Jing ;
Yue, Hong ;
Wu, Aimei ;
Wei, Lichao .
FRONTIERS IN NEUROLOGY, 2023, 14
[7]   The Flavonoid Quercetin Induces AP-1 Activation in FRTL-5 Thyroid Cells [J].
Giuliani, Cesidio .
ANTIOXIDANTS, 2019, 8 (05)
[8]   A guide to machine learning for biologists [J].
Greener, Joe G. ;
Kandathil, Shaun M. ;
Moffat, Lewis ;
Jones, David T. .
NATURE REVIEWS MOLECULAR CELL BIOLOGY, 2022, 23 (01) :40-55
[9]   Long-term Outcomes and Risk Factors Related to Hydrocephalus After Intracerebral Hemorrhage [J].
Hu, Rong ;
Zhang, Chao ;
Xia, Jiesheng ;
Ge, Hongfei ;
Zhong, Jun ;
Fang, Xuanyu ;
Zou, Yongjie ;
Lan, Chuan ;
Li, Lan ;
Feng, Hua .
TRANSLATIONAL STROKE RESEARCH, 2021, 12 (01) :31-38
[10]   Mortality prediction for patients with acute respiratory distress syndrome based on machine learning: a population-based study [J].
Huang, Bingsheng ;
Liang, Dong ;
Zou, Rushi ;
Yu, Xiaxia ;
Dan, Guo ;
Huang, Haofan ;
Liu, Heng ;
Liu, Yong .
ANNALS OF TRANSLATIONAL MEDICINE, 2021, 9 (09)