Causative Classification of Ischemic Stroke by the Machine Learning Algorithm Random Forests

被引:11
作者
Wang, Jianan [1 ]
Gong, Xiaoxian [1 ]
Chen, Hongfang [2 ]
Zhong, Wansi [1 ]
Chen, Yi [1 ]
Zhou, Ying [1 ]
Zhang, Wenhua [1 ]
He, Yaode [1 ]
Lou, Min [1 ]
机构
[1] Zhejiang Univ, Sch Med, Dept Neurol, Affiliated Hosp 2, Hangzhou, Peoples R China
[2] Zhejiang Univ, Jinhua Municipal Cent Hosp, Dept Neurol, Jinhua Hosp, Jinhua, Zhejiang, Peoples R China
来源
FRONTIERS IN AGING NEUROSCIENCE | 2022年 / 14卷
基金
中国国家自然科学基金;
关键词
machine learning; cardioembolism; large-artery atherosclerosis; small-artery occlusion; stroke; SUBTYPE; RECURRENCE; SYSTEMS; TOAST;
D O I
10.3389/fnagi.2022.788637
中图分类号
R592 [老年病学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 100203 ;
摘要
Background: Prognosis, recurrence rate, and secondary prevention strategies differ by different etiologies in acute ischemic stroke. However, identifying its cause is challenging. Objective: This study aimed to develop a model to identify the cause of stroke using machine learning (ML) methods and test its accuracy. Methods: We retrospectively reviewed the data of patients who had determined etiology defined by the Trial of ORG 10172 in Acute Stroke Treatment (TOAST) from CASE-II (NCT04487340) to train and evaluate six ML models, namely, Random Forests (RF), Logistic Regression (LR), Extreme Gradient Boosting (XGBoost), K-Nearest Neighbor (KNN), Ada Boosting, Gradient Boosting Machine (GBM), for the detection of cardioembolism (CE), large-artery atherosclerosis (LAA), and small-artery occlusion (SAO). Between October 2016 and April 2020, patients were enrolled consecutively for algorithm development (phase one). Between June 2020 and December 2020, patients were enrolled consecutively in a test set for algorithm test (phase two). Area under the curve (AUC), precision, recall, accuracy, and F1 score were calculated for the prediction model. Results: Finally, a total of 18,209 patients were enrolled in phase one, including 13,590 patients (i.e., 6,089 CE, 4,539 LAA, and 2,962 SAO) in the model, and a total of 3,688 patients were enrolled in phase two, including 3,070 patients (i.e., 1,103 CE, 1,269 LAA, and 698 SAO) in the model. Among the six models, the best models were RF, XGBoost, and GBM, and we chose the RF model as our final model. Based on the test set, the AUC values of the RF model to predict CE, LAA, and SAO were 0.981 (95%CI, 0.978-0.986), 0.919 (95%CI, 0.911-0.928), and 0.918 (95%CI, 0.908-0.927), respectively. The most important items to identify CE, LAA, and SAO were atrial fibrillation and degree of stenosis of intracranial arteries. Conclusion: The proposed RF model could be a useful diagnostic tool to help neurologists categorize etiologies of stroke.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Predicting 6-Month Unfavorable Outcome of Acute Ischemic Stroke Using Machine Learning
    Li, Xiang
    Pan, XiDing
    Jiang, ChunLian
    Wu, MingRu
    Liu, YuKai
    Wang, FuSang
    Zheng, XiaoHan
    Yang, Jie
    Sun, Chao
    Zhu, YuBing
    Zhou, JunShan
    Wang, ShiHao
    Zhao, Zheng
    Zou, JianJun
    FRONTIERS IN NEUROLOGY, 2020, 11
  • [42] COMPARISON OF DIFFERENT MACHINE LEARNING APPROACHES TO MODEL STROKE SUBTYPE CLASSIFICATION AND RISK PREDICTION
    Garcia-Terriza, Luis
    Risco-Martin, Jose L.
    Ayala, Jose L.
    Reig Rosello, Gemma
    Camarasaltas, Juan M.
    2019 SPRING SIMULATION CONFERENCE (SPRINGSIM), 2019,
  • [43] Machine Learning Based Stroke Segmentation and Classification from CT-Scan: A Survey
    Zewde, Elbetel Taye
    Motuma, Mersibon Melese
    Ayano, Yehualashet Megersa
    Debelee, Taye Girma
    Feyisa, Degaga Wolde
    PAN-AFRICAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PT I, PANAFRICON AI 2023, 2024, 2068 : 3 - 45
  • [44] Machine Learning-Based Etiologic Subtyping of Ischemic Stroke Using Circulating Exosomal microRNAs
    Bang, Ji Hoon
    Kim, Eun Hee
    Kim, Hyung Jun
    Chung, Jong-Won
    Seo, Woo-Keun
    Kim, Gyeong-Moon
    Lee, Dong-Ho
    Kim, Heewon
    Bang, Oh Young
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2024, 25 (12)
  • [45] Systematic Review of Machine Learning Applied to the Secondary Prevention of Ischemic Stroke
    Chen, Meng
    Qian, Dongbao
    Wang, Yixuan
    An, Junyan
    Meng, Ke
    Xu, Shuai
    Liu, Sheng
    Sun, Meiyan
    Li, Miao
    Pang, Chunying
    JOURNAL OF MEDICAL SYSTEMS, 2024, 48 (01)
  • [46] Machine Learning Approach for Sepsis Risk Assessment in Ischemic Stroke Patients
    Mao, Fengkai
    Lin, Leqing
    Liang, Dongcheng
    Cheng, Weiling
    Zhang, Ning
    Li, Ji
    Wu, Siming
    JOURNAL OF INTENSIVE CARE MEDICINE, 2025,
  • [47] Predicting the Risk of Ischemic Stroke in Patients Treated with Novel Oral Anticoagulants: A Machine Learning Approach
    Kostev, Karel
    Wu, Tong
    Wang, Yue
    Chaudhuri, Kal
    Reeve, Russel
    Tanislav, Christian
    NEUROEPIDEMIOLOGY, 2021, 55 (05) : 387 - 392
  • [48] A Machine Learning Approach to Predicting Readmission or Mortality in Patients Hospitalized for Stroke or Transient Ischemic Attack
    Hung, Ling-Chien
    Sung, Sheng-Feng
    Hu, Ya-Han
    APPLIED SCIENCES-BASEL, 2020, 10 (18):
  • [49] Machine learning-based patient classification system for adults with stroke: A systematic review
    Ruksakulpiwat, Suebsarn
    Thongking, Witchuda
    Zhou, Wendie
    Benjasirisan, Chitchanok
    Phianhasin, Lalipat
    Schiltz, Nicholas K.
    Brahmbhatt, Smit
    CHRONIC ILLNESS, 2023, 19 (01) : 26 - 39
  • [50] Identification of anoikis-related genes classification patterns and immune infiltration characterization in ischemic stroke based on machine learning
    Qin, Xiaohong
    Yi, Shangfeng
    Rong, Jingtong
    Lu, Haoran
    Ji, Baowei
    Zhang, Wenfei
    Ding, Rui
    Wu, Liquan
    Chen, Zhibiao
    FRONTIERS IN AGING NEUROSCIENCE, 2023, 15