Causative Classification of Ischemic Stroke by the Machine Learning Algorithm Random Forests

被引:11
|
作者
Wang, Jianan [1 ]
Gong, Xiaoxian [1 ]
Chen, Hongfang [2 ]
Zhong, Wansi [1 ]
Chen, Yi [1 ]
Zhou, Ying [1 ]
Zhang, Wenhua [1 ]
He, Yaode [1 ]
Lou, Min [1 ]
机构
[1] Zhejiang Univ, Sch Med, Dept Neurol, Affiliated Hosp 2, Hangzhou, Peoples R China
[2] Zhejiang Univ, Jinhua Municipal Cent Hosp, Dept Neurol, Jinhua Hosp, Jinhua, Zhejiang, Peoples R China
来源
FRONTIERS IN AGING NEUROSCIENCE | 2022年 / 14卷
基金
中国国家自然科学基金;
关键词
machine learning; cardioembolism; large-artery atherosclerosis; small-artery occlusion; stroke; SUBTYPE; RECURRENCE; SYSTEMS; TOAST;
D O I
10.3389/fnagi.2022.788637
中图分类号
R592 [老年病学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 100203 ;
摘要
Background: Prognosis, recurrence rate, and secondary prevention strategies differ by different etiologies in acute ischemic stroke. However, identifying its cause is challenging. Objective: This study aimed to develop a model to identify the cause of stroke using machine learning (ML) methods and test its accuracy. Methods: We retrospectively reviewed the data of patients who had determined etiology defined by the Trial of ORG 10172 in Acute Stroke Treatment (TOAST) from CASE-II (NCT04487340) to train and evaluate six ML models, namely, Random Forests (RF), Logistic Regression (LR), Extreme Gradient Boosting (XGBoost), K-Nearest Neighbor (KNN), Ada Boosting, Gradient Boosting Machine (GBM), for the detection of cardioembolism (CE), large-artery atherosclerosis (LAA), and small-artery occlusion (SAO). Between October 2016 and April 2020, patients were enrolled consecutively for algorithm development (phase one). Between June 2020 and December 2020, patients were enrolled consecutively in a test set for algorithm test (phase two). Area under the curve (AUC), precision, recall, accuracy, and F1 score were calculated for the prediction model. Results: Finally, a total of 18,209 patients were enrolled in phase one, including 13,590 patients (i.e., 6,089 CE, 4,539 LAA, and 2,962 SAO) in the model, and a total of 3,688 patients were enrolled in phase two, including 3,070 patients (i.e., 1,103 CE, 1,269 LAA, and 698 SAO) in the model. Among the six models, the best models were RF, XGBoost, and GBM, and we chose the RF model as our final model. Based on the test set, the AUC values of the RF model to predict CE, LAA, and SAO were 0.981 (95%CI, 0.978-0.986), 0.919 (95%CI, 0.911-0.928), and 0.918 (95%CI, 0.908-0.927), respectively. The most important items to identify CE, LAA, and SAO were atrial fibrillation and degree of stenosis of intracranial arteries. Conclusion: The proposed RF model could be a useful diagnostic tool to help neurologists categorize etiologies of stroke.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Automating Ischemic Stroke Subtype Classification Using Machine Learning and Natural Language Processing
    Garg, Ravi
    Oh, Elissa
    Naidech, Andrew
    Kording, Konrad
    Prabhakaran, Shyam
    JOURNAL OF STROKE & CEREBROVASCULAR DISEASES, 2019, 28 (07) : 2045 - 2051
  • [2] Enriching the Study Population For Ischemic Stroke Therapeutic Trials Using a Machine Learning Algorithm
    Maharjan, Jenish
    Barnes, Gina
    Saxena, Abigail Green
    Calvert, Jacob
    CIRCULATION RESEARCH, 2021, 129 (12) : E241 - E241
  • [3] MRI-based Algorithm for Acute Ischemic Stroke Subtype Classification
    Ko, Youngchai
    Lee, SooJoo
    Chung, Jong-Won
    Han, Moon-Ku
    Park, Jong-Moo
    Kang, Kyusik
    Park, Tai Hwan
    Park, Sang-Soon
    Cho, Yong-Jin
    Hong, Keun-Sik
    Lee, Kyung Bok
    Lee, Jun
    Kim, Dong-Eog
    Kim, Dae-Hyun
    Cha, Jae-Kwan
    Kim, Joon-Tae
    Choi, Jay Chol
    Shin, Dong-Ick
    Lee, Ji Sung
    Lee, Juneyoung
    Yu, Kyung-Ho
    Lee, Byung-Chul
    Bae, Hee-Joon
    JOURNAL OF STROKE, 2014, 16 (03) : 161 - 172
  • [4] Predictive etiological classification of acute ischemic stroke through interpretable machine learning algorithms: a multicenter, prospective cohort study
    Chen, Siding
    Yang, Xiaomeng
    Gu, Hongqiu
    Wang, Yanzhao
    Xu, Zhe
    Jiang, Yong
    Wang, Yongjun
    BMC MEDICAL RESEARCH METHODOLOGY, 2024, 24 (01)
  • [5] Classification of stroke disease using machine learning algorithms
    Govindarajan, Priya
    Soundarapandian, Ravichandran Kattur
    Gandomi, Amir H.
    Patan, Rizwan
    Jayaraman, Premaladha
    Manikandan, Ramachandran
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (03) : 817 - 828
  • [6] Enriching the Study Population for Ischemic Stroke Therapeutic Trials Using a Machine Learning Algorithm
    Maharjan, Jenish
    Ektefaie, Yasha
    Ryan, Logan
    Mataraso, Samson
    Barnes, Gina
    Shokouhi, Sepideh
    Green-Saxena, Abigail
    Calvert, Jacob
    Mao, Qingqing
    Das, Ritankar
    FRONTIERS IN NEUROLOGY, 2022, 12
  • [7] Machine Learning in Acute Ischemic Stroke Neuroimaging
    Kamal, Haris
    Lopez, Victor
    Sheth, Sunil A.
    FRONTIERS IN NEUROLOGY, 2018, 9
  • [8] Machine Learning Models for Predicting Early Neurological Deterioration and Risk Classification of Acute Ischemic Stroke
    Yang, Huan
    Lv, Zhe
    Wang, Wenxi
    Wang, Yaohui
    Chen, Jie
    Wang, Zhanqiu
    CLINICAL AND APPLIED THROMBOSIS-HEMOSTASIS, 2023, 29
  • [9] Application of Machine Learning and Deep Learning in Imaging of Ischemic Stroke
    Cho, Ara
    Do, Luu-Ngoc
    Kim, Seul Kee
    Yoon, Woong
    Baek, Byung Hyun
    Park, Ilwoo
    INVESTIGATIVE MAGNETIC RESONANCE IMAGING, 2022, 26 (04) : 191 - 199
  • [10] Predicting Ischemic Stroke in Patients with Atrial Fibrillation Using Machine Learning
    Jung, Seonwoo
    Song, Min-Keun
    Lee, Eunjoo
    Bae, Sejin
    Kim, Yeon-Yong
    Lee, Doheon
    Lee, Myoung Jin
    Yoo, Sunyong
    FRONTIERS IN BIOSCIENCE-LANDMARK, 2022, 27 (03):