Stroke risk prediction using machine learning: a prospective cohort study of 0.5 million Chinese adults

被引:31
|
作者
Chun, Matthew [1 ,2 ,3 ]
Clarke, Robert [1 ,2 ]
Cairns, Benjamin J. [1 ,2 ,4 ]
Clifton, David [3 ,5 ]
Bennett, Derrick [1 ,2 ]
Chen, Yiping [1 ,2 ,4 ]
Guo, Yu [6 ]
Pei, Pei [6 ]
Lv, Jun [7 ]
Yu, Canqing [7 ]
Yang, Ling [1 ,2 ]
Li, Liming [7 ]
Chen, Zhengming [4 ]
Zhu, Tingting [3 ]
机构
[1] Univ Oxford, Nuffield Dept Populat Hlth, Clin Trial Serv Unit, Oxford, England
[2] Univ Oxford, Nuffield Dept Populat Hlth, Epidemiol Studies, Oxford, England
[3] Univ Oxford, Dept Engn Sci, Oxford, England
[4] Univ Oxford, Med Res Council, Populat Hlth Res Unit, Oxford, England
[5] Oxford Suzhou Ctr Adv Res, Suzhou, Peoples R China
[6] Chinese Acad Med Sci, Beijing, Peoples R China
[7] Peking Univ, Sch Publ Hlth, Dept Epidemiol & Biostat, Hlth Sci Ctr, Beijing, Peoples R China
基金
英国惠康基金; 英国医学研究理事会;
关键词
stroke; cardiovascular diseases; machine learning; risk assessment; China; PRIMARY PREVENTION; INCIDENT STROKE; VALIDATION; DISEASE; POPULATION; DERIVATION; STATEMENT; KADOORIE; PROFILE; SCORE;
D O I
10.1093/jamia/ocab068
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: To compare Cox models, machine learning (ML), and ensemble models combining both approaches, for prediction of stroke risk in a prospective study of Chinese adults. Materials and Methods: We evaluated models for stroke risk at varying intervals of follow-up (<9 years, 0-3 years, 3-6 years, 6-9 years) in 503 842 adults without prior history of stroke recruited from 10 areas in China in 2004-2008. Inputs included sociodemographic factors, diet, medical history, physical activity, and physical measurements. We compared discrimination and calibration of Cox regression, logistic regression, support vector machines, random survival forests, gradient boosted trees (GBT), and multilayer perceptrons, benchmarking performance against the 2017 Framingham Stroke Risk Profile. We then developed an ensemble approach to identify individuals at high risk of stroke (>10% predicted 9-yr stroke risk) by selectively applying either a GBT or Cox model based on individual-level characteristics. Results: For 9-yr stroke risk prediction, GBT provided the best discrimination (AUROC: 0.833 in men, 0.836 in women) and calibration, with consistent results in each interval of follow-up. The ensemble approach yielded incrementally higher accuracy (men: 76%, women: 80%), specificity (men: 76%, women: 81%), and positive predictive value (men: 26%, women: 24%) compared to any of the single-model approaches. Discussion and Conclusion: Among several approaches, an ensemble model combining both GBT and Cox models achieved the best performance for identifying individuals at high risk of stroke in a contemporary study of Chinese adults. The results highlight the potential value of expanding the use of ML in clinical practice.
引用
收藏
页码:1719 / 1727
页数:9
相关论文
共 50 条
  • [41] Self-Rated Health Status and Risk of Incident Stroke in 0.5 Million Chinese Adults: The China Kadoorie Biobank Study
    Dong, Wenhong
    Pan, Xiong-Fei
    Yu, Canqing
    Lv, Jun
    Guo, Yu
    Bian, Zheng
    Yang, Ling
    Chen, Yiping
    Wu, Tangchun
    Chen, Zhengming
    Pan, An
    Li, Liming
    JOURNAL OF STROKE, 2018, 20 (02) : 247 - +
  • [42] Associations of sedentary behaviour with myocardial infarction and stroke: findings from a 10-year prospective study of 0.5 million chinese adults
    Bennett, D.
    Du, H.
    Guo, Y.
    Bian, Z.
    Chen, J.
    Collins, R.
    Peto, R.
    Li, L.
    Clarke, R.
    Chen, Z.
    EUROPEAN HEART JOURNAL, 2018, 39 : 92 - 93
  • [43] Severity prediction markers in dengue: a prospective cohort study using machine learning approach
    Jean Pierre, Aashika Raagavi
    Green, Siva Ranganathan
    Anandaraj, Lokeshmaran
    Sivaprakasam, Manikandan
    Kasirajan, Anand
    Devaraju, Panneer
    Anumulapuri, Srilekha
    Mutheneni, Srinivasa Rao
    Balakrishna Pillai, Agieshkumar
    BIOMARKERS, 2024, 29 (08) : 557 - 564
  • [44] Comparison of Machine Learning Techniques for Mortality Prediction in a Prospective Cohort of Older Adults
    Tedesco, Salvatore
    Andrulli, Martina
    Larsson, Markus akerlund
    Kelly, Daniel
    Alamaeki, Antti
    Timmons, Suzanne
    Barton, John
    Condell, Joan
    O'Flynn, Brendan
    Nordstroem, Anna
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2021, 18 (23)
  • [45] Family history, waist circumference and risk of ischemic stroke: A prospective cohort study among Chinese adults
    Liu, Lei
    Xue, Xiaojia
    Zhang, Hua
    Tian, Xiaocao
    Chen, Yunhui
    Guo, Yu
    Pei, Pei
    Wang, Shaojie
    Duan, Haiping
    Gao, Ruqin
    Pang, Zengchang
    Chen, Zhengming
    Li, Liming
    NUTRITION METABOLISM AND CARDIOVASCULAR DISEASES, 2023, 33 (04) : 758 - 769
  • [46] Sleep duration and risk of incident of ischaemic heart disease: a 7-year prospective study of 0.5 million Chinese adults
    Chen, Y.
    Li, L.
    Wang, J.
    Gao, H.
    Clarke, R.
    Guo, Y.
    Bian, Z.
    Peto, R.
    Chen, J.
    Chen, Z.
    EUROPEAN HEART JOURNAL, 2015, 36 : 811 - 811
  • [47] Cardiovascular Disease Risk Prediction Using Machine Learning: A Prospective Cohort Study of 423,604 Participants.
    Alaa, Ahmed M.
    Bolton, Thomas
    Di Angelantonio, Emanuele
    Rudd, James H.
    van der Schaar, Mihaela
    CIRCULATION, 2018, 138
  • [48] Interpretable Stroke Risk Prediction Using Machine Learning Algorithms
    Zafeiropoulos, Nikolaos
    Mavrogiorgou, Argyro
    Kleftakis, Spyridon
    Mavrogiorgos, Konstantinos
    Kiourtis, Athanasios
    Kyriazis, Dimosthenis
    INTELLIGENT SUSTAINABLE SYSTEMS, WORLDS4 2022, VOL 2, 2023, 579 : 647 - 656
  • [49] Predicting post-stroke cognitive impairment using machine learning: A prospective cohort study
    Ji, Wencan
    Wang, Canjun
    Chen, Hanqing
    Liang, Yan
    Wang, Shaohua
    JOURNAL OF STROKE & CEREBROVASCULAR DISEASES, 2023, 32 (11):
  • [50] Diabetes, plasma glucose and incidence of colorectal cancer in Chinese adults: a prospective study of 0.5 million people
    Pang, Yuanjie
    Kartsonaki, Christiana
    Guo, Yu
    Chen, Yiping
    Yang, Ling
    Bian, Zheng
    Bragg, Fiona
    Millwood, Iona Y.
    Shen, Leijia
    Zhou, Songgen
    Liu, Jiben
    Chen, Junshi
    Li, Liming
    Holmes, Michael V.
    Chen, Zhengming
    JOURNAL OF EPIDEMIOLOGY AND COMMUNITY HEALTH, 2018, 72 (10) : 919 - 925