Predicting atrial fibrillation in primary care using machine learning

被引:84
作者
Hill, Nathan R. [1 ]
Ayoubkhani, Daniel [2 ]
McEwan, Phil [2 ]
Sugrue, Daniel M. [2 ]
Farooqui, Usman [1 ]
Lister, Steven [1 ]
Lumley, Matthew [3 ]
Bakhai, Ameet [4 ]
Cohen, Alexander T. [5 ]
O'Neill, Mark [6 ]
Clifton, David [7 ]
Gordon, Jason [2 ]
机构
[1] Bristol Myers Squibb Pharmaceut Ltd, Uxbridge, Middx, England
[2] Hlth Econ & Outcomes Res Ltd, Cardiff, Wales
[3] Pfizer Ltd, Walton Oaks, Surrey, England
[4] Royal Free Hosp, Dept Cardiol, London, England
[5] Kings Coll London, Guys & St Thomas NHS Fdn Trust, Dept Haematol Med, London, England
[6] Kings Coll London, Guys & St Thomas NHS Fdn Trust, Div Cardiovasc Med, London, England
[7] Univ Oxford, Dept Engn Sci, Oxford, England
来源
PLOS ONE | 2019年 / 14卷 / 11期
基金
英国工程与自然科学研究理事会;
关键词
RISK SCORE; STROKE; PULSE;
D O I
10.1371/journal.pone.0224582
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background Atrial fibrillation (AF) is the most common sustained heart arrhythmia. However, as many cases are asymptomatic, a large proportion of patients remain undiagnosed until serious complications arise. Efficient, cost-effective detection of the undiagnosed may be supported by risk-prediction models relating patient factors to AF risk. However, there exists a need for an implementable risk model that is contemporaneous and informed by routinely collected patient data, reflecting the real-world pathology of AF. Methods This study sought to develop and evaluate novel and conventional statistical and machine learning models for risk-predication of AF. This was a retrospective, cohort study of adults (aged >= 30 years) without a history of AF, listed on the Clinical Practice Research Datalink, from January 2006 to December 2016. Models evaluated included published risk models (Framingham, ARIC, CHARGE-AF), machine learning models, which evaluated baseline and time-updated information (neural network, LASSO, random forests, support vector machines), and Cox regression. Results Analysis of 2,994,837 individuals (3.2% AF) identified time-varying neural networks as the optimal model achieving an AUROC of 0.827 vs. 0.725, with number needed to screen of 9 vs. 13 patients at 75% sensitivity, when compared with the best existing model CHARGE AF. The optimal model confirmed known baseline risk factors (age, previous cardiovascular disease, antihypertensive medication usage) and identified additional time-varying predictors (proximity of cardiovascular events, body mass index (both levels and changes), pulse pressure, and the frequency of blood pressure measurements). Conclusion The optimal time-varying machine learning model exhibited greater predictive performance than existing AF risk models and reflected known and new patient risk factors for AF.
引用
收藏
页数:13
相关论文
共 38 条
  • [1] Simple Risk Model Predicts Incidence of Atrial Fibrillation in a Racially and Geographically Diverse Population: the CHARGE-AF Consortium
    Alonso, Alvaro
    Krijthe, Bouwe P.
    Aspelund, Thor
    Stepas, Katherine A.
    Pencina, Michael J.
    Moser, Carlee B.
    Sinner, Moritz F.
    Sotoodehnia, Nona
    Fontes, Joao D.
    Janssens, A. Cecile J. W.
    Kronmal, Richard A.
    Magnani, Jared W.
    Witteman, Jacqueline C.
    Chamberlain, Alanna M.
    Lubitz, Steven A.
    Schnabel, Renate B.
    Agarwal, Sunil K.
    McManus, David D.
    Ellinor, Patrick T.
    Larson, Martin G.
    Burke, Gregory L.
    Launer, Lenore J.
    Hofman, Albert
    Levy, Daniel
    Gottdiener, John S.
    Kaeaeb, Stefan
    Couper, David
    Harris, Tamara B.
    Soliman, Elsayed Z.
    Stricker, Bruno H. C.
    Gudnason, Vilmundur
    Heckbert, Susan R.
    Benjamin, Emelia J.
    [J]. JOURNAL OF THE AMERICAN HEART ASSOCIATION, 2013, 2 (02): : e000102
  • [2] [Anonymous], 1988, Statistical power analysis for the behavioral sciences
  • [3] [Anonymous], ATR FIBR PREV EST EN
  • [4] [Anonymous], HLTH TECHNOL ASSESS
  • [5] [Anonymous], 2015, Nature, DOI [10.1038/nature14539, DOI 10.1038/NATURE14539]
  • [6] [Anonymous], XGBOOST DOCUMENTATIO
  • [7] [Anonymous], GRASP AF
  • [8] [Anonymous], AF REP ATR FIBR PREV
  • [9] [Anonymous], 2001, The elements of statistical learning: data mining, inference, and prediction
  • [10] [Anonymous], MACH LEARN MACH LEARN