Predicting adverse outcomes due to diabetes complications with machine learning using administrative health data

被引:0
作者
Mathieu Ravaut
Hamed Sadeghi
Kin Kwan Leung
Maksims Volkovs
Kathy Kornas
Vinyas Harish
Tristan Watson
Gary F. Lewis
Alanna Weisman
Tomi Poutanen
Laura Rosella
机构
[1] Layer 6 AI,Department of Computer Science
[2] University of Toronto,Dalla Lana School of Public Health
[3] University of Toronto,MD/PhD Program, Temerty Faculty of Medicine
[4] University of Toronto,Department of Medicine, Temerty Faculty of Medicine
[5] ICES,Department of Physiology, Temerty Faculty of Medicine
[6] University of Toronto,Lunenfeld
[7] University of Toronto,Tanenbaum Research Institute
[8] Mt. Sinai Hospital,Division of Endocrinology and Metabolism, Temerty Faculty of Medicine
[9] University of Toronto,Institute for Better Health
[10] Vector Institute,Department of Laboratory Medicine & Pathology, Temerty Faculty of Medicine
[11] Trillium Health Partners,undefined
[12] University of Toronto,undefined
来源
npj Digital Medicine | / 4卷
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Across jurisdictions, government and health insurance providers hold a large amount of data from patient interactions with the healthcare system. We aimed to develop a machine learning-based model for predicting adverse outcomes due to diabetes complications using administrative health data from the single-payer health system in Ontario, Canada. A Gradient Boosting Decision Tree model was trained on data from 1,029,366 patients, validated on 272,864 patients, and tested on 265,406 patients. Discrimination was assessed using the AUC statistic and calibration was assessed visually using calibration plots overall and across population subgroups. Our model predicting three-year risk of adverse outcomes due to diabetes complications (hyper/hypoglycemia, tissue infection, retinopathy, cardiovascular events, amputation) included 700 features from multiple diverse data sources and had strong discrimination (average test AUC = 77.7, range 77.7–77.9). Through the design and validation of a high-performance model to predict diabetes complications adverse outcomes at the population level, we demonstrate the potential of machine learning and administrative health data to inform health planning and healthcare resource allocation for diabetes management.
引用
收藏
相关论文
共 153 条
[1]  
Guariguata L(2014)Global estimates of diabetes prevalence for 2013 and projections for 2035 Diabetes Res. Clin. Pract. 103 137-149
[2]  
Deshpande AD(2008)Epidemiology of diabetes and diabetes-related complications Phys. Ther. 88 1254-1264
[3]  
Harris-Hayes M(2019)Global trends in diabetes complications: a review of current evidence Diabetologia 62 3-16
[4]  
Schootman M(2002)Lifetime costs of complications resulting from type 2 diabetes in the U.S Diabetes Care 25 476-481
[5]  
Harding JL(2015)Relationship of diabetes complications severity to healthcare utilization and costs among Medicare Advantage beneficiaries Am. J. Manag. Care 21 e62-e70
[6]  
Pavkov ME(1993)The effect of intensive treatment of diabetes on the development and progression of long-term complications in insulin-dependent diabetes mellitus N. Engl. J. Med. 329 977-986
[7]  
Magliano DJ(2004)Primary prevention of cardiovascular disease with atorvastatin in type 2 diabetes in the Collaborative Atorvastatin Diabetes Study (CARDS): multicentre randomised placebo-controlled trial Lancet 364 685-696
[8]  
Shaw JE(2003)Multifactorial intervention and cardiovascular disease in patients with type 2 diabetes N. Engl. J. Med. 348 383-393
[9]  
Gregg EW(2015)Relation of smoking with total mortality and cardiovascular events among patients with diabetes mellitus Circulation 132 1795-1804
[10]  
Caro JJ(2001)External barriers to diabetes care: addressing personal and health systems issues Diabetes Spectr. 14 23-28