Chronic disease prediction using administrative data and graph theory: The case of type 2 diabetes

被引:38
作者
Khan, Arif [1 ,2 ]
Uddin, Shahadat [1 ]
Srinivasan, Uma [2 ,3 ]
机构
[1] Univ Sydney, Fac Engn & IT, Sydney, NSW, Australia
[2] Capital Markets CRC, Hlth Market Qual Res Program, Level 2,55 Harrington St, Sydney, NSW, Australia
[3] Digital Hlth CRC, Level 3,55 Harrington St, Sydney, NSW, Australia
关键词
Disease prediction; Electronic medical records; Medical information systems; Network theory; Prediction theory; Type; 2; diabetes; PREVALENCE; MELLITUS; ASSOCIATION; NETWORKS; OBESITY; IMPACT; COST; CARE;
D O I
10.1016/j.eswa.2019.05.048
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clinical diagnosis and regular monitoring of the population at risk of chronic diseases is clinically and financially resource-intensive. Mining administrative data could be an effective alternative way to identify this high-risk cohort. In this research, we apply data mining and network analysis technique on hospital admission and discharge data to understand the disease or comorbidity footprints of chronic patients. Based on this understanding we have developed a chronic disease risk prediction framework. The framework is then tested on Australian healthcare context to predict type 2 diabetes (T2D) risk. The dataset contained approximately 1.4 million admission records from 0.75 million patients. From this, we filtered and sampled the records of 2300 patients having comorbidities including T2D and another 2300 patients having comorbidities other than T2D. Along with demographic and behavioral risk factors for prediction, we propose several graph theory and social network-based measures which indicate the prevalence of comorbidities, transition patterns, and clustering membership. We use an exploratory approach to understand the relative impact of these risk factors and evaluate the prediction performance using three different predictive methods-regression, parameter optimization, and tree classification. All three prediction methods gave the highest ranking to the graph theory-based 'comorbidity prevalence' and 'transition pattern match' scores showing the effectiveness of the proposed network theory-based measures. Overall, the prediction accuracy between 82% to 87% shows the potential of the framework utilizing administrative data. The proposed framework could be useful for governments and health insurers to identify high-risk chronic disease cohorts. Developing preventive strategies then, over a period of time, can reduce the burden of acute care hospitalization. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页码:230 / 241
页数:12
相关论文
共 48 条
[1]  
ACCD, 2015, ICD10AMACHIACS ACCD, V2019
[2]  
Aksoy A, 2015, NUMERICAL MODELS PAR
[3]  
American Diabetes Association, 2014, NAT DIAB STAT REP 20
[4]  
[Anonymous], 1966, Soviet Physics Doklady
[5]  
[Anonymous], 2014, HLTH CONSMOK 50 YE
[6]  
[Anonymous], 2017, HCUP EL COM SOFTW
[7]   A New Framework for Distilling Higher Quality information from Health Data via Social Network Analysis [J].
Baglioni, M. ;
Pieroni, S. ;
Geraci, F. ;
Mariani, F. ;
Molinaro, S. ;
Pellegrini, M. ;
Lastres, E. .
2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2013, :48-55
[8]   Network medicine -: From obesity to the "Diseasome'' [J].
Barabasi, Albert-Laszlo .
NEW ENGLAND JOURNAL OF MEDICINE, 2007, 357 (04) :404-407
[9]   Fast unfolding of communities in large networks [J].
Blondel, Vincent D. ;
Guillaume, Jean-Loup ;
Lambiotte, Renaud ;
Lefebvre, Etienne .
JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2008,
[10]  
Breiman L., 1984, BIOMETRICS, V1st ed.