Predictive models to assess risk of type 2 diabetes, hypertension and comorbidity: machine-learning algorithms and validation using national health data from Kuwait-a cohort study

被引:84
作者
Farran, Bassam [1 ]
Channanath, Arshad Mohamed [1 ]
Behbehani, Kazem [1 ]
Thanaraj, Thangavel Alphonse [1 ]
机构
[1] Dasman Diabet Inst, Dasman, Kuwait
关键词
PREVALENCE; ETHNICITY; GENES;
D O I
10.1136/bmjopen-2012-002457
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Objective: We build classification models and risk assessment tools for diabetes, hypertension and comorbidity using machine-learning algorithms on data from Kuwait. We model the increased proneness in diabetic patients to develop hypertension and vice versa. We ascertain the importance of ethnicity (and natives vs expatriate migrants) and of using regional data in risk assessment. Design: Retrospective cohort study. Four machine-learning techniques were used: logistic regression, k-nearest neighbours (k-NN), multifactor dimensionality reduction and support vector machines. The study uses fivefold cross validation to obtain generalisation accuracies and errors. Setting: Kuwait Health Network (KHN) that integrates data from primary health centres and hospitals in Kuwait. Participants: 270 172 hospital visitors (of which, 89 858 are diabetic, 58 745 hypertensive and 30 522 comorbid) comprising Kuwaiti natives, Asian and Arab expatriates. Outcome measures: Incident type 2 diabetes, hypertension and comorbidity. Results: Classification accuracies of >85% (for diabetes) and >90% (for hypertension) are achieved using only simple non-laboratory-based parameters. Risk assessment tools based on k-NN classification models are able to assign 'high' risk to 75% of diabetic patients and to 94% of hypertensive patients. Only 5% of diabetic patients are seen assigned 'low' risk. Asian-specific models and assessments perform even better. Pathological conditions of diabetes in the general population or in hypertensive population and those of hypertension are modelled. Two-stage aggregate classification models and risk assessment tools, built combining both the component models on diabetes (or on hypertension), perform better than individual models. Conclusions: Data on diabetes, hypertension and comorbidity from the cosmopolitan State of Kuwait are available for the first time. This enabled us to apply four different case-control models to assess risks. These tools aid in the preliminary non-intrusive assessment of the population. Ethnicity is seen significant to the predictive models. Risk assessments need to be developed using regional data as we demonstrate the applicability of the American Diabetes Association online calculator on data from Kuwait.
引用
收藏
页数:10
相关论文
共 29 条
[1]   The impact of ethnicity on type 2 diabetes [J].
Abate, N ;
Chandalia, M .
JOURNAL OF DIABETES AND ITS COMPLICATIONS, 2003, 17 (01) :39-58
[2]   Ethnicity and type 2 diabetes - Focus on Asian Indians [J].
Abate, N ;
Chandalia, M .
JOURNAL OF DIABETES AND ITS COMPLICATIONS, 2001, 15 (06) :320-327
[3]   Prediction models for risk of developing type 2 diabetes: systematic literature search and independent external validation study [J].
Abbasi, Ali ;
Peelen, Linda M. ;
Corpeleijn, Eva ;
van der Schouw, Yvonne T. ;
Stolk, Ronald P. ;
Spijkerman, Annemieke M. W. ;
van der A, Daphne L. ;
Moons, Karel G. M. ;
Navis, Gerjan ;
Bakker, Stephan J. L. ;
Beulens, Joline W. J. .
BMJ-BRITISH MEDICAL JOURNAL, 2012, 345
[4]   Systematic screening of lysyl oxidase-like (LOXL) family genes demonstrates that LOXL2 is a susceptibility gene to intracranial aneurysms [J].
Akagawa, Hiroyuki ;
Narita, Akira ;
Yamada, Haruhiko ;
Tajima, Atsushi ;
Krischek, Boris ;
Kasuya, Hidetoshi ;
Hori, Tomokatsu ;
Kubota, Motoo ;
Saeki, Naokatsu ;
Hata, Akira ;
Mizutani, Tohru ;
Inoue, Ituro .
HUMAN GENETICS, 2007, 121 (3-4) :377-387
[5]   Prevalence of Type 2 Diabetes in the States of The Co-Operation Council for the Arab States of the Gulf: A Systematic Review [J].
Alhyas, Layla ;
McKay, Ailsa ;
Majeed, Azeem .
PLOS ONE, 2012, 7 (08)
[6]   DNA repair polymorphisms modify bladder cancer risk: A multi-factor analytic strategy [J].
Andrew, Angeline S. ;
Karagas, Margaret R. ;
Nelson, Heather H. ;
Guarrera, Simonetta ;
Polidoro, Silvia ;
Gamberini, Sara ;
Sacerdote, Carlotta ;
Moore, Jason H. ;
Kelsey, Karl T. ;
Demidenko, Eugene ;
Vineis, Paolo ;
Matullo, Giuseppe .
HUMAN HEREDITY, 2008, 65 (02) :105-118
[7]   Type II Diabetes Mellitus in Arabic-Speaking Countries [J].
Badran, Mohammad ;
Laher, Ismail .
INTERNATIONAL JOURNAL OF ENDOCRINOLOGY, 2012, 2012
[8]   Development and Validation of a Patient Self-assessment Score for Diabetes Risk [J].
Bang, Heejung ;
Edwards, Alison M. ;
Bomback, Andrew S. ;
Ballantyne, Christie M. ;
Brillon, David ;
Callahan, Mark A. ;
Teutsch, Steven M. ;
Mushlin, Alvin I. ;
Kern, Lisa M. .
ANNALS OF INTERNAL MEDICINE, 2009, 151 (11) :775-W255
[9]  
Bhandari M., 2009, CLIN RES SURG
[10]  
Channanath AM, 2013, DIABETES CA IN PRESS