The Prediction of Diabetes Development: A Machine Learning Framework

被引:0
作者
Islam, Md Shafiqul [1 ]
Qaraqe, Marwa K. [1 ]
Abbas, Hasan T. [2 ]
Erraguntla, Madhav [3 ]
Abdul-Ghani, Muhammad [4 ]
机构
[1] Hamad Bin Khalifa Univ, Dept Comp Sci & Engn, Doha, Qatar
[2] Texas A&M Univ Qatar, Dept Elect Engn, Doha, Qatar
[3] Texas A&M Univ, Dept Ind Engn, College Stn, TX 77843 USA
[4] UT Hlth, Dept Med, San Antonio, TX USA
来源
2020 IEEE 5TH MIDDLE EAST AND AFRICA CONFERENCE ON BIOMEDICAL ENGINEERING (MECBME) | 2020年
关键词
Machine Learning; Ensemble of Classifiers; Feature Selection; Diabetes Prediction; GLUCOSE-TOLERANCE; INSULIN SENSITIVITY; HEALTH; MELLITUS;
D O I
暂无
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
The development of diabetes occurs due to elevated glucose levels in the bloodstream. Prevention of diabetes or the delayed onset of diabetes is crucial. It can be achieved if there exists a screening process that can accurately identify individuals who are at a higher risk of developing diabetes in the future. Although there are many works employing machine learning techniques in medical diagnostics, there is little work done regarding the long term prediction of disease, type 2 diabetes in particular. In this study, we propose a machine learning framework consists of finding the best features that are highly correlated with the future development of diabetes, followed by developing diabetes prediction models. The proposed models are evaluated using data from a longitudinal clinical study known as the San Antonio Heart Study. Our approach has managed to achieve a long-term prediction accuracy of 81.01%, a specificity of 81.2%, a sensitivity of 79.5%, and an AUC score of 87.1%.
引用
收藏
页码:154 / 159
页数:6
相关论文
共 26 条
[1]   What is the best predictor of future type 2 diabetes? [J].
Abdul-Ghani, Muhammad A. ;
Williams, Ken ;
DeFronzo, Ralph A. ;
Stern, Michael .
DIABETES CARE, 2007, 30 (06) :1544-1548
[2]   Two-Step Approach for the Prediction of Future Type 2 Diabetes Risk [J].
Abdul-Ghani, Muhammad A. ;
Abdul-Ghani, Tamam ;
Stern, Michael P. ;
Karavic, Jasmina ;
Tuomi, Tiinamaija ;
Bo, Insoma ;
DeFronzo, Ralph A. ;
Groop, Leif .
DIABETES CARE, 2011, 34 (09) :2108-2112
[3]  
Alberti KGMM, 1998, DIABETIC MED, V15, P539, DOI 10.1002/(SICI)1096-9136(199807)15:7<539::AID-DIA668>3.0.CO
[4]  
2-S
[5]   Predicting diabetes mellitus using SMOTE and ensemble machine learning approach: The Henry Ford ExercIse Testing (FIT) project [J].
Alghamdi, Manal ;
Al-Mallah, Mouaz ;
Keteylan, Steven ;
Brawner, Clinton ;
Ehrman, Jonathan ;
Sakr, Sherif .
PLOS ONE, 2017, 12 (07)
[6]   Application of data mining: Diabetes health care in young and old patients [J].
Aljumah, Abdullah A. ;
Ahamad, Mohammed Gulam ;
Siddiqui, Mohammad Khubeb .
JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2013, 25 (02) :127-136
[7]  
[Anonymous], 2006, NAIVE BAYES CLASSIFI
[8]  
[Anonymous], 2013, MATLAB, V488
[9]   E-health: transforming the physician/patient relationship [J].
Ball, MJ ;
Lillis, J .
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2001, 61 (01) :1-10
[10]   San Antonio heart study diabetes prediction model applicable to a Middle Eastern population? Tehran glucose and lipid study [J].
Bozorgmanesh, Mohammadreza ;
Hadaegh, Farzad ;
Zabetian, Azadeh ;
Azizi, Fereidoun .
INTERNATIONAL JOURNAL OF PUBLIC HEALTH, 2010, 55 (04) :315-323