Machine-learning algorithms in screening for type 2 diabetes mellitus: Data from Fasa Adults Cohort Study

被引:1
作者
Karmand, Hanieh [1 ]
Andishgar, Aref [2 ]
Tabrizi, Reza [3 ]
Sadeghi, Alireza [4 ,5 ]
Pezeshki, Babak [6 ]
Ravankhah, Mahdi [4 ]
Taherifard, Erfan [4 ,5 ]
Ahmadizar, Fariba [7 ]
机构
[1] Fasa Univ Med Sci, Student Res Comm, Sch Med, Fasa, Iran
[2] Fasa Univ Med Sci, USERN Off, Fasa, Iran
[3] Fasa Univ Med Sci, Noncommunicable Dis Res Ctr, Fasa 6688874616, Iran
[4] Shiraz Univ Med Sci, Student Res Comm, Sch Med, Shiraz, Iran
[5] Shiraz Univ Med Sci, Hlth Policy Res Ctr, Sch Med, Shiraz, Iran
[6] Fasa Univ Med Sci, Valiasr Hosp, Clin Res Dev Unit, Fasa, Iran
[7] Julius Global Hlth, Data Sci & Biostat Dept, Utrecht, Netherlands
关键词
machine learning; screening; type 2 diabetes mellitus; COMPLICATIONS;
D O I
10.1002/edm2.472
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
IntroductionThe application of machine learning (ML) is increasingly growing in biomedical sciences. This study aimed to evaluate factors associated with type 2 diabetes mellitus (T2DM) and compare the performance of ML methods in identifying individuals with the disease in an Iranian setting.MethodsUsing the baseline data from Fasa Adult Cohort Study (FACS) and in a sex-stratified manner, we studied factors associated with T2DM by applying seven different ML methods including Logistic Regression (LR), Support Vector Machine (SVM), Random Forest (RF), K-Nearest Neighbours (KNN), Gradient Boosting Machine (GBM), Extreme Gradient Boosting (XGB) and Bagging classifier (BAG). We further compared the performance of these methods; for each algorithm, accuracy, precision, sensitivity, specificity, F1 score, and Area Under Curve (AUC) were calculated.Results10,112 participants were recruited between 2014 and 2016, of whom 1246 had T2DM at baseline. 4566 (45%) participants were males, aged between 35 and 70 years. For males, age, sugar consumption, and history of hospitalization were the most weighted variables regarding their importance in screening for T2DM using the GBM model, respectively; these variables were sugar consumption, urine blood, and age for females. GBM outperformed other models for both males and females with AUC of 0.75 (0.69-0.82) and 0.76 (0.71-0.80), and F1 score of 0.33 (0.27-0.39) and 0.42 (0.38-0.46), respectively. GBM also showed a sensitivity of 0.24 (0.19-0.29) and a specificity of 0.98 (0.96-1.0) in males and a sensitivity of 0.38 (0.34-0.42) and specificity of 0.92 (0.89-0.95) in females. Notably, close performance characteristics were detected among other ML models.ConclusionsGBM model might achieve better performance in screening for T2DM in a south Iranian population. Gradient Boosting Machine model might achieve better performance in screening for T2DM in a south Iranian population.image
引用
收藏
页数:11
相关论文
共 23 条
[1]   Artificial Intelligence Applications in Type 2 Diabetes Mellitus Care: Focus on Machine Learning Methods [J].
Abhari, Shahabeddin ;
Kalhori, Sharareh R. Niakan ;
Ebrahimi, Mehdi ;
Hasannejadasl, Hajar ;
Garavand, Ali .
HEALTHCARE INFORMATICS RESEARCH, 2019, 25 (04) :248-261
[2]   Recent applications of machine learning and deep learning models in the prediction, diagnosis, and management of diabetes: a comprehensive review [J].
Afsaneh, Elaheh ;
Sharifdini, Amin ;
Ghazzaghi, Hadi ;
Ghobadi, Mohadeseh Zarei .
DIABETOLOGY & METABOLIC SYNDROME, 2022, 14 (01)
[3]  
[Anonymous], 2022, Int. J. Intell. Netw., DOI DOI 10.1016/J.IJIN.2022.05.002
[4]   A systematic review of real-world diabetes prevention programs: learnings from the last 15 years [J].
Aziz, Zahra ;
Absetz, Pilvikki ;
Oldroyd, John ;
Pronk, Nicolaas P. ;
Oldenburg, Brian .
IMPLEMENTATION SCIENCE, 2015, 10
[5]   IDF Diabetes Atlas: Global estimates of diabetes prevalence for 2017 and projections for 2045 [J].
Cho, N. H. ;
Shaw, J. E. ;
Karuranga, S. ;
Huang, Y. ;
Fernandes, J. D. da Rocha ;
Ohlrogge, A. W. ;
Malanda, B. .
DIABETES RESEARCH AND CLINICAL PRACTICE, 2018, 138 :271-281
[6]   Use and performance of machine learning models for type 2 diabetes prediction in community settings: A systematic review and meta-analysis [J].
De Silva, Kushan ;
Lee, Wai Kit ;
Forbes, Andrew ;
Demmer, Ryan T. ;
Barton, Christopher ;
Enticott, Joanne .
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2020, 143
[7]   A cohort study protocol to analyze the predisposing factors to common chronic non-communicable diseases in rural areas: Fasa Cohort Study [J].
Farjam, Mojtaba ;
Bahrami, Hossein ;
Bahramali, Ehsan ;
Jamshidi, Javad ;
Askari, Alireza ;
Zakeri, Habibollah ;
Homayounfar, Reza ;
Poustchi, Hossein ;
Malekzadeh, Reza .
BMC PUBLIC HEALTH, 2016, 16 :1-8
[8]   Machine learning and deep learning predictive models for type 2 diabetes: a systematic review [J].
Fregoso-Aparicio, Luis ;
Noguez, Julieta ;
Montesinos, Luis ;
Garcia-Garcia, Jose A. .
DIABETOLOGY & METABOLIC SYNDROME, 2021, 13 (01)
[9]   Cohort Profile: The Fasa Adults Cohort Study (FACS): a prospective study of non-communicable diseases risks [J].
Homayounfar, Reza ;
Farjam, Mojtaba ;
Bahramali, Ehsan ;
Sharafi, Mehdi ;
Poustchi, Hossein ;
Malekzadeh, Reza ;
Mansoori, Yaser ;
Naghizadeh, Mohammad Mehdi ;
Vakil, Mohammad Kazem ;
Dehghan, Azizallah .
INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2023, 52 (03) :E172-E178
[10]  
Jovic A, 2015, 2015 8TH INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), P1200, DOI 10.1109/MIPRO.2015.7160458