Predicting Type 2 Diabetes Using Logistic Regression and Machine Learning Approaches

被引:92
作者
Joshi, Ram D. [1 ]
Dhakal, Chandra K. [2 ]
机构
[1] Texas Tech Univ, Dept Econ, Lubbock, TX 79409 USA
[2] Univ Georgia, Dept Agr & Appl Econ, Athens, GA 30602 USA
关键词
decision tree; diabetes risk factors; machine learning; prediction accuracy; INSULIN-RESISTANCE; RISK-FACTORS; LIFE-STYLE; MELLITUS; RECOMMENDATIONS; POPULATION; DISEASES; OBESITY; TOOL;
D O I
10.3390/ijerph18147346
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Diabetes mellitus is one of the most common human diseases worldwide and may cause several health-related complications. It is responsible for considerable morbidity, mortality, and economic loss. A timely diagnosis and prediction of this disease could provide patients with an opportunity to take the appropriate preventive and treatment strategies. To improve the understanding of risk factors, we predict type 2 diabetes for Pima Indian women utilizing a logistic regression model and decision tree-a machine learning algorithm. Our analysis finds five main predictors of type 2 diabetes: glucose, pregnancy, body mass index (BMI), diabetes pedigree function, and age. We further explore a classification tree to complement and validate our analysis. The six-fold classification tree indicates glucose, BMI, and age are important factors, while the ten-node tree implies glucose, BMI, pregnancy, diabetes pedigree function, and age as the significant predictors. Our preferred specification yields a prediction accuracy of 78.26% and a cross-validation error rate of 21.74%. We argue that our model can be applied to make a reasonable prediction of type 2 diabetes, and could potentially be used to complement existing preventive measures to curb the incidence of diabetes and reduce associated costs.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Comparison of logistic regression and machine learning methods for predicting postoperative delirium in elderly patients: A retrospective study
    Song, Yu-Xiang
    Yang, Xiao-Dong
    Luo, Yun-Gen
    Ouyang, Chun-Lei
    Yu, Yao
    Ma, Yu-Long
    Li, Hao
    Lou, Jing-Sheng
    Liu, Yan-Hong
    Chen, Yi-Qiang
    Cao, Jiang-Bei
    Mi, Wei-Dong
    CNS NEUROSCIENCE & THERAPEUTICS, 2023, 29 (01) : 158 - 167
  • [22] Identification of the risk factors of type 2 diabetes and its prediction using machine learning techniques
    Islam, Md Merajul
    Rahman, Md Jahanur
    Abedin, Md Menhazul
    Ahammed, Benojir
    Ali, Mohammad
    Ahmed, N. A. M. Faisal
    Maniruzzaman, Md
    HEALTH SYSTEMS, 2023, 12 (02) : 243 - 254
  • [23] Predicting Diabetes Using Machine Learning Techniques
    Kirgil, Elif Nur Haner
    Erkal, Begum
    Ayyildiz, Tulin Ercelebi
    2022 INTERNATIONAL CONFERENCE ON THEORETICAL AND APPLIED COMPUTER SCIENCE AND ENGINEERING (ICTASCE), 2022, : 137 - 141
  • [24] Mortality risk prediction in burn injury: Comparison of logistic regression with machine learning approaches
    Stylianou, Neophytos
    Akbarov, Artur
    Kontopantelis, Evangelos
    Buchan, Iain
    Dunn, Ken W.
    BURNS, 2015, 41 (05) : 925 - 934
  • [25] Predicting the Risk of Incident Type 2 Diabetes Mellitus in Chinese Elderly Using Machine Learning Techniques
    Liu, Qing
    Zhang, Miao
    He, Yifeng
    Zhang, Lei
    Zou, Jingui
    Yan, Yaqiong
    Guo, Yan
    JOURNAL OF PERSONALIZED MEDICINE, 2022, 12 (06):
  • [26] Predicting Diabetes Disease Occurrence Using Logistic Regression: An Early Detection Approach
    Abdalrada A.S.
    Neamah A.F.
    Murad H.
    Iraqi Journal for Computer Science and Mathematics, 2024, 5 (01): : 160 - 167
  • [27] Predicting Location of Tweets Using Machine Learning Approaches
    Alsaqer, Mohammed
    Alelyani, Salem
    Mohana, Mohamed
    Alreemy, Khalid
    Alqahtani, Ali
    APPLIED SCIENCES-BASEL, 2023, 13 (05):
  • [28] Predicting Employee Attrition Using Machine Learning Approaches
    Raza, Ali
    Munir, Kashif
    Almutairi, Mubarak
    Younas, Faizan
    Fareed, Mian Muhammad Sadiq
    APPLIED SCIENCES-BASEL, 2022, 12 (13):
  • [29] Logistic Regression for Machine Learning in Process Tomography
    Rymarczyk, Tomasz
    Kozlowski, Edward
    Klosowski, Grzegorz
    Niderla, Konrad
    SENSORS, 2019, 19 (15)
  • [30] Predicting diabetic nephropathy in type 2 diabetic patients using machine learning algorithms
    Seyyed Mahdi Hosseini Sarkhosh
    Alireza Esteghamati
    Mahboobeh Hemmatabadi
    Morteza Daraei
    Journal of Diabetes & Metabolic Disorders, 2022, 21 : 1433 - 1441