Predicting Type 2 Diabetes Using Logistic Regression and Machine Learning Approaches

被引:92
作者
Joshi, Ram D. [1 ]
Dhakal, Chandra K. [2 ]
机构
[1] Texas Tech Univ, Dept Econ, Lubbock, TX 79409 USA
[2] Univ Georgia, Dept Agr & Appl Econ, Athens, GA 30602 USA
关键词
decision tree; diabetes risk factors; machine learning; prediction accuracy; INSULIN-RESISTANCE; RISK-FACTORS; LIFE-STYLE; MELLITUS; RECOMMENDATIONS; POPULATION; DISEASES; OBESITY; TOOL;
D O I
10.3390/ijerph18147346
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Diabetes mellitus is one of the most common human diseases worldwide and may cause several health-related complications. It is responsible for considerable morbidity, mortality, and economic loss. A timely diagnosis and prediction of this disease could provide patients with an opportunity to take the appropriate preventive and treatment strategies. To improve the understanding of risk factors, we predict type 2 diabetes for Pima Indian women utilizing a logistic regression model and decision tree-a machine learning algorithm. Our analysis finds five main predictors of type 2 diabetes: glucose, pregnancy, body mass index (BMI), diabetes pedigree function, and age. We further explore a classification tree to complement and validate our analysis. The six-fold classification tree indicates glucose, BMI, and age are important factors, while the ten-node tree implies glucose, BMI, pregnancy, diabetes pedigree function, and age as the significant predictors. Our preferred specification yields a prediction accuracy of 78.26% and a cross-validation error rate of 21.74%. We argue that our model can be applied to make a reasonable prediction of type 2 diabetes, and could potentially be used to complement existing preventive measures to curb the incidence of diabetes and reduce associated costs.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Comparison of Statistical Logistic Regression and RandomForest Machine Learning Techniques in Predicting Diabetes
    Daghistani, Tahani
    Alshammari, Riyad
    JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2020, 11 (02) : 78 - 83
  • [2] Comparison of Logistic Regression and Machine Learning Approaches in Predicting Depressive Symptoms: A National-Based Study
    Dong, Xing-Xuan
    Liu, Jian-Hua
    Zhang, Tian-Yang
    Pan, Chen-Wei
    Zhao, Chun-Hua
    Wu, Yi-Bo
    Chen, Dan-Dan
    PSYCHIATRY INVESTIGATION, 2025, 22 (03) : 267 - 278
  • [3] Comparison between multiple logistic regression and machine learning methods in prediction of abnormal thallium scans in type 2 diabetes
    Yang, Chung-Chi
    Peng, Chung-Hsin
    Huang, Li-Ying
    Chen, Fang Yu
    Kuo, Chun-Heng
    Wu, Chung-Ze
    Hsia, Te-Lin
    Lin, Chung-Yu
    WORLD JOURNAL OF CLINICAL CASES, 2023, 11 (33)
  • [4] Advancing Breast Cancer Prediction using Logistic Regression and Machine Learning Techniques
    Bhuria, Ruchika
    Gill, Kanwarpartap Singh
    Malhotra, Sonal
    Singh, Mukesh
    2ND INTERNATIONAL CONFERENCE ON SUSTAINABLE COMPUTING AND SMART SYSTEMS, ICSCSS 2024, 2024, : 1374 - 1377
  • [5] Prediction of preterm birth in multiparous women using logistic regression and machine learning approaches
    Belaghi, Reza Arabi
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [6] Logistic regression was as good as machine learning for predicting major chronic diseases
    Nusinovici, Simon
    Tham, Yih Chung
    Yan, Marco Yu Chak
    Ting, Daniel Shu Wei
    Li, Jialiang
    Sabanayagam, Charumathi
    Wong, Tien Yin
    Cheng, Ching-Yu
    JOURNAL OF CLINICAL EPIDEMIOLOGY, 2020, 122 : 56 - 69
  • [7] Predicting Glycemic Control in a Small Cohort of Children with Type 1 Diabetes Using Machine Learning Algorithms
    Neamtu, Bogdan
    Negrea, Mihai Octavian
    Neagu, Iuliana
    MATHEMATICS, 2023, 11 (20)
  • [8] Predicting the Development of Type 2 Diabetes in a Large Australian Cohort Using Machine-Learning Techniques: Longitudinal Survey Study
    Zhang, Lei
    Shang, Xianwen
    Sreedharan, Subhashaan
    Yan, Xixi
    Liu, Jianbin
    Keel, Stuart
    Wu, Jinrong
    Peng, Wei
    He, Mingguang
    JMIR MEDICAL INFORMATICS, 2020, 8 (07)
  • [9] Using Machine Learning to Predict Abnormal Carotid Intima-Media Thickness in Type 2 Diabetes
    Wu, Chung-Ze
    Huang, Li-Ying
    Chen, Fang-Yu
    Kuo, Chun-Heng
    Yeih, Dong-Feng
    DIAGNOSTICS, 2023, 13 (11)
  • [10] Predicting diabetic nephropathy in type 2 diabetic patients using machine learning algorithms
    Sarkhosh, Seyyed Mahdi Hosseini
    Esteghamati, Alireza
    Hemmatabadi, Mahboobeh
    Daraei, Morteza
    JOURNAL OF DIABETES AND METABOLIC DISORDERS, 2022, 21 (02) : 1433 - 1441