Predicting Type 2 Diabetes Using Logistic Regression and Machine Learning Approaches

被引:92
作者
Joshi, Ram D. [1 ]
Dhakal, Chandra K. [2 ]
机构
[1] Texas Tech Univ, Dept Econ, Lubbock, TX 79409 USA
[2] Univ Georgia, Dept Agr & Appl Econ, Athens, GA 30602 USA
关键词
decision tree; diabetes risk factors; machine learning; prediction accuracy; INSULIN-RESISTANCE; RISK-FACTORS; LIFE-STYLE; MELLITUS; RECOMMENDATIONS; POPULATION; DISEASES; OBESITY; TOOL;
D O I
10.3390/ijerph18147346
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Diabetes mellitus is one of the most common human diseases worldwide and may cause several health-related complications. It is responsible for considerable morbidity, mortality, and economic loss. A timely diagnosis and prediction of this disease could provide patients with an opportunity to take the appropriate preventive and treatment strategies. To improve the understanding of risk factors, we predict type 2 diabetes for Pima Indian women utilizing a logistic regression model and decision tree-a machine learning algorithm. Our analysis finds five main predictors of type 2 diabetes: glucose, pregnancy, body mass index (BMI), diabetes pedigree function, and age. We further explore a classification tree to complement and validate our analysis. The six-fold classification tree indicates glucose, BMI, and age are important factors, while the ten-node tree implies glucose, BMI, pregnancy, diabetes pedigree function, and age as the significant predictors. Our preferred specification yields a prediction accuracy of 78.26% and a cross-validation error rate of 21.74%. We argue that our model can be applied to make a reasonable prediction of type 2 diabetes, and could potentially be used to complement existing preventive measures to curb the incidence of diabetes and reduce associated costs.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Exploratory analysis using machine learning of predictive factors for falls in type 2 diabetes
    Suzuki, Yasuhiro
    Suzuki, Hiroaki
    Ishikawa, Tatsuya
    Yamada, Yasunori
    Yatoh, Shigeru
    Sugano, Yoko
    Iwasaki, Hitoshi
    Sekiya, Motohiro
    Yahagi, Naoya
    Hada, Yasushi
    Shimano, Hitoshi
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [42] Predicting Overweight and Obesity Status Among Malaysian Working Adults With Machine Learning or Logistic Regression: Retrospective Comparison Study
    Wong, Jyh Eiin
    Yamaguchi, Miwa
    Nishi, Nobuo
    Araki, Michihiro
    Wee, Lei Hum
    JMIR FORMATIVE RESEARCH, 2022, 6 (12)
  • [43] Comparison between traditional logistic regression and machine learning for predicting mortality in adult sepsis patients
    Wu, Hongsheng
    Liao, Biling
    Ji, Tengfei
    Ma, Keqiang
    Luo, Yumei
    Zhang, Shengmin
    FRONTIERS IN MEDICINE, 2025, 11
  • [44] Prediction of Acute Kidney Injury after Liver Transplantation: Machine Learning Approaches vs. Logistic Regression Model
    Lee, Hyung-Chul
    Yoon, Soo Bin
    Yang, Seong-Mi
    Kim, Won Ho
    Ryu, Ho-Geol
    Jung, Chul-Woo
    Suh, Kyung-Suk
    Lee, Kook Hyun
    JOURNAL OF CLINICAL MEDICINE, 2018, 7 (11)
  • [45] Prediction of Heart Disease using Decision Tree over Logistic Regression using Machine Learning with Improved Accuracy
    Raj, K. N. S. Shanmukha
    Thinakaran, K.
    CARDIOMETRY, 2022, (25): : 1514 - 1519
  • [46] Prediction of the rate of penetration using logistic regression algorithm of machine learning model
    Deng S.
    Wei M.
    Xu M.
    Cai W.
    Arabian Journal of Geosciences, 2021, 14 (21)
  • [47] Predicting Aquaculture Water Quality Using Machine Learning Approaches
    Li, Tingting
    Lu, Jian
    Wu, Jun
    Zhang, Zhenhua
    Chen, Liwei
    WATER, 2022, 14 (18)
  • [48] Predicting Methylphenidate Response in ADHD Using Machine Learning Approaches
    Kim, Jae-Won
    Sharma, Vinod
    Ryan, Neal D.
    INTERNATIONAL JOURNAL OF NEUROPSYCHOPHARMACOLOGY, 2015, 18 (11)
  • [49] Logistic regression has similar performance to optimised machine learning algorithms in a clinical setting: application to the discrimination between type 1 and type 2 diabetes in young adults
    Anita L. Lynam
    John M. Dennis
    Katharine R. Owen
    Richard A. Oram
    Angus G. Jones
    Beverley M. Shields
    Lauric A. Ferrat
    Diagnostic and Prognostic Research, 4 (1)
  • [50] Machine Learning Approaches to Predict Risks of Diabetic Complications and Poor Glycemic Control in Nonadherent Type 2 Diabetes
    Fan, Yuting
    Long, Enwu
    Cai, Lulu
    Cao, Qiyuan
    Wu, Xingwei
    Tong, Rongsheng
    FRONTIERS IN PHARMACOLOGY, 2021, 12