Predicting Type 2 Diabetes Using Logistic Regression and Machine Learning Approaches

被引:92
作者
Joshi, Ram D. [1 ]
Dhakal, Chandra K. [2 ]
机构
[1] Texas Tech Univ, Dept Econ, Lubbock, TX 79409 USA
[2] Univ Georgia, Dept Agr & Appl Econ, Athens, GA 30602 USA
关键词
decision tree; diabetes risk factors; machine learning; prediction accuracy; INSULIN-RESISTANCE; RISK-FACTORS; LIFE-STYLE; MELLITUS; RECOMMENDATIONS; POPULATION; DISEASES; OBESITY; TOOL;
D O I
10.3390/ijerph18147346
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Diabetes mellitus is one of the most common human diseases worldwide and may cause several health-related complications. It is responsible for considerable morbidity, mortality, and economic loss. A timely diagnosis and prediction of this disease could provide patients with an opportunity to take the appropriate preventive and treatment strategies. To improve the understanding of risk factors, we predict type 2 diabetes for Pima Indian women utilizing a logistic regression model and decision tree-a machine learning algorithm. Our analysis finds five main predictors of type 2 diabetes: glucose, pregnancy, body mass index (BMI), diabetes pedigree function, and age. We further explore a classification tree to complement and validate our analysis. The six-fold classification tree indicates glucose, BMI, and age are important factors, while the ten-node tree implies glucose, BMI, pregnancy, diabetes pedigree function, and age as the significant predictors. Our preferred specification yields a prediction accuracy of 78.26% and a cross-validation error rate of 21.74%. We argue that our model can be applied to make a reasonable prediction of type 2 diabetes, and could potentially be used to complement existing preventive measures to curb the incidence of diabetes and reduce associated costs.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Comparison of multiple linear regression and machine learning methods in predicting cognitive function in older Chinese type 2 diabetes patients
    Chi-Hao Liu
    Chung-Hsin Peng
    Li-Ying Huang
    Fang-Yu Chen
    Chun-Heng Kuo
    Chung-Ze Wu
    Yu-Fang Cheng
    BMC Neurology, 24
  • [32] Machine learning approach to predicting albuminuria in persons with type 2 diabetes: An analysis of the LOOK AHEAD Cohort
    Khitan, Zeid
    Nath, Tanmay
    Santhanam, Prasanna
    JOURNAL OF CLINICAL HYPERTENSION, 2021, 23 (12) : 2137 - 2145
  • [33] Predicting Antidiabetic Peptide Activity: A Machine Learning Perspective on Type 1 and Type 2 Diabetes
    Cai, Kaida
    Zhang, Zhe
    Zhu, Wenzhou
    Liu, Xiangwei
    Yu, Tingqing
    Liao, Wang
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2024, 25 (18)
  • [34] Genome-wide discovery of miRNAs using ensembles of machine learning algorithms and logistic regression
    Ulfenborg, Benjamin
    Klinga-Levan, Karin
    Olsson, Bjorn
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2015, 13 (04) : 338 - 359
  • [35] Classification and prediction of diabetes disease using machine learning paradigm
    Maniruzzaman, Md.
    Rahman, Md. Jahanur
    Ahammed, Benojir
    Abedin, Md. Menhazul
    HEALTH INFORMATION SCIENCE AND SYSTEMS, 2020, 8 (01)
  • [36] The diabacare cloud: predicting diabetes using machine learning
    Alam, Mehtab
    Khan, Ihtiram Raza
    Alam, Mohammad Afshar
    Siddiqui, Farheen
    Tanweer, Safdar
    ACTA SCIENTIARUM-TECHNOLOGY, 2024, 46 (01)
  • [37] Predicting ipsilateral recurrence in women treated for ductal carcinoma in situ using machine learning and multivariable logistic regression models
    Lamb, Leslie R.
    Mercaldo, Sarah
    Kim, Geunwon
    Hovis, Keegan
    Oseni, Tawakalitu O.
    Bahl, Manisha
    CLINICAL IMAGING, 2022, 92 : 94 - 100
  • [38] Diabetes Predicting mHealth Application Using Machine Learning
    Khan, Nabila Shahnaz
    Muaz, Mehedi Hasan
    Kabir, Anusha
    Islam, Muhammad Nazrul
    2017 IEEE INTERNATIONAL WIE CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (IEEE WIECON-ECE 2017), 2017, : 237 - 240
  • [39] Nutritional intake of micronutrient and macronutrient and type 2 diabetes: machine learning schemes
    Rashidmayvan, Mohammad
    Mansoori, Amin
    Derakhshan-Nezhad, Elahe
    Tanbakuchi, Davoud
    Sangin, Fatemeh
    Mohammadi-Bajgiran, Maryam
    Abedsaeidi, Malihehsadat
    Ghazizadeh, Sara
    Sarabi, MohammadReza Mohammad Taghizadeh
    Rezaee, Ali
    Ferns, Gordon
    Esmaily, Habibollah
    Ghayour-Mobarhan, Majid
    JOURNAL OF HEALTH POPULATION AND NUTRITION, 2025, 44 (01) : 31
  • [40] Predicting the risk of osteoporosis in older Vietnamese women using machine learning approaches
    Hanh My Bui
    Minh Hoang Ha
    Hoang Giang Pham
    Thang Phuoc Dao
    Thuy-Trang Thi Nguyen
    Minh Loi Nguyen
    Ngan Thi Vuong
    Xuyen Hong Thi Hoang
    Loc Tien Do
    Thanh Xuan Dao
    Cuong Quang Le
    SCIENTIFIC REPORTS, 2022, 12 (01)