Logistic regression was as good as machine learning for predicting major chronic diseases

被引:259
作者
Nusinovici, Simon [1 ]
Tham, Yih Chung [1 ,3 ]
Yan, Marco Yu Chak [1 ]
Ting, Daniel Shu Wei [1 ,3 ]
Li, Jialiang [1 ,4 ]
Sabanayagam, Charumathi [1 ,3 ]
Wong, Tien Yin [1 ,2 ,3 ]
Cheng, Ching-Yu [1 ,2 ,3 ]
机构
[1] Singapore Natl Eye Ctr, Singapore Eye Res Inst, Singapore, Singapore
[2] Natl Univ Singapore, Yong Loo Lin Sch Med, Dept Ophthalmol, Singapore, Singapore
[3] Duke NUS Med Sch, Ophthalmol & Visual Sci Acad Clin Programme, Singapore, Singapore
[4] Natl Univ Singapore, Dept Stat & Appl Probabil, Singapore, Singapore
基金
英国医学研究理事会;
关键词
Machine learning; Logistic regression; Prognostic modeling; Chronic diseases; Interaction; Nonlinearity; SINGAPORE MALAY EYE; CONVENTIONAL REGRESSION; CARDIOVASCULAR-DISEASE; RISK PREDICTION; METHODOLOGY; CLASSIFICATION; RATIONALE; PROGNOSIS; MORTALITY; DIAGNOSIS;
D O I
10.1016/j.jclinepi.2020.03.002
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Objective: To evaluate the performance of machine learning (ML) algorithms and to compare them with logistic regression for the prediction of risk of cardiovascular diseases (CVDs), chronic kidney disease (CKD), diabetes (DM), and hypertension (HTN) and in a prospective cohort study using simple clinical predictors. Study Design and Setting: We conducted analyses in a population-based cohort study in Asian adults (n = 6,762). Five different ML models were considered-single-hidden-layer neural network, support vector machine, random forest, gradient boosting machine, and k-nearest neighbor-and were compared with standard logistic regression. Results: The incidences at 6 years of CVD, CKD, DM, and HTN cases were 4.0%, 7.0%, 9.2%, and 34.6%, respectively. Logistic regression reached the highest area under the receiver operating characteristic curve for CKD (0.905 [0.88, 0.93]) and DM (0.768 [0.73, 0.81]) predictions. For CVD and HTN, the best models were neural network (0.753 [0.70, 0.81]) and support vector machine (0.780 [0.747, 0.812]), respectively. However, the differences with logistic regression were small (less than 1%) and nonsignificant. Logistic regression, gradient boosting machine, and neural network were systematically ranked among the best models. Conclusion: Logistic regression yields as good performance as ML models to predict the risk of major chronic diseases with low incidence and simple clinical predictors. (C) 2020 Elsevier Inc. All rights reserved.
引用
收藏
页码:56 / 69
页数:14
相关论文
共 50 条
  • [21] Prediction of preterm birth in multiparous women using logistic regression and machine learning approaches
    Belaghi, Reza Arabi
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [22] Comparison of machine learning and logistic regression models in predicting acute kidney injury: A systematic review and meta-analysis
    Song, Xuan
    Liu, Xinyan
    Liu, Fei
    Wang, Chunting
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2021, 151
  • [23] Predicting Overweight and Obesity Status Among Malaysian Working Adults With Machine Learning or Logistic Regression: Retrospective Comparison Study
    Wong, Jyh Eiin
    Yamaguchi, Miwa
    Nishi, Nobuo
    Araki, Michihiro
    Wee, Lei Hum
    JMIR FORMATIVE RESEARCH, 2022, 6 (12)
  • [24] Does Good ESG Lead to Better Financial Performances by Firms? Machine Learning and Logistic Regression Models of Public Enterprises in Europe
    De Lucia, Caterina
    Pazienza, Pasquale
    Bartlett, Mark
    SUSTAINABILITY, 2020, 12 (13)
  • [25] Machine Learning Model for Predicting Mortality Risk in Patients With Complex Chronic Conditions: Retrospective Analysis
    Guillamet, Guillem Hernandez
    Pallaruelo, Ariadna Ning Morancho
    Mezquita, Laura Miro
    Miralles, Ramon
    Mas, Miquel angel
    Papaseit, Maria Jose Ulldemolins
    Cuxart, Oriol Estrada
    Segui, Francesc Lopez
    ONLINE JOURNAL OF PUBLIC HEALTH INFORMATICS, 2024, 15
  • [26] Comparison of machine learning and the regression-based EHMRG model for predicting early mortality in acute heart failure
    Austin, David E.
    Lee, Douglas S.
    Wang, Chloe X.
    Ma, Shihao
    Wang, Xuesong
    Porter, Joan
    Wang, Bo
    INTERNATIONAL JOURNAL OF CARDIOLOGY, 2022, 365 : 78 - 84
  • [27] Predicting skilled delivery service use in Ethiopia: dual application of logistic regression and machine learning algorithms
    Brook Tesfaye
    Suleman Atique
    Tariq Azim
    Mihiretu M. Kebede
    BMC Medical Informatics and Decision Making, 19
  • [28] Predicting synkinesis caused by Bell's palsy or Ramsay Hunt syndrome using machine learning-based logistic regression
    Kishimoto-Urata, Megumi
    Urata, Shinji
    Nishijima, Hironobu
    Baba, Shintaro
    Fujimaki, Yoko
    Kondo, Kenji
    Yamasoba, Tatsuya
    LARYNGOSCOPE INVESTIGATIVE OTOLARYNGOLOGY, 2023, 8 (05): : 1189 - 1195
  • [29] Comparison of logistic regression and machine learning methods for predicting postoperative delirium in elderly patients: A retrospective study
    Song, Yu-Xiang
    Yang, Xiao-Dong
    Luo, Yun-Gen
    Ouyang, Chun-Lei
    Yu, Yao
    Ma, Yu-Long
    Li, Hao
    Lou, Jing-Sheng
    Liu, Yan-Hong
    Chen, Yi-Qiang
    Cao, Jiang-Bei
    Mi, Wei-Dong
    CNS NEUROSCIENCE & THERAPEUTICS, 2023, 29 (01) : 158 - 167
  • [30] Testing and Validating Two Morphological Flare Predictors by Logistic Regression Machine Learning
    Korsos, M. B.
    Erdelyi, R.
    Liu, J.
    Morgan, H.
    FRONTIERS IN ASTRONOMY AND SPACE SCIENCES, 2021, 7