Using machine learning algorithms to identify chronic heart disease: National Health and Nutrition Examination Survey 2011-2018

被引:2
作者
Chen, Xiaofei [1 ]
Guo, Dingjie [1 ]
Wang, Yashan [1 ]
Qu, Zihan [1 ]
He, Guangliang [1 ]
Sui, Chuanying [1 ]
Lan, Linwei [1 ]
Zhang, Xin [1 ]
Duan, Yuqing [1 ]
Meng, Hengyu [1 ]
Wang, Chunpeng [2 ]
Liu, Xin [1 ]
机构
[1] Jilin Univ, Sch Publ Hlth, Epidemiol & Stat, Changchun 130021, Jilin, Peoples R China
[2] Northeast Normal Univ, Sch Math & Stat, Changchun, Jilin, Peoples R China
关键词
chronic heart disease; machine learning; National Health and Nutrition Examination Survey; LOGISTIC-REGRESSION; PREDICTION;
D O I
10.2459/JCM.0000000000001497
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
ObjectiveThe number of heart disease patients is increasing. Establishing a risk assessment model for chronic heart disease (CHD) based on risk factors is beneficial for early diagnosis and timely treatment of high-risk populations.MethodsFour machine learning models, including logistic regression, support vector machines (SVM), random forests, and extreme gradient boosting (XGBoost), were used to evaluate the CHD among 14 971 participants in the National Health and Nutrition Examination Survey from 2011 to 2018. The area under the receiver-operator curve (AUC) is the indicator that we evaluate the model.ResultsIn four kinds of models, SVM has the best classification performance (AUC = 0.898), and the AUC value of logistic regression and random forest were 0.895 and 0.894, respectively. Although XGBoost performed the worst with an AUC value of 0.891. There was no significant difference among the four algorithms. In the importance analysis of variables, the three most important variables were taking low-dose aspirin, chest pain or discomfort, and total amount of dietary supplements taken.ConclusionAll four machine learning classifiers can identify the occurrence of CHD based on population survey data. We also determined the contribution of variables in the prediction, which can further explore their effectiveness in actual clinical data.
引用
收藏
页码:461 / 466
页数:6
相关论文
共 30 条
  • [1] Machine learning-based heart disease diagnosis: A systematic literature review
    Ahsan, Md Manjurul
    Siddique, Zahed
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2022, 128
  • [2] Heart disease prediction using supervised machine learning algorithms: Performance analysis and comparison
    Ali, Md Mamun
    Paul, Bikash Kumar
    Ahmed, Kawsar
    Bui, Francis M.
    Quinn, Julian M. W.
    Moni, Mohammad Ali
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 136
  • [3] Framingham Heart Study
    Andersson, Charlotte
    Nayor, Matthew
    Tsao, Connie W.
    Levy, Daniel
    Vasan, Ramachandran S.
    [J]. JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2021, 77 (21) : 2680 - 2692
  • [4] Significance of Visible Non-Invasive Risk Attributes for the Initial Prediction of Heart Disease Using Different Machine Learning Techniques
    Ansarullah, Syed Immamul
    Saif, Syed Mohsin
    Kumar, Pradeep
    Kirmani, Mudasir Manzoor
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [5] Data preprocessing for heart disease classification: A systematic literature review
    Benhar, H.
    Idri, A.
    Fernandez-Aleman, J. L.
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2020, 195
  • [6] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [7] Current Applications and Future Impact of Machine Learning in Radiology
    Choy, Garry
    Khalilzadeh, Omid
    Michalski, Mark
    Do, Synho
    Samir, Anthony E.
    Pianykh, Oleg S.
    Geis, J. Raymond
    Pandharipande, Pari V.
    Brink, James A.
    Dreyer, Keith J.
    [J]. RADIOLOGY, 2018, 288 (02) : 318 - 328
  • [8] A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models
    Christodoulou, Evangelia
    Ma, Jie
    Collins, Gary S.
    Steyerberg, Ewout W.
    Verbakel, Jan Y.
    Van Calster, Ben
    [J]. JOURNAL OF CLINICAL EPIDEMIOLOGY, 2019, 110 : 12 - 22
  • [9] Deep learning to detect significant coronary artery disease from plain chest radiographs AI4CAD
    D'Ancona, Giuseppe
    Massussi, Mauro
    Savardi, Mattia
    Signoroni, Alberto
    Di Bacco, Lorenzo
    Farina, Davide
    Metra, Marco
    Maroldi, Roberto
    Muneretto, Claudio
    Ince, Huseyin
    Costabile, Davide
    Murero, Monica
    Chizzola, Giuliano
    Curello, Salvatore
    Benussi, Stefano
    [J]. INTERNATIONAL JOURNAL OF CARDIOLOGY, 2023, 370 : 435 - 441
  • [10] A data-driven approach to predicting diabetes and cardiovascular disease with machine learning
    Dinh, An
    Miertschin, Stacey
    Young, Amber
    Mohanty, Somya D.
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2019, 19 (01)