Construction and Validation of a Predictive Model for Coronary Artery Disease Using Extreme Gradient Boosting

被引:1
作者
Zhang, Zheng [1 ,2 ]
Shao, Binbin [3 ]
Liu, Hongzhou [2 ,4 ]
Huang, Ben [2 ,5 ]
Gao, Xuechen [1 ]
Qiu, Jun [1 ]
Wang, Chen [1 ,2 ]
机构
[1] Soochow Univ, Ctr Clin Lab, Affiliated Hosp 1, 899 Pinghai Rd, Suzhou 215006, Jiangsu, Peoples R China
[2] Wuhan Univ, Ctr Gene Diag, Dept Lab Med, Zhongnan Hosp, Wuhan, Hubei, Peoples R China
[3] Nanjing Med Univ, Nanjing Women & Childrens Healthcare Hosp, Dept Prenatal Diag, Womens Hosp, Nanjing, Jiangsu, Peoples R China
[4] Chengdu Med Coll, Sch Clin Med, Affiliated Hosp 1, Chengdu, Sichuang Prov, Peoples R China
[5] Nanjing Med Univ, Dept Lab Med, Affiliated Hosp 1, Nanjing, Jiangsu, Peoples R China
关键词
coronary artery disease; predictive model; machine learning; XGBoost; primary prevention; CARDIOVASCULAR-DISEASE; HEART-DISEASE; RISK; ATHEROSCLEROSIS; TRIGLYCERIDES; CHOLESTEROL; MORTALITY; HEALTH; LIPOPROTEINS; INSIGHTS;
D O I
10.2147/JIR.S464489
中图分类号
R392 [医学免疫学]; Q939.91 [免疫学];
学科分类号
100102 ;
摘要
Purpose: Early recognition of coronary artery disease (CAD) could delay its progress and significantly reduce mortality. Sensitive, specific, cost-efficient and non-invasive indicators for assessing individual CAD risk in community population screening are urgently needed. Patients and Methods: 3112 patients with CAD and 3182 controls were recruited from three clinical centers in China, and differences in baseline and clinical characteristics were compared. For the discovery cohort, the least absolute shrinkage and selection operator (LASSO) regression was used to identify significant features and four machine learning algorithms (logistic regression, support vector machine (SVM), random forest (RF) and extreme gradient boosting (XGBoost)) were applied to construct models for CAD risk assessment, the receiver operating characteristics (ROC) curve and precision-recall (PR) curve were conducted to evaluate their predictive accuracy. The optimal model was interpreted by Shapley additive explanations (SHAP) analysis and assessed by the ROC curve, calibration curve, and decision curve analysis (DCA) and validated by two external cohorts. Results: Using LASSO filtration, all included variables were considered to be statistically significant. Four machine learning models were constructed based on these features and the results of ROC and PR curve implied that the XGBoost model exhibited the highest predictive performance, which yielded a high area of ROC curve (AUC) of 0.988 (95% CI: 0.986-0.991) to distinguish CAD patients from controls with a sensitivity of 94.6% and a specificity of 94.6%. The calibration curve showed that the predicted results were in good agreement with actual observations, and DCA exhibited a better net benefit across a wide range of threshold probabilities. External validation of the model also exhibited favorable discriminatory performance, with an AUC, sensitivity, and specificity of 0.953 (95% CI: 0.945-0.960), 89.9%, and 87.1% in the validation cohort, and 0.935 (95% CI: 0.915-0.955), 82.0%, and 90.3% in the replication cohort. Conclusion: Our model is highly informative for clinical practice and will be conducive to primary prevention and tailoring the precise management for CAD patients.
引用
收藏
页码:4163 / 4174
页数:12
相关论文
共 56 条
[1]   MicroRNAs and obesity-induced endothelial dysfunction: key paradigms in molecular therapy [J].
Ait-Aissa, Karima ;
Nguyen, Quynh My ;
Gabani, Mohanad ;
Kassan, Adam ;
Kumar, Santosh ;
Choi, Soo-Kyoung ;
Gonzalez, Alexis A. ;
Khataei, Tahsin ;
Sahyoun, Amal M. ;
Chen, Cheng ;
Kassan, Modar .
CARDIOVASCULAR DIABETOLOGY, 2020, 19 (01)
[2]   Machine learning of clinical variables and coronary artery calcium scoring for the prediction of obstructive coronary artery disease on coronary computed tomography angiography: analysis from the CONFIRM registry [J].
Al'Arefilb, Subhi J. ;
Maliakal, Gabriel ;
Singh, Gurpreet ;
van Rosendael, Alexander R. ;
Ma, Xiaoyue ;
Xu, Zhuoran ;
Alawamlh, Omar Al Hussein ;
Lee, Benjamin ;
Pandey, Mohit ;
Achenbach, Stephan ;
Al-Mallah, Mouaz H. ;
Andreini, Daniele ;
Bax, Jeroen J. ;
Berman, Daniel S. ;
Budoff, Matthew J. ;
Cademartiri, Filippo ;
Canister, Tracy Q. ;
Chang, Hyuk-Jae ;
Chinnaiyan, Kavitha ;
Chow, Benjamin J. W. ;
Cury, Ricardo C. ;
DeLago, Augustin ;
Feuchtner, Gudrun ;
Hadamitzky, Martin ;
Hausleiter, Joerg ;
Kaufmann, Philipp A. ;
Kim, Yong-Jin ;
Leipsic, Jonathon A. ;
Maffei, Erica ;
Marques, Hugo ;
Goncalves, Pedro de Araujo ;
Pontone, Gianluca ;
Raff, Gilbert L. ;
Rubinshtein, Ronen ;
Villines, Todd C. ;
Gransar, Heidi ;
Lu, Yao ;
Jones, Erica C. ;
Pena, Jessica M. ;
Lin, Fay Y. ;
Min, James K. ;
Shaw, Leslee J. .
EUROPEAN HEART JOURNAL, 2020, 41 (03) :359-367
[3]   Association of Bariatric Surgery With Major Adverse Liver and Cardiovascular Outcomes in Patients With Biopsy-Proven Nonalcoholic Steatohepatitis [J].
Aminian, Ali ;
Al-Kurd, Abbas ;
Wilson, Rickesha ;
Bena, James ;
Fayazzadeh, Hana ;
Singh, Tavankit ;
Albaugh, Vance L. ;
Shariff, Faiz U. ;
Rodriguez, Noe A. ;
Jin, Jian ;
Brethauer, Stacy A. ;
Dasarathy, Srinivasan ;
Alkhouri, Naim ;
Schauer, Philip R. ;
McCullough, Arthur J. ;
Nissen, Steven E. .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2021, 326 (20) :2031-2042
[4]   Dietary cholesterol and cardiovascular disease: a systematic review and meta-analysis [J].
Berger, Samantha ;
Raman, Gowri ;
Vishwanathan, Rohini ;
Jacques, Paul F. ;
Johnson, Elizabeth J. .
AMERICAN JOURNAL OF CLINICAL NUTRITION, 2015, 102 (02) :276-294
[5]   Contributions of the Framingham Heart Study to the Epidemiology of Coronary Heart Disease [J].
Chen, George ;
Levy, Daniel .
JAMA CARDIOLOGY, 2016, 1 (07) :825-830
[6]   Identifying the natural products in the treatment of atherosclerosis by increasing HDL-C level based on bioinformatics analysis, molecular docking, and in vitro experiment [J].
Chen, Yilin ;
Zhang, Fengwei ;
Sun, Jijia ;
Zhang, Lei .
JOURNAL OF TRANSLATIONAL MEDICINE, 2023, 21 (01)
[7]   The impact of demographic and risk factor changes on coronary heart disease deaths in Beijing, 1999-2010 [J].
Cheng, Jun ;
Zhao, Dong ;
Zeng, Zhechun ;
Critchley, Julia Alison ;
Liu, Jing ;
Wang, Wei ;
Sun, Jiayi ;
Capewell, Simon .
BMC PUBLIC HEALTH, 2009, 9
[8]  
Collins GS, 2015, ANN INTERN MED, V162, P55, DOI [10.1016/j.jclinepi.2014.11.010, 10.7326/M14-0697, 10.1002/bjs.9736, 10.1016/j.eururo.2014.11.025, 10.7326/M14-0698, 10.1038/bjc.2014.639, 10.1136/bmj.g7594, 10.1186/s12916-014-0241-z, 10.1111/eci.12376]
[9]   Long term outcomes of metabolic/bariatric surgery in adults [J].
Courcoulas, Anita P. ;
Daigle, Christopher R. ;
Arterburn, David E. .
BMJ-BRITISH MEDICAL JOURNAL, 2023, 383
[10]   Joint Genetic Inhibition of PCSK9 and CETP and the Association With Coronary Artery Disease A Factorial Mendelian Randomization Study [J].
Cupido, Arjen J. ;
Reeskamp, Laurens F. ;
Hingorani, Aroon D. ;
Finan, Chris ;
Asselbergs, Folkert W. ;
Hovingh, G. Kees ;
Schmidt, Amand F. .
JAMA CARDIOLOGY, 2022, 7 (09) :955-964