Predicting the risk of cardiovascular disease in adults exposed to heavy metals: Interpretable machine learning

被引:2
作者
Shen, Meiyue [1 ]
Zhang, Yine [2 ]
Zhan, Runqing [3 ]
Du, Tingwei [1 ]
Shen, Peixuan [1 ]
Lu, Xiaochuan [1 ]
Liu, Shengnan [1 ,2 ,3 ]
Guo, Rongrong [1 ]
Shen, Xiaoli [1 ]
机构
[1] Qingdao Univ, Sch Publ Hlth, Dept Epidemiol & Hlth Stat, 308 Ningxia Rd, Qingdao 266071, Peoples R China
[2] Ningxia Ctr Dis Control & Prevent, Yinchuan, Peoples R China
[3] Qingdao Haici Hosp, Qingdao 266033, Peoples R China
关键词
Cardiovascular disease; Heavy metals; Machine learning; Random forest; AdaBoost; Partial dependence plot; MECHANISMS; CADMIUM; METAANALYSIS; HYPERTENSION; BIOMARKERS; TUNGSTEN; BARIUM; HEALTH;
D O I
10.1016/j.ecoenv.2024.117570
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Machine learning exhibits excellent performance in terms of predictive power. We aimed to construct an interpretable machine learning model utilizing National Health and Nutrition Examination Survey data to investigate the relationship between heavy metal exposure and cardiovascular disease (CVD). A total of 4600 adults were included in the analysis. The Least Absolute Shrinkage and Selection Operator regression method was employed to select relevant feature variables. Subsequently, six machine learning models were constructed, including random forest, decision tree, gradient boosting decision tree, k-nearest neighbor, support vector machine, and AdaBoost algorithms. Feature importance analysis, partial dependence plot, and shapley additive explanations were integrated to enhance the interpretability of the CVD prediction model. Among all models, the random forest exhibited the best performance, with an accuracy of 90 %, an area under the curve of 0.85, and an F1 score of 0.86. Urine cadmium (Cd), blood lead (Pb), urine thallium (Tl), and urine tungsten (W) were identified as the most significant predictors of CVD, with importance scores of 0.062, 0.057, 0.051, and 0.050, respectively. At the overall level, higher levels of urine Cd, blood Pb, and urine W were associated with an increased risk of CVD, whereas a lower level of urine Tl was linked to a reduced CVD risk. Additionally, the analysis of synergistic effects revealed that Cd was the predominant determinant of CVD risk. The random forestbased CVD prediction model demonstrated excellent predictive power and provided valuable insights for personalized patient care and optimal resource allocation in populations exposed to heavy metals.
引用
收藏
页数:12
相关论文
共 88 条
[1]   Effect on the offspring of pregnant females CD-1 mice treated with a single thallium(I) application [J].
Alvarez-Barrera, Lucila ;
Rodriguez-Mercado, Juan. J. ;
Mateos-Nava, Rodrigo A. ;
Vazquez-Martinez, Yazmin ;
Altamirano-Lozano, Mario A. .
REPRODUCTIVE TOXICOLOGY, 2019, 90 :1-7
[2]   The acute systemic toxicity of thallium in rats produces oxidative stress: attenuation by metallothionein and Prussian blue [J].
Anaya-Ramos, Laura ;
Diaz-Ruiz, Araceli ;
Rios, Camilo ;
Mendez-Armenta, Marisela ;
Montes, Sergio ;
Aguirre-Vidal, Yoshajandith ;
Garcia-Jimenez, Sara ;
Baron-Flores, Veronica ;
Monroy-Noyola, Antonio .
BIOMETALS, 2021, 34 (06) :1295-1311
[3]   Application of machine learning techniques for predicting survival in ovarian cancer [J].
Azar, Amir Sorayaie ;
Rikan, Samin Babaei ;
Naemi, Amin ;
Mohasefi, Jamshid Bagherzadeh ;
Pirnejad, Habibollah ;
Mohasefi, Matin Bagherzadeh ;
Wiil, Uffe Kock .
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2022, 22 (01)
[4]   Heavy metal pollution in the aquatic environment: efficient and low-cost removal approaches to eliminate their toxicity: a review [J].
Aziz, Kosar Hikmat Hama ;
Mustafa, Fryad S. ;
Omer, Khalid M. ;
Hama, Sarkawt ;
Hamarawf, Rebaz Fayaq ;
Rahman, Kaiwan Othman .
RSC ADVANCES, 2023, 13 (26) :17595-17610
[5]   Decision trees and random forests [J].
Becker, Thijs ;
Rousseau, Axel-Jan ;
Geubbelmans, Melvin ;
Burzykowski, Tomasz ;
Valkenborg, Dirk .
AMERICAN JOURNAL OF ORTHODONTICS AND DENTOFACIAL ORTHOPEDICS, 2023, 164 (06) :894-897
[6]   Interpretable machine learning with tree-based shapley additive explanations: Application to metabolomics datasets for binary classification [J].
Bifarin, Olatomiwa O. .
PLOS ONE, 2023, 18 (05)
[7]   The endocrine disruptor cadmium: a new player in the pathophysiology of metabolic diseases [J].
Bimonte, V. M. ;
Besharat, Z. M. ;
Antonioni, A. ;
Cella, V. ;
Lenzi, A. ;
Ferretti, E. ;
Migliaccio, S. .
JOURNAL OF ENDOCRINOLOGICAL INVESTIGATION, 2021, 44 (07) :1363-1377
[8]   Machine learning models for predicting the risk factor of carotid plaque in cardiovascular disease [J].
Bin, Chengling ;
Li, Qin ;
Tang, Jing ;
Dai, Chaorong ;
Jiang, Ting ;
Xie, Xiufang ;
Qiu, Min ;
Chen, Lumiao ;
Yang, Shaorong .
FRONTIERS IN CARDIOVASCULAR MEDICINE, 2023, 10
[9]   CARDIOVASCULAR-DISEASE DEATH RATES IN COMMUNITIES WITH ELEVATED LEVELS OF BARIUM IN DRINKING-WATER [J].
BRENNIMAN, GR ;
NAMEKATA, T ;
KOJOLA, WH ;
CARNOW, BW ;
LEVY, PS .
ENVIRONMENTAL RESEARCH, 1979, 20 (02) :318-324
[10]   Environmental pollution and the global burden of disease [J].
Briggs, D .
BRITISH MEDICAL BULLETIN, 2003, 68 :1-24