An explainable machine learning approach for automated medical decision support of heart disease

被引:5
作者
Mesquita, Francisco [1 ]
Marques, Goncalo [1 ]
机构
[1] Polytech Inst Coimbra, Technol & Management Sch Oliveira Do Hosp, Rua Gen Santos Costa, P-3400124 Oliveira Do Hosp, Portugal
关键词
Coronary heart disease; Disease prediction; Interpretation; Machine learning; SHAP method; RANDOM FOREST; CARE;
D O I
10.1016/j.datak.2024.102339
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Coronary Heart Disease (CHD) is the dominant cause of mortality around the world. Every year, it causes about 3.9 million deaths in Europe and 1.8 million in the European Union (EU). It is responsible for 45 % and 37 % of all deaths in Europe and the European Union, respectively. Using machine learning (ML) to predict heart diseases is one of the most promising research topics, as it can improve healthcare and consequently increase the longevity of people 's lives. However, although the ability to interpret the results of the predictive model is essential, most of the related studies do not propose explainable methods. To address this problem, this paper presents a classification method that not only exhibits reliable performance but is also interpretable, ensuring transparency in its decision-making process. SHapley Additive exPlanations, known as the SHAP method was chosen for model interpretability. This approach presents a comparison between different classifiers and parameter tuning techniques, providing all the details necessary to replicate the experiment and help future researchers working in the field. The proposed model achieves similar performance to those proposed in the literature, and its predictions are fully interpretable.
引用
收藏
页数:15
相关论文
共 88 条
[21]   SMOTE for Learning from Imbalanced Data: Progress and Challenges, Marking the 15-year Anniversary [J].
Fernandez, Alberto ;
Garcia, Salvador ;
Herrera, Francisco ;
Chawla, Nitesh V. .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2018, 61 :863-905
[22]  
Ferreira-González I, 2014, REV ESP CARDIOL, V67, P139, DOI [10.1016/j.rec.2013.10.002, 10.1016/j.recesp.2013.10.003]
[23]  
Firdaus FF, 2021, 2021 IEEE ASIA PACIFIC CONFERENCE ON WIRELESS AND MOBILE (APWIMOB), P59, DOI [10.1109/APWIMOB51111.2021.9435250, 10.1109/APWiMob51111.2021.9435250]
[24]  
Futoma J, 2020, LANCET DIGIT HEALTH, V2, pE489, DOI 10.1016/S2589-7500(20)30186-2
[25]  
Gain Ulla, 2021, Journal of Physics: Conference Series, V1828, DOI [10.1088/1742-6596/1828/1/012015, 10.1088/1742-6596/1828/1/012015]
[26]  
Gholamy A., 2018, Why 70/30 or 80/20 relation between training and testing sets: A pedagogical explanation, DOI DOI 10.6148/IJITAS.20180611(2).0003
[27]   Diversity in Machine Learning [J].
Gong, Zhiqiang ;
Zhong, Ping ;
Hu, Weidong .
IEEE ACCESS, 2019, 7 :64323-64350
[28]   MIFH: A Machine Intelligence Framework for Heart Disease Diagnosis [J].
Gupta, Ankur ;
Kumar, Rahul ;
Arora, Harkirat Singh ;
Raman, Balasubramanian .
IEEE ACCESS, 2020, 8 :14659-14674
[29]   Recall-based Machine Learning approach for early detection of Cervical Cancer [J].
Gupta, Apoorva ;
Anand, Ashutosh ;
Hasija, Yasha .
2021 6TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2021,
[30]  
Gupta D.L, 2012, Performance analysis of classification tree learning algorithms