An explainable machine learning approach for automated medical decision support of heart disease

被引:4
作者
Mesquita, Francisco [1 ]
Marques, Goncalo [1 ]
机构
[1] Polytech Inst Coimbra, Technol & Management Sch Oliveira Do Hosp, Rua Gen Santos Costa, P-3400124 Oliveira Do Hosp, Portugal
关键词
Coronary heart disease; Disease prediction; Interpretation; Machine learning; SHAP method; RANDOM FOREST; CARE;
D O I
10.1016/j.datak.2024.102339
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Coronary Heart Disease (CHD) is the dominant cause of mortality around the world. Every year, it causes about 3.9 million deaths in Europe and 1.8 million in the European Union (EU). It is responsible for 45 % and 37 % of all deaths in Europe and the European Union, respectively. Using machine learning (ML) to predict heart diseases is one of the most promising research topics, as it can improve healthcare and consequently increase the longevity of people 's lives. However, although the ability to interpret the results of the predictive model is essential, most of the related studies do not propose explainable methods. To address this problem, this paper presents a classification method that not only exhibits reliable performance but is also interpretable, ensuring transparency in its decision-making process. SHapley Additive exPlanations, known as the SHAP method was chosen for model interpretability. This approach presents a comparison between different classifiers and parameter tuning techniques, providing all the details necessary to replicate the experiment and help future researchers working in the field. The proposed model achieves similar performance to those proposed in the literature, and its predictions are fully interpretable.
引用
收藏
页数:15
相关论文
共 88 条
[1]   Estimating the reproducibility of psychological science [J].
Aarts, Alexander A. ;
Anderson, Joanna E. ;
Anderson, Christopher J. ;
Attridge, Peter R. ;
Attwood, Angela ;
Axt, Jordan ;
Babel, Molly ;
Bahnik, Stepan ;
Baranski, Erica ;
Barnett-Cowan, Michael ;
Bartmess, Elizabeth ;
Beer, Jennifer ;
Bell, Raoul ;
Bentley, Heather ;
Beyan, Leah ;
Binion, Grace ;
Borsboom, Denny ;
Bosch, Annick ;
Bosco, Frank A. ;
Bowman, Sara D. ;
Brandt, Mark J. ;
Braswell, Erin ;
Brohmer, Hilmar ;
Brown, Benjamin T. ;
Brown, Kristina ;
Bruening, Jovita ;
Calhoun-Sauls, Ann ;
Callahan, Shannon P. ;
Chagnon, Elizabeth ;
Chandler, Jesse ;
Chartier, Christopher R. ;
Cheung, Felix ;
Christopherson, Cody D. ;
Cillessen, Linda ;
Clay, Russ ;
Cleary, Hayley ;
Cloud, Mark D. ;
Cohn, Michael ;
Cohoon, Johanna ;
Columbus, Simon ;
Cordes, Andreas ;
Costantini, Giulio ;
Alvarez, Leslie D. Cramblet ;
Cremata, Ed ;
Crusius, Jan ;
DeCoster, Jamie ;
DeGaetano, Michelle A. ;
Della Penna, Nicolas ;
den Bezemer, Bobby ;
Deserno, Marie K. .
SCIENCE, 2015, 349 (6251)
[2]  
Abdollahi B, 2018, HUM-COMPUT INT-SPRIN, P21, DOI 10.1007/978-3-319-90403-0_2
[3]  
Ahmad MA, 2018, ACM-BCB'18: PROCEEDINGS OF THE 2018 ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, P559, DOI [10.1145/3233547.3233667, 10.1109/ICHI.2018.00095]
[4]   Predictive modelling for solar thermal energy systems: A comparison of support vector regression, random forest, extra trees and regression trees [J].
Ahmad, Muhammad Waseem ;
Reynolds, Jonathan ;
Rezgui, Yacine .
JOURNAL OF CLEANER PRODUCTION, 2018, 203 :810-821
[5]   Machine-Learning-Based Disease Diagnosis: A Comprehensive Review [J].
Ahsan, Md Manjurul ;
Luna, Shahana Akter ;
Siddique, Zahed .
HEALTHCARE, 2022, 10 (03)
[6]   Optuna: A Next-generation Hyperparameter Optimization Framework [J].
Akiba, Takuya ;
Sano, Shotaro ;
Yanase, Toshihiko ;
Ohta, Takeru ;
Koyama, Masanori .
KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, :2623-2631
[7]  
Alalawi H, 2021, Detection of cardiovascular disease using machine learning classification models, V10, P151
[8]   Data preprocessing in predictive data mining [J].
Alexandropoulos, Stamatios-Aggelos N. ;
Kotsiantis, Sotiris B. ;
Vrahatis, Michael N. .
KNOWLEDGE ENGINEERING REVIEW, 2019, 34
[9]  
Ali M, 2020, PyCaret: An open source, low-code machine learning library in Python
[10]   Machine learning-based coronary artery disease diagnosis: A comprehensive review [J].
Alizadehsani, Roohallah ;
Abdar, Moloud ;
Roshanzamir, Mohamad ;
Khosravi, Abbas ;
Kebria, Parham M. ;
Khozeimeh, Fahime ;
Nahavandi, Saeid ;
Sarrafzadegan, Nizal ;
Acharya, U. Rajendra .
COMPUTERS IN BIOLOGY AND MEDICINE, 2019, 111