Evaluation of uncertainty quantification methods in multi-label classification: A case study with automatic diagnosis of electrocardiogram

被引:16
作者
Barandas, Marilia [1 ,2 ]
Famiglini, Lorenzo [3 ]
Campagner, Andrea [3 ,4 ]
Folgado, Duarte [1 ,2 ]
Simao, Raquel [2 ]
Cabitza, Federico [3 ,4 ]
Gamboa, Hugo [1 ,2 ]
机构
[1] Assoc Fraunhofer Portugal Res, Rua Alfredo Allen 455-461, P-4200135 Porto, Portugal
[2] NOVA Sch Sci & Technol, LIBPhys Lab Instrumentat Biomed Engn & Radiat Phys, Campus Caparica, P-2829516 Caparica, Portugal
[3] Univ Milano Bicocca, Dept Informat Syst & Commun, Viale Sarca 336, I-20126 Milan, Italy
[4] IRCCS Ist Ortoped Galeazzi, Via Riccardo Galeazzi 4, I-20161 Milan, Italy
关键词
Artificial Intelligence; Uncertainty quantification; Multi-label classification; Cardiology; ARRHYTHMIA DETECTION; MODELS; NEED;
D O I
10.1016/j.inffus.2023.101978
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Artificial Intelligence (AI) use in automated Electrocardiogram (ECG) classification has continuously attracted the research community's interest, motivated by their promising results. Despite their great promise, limited attention has been paid to the robustness of their results, which is a key element for their implementation in clinical practice. Uncertainty Quantification (UQ) is a critical for trustworthy and reliable AI, particularly in safety-critical domains such as medicine. Estimating uncertainty in Machine Learning (ML) model predictions has been extensively used for Out-of-Distribution (OOD) detection under single-label tasks. However, the use of UQ methods in multi-label classification remains underexplored. This study goes beyond developing highly accurate models comparing five uncertainty quantification methods using the same Deep Neural Network (DNN) architecture across various validation scenarios, including internal and external validation as well as OOD detection, taking multi-label ECG classification as the example domain. We show the importance of external validation and its impact on classification performance, uncer-tainty estimates quality, and calibration. Ensemble-based methods yield more robust uncertainty estimations than single network or stochastic methods. Although current methods still have limitations in accurately quantifying uncertainty, particularly in the case of dataset shift, incorporating uncertainty estimates with a classification with a rejection option improves the ability to detect such changes. Moreover, we show that using uncertainty estimates as a criterion for sample selection in active learning setting results in greater improvements in classification performance compared to random sampling.
引用
收藏
页数:19
相关论文
共 103 条
[1]   UncertaintyFuseNet: Robust uncertainty-aware hierarchical feature fusion model with Ensemble Monte Carlo Dropout for COVID-19 detection [J].
Abdar, Moloud ;
Salari, Soorena ;
Qahremani, Sina ;
Lam, Hak-Keung ;
Karray, Fakhri ;
Hussain, Sadiq ;
Khosravi, Abbas ;
Acharya, U. Rajendra ;
Makarenkov, Vladimir ;
Nahavandi, Saeid .
INFORMATION FUSION, 2023, 90 :364-381
[2]   Uncertainty quantification in skin cancer classification using three-way decision-based Bayesian deep learning [J].
Abdar, Moloud ;
Samami, Maryam ;
Mahmoodabad, Sajjad Dehghani ;
Doan, Thang ;
Mazoure, Bogdan ;
Hashemifesharaki, Reza ;
Liu, Li ;
Khosravi, Abbas ;
Acharya, U. Rajendra ;
Makarenkov, Vladimir ;
Nahavandi, Saeid .
COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 135
[3]   A review of uncertainty quantification in deep learning: Techniques, applications and challenges [J].
Abdar, Moloud ;
Pourpanah, Farhad ;
Hussain, Sadiq ;
Rezazadegan, Dana ;
Liu, Li ;
Ghavamzadeh, Mohammad ;
Fieguth, Paul ;
Cao, Xiaochun ;
Khosravi, Abbas ;
Acharya, U. Rajendra ;
Makarenkov, Vladimir ;
Nahavandi, Saeid .
INFORMATION FUSION, 2021, 76 :243-297
[4]   ECG Heartbeat Classification Using Multimodal Fusion [J].
Ahmad, Zeeshan ;
Tabassum, Anika ;
Guan, Ling ;
Khan, Naimul Mefraz .
IEEE ACCESS, 2021, 9 :100615-100626
[5]   ECG heartbeat arrhythmias classification: a comparison study between different types of spectrum representation and convolutional neural networks architectures [J].
Alqudah, Ali Mohammad ;
Qazan, Shoroq ;
Al-Ebbini, Lina ;
Alquran, Hiam ;
Abu Qasmieh, Isam .
JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 13 (10) :4877-4907
[6]  
Nguyen A, 2015, PROC CVPR IEEE, P427, DOI 10.1109/CVPR.2015.7298640
[7]  
[Anonymous], 1990, NeurIPS, DOI 10.5555/2986766.2986882
[8]  
Arjovsky M, 2017, PR MACH LEARN RES, V70
[9]   Uncertainty-Aware Deep Learning-Based Cardiac Arrhythmias Classification Model of Electrocardiogram Signals [J].
Aseeri, Ahmad O. .
COMPUTERS, 2021, 10 (06)
[10]   Objective evaluation of deep uncertainty predictions for COVID-19 detection [J].
Asgharnezhad, Hamzeh ;
Shamsi, Afshar ;
Alizadehsani, Roohallah ;
Khosravi, Abbas ;
Nahavandi, Saeid ;
Sani, Zahra Alizadeh ;
Srinivasan, Dipti ;
Islam, Sheikh Mohammed Shariful .
SCIENTIFIC REPORTS, 2022, 12 (01)