GANDALF: Graph-based transformer and Data Augmentation Active Learning Framework with interpretable features for multi-label chest Xray classification

被引:6
作者
Mahapatra, Dwarikanath [1 ,4 ]
Bozorgtabar, Behzad [2 ,3 ]
Ge, Zongyuan [4 ]
Reyes, Mauricio [5 ]
机构
[1] Incept Inst AI, Abu Dhabi, U Arab Emirates
[2] Ecole Polytech Fed Lausanne EPFL, Lausanne, Switzerland
[3] Lausanne Univ Hosp CHUV, Lausanne, Switzerland
[4] Monash Univ, Fac IT, Melbourne, Australia
[5] Univ Bern, ARTORG Ctr Biomed Engn Res, Bern, Switzerland
关键词
Multi-label; Informative samples; Active learning; Data augmentation;
D O I
10.1016/j.media.2023.103075
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Informative sample selection in an active learning (AL) setting helps a machine learning system attain optimum performance with minimum labeled samples, thus reducing annotation costs and boosting performance of computer-aided diagnosis systems in the presence of limited labeled data. Another effective technique to enlarge datasets in a small labeled data regime is data augmentation. An intuitive active learning approach thus consists of combining informative sample selection and data augmentation to leverage their respective advantages and improve the performance of AL systems. In this paper, we propose a novel approach called GANDALF (Graph-based TrANsformer and Data Augmentation Active Learning Framework) to combine sample selection and data augmentation in a multi-label setting. Conventional sample selection approaches in AL have mostly focused on the single-label setting where a sample has only one disease label. These approaches do not perform optimally when a sample can have multiple disease labels (e.g., in chest X-ray images). We improve upon state-of-the-art multi-label active learning techniques by representing disease labels as graph nodes and use graph attention transformers (GAT) to learn more effective inter-label relationships. We identify the most informative samples by aggregating GAT representations. Subsequently, we generate transformations of these informative samples by sampling from a learned latent space. From these generated samples, we identify informative samples via a novel multi-label informativeness score, which beyond the state of the art, ensures that (i) generated samples are not redundant with respect to the training data and (ii) make important contributions to the training stage. We apply our method to two public chest X-ray datasets, as well as breast, dermatology, retina and kidney tissue microscopy MedMNIST datasets, and report improved results over state-of-the-art multi-label AL techniques in terms of model performance, learning rates, and robustness.
引用
收藏
页数:15
相关论文
共 73 条
[1]   A review of uncertainty quantification in deep learning: Techniques, applications and challenges [J].
Abdar, Moloud ;
Pourpanah, Farhad ;
Hussain, Sadiq ;
Rezazadegan, Dana ;
Liu, Li ;
Ghavamzadeh, Mohammad ;
Fieguth, Paul ;
Cao, Xiaochun ;
Khosravi, Abbas ;
Acharya, U. Rajendra ;
Makarenkov, Vladimir ;
Nahavandi, Saeid .
INFORMATION FUSION, 2021, 76 :243-297
[2]   Dataset of breast ultrasound images [J].
Al-Dhabyani, Walid ;
Gomaa, Mohammed ;
Khaled, Hussien ;
Fahmy, Aly .
DATA IN BRIEF, 2020, 28
[3]  
Alber M, 2019, J MACH LEARN RES, V20
[4]  
Ash Jordan T., 2020, Deep batch active learning by diverse, uncertain gradient lower bounds
[5]  
Ba J, 2014, ACS SYM SER
[6]  
Ba J.L., 2016, arXiv preprint arXiv:1607.06450, DOI DOI 10.48550/ARXIV.1607.06450
[7]  
Baek J., 2021, INT C LEARNING REPRE
[8]   Informative sample generation using class aware generative adversarial networks for classification of chest Xrays [J].
Bozorgtabar, Behzad ;
Mahapatra, Dwarikanath ;
von Teng, Hendrik ;
Pollinger, Alexander ;
Ebner, Lukas ;
Thiran, Jean-Phillipe ;
Reyes, Mauricio .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2019, 184 :57-65
[9]   A survey on active learning and human-in-the-loop deep learning for medical image analysis [J].
Budd, Samuel ;
Robinson, Emma C. ;
Kainz, Bernhard .
MEDICAL IMAGE ANALYSIS, 2021, 71
[10]  
Chen Liangyu, 2023, Medical Imaging with Deep Learning