GANDALF: Graph-based transformer and Data Augmentation Active Learning Framework with interpretable features for multi-label chest Xray classification

被引：6

作者：

Mahapatra, Dwarikanath ^{[1
,4
]}

Bozorgtabar, Behzad ^{[2
,3
]}

Ge, Zongyuan ^{[4
]}

Reyes, Mauricio ^{[5
]}

机构：

[1] Incept Inst AI, Abu Dhabi, U Arab Emirates

[2] Ecole Polytech Fed Lausanne EPFL, Lausanne, Switzerland

[3] Lausanne Univ Hosp CHUV, Lausanne, Switzerland

[4] Monash Univ, Fac IT, Melbourne, Australia

[5] Univ Bern, ARTORG Ctr Biomed Engn Res, Bern, Switzerland

来源：

MEDICAL IMAGE ANALYSIS | 2024年 / 93卷

关键词：

Multi-label; Informative samples; Active learning; Data augmentation;

D O I：

10.1016/j.media.2023.103075

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Informative sample selection in an active learning (AL) setting helps a machine learning system attain optimum performance with minimum labeled samples, thus reducing annotation costs and boosting performance of computer-aided diagnosis systems in the presence of limited labeled data. Another effective technique to enlarge datasets in a small labeled data regime is data augmentation. An intuitive active learning approach thus consists of combining informative sample selection and data augmentation to leverage their respective advantages and improve the performance of AL systems. In this paper, we propose a novel approach called GANDALF (Graph-based TrANsformer and Data Augmentation Active Learning Framework) to combine sample selection and data augmentation in a multi-label setting. Conventional sample selection approaches in AL have mostly focused on the single-label setting where a sample has only one disease label. These approaches do not perform optimally when a sample can have multiple disease labels (e.g., in chest X-ray images). We improve upon state-of-the-art multi-label active learning techniques by representing disease labels as graph nodes and use graph attention transformers (GAT) to learn more effective inter-label relationships. We identify the most informative samples by aggregating GAT representations. Subsequently, we generate transformations of these informative samples by sampling from a learned latent space. From these generated samples, we identify informative samples via a novel multi-label informativeness score, which beyond the state of the art, ensures that (i) generated samples are not redundant with respect to the training data and (ii) make important contributions to the training stage. We apply our method to two public chest X-ray datasets, as well as breast, dermatology, retina and kidney tissue microscopy MedMNIST datasets, and report improved results over state-of-the-art multi-label AL techniques in terms of model performance, learning rates, and robustness.

引用

页数：15

共 73 条

[41] Active Learning by Feature Mixing [J].

Parvaneh, Amin ;

Abbasnejad, Ehsan ;

Teney, Damien ;

Haffari, Reza ;

van den Hengel, Anton ;

Shi, Javen Qinfeng .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :12227-12236

[42]

Pedregosa F, 2011, J MACH LEARN RES, V12, P2825

[43]

Perez L, 2017, Arxiv, DOI arXiv:1712.04621

[44] On the Interpretability of Artificial Intelligence in Radiology: Challenges and Opportunities [J].

Reyes, Mauricio ;

Meier, Raphael ;

Pereira, Sergio ;

Silva, Carlos A. ;

Dahlweid, Fried-Michael ;

Von Tengg-Kobligk, Hendrik ;

Summers, Ronald M. ;

Wiest, Roland .

RADIOLOGY-ARTIFICIAL INTELLIGENCE, 2020, 2 (03)

[45] Effective active learning strategy for multi-label learning [J].

Reyes, Oscar ;

Morell, Carlos ;

Ventura, Sebastian .

NEUROCOMPUTING, 2018, 273 :494-508

[46] Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization [J].

Selvaraju, Ramprasaath R. ;

Cogswell, Michael ;

Das, Abhishek ;

Vedantam, Ramakrishna ;

Parikh, Devi ;

Batra, Dhruv .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :618-626

[47]

Silva Wilson, 2020, Medical Image Computing and Computer Assisted Intervention - MICCAI 2020. 23rd International Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12261), P305, DOI 10.1007/978-3-030-59710-8_30

[48] Rethinking deep active learning: Using unlabeled data at model training [J].

Simeoni, Oriane ;

Budnik, Mateusz ;

Avrithis, Yannis ;

Gravier, Guillaume .

2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, :1220-1227

[49] Intelligent Labeling Based on Fisher Information for Medical Image Segmentation Using Deep Learning [J].

Sourati, Jamshid ;

Gholipour, Ali ;

Dy, Jennifer G. ;

Tomas-Fernandez, Xavier ;

Kurugol, Sila ;

Warfield, Simon K. .

IEEE TRANSACTIONS ON MEDICAL IMAGING, 2019, 38 (11) :2642-2653

[50]

Sung Joseph, 2023, Diagnose like a pathologist: Transformer-enabled hierarchical attention-guided multiple instance learning for whole slide image classification

← 1 2 3 4 5 6 7 8 →