Evaluating Task-Specific Augmentations in Self-Supervised Pre-Training for 3D Medical Image Analysis

被引：1

作者：

Claessens, C. H. B. ^{[1
]}

Hamm, J. J. M. ^{[2
]}

Viviers, C. G. A. ^{[1
]}

Nederend, J. ^{[3
]}

Grunhagen, D. J. ^{[2
]}

Tanis, P. J. ^{[2
]}

de With, P. H. N. ^{[1
]}

van der Sommen, F. ^{[1
]}

机构：

[1] Eindhoven Univ Technol, Eindhoven, Netherlands

[2] Erasmus MC, Rotterdam, Netherlands

[3] Catharina Hosp, Eindhoven, Netherlands

来源：

MEDICAL IMAGING 2024: IMAGE PROCESSING | 2024年 / 12926卷

关键词：

self-supervised learning; pre-training; medical imaging; three-dimensional; augmentations; self-distillation;

D O I：

10.1117/12.3000850

中图分类号：

R5 [内科学];

学科分类号：

1002 ; 100201 ;

摘要：

Self-supervised learning (SSL) has become a crucial approach for pre-training deep learning models in natural and medical image analysis. However, applying transformations designed for natural images to three-dimensional (3D) medical data poses challenges. This study explores the efficacy of specific augmentations in the context of self-supervised pre-training for volumetric medical images. A 3D non-contrastive framework is proposed for in-domain self-supervised pre-training on 3D gray-scale thorax CT data, incorporating four spatial and two intensity augmentations commonly used in 3D medical image analysis. The pre-trained models, adapted versions of ResNet-50 and Vision Transformer (ViT)-S, are evaluated on lung nodule classification and lung tumor segmentation tasks. The results indicate a significant impact of SSL, with a remarkable increase in AUC and DSC as compared to training from scratch. For classification, random scalings and random rotations play a fundamental role in achieving higher downstream performance, while intensity augmentations show limited contribution and may even degrade performance. For segmentation, random intensity histogram shifting enhances robustness, while other augmentations have marginal or negative impacts. These findings underscore the necessity of tailored data augmentations within SSL for medical imaging, emphasizing the importance of task-specific transformations for optimal model performance in complex 3D medical datasets.

引用

页数：8