Evaluating Task-Specific Augmentations in Self-Supervised Pre-Training for 3D Medical Image Analysis

被引:1
作者
Claessens, C. H. B. [1 ]
Hamm, J. J. M. [2 ]
Viviers, C. G. A. [1 ]
Nederend, J. [3 ]
Grunhagen, D. J. [2 ]
Tanis, P. J. [2 ]
de With, P. H. N. [1 ]
van der Sommen, F. [1 ]
机构
[1] Eindhoven Univ Technol, Eindhoven, Netherlands
[2] Erasmus MC, Rotterdam, Netherlands
[3] Catharina Hosp, Eindhoven, Netherlands
来源
MEDICAL IMAGING 2024: IMAGE PROCESSING | 2024年 / 12926卷
关键词
self-supervised learning; pre-training; medical imaging; three-dimensional; augmentations; self-distillation;
D O I
10.1117/12.3000850
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Self-supervised learning (SSL) has become a crucial approach for pre-training deep learning models in natural and medical image analysis. However, applying transformations designed for natural images to three-dimensional (3D) medical data poses challenges. This study explores the efficacy of specific augmentations in the context of self-supervised pre-training for volumetric medical images. A 3D non-contrastive framework is proposed for in-domain self-supervised pre-training on 3D gray-scale thorax CT data, incorporating four spatial and two intensity augmentations commonly used in 3D medical image analysis. The pre-trained models, adapted versions of ResNet-50 and Vision Transformer (ViT)-S, are evaluated on lung nodule classification and lung tumor segmentation tasks. The results indicate a significant impact of SSL, with a remarkable increase in AUC and DSC as compared to training from scratch. For classification, random scalings and random rotations play a fundamental role in achieving higher downstream performance, while intensity augmentations show limited contribution and may even degrade performance. For segmentation, random intensity histogram shifting enhances robustness, while other augmentations have marginal or negative impacts. These findings underscore the necessity of tailored data augmentations within SSL for medical imaging, emphasizing the importance of task-specific transformations for optimal model performance in complex 3D medical datasets.
引用
收藏
页数:8
相关论文
共 50 条
[21]   SPAKT: A Self-Supervised Pre-TrAining Method for Knowledge Tracing [J].
Ma, Yuling ;
Han, Peng ;
Qiao, Huiyan ;
Cui, Chaoran ;
Yin, Yilong ;
Yu, Dehu .
IEEE ACCESS, 2022, 10 :72145-72154
[22]   MEASURING THE IMPACT OF DOMAIN FACTORS IN SELF-SUPERVISED PRE-TRAINING [J].
Sanabria, Ramon ;
Wei-Ning, Hsu ;
Alexei, Baevski ;
Auli, Michael .
2023 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW, 2023,
[23]   Contrastive Self-Supervised Pre-Training for Video Quality Assessment [J].
Chen, Pengfei ;
Li, Leida ;
Wu, Jinjian ;
Dong, Weisheng ;
Shi, Guangming .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 :458-471
[24]   ReFs: A hybrid pre-training paradigm for 3D medical image segmentation [J].
Xie, Yutong ;
Zhang, Jianpeng ;
Liu, Lingqiao ;
Wang, Hu ;
Ye, Yiwen ;
Verjans, Johan ;
Xia, Yong .
MEDICAL IMAGE ANALYSIS, 2024, 91
[25]   Voice Deepfake Detection Using the Self-Supervised Pre-Training Model HuBERT [J].
Li, Lanting ;
Lu, Tianliang ;
Ma, Xingbang ;
Yuan, Mengjiao ;
Wan, Da .
APPLIED SCIENCES-BASEL, 2023, 13 (14)
[26]   SELF-SUPERVISED PRE-TRAINING BASED ON CONTRASTIVE COMPLEMENTARY MASKING FOR SEMI-SUPERVISED CARDIAC IMAGE SEGMENTATION [J].
Zhou, Yubo ;
Gu, Ran ;
Zhang, Shaoting ;
Wang, Guotai .
IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI 2024, 2024,
[27]   Joint Encoder-Decoder Self-Supervised Pre-training for ASR [J].
Arunkumar, A. ;
Umesh, S. .
INTERSPEECH 2022, 2022, :3418-3422
[28]   ENHANCING THE DOMAIN ROBUSTNESS OF SELF-SUPERVISED PRE-TRAINING WITH SYNTHETIC IMAGES [J].
Hassan, Mohamad N. C. ;
Bhattacharya, Avigyan ;
da Costa, Victor G. Turrisi ;
Banerjee, Biplab ;
Ricci, Elisa .
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, :5470-5474
[29]   Individualized Stress Mobile Sensing Using Self-Supervised Pre-Training [J].
Islam, Tanvir ;
Washington, Peter .
APPLIED SCIENCES-BASEL, 2023, 13 (21)
[30]   Progressive self-supervised learning: A pre-training method for crowd counting [J].
Gu, Yao ;
Zheng, Zhe ;
Wu, Yingna ;
Xie, Guangping ;
Ni, Na .
PATTERN RECOGNITION LETTERS, 2025, 188 :148-154