Comparison of fine-tuning strategies for transfer learning in medical image classification

被引：15

作者：

Davila, Ana ^{[1
]}

Colan, Jacinto ^{[2
]}

Hasegawa, Yasuhisa ^{[1
]}

机构：

[1] Nagoya Univ, Inst Innovat Future Soc, Furo Cho,Chikusa Ku, Nagoya, Aichi 4648601, Japan

[2] Nagoya Univ, Dept Micronano Mech Sci & Engn, Furo Cho,Chikusa Ku, Nagoya, Aichi 4648603, Japan

来源：

IMAGE AND VISION COMPUTING | 2024年 / 146卷

基金：

日本学术振兴会;

关键词：

Medical image analysis; Fine-tuning; Transfer learning; Convolutional neural network; Image classification;

D O I：

10.1016/j.imavis.2024.105012

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In the context of medical imaging and machine learning, one of the most pressing challenges is the effective adaptation of pre-trained models to specialized medical contexts. Despite the availability of advanced pre-trained models, their direct application to the highly specialized and diverse field of medical imaging often falls short due to the unique characteristics of medical data. This study provides a comprehensive analysis on the performance of various fine-tuning methods applied to pre-trained models across a spectrum of medical imaging domains, including X-ray, MRI, Histology, Dermoscopy, and Endoscopic surgery. We evaluated eight fine-tuning strategies, including standard techniques such as fine-tuning all layers or fine-tuning only the classifier layers, alongside methods such as gradually unfreezing layers, regularization based fine-tuning and adaptive learning rates. We selected three well-established CNN architectures (ResNet-50, DenseNet-121, and VGG-19) to cover a range of learning and feature extraction scenarios. Although our results indicate that the efficacy of these finetuning methods significantly varies depending on both the architecture and the medical imaging type, strategies such as combining Linear Probing with Full Fine-tuning resulted in notable improvements in over 50% of the evaluated cases, demonstrating general effectiveness across medical domains. Moreover, Auto-RGN, which dynamically adjusts learning rates, led to performance enhancements of up to 11% for specific modalities. Additionally, the DenseNet architecture showed more pronounced benefits from alternative fine-tuning approaches compared to traditional full fine-tuning. This work not only provides valuable insights for optimizing pre-trained models in medical image analysis but also suggests the potential for future research into more advanced architectures and fine-tuning methods.

引用

页数：17

共 87 条

[1] Deep transfer learning-based automated detection of COVID-19 from lung CT scan slices [J].

Ahuja, Sakshi ;

Panigrahi, Bijaya Ketan ;

Dey, Nilanjan ;

Rajinikanth, Venkatesan ;

Gandhi, Tapan Kumar .

APPLIED INTELLIGENCE, 2021, 51 (01) :571-585

[2] Covid-19: automatic detection from X-ray images utilizing transfer learning with convolutional neural networks [J].

Apostolopoulos, Ioannis D. ;

Mpesiana, Tzani A. .

PHYSICAL AND ENGINEERING SCIENCES IN MEDICINE, 2020, 43 (02) :635-640

[3] BACH: Grand challenge on breast cancer histology images [J].

Aresta, Guilherme ;

Araujo, Teresa ;

Kwok, Scotty ;

Chennamsetty, Sai Saketh ;

Safwan, Mohammed ;

Alex, Varghese ;

Marami, Bahram ;

Prastawa, Marcel ;

Chan, Monica ;

Donovan, Michael ;

Fernandez, Gerardo ;

Zeineh, Jack ;

Kohl, Matthias ;

Walz, Christoph ;

Ludwig, Florian ;

Braunewell, Stefan ;

Baust, Maximilian ;

Quoc Dang Vu ;

Minh Nguyen Nhat To ;

Kim, Eal ;

Kwak, Jin Tae ;

Galal, Sameh ;

Sanchez-Freire, Veronica ;

Brancati, Nadia ;

Frucci, Maria ;

Riccio, Daniel ;

Wang, Yaqi ;

Sun, Lingling ;

Ma, Kaiqiang ;

Fang, Jiannan ;

Kone, Ismael ;

Boulmane, Lahsen ;

Campilho, Aurelio ;

Eloy, Catarina ;

Polonia, Antonio ;

Aguiar, Paulo .

MEDICAL IMAGE ANALYSIS, 2019, 56 :122-139

[4]

Arjovsky M, 2020, Arxiv, DOI [arXiv:1907.02893, 10.48550/arXiv.1907.02893]

[5] Automated classification of Alzheimer's disease and mild cognitive impairment using a single MRI and deep neural networks [J].

Basaia, Silvia ;

Agosta, Federica ;

Wagner, Luca ;

Canu, Elisa ;

Magnani, Giuseppe ;

Santangelo, Roberto ;

Filippi, Massimo .

NEUROIMAGE-CLINICAL, 2019, 21

[6] Transfer Learning for Cell Nuclei Classification in Histopathology Images [J].

Bayramoglu, Neslihan ;

Heikkila, Janne .

COMPUTER VISION - ECCV 2016 WORKSHOPS, PT III, 2016, 9915 :532-539

[7] Understanding Robustness of Transformers for Image Classification [J].

Bhojanapalli, Srinadh ;

Chakrabarti, Ayan ;

Glasner, Daniel ;

Li, Daliang ;

Unterthiner, Thomas ;

Veit, Andreas .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :10211-10221

[8] PadChest: A large chest x-ray image dataset with multi-label annotated reports [J].

Bustos, Aurelia ;

Pertusa, Antonio ;

Salinas, Jose-Maria ;

de la Iglesia-Vaya, Maria .

MEDICAL IMAGE ANALYSIS, 2020, 66

[9] Online Continual Learning with Natural Distribution Shifts: An Empirical Study with Visual Data [J].

Cai, Zhipeng ;

Sener, Ozan ;

Koltun, Vladlen .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :8261-8270

[10] Analysis of the ISIC image datasets: Usage, benchmarks and recommendations [J].

Cassidy, Bill ;

Kendrick, Connah ;

Brodzicki, Andrzej ;

Jaworek-Korjakowska, Joanna ;

Yap, Moi Hoon .

MEDICAL IMAGE ANALYSIS, 2022, 75

← 1 2 3 4 5 6 7 8 9 →