Distilling Knowledge From an Ensemble of Vision Transformers for Improved Classification of Breast Ultrasound

被引:4
作者
Zhou, George [1 ]
Mosadegh, Bobak [2 ]
机构
[1] Weill Cornell Med, New York, NY 10021 USA
[2] Weill Cornell Med, Dalio Inst Cardiovasc Imaging, Dept Radiol, New York, NY USA
关键词
Breast ultrasound; Deep learning; Vision transformer; Ensemble learning; Knowledge distillation; NEURAL-NETWORK; CANCER; MAMMOGRAPHY;
D O I
10.1016/j.acra.2023.08.006
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Rationale and Objectives: To develop a deep learning model for the automated classification of breast ultrasound images as benign or malignant. More specifically, the application of vision transformers, ensemble learning, and knowledge distillation is explored for breast ultrasound classification. Materials and Methods: Single view, B-mode ultrasound images were curated from the publicly available Breast Ultrasound Image (BUSI) dataset, which has categorical ground truth labels (benign vs malignant) assigned by radiologists and malignant cases confirmed by biopsy. The performance of vision transformers (ViT) is compared to convolutional neural networks (CNN), followed by a comparison between supervised, self-supervised, and randomly initialized ViT. Subsequently, the ensemble of 10 independently trained ViT, where the ensemble model is the unweighted average of the output of each individual model is compared to the performance of each ViT alone. Finally, we train a single ViT to emulate the ensembled ViT using knowledge distillation. Results: On this dataset that was trained using five-fold cross validation, ViT outperforms CNN, while self-supervised ViT outperform supervised and randomly initialized ViT. The ensemble model achieves an area under the receiver operating characteristics curve (AuROC) and area under the precision recall curve (AuPRC) of 0.977 and 0.965 on the test set, outperforming the average AuROC and AuPRC of the independently trained ViTs (0.958 +/- 0.05 and 0.931 +/- 0.016). The distilled ViT achieves an AuROC and AuPRC of 0.972 and 0.960. Conclusion: Both transfer learning and ensemble learning can each offer increased performance independently and can be sequentially combined to collectively improve the performance of the final model. Furthermore, a single vision transformer can be trained to match the performance of an ensemble of a set of vision transformers using knowledge distillation.
引用
收藏
页码:104 / 120
页数:17
相关论文
共 50 条
  • [21] Breast Ultrasound Image BI-RADS Classification Based on Vision Transformer
    Wei, Yanbo
    Ye, Junbo
    Li, Xiaofeng
    Zhao, Yuanyuan
    Wang, Yanwei
    INTERNATIONAL JOURNAL OF MULTIPHYSICS, 2024, 18 (02) : 32 - 39
  • [22] A VISION TRANSFORMER NETWORK WITH WAVELET-BASED FEATURES FOR BREAST ULTRASOUND CLASSIFICATION
    He, Chenyang
    Diao, Yan
    Ma, Xingcong
    Yu, Shuo
    He, Xin
    Mao, Guochao
    Wei, Xinyu
    Zhang, Yu
    Zhao, Yang
    IMAGE ANALYSIS & STEREOLOGY, 2024, 43 (02) : 185 - 194
  • [23] Gaussian Dropout Based Stacked Ensemble CNN for Classification of Breast Tumor in Ultrasound Images
    Karthik, R.
    Menaka, R.
    Kathiresan, G. S.
    Anirudh, M.
    Nagharjun, M.
    IRBM, 2022, 43 (06) : 715 - 733
  • [24] Breast UltraSound Image classification using fuzzy-rank-based ensemble network
    Deb, Sagar Deep
    Jha, Rajib Kumar
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 85
  • [25] Classification of Brain Tumor from Magnetic Resonance Imaging Using Vision Transformers Ensembling
    Tummala, Sudhakar
    Kadry, Seifedine
    Bukhari, Syed Ahmad Chan
    Rauf, Hafiz Tayyab
    CURRENT ONCOLOGY, 2022, 29 (10) : 7498 - 7511
  • [26] A VGG attention vision transformer network for benign and malignant classification of breast ultrasound images
    Qu, Xiaolei
    Lu, Hongyan
    Tang, Wenzhong
    Wang, Shuai
    Zheng, Dezhi
    Hou, Yaxin
    Jiang, Jue
    MEDICAL PHYSICS, 2022, 49 (09) : 5787 - 5798
  • [27] SEMIAUTOMATED BREAST CANCER CLASSIFICATION FROM ULTRASOUND VIDEO
    Bocchi, L.
    Gritti, F.
    Manfredi, C.
    Giannotti, E.
    Nori, J.
    2012 9TH IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2012, : 1112 - 1115
  • [28] Performance Analysis of Breast Cancer Classification from Mammogram Images Using Vision Transformer
    Borah, Naiwrita
    Varma, Sai Pratyush P.
    Datta, Ashis
    Kumar, Amish
    Baruah, Udayan
    Ghosal, Palash
    2022 IEEE CALCUTTA CONFERENCE, CALCON, 2022, : 238 - 243
  • [29] Optimal fusion of features from decomposed ultrasound RF data with adaptive weighted ensemble classifier to improve breast lesion classification
    Yao, Ruihan
    He, Bingbing
    Zhang, Yufeng
    Li, Zhiyao
    Zhu, Jingying
    Lang, Xun
    IMAGE AND VISION COMPUTING, 2024, 146
  • [30] Automated Detection and Classification of Mass from Breast Ultrasound Images
    Menon, Radhika V.
    Raha, Poulami
    Kothari, Shweta
    Chakraborty, Sumit
    Chakrabarti, Indrajit
    Karim, Rezaul
    2015 FIFTH NATIONAL CONFERENCE ON COMPUTER VISION, PATTERN RECOGNITION, IMAGE PROCESSING AND GRAPHICS (NCVPRIPG), 2015,