Multi-modal medical Transformers: A meta-analysis for medical image segmentation in oncology

被引:21
作者
Andrade-Miranda, Gustavo [1 ]
Jaouen, Vincent [1 ,2 ]
Tankyevych, Olena [1 ,3 ]
Le Rest, Catherine Cheze [1 ,3 ]
Visvikis, Dimitris [1 ]
Conze, Pierre-Henri [1 ,2 ]
机构
[1] Inserm, LaTIM UMR 1101, Brest, France
[2] IMT Atlantique, Brest, France
[3] Univ Hosp Poitiers, Nucl Med, Poitiers, France
关键词
Medical imaging; Multi-modality; Tumor segmentation; Vision transformers; CNN; Oncology; NEURAL-NETWORK;
D O I
10.1016/j.compmedimag.2023.102308
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Multi-modal medical image segmentation is a crucial task in oncology that enables the precise localization and quantification of tumors. The aim of this work is to present a meta-analysis of the use of multi-modal medical Transformers for medical image segmentation in oncology, specifically focusing on multi-parametric MR brain tumor segmentation (BraTS2021), and head and neck tumor segmentation using PET-CT images (HECKTOR2021). The multi-modal medical Transformer architectures presented in this work exploit the idea of modality interaction schemes based on visio-linguistic representations: (i) single-stream, where modalities are jointly processed by one Transformer encoder, and (ii) multiple-stream, where the inputs are encoded separately before being jointly modeled. A total of fourteen multi-modal architectures are evaluated using different ranking strategies based on dice similarity coefficient (DSC) and average symmetric surface distance (ASSD) metrics. In addition, cost indicators such as the number of trainable parameters and the number of multiply-accumulate operations (MACs) are reported. The results demonstrate that multi-path hybrid CNN Transformer-based models improve segmentation accuracy when compared to traditional methods, but come at the cost of increased computation time and potentially larger model size.
引用
收藏
页数:21
相关论文
共 100 条
[41]   CHAOS Challenge- combined (CT-MR) healthy abdominal organ segmentation [J].
Kavur, A. Emre ;
Gezer, N. Sinem ;
Baris, Mustafa ;
Aslan, Sinem ;
Conze, Pierre-Henri ;
Groza, Vladimir ;
Duc Duy Pham ;
Chatterjee, Soumick ;
Ernst, Philipp ;
Ozkan, Savas ;
Baydar, Bora ;
Lachinov, Dmitry ;
Han, Shuo ;
Pauli, Josef ;
Isensee, Fabian ;
Perkonigg, Matthias ;
Sathish, Rachana ;
Rajan, Ronnie ;
Sheet, Debdoot ;
Dovletov, Gurbandurdy ;
Speck, Oliver ;
Nurnberger, Andreas ;
Maier-Hein, Klaus H. ;
Akar, Gozde Bozdagi ;
Unal, Gozde ;
Dicle, Oguz ;
Selver, M. Alper .
MEDICAL IMAGE ANALYSIS, 2021, 69
[42]  
Kim W, 2021, PR MACH LEARN RES, V139
[43]   ImageNet Classification with Deep Convolutional Neural Networks [J].
Krizhevsky, Alex ;
Sutskever, Ilya ;
Hinton, Geoffrey E. .
COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90
[44]   Automatic segmentation of organs-at-risk from head-and-neck CT using separable convolutional neural network with hard-region-weighted loss [J].
Lei, Wenhui ;
Mei, Haochen ;
Sun, Zhengwentai ;
Ye, Shan ;
Gu, Ran ;
Wang, Huan ;
Huang, Rui ;
Zhang, Shichuan ;
Zhang, Shaoting ;
Wang, Guotai .
NEUROCOMPUTING, 2021, 442 :184-199
[45]  
Li J., 2022, TransBTSV2: Towards Better and More Efficient Volumetric Segmentation of Medical Images
[46]  
Li J, 2022, Arxiv, DOI arXiv:2206.01136
[47]  
Li S., 2021, arXiv
[48]  
Li X., 2022, arXiv
[49]  
Lin A., 2021, arXiv
[50]  
Lin JW, 2022, Arxiv, DOI arXiv:2207.07370