Multi-modal medical Transformers: A meta-analysis for medical image segmentation in oncology

被引:21
作者
Andrade-Miranda, Gustavo [1 ]
Jaouen, Vincent [1 ,2 ]
Tankyevych, Olena [1 ,3 ]
Le Rest, Catherine Cheze [1 ,3 ]
Visvikis, Dimitris [1 ]
Conze, Pierre-Henri [1 ,2 ]
机构
[1] Inserm, LaTIM UMR 1101, Brest, France
[2] IMT Atlantique, Brest, France
[3] Univ Hosp Poitiers, Nucl Med, Poitiers, France
关键词
Medical imaging; Multi-modality; Tumor segmentation; Vision transformers; CNN; Oncology; NEURAL-NETWORK;
D O I
10.1016/j.compmedimag.2023.102308
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Multi-modal medical image segmentation is a crucial task in oncology that enables the precise localization and quantification of tumors. The aim of this work is to present a meta-analysis of the use of multi-modal medical Transformers for medical image segmentation in oncology, specifically focusing on multi-parametric MR brain tumor segmentation (BraTS2021), and head and neck tumor segmentation using PET-CT images (HECKTOR2021). The multi-modal medical Transformer architectures presented in this work exploit the idea of modality interaction schemes based on visio-linguistic representations: (i) single-stream, where modalities are jointly processed by one Transformer encoder, and (ii) multiple-stream, where the inputs are encoded separately before being jointly modeled. A total of fourteen multi-modal architectures are evaluated using different ranking strategies based on dice similarity coefficient (DSC) and average symmetric surface distance (ASSD) metrics. In addition, cost indicators such as the number of trainable parameters and the number of multiply-accumulate operations (MACs) are reported. The results demonstrate that multi-path hybrid CNN Transformer-based models improve segmentation accuracy when compared to traditional methods, but come at the cost of increased computation time and potentially larger model size.
引用
收藏
页数:21
相关论文
共 100 条
[1]  
Akbari H, 2021, ADV NEUR IN
[2]   ViViT: A Video Vision Transformer [J].
Arnab, Anurag ;
Dehghani, Mostafa ;
Heigold, Georg ;
Sun, Chen ;
Lucic, Mario ;
Schmid, Cordelia .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :6816-6826
[3]  
Baid U, 2021, Arxiv, DOI [arXiv:2107.02314, 10.48550/arXiv:2107.02314]
[4]   Data Descriptor: Advancing The Cancer Genome Atlas glioma MRI collections with expert segmentation labels and radiomic features [J].
Bakas, Spyridon ;
Akbari, Hamed ;
Sotiras, Aristeidis ;
Bilello, Michel ;
Rozycki, Martin ;
Kirby, Justin S. ;
Freymann, John B. ;
Farahani, Keyvan ;
Davatzikos, Christos .
SCIENTIFIC DATA, 2017, 4
[5]   STORM-GAN: Spatio-Temporal Meta-GAN for Cross-City Estimation of Human Mobility Responses to COVID- [J].
Bao, Han ;
Zhou, Xun ;
Xie, Yiqun ;
Li, Yanhua ;
Jia, Xiaowei .
2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2022, :1-10
[6]  
Bao R., 2023, bioRxiv
[7]   Attention Augmented Convolutional Networks [J].
Bello, Irwan ;
Zoph, Barret ;
Vaswani, Ashish ;
Shlens, Jonathon ;
Le, Quoc V. .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :3285-3294
[8]   The Liver Tumor Segmentation Benchmark (LiTS) [J].
Bilic, Patrick ;
Christ, Patrick ;
Li, Hongwei Bran ;
Vorontsov, Eugene ;
Ben-Cohen, Avi ;
Kaissis, Georgios ;
Szeskin, Adi ;
Jacobs, Colin ;
Mamani, Gabriel Efrain Humpire ;
Chartrand, Gabriel ;
Lohoefer, Fabian ;
Holch, Julian Walter ;
Sommer, Wieland ;
Hofmann, Felix ;
Hostettler, Alexandre ;
Lev-Cohain, Naama ;
Drozdzal, Michal ;
Amitai, Michal Marianne ;
Vivanti, Refael ;
Sosna, Jacob ;
Ezhov, Ivan ;
Sekuboyina, Anjany ;
Navarro, Fernando ;
Kofler, Florian ;
Paetzold, Johannes C. ;
Shit, Suprosanna ;
Hu, Xiaobin ;
Lipkova, Jana ;
Rempfler, Markus ;
Piraud, Marie ;
Kirschke, Jan ;
Wiestler, Benedikt ;
Zhang, Zhiheng ;
Huelsemeyer, Christian ;
Beetz, Marcel ;
Ettlinger, Florian ;
Antonelli, Michela ;
Bae, Woong ;
Bellver, Miriam ;
Bi, Lei ;
Chen, Hao ;
Chlebus, Grzegorz ;
Dam, Erik B. ;
Dou, Qi ;
Fu, Chi-Wing ;
Georgescu, Bogdan ;
Giro-I-Nieto, Xavier ;
Gruen, Felix ;
Han, Xu ;
Heng, Pheng-Ann .
MEDICAL IMAGE ANALYSIS, 2023, 84
[9]  
Cao Hu, 2023, Computer Vision - ECCV 2022 Workshops: Proceedings. Lecture Notes in Computer Science (13803), P205, DOI 10.1007/978-3-031-25066-8_9
[10]   GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond [J].
Cao, Yue ;
Xu, Jiarui ;
Lin, Stephen ;
Wei, Fangyun ;
Hu, Han .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, :1971-1980