ResMT: A hybrid CNN-transformer framework for glioma grading with 3D MRI

被引:5
作者
Cui, Honghao [1 ]
Ruan, Zhuoying [2 ]
Xu, Zhijian [1 ]
Luo, Xiao [1 ]
Dai, Jian [1 ]
Geng, Daoying [1 ,2 ]
机构
[1] Fudan Univ, Acad Engn & Technol, Shanghai 200433, Peoples R China
[2] Fudan Univ, Huashan Hosp, Dept Radiol, Shanghai 200040, Peoples R China
关键词
Glioma grading; Deep learning; Hybrid architecture; Attention mechanisms; Transformer; Magnetic resonance imaging; CENTRAL-NERVOUS-SYSTEM; NETWORK;
D O I
10.1016/j.compeleceng.2024.109745
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Accurate grading of gliomas is crucial for treatment strategies and prognosis. While convolutional neural networks (CNNs) have proven effective in classifying medical images, they struggle with capturing long-range dependencies among pixels. Transformer-based networks can address this issue, but CNN-based methods often perform better when trained on small datasets. Additionally, tumor segmentation is essential for classification models, but training an additional segmentation model significantly increases workload. To address these challenges, we propose ResMT, which combines CNN and transformer architectures for glioma grading, extracting both local and global features efficiently. Specifically, we designed a spatial residual module (SRM) where a 3D CNN captures glioma's volumetric complexity, and Swin UNETR, a pre-trained segmentation model, enhances the network without extra training. Our model also includes a multi-plane channel and spatial attention module (MCSA) to refine the analysis by focusing on critical features across multiple planes (axial, coronal, and sagittal). Transformer blocks establish long-range relationships among planes and slices. We evaluated ResMT on the BraTs19 dataset, comparing it with baselines and state-of-the-art models. Results demonstrate that ResMT achieves the highest prediction performance with an AUC of 0.9953, highlighting hybrid CNN-transformer models' potential for 3D MRI classification.
引用
收藏
页数:17
相关论文
共 50 条
[21]   A CNN-transformer hybrid approach for an intrusion detection system in advanced metering infrastructure [J].
Yao, Ruizhe ;
Wang, Ning ;
Chen, Peng ;
Ma, Di ;
Sheng, Xianjun .
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (13) :19463-19486
[22]   Air Quality Assessment Based on CNN-Transformer Hybrid Architecture [J].
Zhang, Yuchen ;
Thinakaran, Rajermani .
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2025, 16 (04) :273-279
[23]   A hybrid CNN-Transformer model for Historical Document Image Binarization [J].
Rezanezhad, Vahid ;
Baierer, Konstantin ;
Neudecker, Clemens .
PROCEEDINGS OF THE 2023 INTERNATIONAL WORKSHOP ON HISTORICAL DOCUMENT IMAGING AND PROCESSING, HIP 2023, 2023, :79-84
[24]   IC Packaging Material Identification via a Hybrid Deep Learning Framework with CNN-Transformer Bidirectional Interaction [J].
Zhang, Chengbin ;
Zhou, Xuankai ;
Cai, Nian ;
Zhou, Shuai ;
Wang, Han .
MICROMACHINES, 2024, 15 (03)
[25]   A 3D reconstruction method based on multi-views of contours segmented with CNN-transformer for long bones [J].
Ge, Yunfei ;
Zhang, Qing ;
Shen, Yidong ;
Sun, Yuantao ;
Huang, Chongyang .
INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2022, 17 (10) :1891-1902
[26]   TractGraphFormer: Anatomically informed hybrid graph CNN-transformer network for interpretable sex and age prediction from diffusion MRI tractography [J].
Chen, Yuqian ;
Zhang, Fan ;
Wang, Meng ;
Zekelman, Leo R. ;
Cetin-Karayumak, Suheyla ;
Xue, Tengfei ;
Zhang, Chaoyi ;
Song, Yang ;
Rushmore, Jarrett ;
Makris, Nikos ;
Rathi, Yogesh ;
Cai, Weidong ;
O'Donnell, Lauren J. .
MEDICAL IMAGE ANALYSIS, 2025, 101
[27]   A 3D reconstruction method based on multi-views of contours segmented with CNN-transformer for long bones [J].
Yunfei Ge ;
Qing Zhang ;
Yidong Shen ;
Yuantao Sun ;
Chongyang Huang .
International Journal of Computer Assisted Radiology and Surgery, 2022, 17 :1891-1902
[28]   Polarformer: Optic Disc and Cup Segmentation Using a Hybrid CNN-Transformer and Polar Transformation [J].
Feng, Yaowei ;
Li, Zhendong ;
Yang, Dong ;
Hu, Hongkai ;
Guo, Hui ;
Liu, Hao .
APPLIED SCIENCES-BASEL, 2023, 13 (01)
[29]   Parkinson's Disease Recognition Using Hybrid CNN-Transformer Model [J].
Khushbu, Al-Nahiyan ;
Yang, Zhenxing ;
Liu, Yutao ;
Zhang, Xiaobo .
2024 4TH INTERNATIONAL CONFERENCE ON INDUSTRIAL AUTOMATION, ROBOTICS AND CONTROL ENGINEERING, IARCE, 2024, :307-311
[30]   Rethinking Image Deblurring via CNN-Transformer Multiscale Hybrid Architecture [J].
Zhao, Qian ;
Yang, Hao ;
Zhou, Dongming ;
Cao, Jinde .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72