An efficient medical image classification network based on multi-branch CNN, token grouping Transformer and mixer MLP

被引:25
作者
Liu, Shiwei [1 ]
Wang, Liejun [1 ]
Yue, Wenwen [1 ]
机构
[1] Xinjiang Univ, Sch Comp Sci & Technol, Urumqi 830017, Xinjiang, Peoples R China
基金
美国国家科学基金会;
关键词
CNN; MLP; Transformer; Medical image classification;
D O I
10.1016/j.asoc.2024.111323
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, medical image classification techniques based on deep learning have made remarkable achievements, but most of the current models sacrifice the efficiency of the model for performance improvement. This poses a great challenge in practical clinical applications. Meanwhile, Convolutional Neural Network (CNN)based methods, Visual Transformer(ViT)-based and Multi -layer Perceptron(MLP)-based methods have their own advantages and disadvantages in capturing local features and global features of medical images. And there is no good method to combine the three to achieve a better trade-off in model scale and performance. Based on the above problems, we propose Eff-CTM: an hybrid efficient medical image classification network based on multi -branch CNN, token grouping Transformer and mixer MLP. It combines the advantages of all three and takes a small number of parameters to classify pneumonia, colon cancer histopathology and dermatology images quickly and accurately. Eff-CTM uses an efficient CNN module with multi -branch structure to learn local detail information in the shallow CNN stage of the network, an efficient CNN, Transformer (ECT) module and efficient MLP (EM) module in the middle stage of the network to extract local features and global features. An efficient Transformer (ET) module is used in the final stage to fuse the rich feature information. We have conducted extensive experiments on three publicly available medical image classification datasets, and the experimental results show that our proposed Eff-CTM achieves a better trade-off in efficiency and performance than methods based on CNN, Transformer and MLP.
引用
收藏
页数:16
相关论文
共 45 条
[31]   C-TUnet: A CNN-Transformer Architecture-Based Ultrasound Breast Image Classification Network [J].
Wu, Ying ;
Li, Faming ;
Xu, Bo .
INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2025, 35 (01)
[32]   HYPERSPECTRAL IMAGE CLASSIFICATION BASED ON MULTI-LEVEL SPECTRAL-SPATIAL TRANSFORMER NETWORK [J].
Yang, Hao ;
Yu, Haoyang ;
Hong, Danfeng ;
Xu, Zhen ;
Wang, Yulei ;
Song, Meiping .
2022 12TH WORKSHOP ON HYPERSPECTRAL IMAGING AND SIGNAL PROCESSING: EVOLUTION IN REMOTE SENSING (WHISPERS), 2022,
[33]   A hybrid network of CNN and transformer for subpixel shifting-based multi-image super-resolution [J].
Wu, Qiang ;
Zeng, Hongfei ;
Zhang, Jin ;
Li, Weishi ;
Xia, Haojie .
OPTICS AND LASERS IN ENGINEERING, 2024, 182
[34]   Multi-level wavelet network based on CNN-Transformer hybrid attention for single image deraining [J].
Bin Liu ;
Siyan Fang .
Neural Computing and Applications, 2023, 35 :22387-22404
[35]   Multi-level wavelet network based on CNN-Transformer hybrid attention for single image deraining [J].
Liu, Bin ;
Fang, Siyan .
NEURAL COMPUTING & APPLICATIONS, 2023, 35 (30) :22387-22404
[36]   DM-CNN: Dynamic Multi-scale Convolutional Neural Network with uncertainty quantification for medical image classification [J].
Han, Qi ;
Qian, Xin ;
Xu, Hongxiang ;
Wu, Kepeng ;
Meng, Lun ;
Qiu, Zicheng ;
Weng, Tengfei ;
Zhou, Baoping ;
Gao, Xianqiang .
COMPUTERS IN BIOLOGY AND MEDICINE, 2024, 168
[37]   MTC: A Multi-Task Model for Encrypted Network Traffic Classification Based on Transformer and 1D-CNN [J].
Wang, Kaiyue ;
Gao, Jian ;
Lei, Xinyan .
INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 37 (01) :619-638
[38]   Research on Multi-Scale Feature Fusion Network Algorithm Based on Brain Tumor Medical Image Classification [J].
Zhou, Yuting ;
Yang, Xuemei ;
Yin, Junping ;
Liu, Shiqi .
CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 79 (03) :5313-5333
[39]   HCT-net: hybrid CNN-transformer model based on a neural architecture search network for medical image segmentation [J].
Zhihong Yu ;
Feifei Lee ;
Qiu Chen .
Applied Intelligence, 2023, 53 :19990-20006
[40]   HCT-net: hybrid CNN-transformer model based on a neural architecture search network for medical image segmentation [J].
Yu, Zhihong ;
Lee, Feifei ;
Chen, Qiu .
APPLIED INTELLIGENCE, 2023, 53 (17) :19990-20006