CSMViT: A Lightweight Transformer and CNN fusion Network for Lymph Node Pathological Images Diagnosis

被引：1

作者：

Jiang, Peihe ^{[1
]}

Xu, Yukun ^{[1
]}

Wang, Chunni ^{[2
]}

Zhang, Wei ^{[3
]}

Lu, Ning ^{[3
]}

机构：

[1] Yantai Univ, Sch Phys & Elect Informat, Yantai 264005, Peoples R China

[2] Shandong First Med Univ & Shandong Acad Med Sci, Shandong Canc Hosp & Inst, Dept Radiat Oncol, Jinan 250117, Peoples R China

[3] Yantaishan Hosp, Dept Pathol, Yantai 264003, Peoples R China

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Pathology; Feature extraction; Computational modeling; Transformers; Computer vision; Accuracy; Lymph nodes; Image segmentation; Convolutional neural networks; Metastasis; Classification algorithms; Biomedical imaging; Classification; lightweight network; lymph node; transformer; pathological images;

D O I：

10.1109/ACCESS.2024.3483769

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

To address the burdensome and time-consuming nature of manual diagnosis of pathological sections, this study proposes an automated pathological image detection system. This system can directly detect pathological images and accurately locate lesion tissues, providing a reference for pathological diagnosis. We propose an improved MobileViT model for feature extraction in the system, which we have named CSMViT. Considering the complexity and multi-scale characteristics of pathological images, we made three significant modifications to the MobileViT model. First, the original MV2 module was replaced with an improved Ghost module to reduce the model's parameter count, enhance detection accuracy, and accelerate inference speed. Second, we improved the backbone structure of the network to achieve multi-scale feature learning, which not only further reduces the parameter count but also allows for more effective capture of features at different scales. Lastly, we introduced a new CSA module that can simultaneously accept two feature maps of different sizes as input. Through internal attention mechanisms and feature fusion, this module achieves cross-scale feature learning. Experimental results indicate that the CSMViT model achieved accuracy, F1-score, and specificity of 99.42%, 99.4%, and 99.6%, respectively. Additionally, the detection accuracy of CSMViT for the entire pathological image is 84%, representing an 8% improvement over the original network. Notably, the FLOPs of CSMViT is 1.461G, which is a 72.19% reduction compared to the original network, significantly decreasing the model's complexity. These results thoroughly demonstrate the effectiveness and substantial value of CSMViT in pathological image detection.

引用

页码：155365 / 155378

页数：14

共 26 条

[1] Patterns and trends in esophageal cancer incidence and mortality in China: An analysis based on cancer registry data
Chen, Ru
Zheng, Rongshou
Zhang, Siwei
Wang, Shaoming
Sun, Kexin
Zeng, Hongmei
Li, Li
Wei, Wenqiang
He, Jie
[J]. JOURNAL OF THE NATIONAL CANCER CENTER, 2023, 3 (02): : 21 - 27
[2] Deep Metric Learning-Based for Multi-Target Few-Shot Pavement Distress Classification
Dong, Hongwen
Song, Kechen
Wang, Qi
Yan, Yunhui
Jiang, Peng
[J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (03) : 1801 - 1810
[3] Dordevic D, 2024, AAAI CONF ARTIF INTE, P23477
[4] Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
[5] Predictive factors for central lymph node and lateral cervical lymph node metastases in papillary thyroid carcinoma
Feng, J. -W.
Yang, X. -H.
Wu, B. -Q.
Sun, D. -L.
Jiang, Y.
Qu, Z.
[J]. CLINICAL & TRANSLATIONAL ONCOLOGY, 2019, 21 (11) : 1482 - 1491
[6] Guo JL, 2024, Arxiv, DOI arXiv:2405.11582
[7] CMT: Convolutional Neural Networks Meet Vision Transformers
Guo, Jianyuan
Han, Kai
Wu, Han
Tang, Yehui
Chen, Xinghao
Wang, Yunhe
Xu, Chang
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 12165 - 12175
[8] GhostNet: More Features from Cheap Operations
Han, Kai
Wang, Yunhe
Tian, Qi
Guo, Jianyuan
Xu, Chunjing
Xu, Chang
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 1577 - 1586
[9] Deep Residual Learning for Image Recognition
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
[10] Deep Multi-Magnification Networks for multi-class breast cancer image segmentation
Ho, David Joon
Yarlagadda, Dig V. K.
D'Alfonso, Timothy M.
Hanna, Matthew G.
Grabenstetter, Anne
Ntiamoah, Peter
Brogi, Edi
Tan, Lee K.
Fuchs, Thomas J.
[J]. COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2021, 88

← 1 2 3 →