CSMViT: A Lightweight Transformer and CNN fusion Network for Lymph Node Pathological Images Diagnosis

被引:1
作者
Jiang, Peihe [1 ]
Xu, Yukun [1 ]
Wang, Chunni [2 ]
Zhang, Wei [3 ]
Lu, Ning [3 ]
机构
[1] Yantai Univ, Sch Phys & Elect Informat, Yantai 264005, Peoples R China
[2] Shandong First Med Univ & Shandong Acad Med Sci, Shandong Canc Hosp & Inst, Dept Radiat Oncol, Jinan 250117, Peoples R China
[3] Yantaishan Hosp, Dept Pathol, Yantai 264003, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Pathology; Feature extraction; Computational modeling; Transformers; Computer vision; Accuracy; Lymph nodes; Image segmentation; Convolutional neural networks; Metastasis; Classification algorithms; Biomedical imaging; Classification; lightweight network; lymph node; transformer; pathological images;
D O I
10.1109/ACCESS.2024.3483769
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To address the burdensome and time-consuming nature of manual diagnosis of pathological sections, this study proposes an automated pathological image detection system. This system can directly detect pathological images and accurately locate lesion tissues, providing a reference for pathological diagnosis. We propose an improved MobileViT model for feature extraction in the system, which we have named CSMViT. Considering the complexity and multi-scale characteristics of pathological images, we made three significant modifications to the MobileViT model. First, the original MV2 module was replaced with an improved Ghost module to reduce the model's parameter count, enhance detection accuracy, and accelerate inference speed. Second, we improved the backbone structure of the network to achieve multi-scale feature learning, which not only further reduces the parameter count but also allows for more effective capture of features at different scales. Lastly, we introduced a new CSA module that can simultaneously accept two feature maps of different sizes as input. Through internal attention mechanisms and feature fusion, this module achieves cross-scale feature learning. Experimental results indicate that the CSMViT model achieved accuracy, F1-score, and specificity of 99.42%, 99.4%, and 99.6%, respectively. Additionally, the detection accuracy of CSMViT for the entire pathological image is 84%, representing an 8% improvement over the original network. Notably, the FLOPs of CSMViT is 1.461G, which is a 72.19% reduction compared to the original network, significantly decreasing the model's complexity. These results thoroughly demonstrate the effectiveness and substantial value of CSMViT in pathological image detection.
引用
收藏
页码:155365 / 155378
页数:14
相关论文
共 26 条
  • [1] Patterns and trends in esophageal cancer incidence and mortality in China: An analysis based on cancer registry data
    Chen, Ru
    Zheng, Rongshou
    Zhang, Siwei
    Wang, Shaoming
    Sun, Kexin
    Zeng, Hongmei
    Li, Li
    Wei, Wenqiang
    He, Jie
    [J]. JOURNAL OF THE NATIONAL CANCER CENTER, 2023, 3 (02): : 21 - 27
  • [2] Deep Metric Learning-Based for Multi-Target Few-Shot Pavement Distress Classification
    Dong, Hongwen
    Song, Kechen
    Wang, Qi
    Yan, Yunhui
    Jiang, Peng
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (03) : 1801 - 1810
  • [3] Dordevic D, 2024, AAAI CONF ARTIF INTE, P23477
  • [4] Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
  • [5] Predictive factors for central lymph node and lateral cervical lymph node metastases in papillary thyroid carcinoma
    Feng, J. -W.
    Yang, X. -H.
    Wu, B. -Q.
    Sun, D. -L.
    Jiang, Y.
    Qu, Z.
    [J]. CLINICAL & TRANSLATIONAL ONCOLOGY, 2019, 21 (11) : 1482 - 1491
  • [6] Guo JL, 2024, Arxiv, DOI arXiv:2405.11582
  • [7] CMT: Convolutional Neural Networks Meet Vision Transformers
    Guo, Jianyuan
    Han, Kai
    Wu, Han
    Tang, Yehui
    Chen, Xinghao
    Wang, Yunhe
    Xu, Chang
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 12165 - 12175
  • [8] GhostNet: More Features from Cheap Operations
    Han, Kai
    Wang, Yunhe
    Tian, Qi
    Guo, Jianyuan
    Xu, Chunjing
    Xu, Chang
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 1577 - 1586
  • [9] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [10] Deep Multi-Magnification Networks for multi-class breast cancer image segmentation
    Ho, David Joon
    Yarlagadda, Dig V. K.
    D'Alfonso, Timothy M.
    Hanna, Matthew G.
    Grabenstetter, Anne
    Ntiamoah, Peter
    Brogi, Edi
    Tan, Lee K.
    Fuchs, Thomas J.
    [J]. COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2021, 88