Efficient UNet fusion of convolutional neural networks and state space models for medical image segmentation

被引:3
作者
Meng, Wenjie [1 ]
Mu, Aiming [1 ]
Wang, Huajun [2 ]
机构
[1] Chengdu Univ Technol, Coll Comp & Cyber Secur, Chengdu 610059, Sichuan, Peoples R China
[2] Chengdu Univ Technol, Coll Math & Phys, Chengdu 610059, Peoples R China
关键词
Cross fusion module; CNS-UNet; Lightweight attention gate; Medical image segmentation;
D O I
10.1016/j.dsp.2024.104937
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In current medical image segmentation research, convolutional neural networks (CNNs) excel in local feature extraction but struggle with global context modeling. Although the self-attention mechanism in Transformers effectively captures global dependencies, its quadratic time complexity limits its efficient application on largescale medical image datasets. To address these issues and develop a high-precision lightweight model, this study proposes an innovative model, CNS-UNet: Combined Neural Network and State Space Model in UNet, which integrates the State Space Model (SSM) architecture with CNNs. This combination allows the model to achieve effective global information modeling while maintaining low computational complexity. We adopt a U-shaped encoder-decoder framework, integrating newly developed Double Visual Space State (DVSS) and Residual Convolution (Res-Conv) modules as dual encoders for feature extraction. Additionally, we design a Cross-Fusion Module (CFM) to integrate global and local features from the dual encoders and incorporate a Lightweight Attention Gate (LAG) mechanism to enhance the recognition of key features and filter out irrelevant information. Experimental results show that CNS-UNet achieves mean Intersection over Union (mIoU) scores of 89.67%, 95.18%, and 87.96% on the Kvasir-SEG, CVC-ClinicDB, and ISIC2018 datasets, respectively, with a reduction in model parameters of 71.8% compared to UNet. These results validate the unique advantages of CNS-UNet in medical image segmentation tasks and highlight its potential for broad application.
引用
收藏
页数:11
相关论文
共 54 条
[1]   SAA-UNet: Spatial Attention and Attention Gate UNet for COVID-19 Pneumonia Segmentation from Computed Tomography [J].
Alshomrani, Shroog ;
Arif, Muhammad ;
Al Ghamdi, Mohammed A. .
DIAGNOSTICS, 2023, 13 (09)
[2]   A lightweight neural network with multiscale feature enhancement for liver CT segmentation [J].
Ansari, MohammedYusuf ;
Yang, Yin ;
Balakrishnan, Shidin ;
Abinahed, Julien ;
Al-Ansari, Abdulla ;
Warfa, Mohamed ;
Almokdad, Omran ;
Barah, Ali ;
Omer, Ahmed ;
Singh, AjayVikram ;
Meher, Pramod Kumar ;
Bhadra, Jolly ;
Halabi, Osama ;
Azampour, Mohammad Farid ;
Navab, Nassir ;
Wendler, Thomas ;
Dakua, Sarada Prasad .
SCIENTIFIC REPORTS, 2022, 12 (01)
[3]  
Archit A, 2024, Arxiv, DOI [arXiv:2404.07705, 10.48550/arXiv.2404.07705, DOI 10.48550/ARXIV.2404.07705]
[4]   WM-DOVA maps for accurate polyp highlighting in colonoscopy: Validation vs. saliency maps from physicians [J].
Bernal, Jorge ;
Javier Sanchez, F. ;
Fernandez-Esparrach, Gloria ;
Gil, Debora ;
Rodriguez, Cristina ;
Vilarino, Fernando .
COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2015, 43 :99-111
[5]   Developing Deep LSTMs With Later Temporal Attention for Predicting COVID-19 Severity, Clinical Outcome, and Antibody Level by Screening Serological Indicators Over Time [J].
Cai, Jiaxin ;
Li, Yang ;
Liu, Baichen ;
Wu, Zhixi ;
Zhu, Shengjun ;
Chen, Qiliang ;
Lei, Qing ;
Hou, Hongyan ;
Guo, Zhibin ;
Jiang, Hewei ;
Guo, Shujuan ;
Wang, Feng ;
Huang, Shengjing ;
Zhu, Shunzhi ;
Fan, Xionglin ;
Tao, Shengce .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (07) :4204-4215
[6]   Deep historical long short-term memory network for action recognition [J].
Cai, Jiaxin ;
Hu, Junlin ;
Tang, Xin ;
Hung, Tzu-Yi ;
Tan, Yap-Peng .
NEUROCOMPUTING, 2020, 407 :428-438
[7]  
Cao Hu, 2023, Computer Vision - ECCV 2022 Workshops: Proceedings. Lecture Notes in Computer Science (13803), P205, DOI 10.1007/978-3-031-25066-8_9
[8]  
Chen J., 2021, arXiv
[9]  
Codella N, 2019, Arxiv, DOI arXiv:1902.03368
[10]   Optimization of U-shaped pure transformer medical image segmentation network [J].
Dan, Yongping ;
Jin, Weishou ;
Wang, Zhida ;
Sun, Changhao .
PEERJ COMPUTER SCIENCE, 2023, 9