RM-UNet: UNet-like Mamba with rotational SSM module for medical image segmentation

被引:3
|
作者
Tang, Hao [1 ]
Huang, Guoheng [1 ]
Cheng, Lianglun [1 ]
Yuan, Xiaochen [2 ]
Tao, Qi [3 ]
Chen, Xuhang [4 ]
Zhong, Guo [5 ]
Yang, Xiaohui [6 ]
机构
[1] Guangdong Univ Technol, Sch Comp Sci & Technol, Guangzhou 510006, Peoples R China
[2] Macao Polytech Univ, Fac Appl Sci, Macau 999078, Peoples R China
[3] Guangdong Technion Israel Inst Technol, Dept Mech Engn Robot, Shantou 515063, Peoples R China
[4] Huizhou Univ, Sch Comp Sci & Engn, Huizhou 516007, Peoples R China
[5] Guangdong Univ Foreign Studies, Sch Informat Sci & Technol, Guangzhou 510006, Peoples R China
[6] Sun Yat sen Univ, Affiliated Hosp 3, Dept Gynecol, Guangzhou, Peoples R China
关键词
U-Net; State Space Models; Medical image segmentation; Mamba; LSIL; U-NET ARCHITECTURE; TRANSFORMER;
D O I
10.1007/s11760-024-03484-8
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Accurate segmentation of tissues and lesions is crucial for disease diagnosis, treatment planning, and surgical navigation. Yet, the complexity of medical images presents significant challenges for traditional Convolutional Neural Networks and Transformer models due to their limited receptive fields or high computational complexity. State Space Models (SSMs) have recently shown notable vision performance, particularly Mamba and its variants. However, their feature extraction methods may not be sufficiently effective and retain some redundant structures, leaving room for parameter reduction. In response to these challenges, we introduce a methodology called Rotational Mamba-UNet, characterized by Residual Visual State Space (ResVSS) block and Rotational SSM Module. The ResVSS block is devised to mitigate network degradation caused by the diminishing efficacy of information transfer from shallower to deeper layers. Meanwhile, the Rotational SSM Module is devised to tackle the challenges associated with channel feature extraction within State Space Models. Finally, we propose a weighted multi-level loss function, which fully leverages the outputs of the decoder's three stages for supervision. We conducted experiments on ISIC17, ISIC18, CVC-300, Kvasir-SEG, CVC-ColonDB, Kvasir-Instrument datasets, and Low-grade Squamous Intraepithelial Lesion datasets provided by The Third Affiliated Hospital of Sun Yat-sen University, demonstrating the superior segmentation performance of our proposed RM-UNet. Additionally, compared to the previous VM-UNet, our model achieves a one-third reduction in parameters. Our code is available at https://github.com/Halo2Tang/RM-UNet.
引用
收藏
页码:8427 / 8443
页数:17
相关论文
共 50 条
  • [21] TCI-UNet: transformer-CNN interactive module for medical image segmentation
    Bian, Xuan
    Wang, Guanglei
    Li, Yan
    Wang, Hongrui
    BIOMEDICAL OPTICS EXPRESS, 2023, 14 (11) : 5904 - 5920
  • [22] EGCM-UNet: Edge Guided Hybrid CNN-Mamba UNet for farmland remote sensing image semantic segmentation
    Zheng, Jianhua
    Fu, Yusha
    Chen, Xiaohan
    Zhao, Ruolin
    Lu, Junde
    Zhao, Huanghui
    Chen, Qian
    GEOCARTO INTERNATIONAL, 2025, 40 (01)
  • [23] CSWin-UNet: Transformer UNet with cross-shaped windows for medical image segmentation
    Liu, Xiao
    Gao, Peng
    Yu, Tao
    Wang, Fei
    Yuan, Ru-Yue
    INFORMATION FUSION, 2025, 113
  • [24] DEA-UNet: a dense-edge-attention UNet architecture for medical image segmentation
    Zeng, Zhenhuan
    Fan, Chaodong
    Xiao, Leyi
    Qu, Xilong
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (04)
  • [25] MLFA-UNet: A multi-level feature assembly UNet for medical image segmentation
    Garbaz, Anass
    Oukdacha, Yassine
    Charfi, Said
    El Ansari, Mohamed
    Koutti, Lahcen
    Salihoun, Mouna
    METHODS, 2024, 232 : 52 - 64
  • [26] UNetFormer: A UNet-like transformer for efficient semantic segmentation of remote sensing urban scene imagery
    Wang, Libo
    Li, Rui
    Zhang, Ce
    Fang, Shenghui
    Duan, Chenxi
    Meng, Xiaoliang
    Atkinson, Peter M.
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2022, 190 : 196 - 214
  • [27] EMED-UNet: An Efficient Multi-Encoder-Decoder Based UNet for Medical Image Segmentation
    Shah, Kashish D.
    Patel, Dhaval K.
    Thaker, Minesh P.
    Patel, Harsh A.
    Saikia, Manob Jyoti
    Ranger, Bryan J.
    IEEE ACCESS, 2023, 11 : 95253 - 95266
  • [28] TransCUNet: UNet cross fused transformer for medical image segmentation
    Jiang, Shen
    Li, Jinjiang
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 150
  • [29] SMESwin Unet: Merging CNN and Transformer for Medical Image Segmentation
    Wang, Ziheng
    Min, Xiongkuo
    Shi, Fangyu
    Jin, Ruinian
    Nawrin, Saida S.
    Yu, Ichen
    Nagatomi, Ryoichi
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT V, 2022, 13435 : 517 - 526
  • [30] Light-UNet: An Efficient Segmentation Network for Medical Image
    Zhang, Yue
    Xu, Chao
    Zhang, Zhifan
    Wang, Jianjun
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VI, ICIC 2024, 2024, 14867 : 302 - 313