RM-UNet: UNet-like Mamba with rotational SSM module for medical image segmentation

被引:3
|
作者
Tang, Hao [1 ]
Huang, Guoheng [1 ]
Cheng, Lianglun [1 ]
Yuan, Xiaochen [2 ]
Tao, Qi [3 ]
Chen, Xuhang [4 ]
Zhong, Guo [5 ]
Yang, Xiaohui [6 ]
机构
[1] Guangdong Univ Technol, Sch Comp Sci & Technol, Guangzhou 510006, Peoples R China
[2] Macao Polytech Univ, Fac Appl Sci, Macau 999078, Peoples R China
[3] Guangdong Technion Israel Inst Technol, Dept Mech Engn Robot, Shantou 515063, Peoples R China
[4] Huizhou Univ, Sch Comp Sci & Engn, Huizhou 516007, Peoples R China
[5] Guangdong Univ Foreign Studies, Sch Informat Sci & Technol, Guangzhou 510006, Peoples R China
[6] Sun Yat sen Univ, Affiliated Hosp 3, Dept Gynecol, Guangzhou, Peoples R China
关键词
U-Net; State Space Models; Medical image segmentation; Mamba; LSIL; U-NET ARCHITECTURE; TRANSFORMER;
D O I
10.1007/s11760-024-03484-8
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Accurate segmentation of tissues and lesions is crucial for disease diagnosis, treatment planning, and surgical navigation. Yet, the complexity of medical images presents significant challenges for traditional Convolutional Neural Networks and Transformer models due to their limited receptive fields or high computational complexity. State Space Models (SSMs) have recently shown notable vision performance, particularly Mamba and its variants. However, their feature extraction methods may not be sufficiently effective and retain some redundant structures, leaving room for parameter reduction. In response to these challenges, we introduce a methodology called Rotational Mamba-UNet, characterized by Residual Visual State Space (ResVSS) block and Rotational SSM Module. The ResVSS block is devised to mitigate network degradation caused by the diminishing efficacy of information transfer from shallower to deeper layers. Meanwhile, the Rotational SSM Module is devised to tackle the challenges associated with channel feature extraction within State Space Models. Finally, we propose a weighted multi-level loss function, which fully leverages the outputs of the decoder's three stages for supervision. We conducted experiments on ISIC17, ISIC18, CVC-300, Kvasir-SEG, CVC-ColonDB, Kvasir-Instrument datasets, and Low-grade Squamous Intraepithelial Lesion datasets provided by The Third Affiliated Hospital of Sun Yat-sen University, demonstrating the superior segmentation performance of our proposed RM-UNet. Additionally, compared to the previous VM-UNet, our model achieves a one-third reduction in parameters. Our code is available at https://github.com/Halo2Tang/RM-UNet.
引用
收藏
页码:8427 / 8443
页数:17
相关论文
共 50 条
  • [1] ConvWin-UNet: UNet-like hierarchical vision Transformer combined with convolution for medical image segmentation
    Feng, Xiaomeng
    Wang, Taiping
    Yang, Xiaohang
    Zhang, Minfei
    Guo, Wanpeng
    Wang, Weina
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (01) : 128 - 144
  • [2] LKM-UNet: Large Kernel Vision Mamba UNet for Medical Image Segmentation
    Wang, Jinhong
    Chen, Jintai
    Chen, Danny
    Wu, Jian
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT VIII, 2024, 15008 : 360 - 370
  • [3] Vision Mamba and xLSTM-UNet for medical image segmentation
    Zhong, Xin
    Lu, Gehao
    Li, Hao
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [4] VM-UNET-V2: Rethinking Vision Mamba UNet for Medical Image Segmentation
    Zhang, Mingya
    Yu, Yue
    Jin, Sun
    Gu, Limei
    Ling, Tingsheng
    Tao, Xianping
    BIOINFORMATICS RESEARCH AND APPLICATIONS, PT I, ISBRA 2024, 2024, 14954 : 335 - 346
  • [5] UTR: A UNet-like transformer for efficient unsupervised medical image registration
    Qiu, Wei
    Xiong, Lianjin
    Li, Ning
    Wang, Yaobin
    Zhang, Yangsong
    IMAGE AND VISION COMPUTING, 2024, 150
  • [6] UNetMamba: An Efficient UNet-Like Mamba for Semantic Segmentation of High-Resolution Remote Sensing Images
    Zhu, Enze
    Chen, Zhan
    Wang, Dingkai
    Shi, Hanru
    Liu, Xiaoxuan
    Wang, Lei
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2025, 22
  • [7] TMU: Transmission-Enhanced Mamba-UNet for Medical Image Segmentation
    Yang, Xiongfeng
    Luo, Ziyang
    Wu, Yanlin
    Xie, Xueshuo
    Nan, Li
    Li, Tao
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT X, ICIC 2024, 2024, 14871 : 428 - 438
  • [8] Multi-scale context UNet-like network with redesigned skip connections for medical image segmentation
    Qian, Ledan
    Wen, Caiyun
    Li, Yi
    Hu, Zhongyi
    Zhou, Xiao
    Xia, Xiaonyu
    Kim, Soo-Hyung
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2024, 243
  • [9] PCAT-UNet: UNet-like network fused convolution and transformer for retinal vessel segmentation
    Chen, Danny
    Yang, Wenzhong
    Wang, Liejun
    Tan, Sixiang
    Lin, Jiangzhaung
    Bu, Wenxiu
    PLOS ONE, 2022, 17 (01):
  • [10] DRD-UNet, a UNet-Like Architecture for Multi-Class Breast Cancer Semantic Segmentation
    Ortega-Ruiz, Mauricio Alberto
    Karabag, Cefa
    Roman-Rangel, Edgar
    Reyes-Aldasoro, Constantino Carlos
    IEEE ACCESS, 2024, 12 : 40412 - 40424