RM-UNet: UNet-like Mamba with rotational SSM module for medical image segmentation

被引:3
作者
Tang, Hao [1 ]
Huang, Guoheng [1 ]
Cheng, Lianglun [1 ]
Yuan, Xiaochen [2 ]
Tao, Qi [3 ]
Chen, Xuhang [4 ]
Zhong, Guo [5 ]
Yang, Xiaohui [6 ]
机构
[1] Guangdong Univ Technol, Sch Comp Sci & Technol, Guangzhou 510006, Peoples R China
[2] Macao Polytech Univ, Fac Appl Sci, Macau 999078, Peoples R China
[3] Guangdong Technion Israel Inst Technol, Dept Mech Engn Robot, Shantou 515063, Peoples R China
[4] Huizhou Univ, Sch Comp Sci & Engn, Huizhou 516007, Peoples R China
[5] Guangdong Univ Foreign Studies, Sch Informat Sci & Technol, Guangzhou 510006, Peoples R China
[6] Sun Yat sen Univ, Affiliated Hosp 3, Dept Gynecol, Guangzhou, Peoples R China
关键词
U-Net; State Space Models; Medical image segmentation; Mamba; LSIL; U-NET ARCHITECTURE; TRANSFORMER;
D O I
10.1007/s11760-024-03484-8
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Accurate segmentation of tissues and lesions is crucial for disease diagnosis, treatment planning, and surgical navigation. Yet, the complexity of medical images presents significant challenges for traditional Convolutional Neural Networks and Transformer models due to their limited receptive fields or high computational complexity. State Space Models (SSMs) have recently shown notable vision performance, particularly Mamba and its variants. However, their feature extraction methods may not be sufficiently effective and retain some redundant structures, leaving room for parameter reduction. In response to these challenges, we introduce a methodology called Rotational Mamba-UNet, characterized by Residual Visual State Space (ResVSS) block and Rotational SSM Module. The ResVSS block is devised to mitigate network degradation caused by the diminishing efficacy of information transfer from shallower to deeper layers. Meanwhile, the Rotational SSM Module is devised to tackle the challenges associated with channel feature extraction within State Space Models. Finally, we propose a weighted multi-level loss function, which fully leverages the outputs of the decoder's three stages for supervision. We conducted experiments on ISIC17, ISIC18, CVC-300, Kvasir-SEG, CVC-ColonDB, Kvasir-Instrument datasets, and Low-grade Squamous Intraepithelial Lesion datasets provided by The Third Affiliated Hospital of Sun Yat-sen University, demonstrating the superior segmentation performance of our proposed RM-UNet. Additionally, compared to the previous VM-UNet, our model achieves a one-third reduction in parameters. Our code is available at https://github.com/Halo2Tang/RM-UNet.
引用
收藏
页码:8427 / 8443
页数:17
相关论文
共 50 条
  • [21] DSKCA-UNet: Dynamic selective kernel channel attention for medical image segmentation
    Shen, Longfeng
    Wang, Qiong
    Zhang, Yingjie
    Qin, Fenglan
    Jin, Hengjun
    Zhao, Wei
    MEDICINE, 2023, 102 (39) : E35328
  • [22] DI-Unet: Dimensional interaction self-attention for medical image segmentation
    Wu, Yanlin
    Wang, Guanglei
    Wang, Zhongyang
    Wang, Hongrui
    Li, Yan
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 78
  • [23] Light-UNet: An Efficient Segmentation Network for Medical Image
    Zhang, Yue
    Xu, Chao
    Zhang, Zhifan
    Wang, Jianjun
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VI, ICIC 2024, 2024, 14867 : 302 - 313
  • [24] Dynamic neighbourhood-enhanced UNet with interwoven fusion for medical image segmentation
    Wan, Liming
    Song, Lin
    Zhou, Ying
    Kang, Chenrui
    Zheng, Shijian
    Chen, Guo
    VISUAL COMPUTER, 2025,
  • [25] LeViT-UNet: Make Faster Encoders with Transformer for Medical Image Segmentation
    Xu, Guoping
    Zhang, Xuan
    He, Xinwei
    Wu, Xinglong
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VIII, 2024, 14432 : 42 - 53
  • [26] ERDUnet: An Efficient Residual Double-Coding Unet for Medical Image Segmentation
    Li, Hao
    Zhai, Di-Hua
    Xia, Yuanqing
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (04) : 2083 - 2096
  • [27] Multiresolution Aggregation Transformer UNet Based on Multiscale Input and Coordinate Attention for Medical Image Segmentation
    Chen, Shaolong
    Qiu, Changzhen
    Yang, Weiping
    Zhang, Zhiyong
    SENSORS, 2022, 22 (10)
  • [28] EPolar-UNet: An edge-attending polar UNet for automatic medical image segmentation with small datasets
    Ling, Yating
    Wang, Yuling
    Liu, Qian
    Yu, Jie
    Xu, Lei
    Zhang, Xiaoqian
    Liang, Ping
    Kong, Dexing
    MEDICAL PHYSICS, 2024, 51 (03) : 1702 - 1713
  • [29] MCNMF-Unet: a mixture Conv-MLP network with multi-scale features fusion Unet for medical image segmentation
    Yuan, Lei
    Song, Jianhua
    Fan, Yazhuo
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [30] TMD-Unet: Triple-Unet with Multi-Scale Input Features and Dense Skip Connection for Medical Image Segmentation
    Tran, Song-Toan
    Cheng, Ching-Hwa
    Nguyen, Thanh-Tuan
    Le, Minh-Hai
    Liu, Don-Gey
    HEALTHCARE, 2021, 9 (01)