RAMIS: Increasing robustness and accuracy in medical image segmentation with hybrid CNN-transformer synergy

被引：2

作者：

Gu, Jia ^{[1
]}

Tian, Fangzheng ^{[1
]}

Oh, Il-Seok ^{[1
,2
]}

机构：

[1] Jeonbuk Natl Univ, Dept Comp Sci & Artificial Intelligence, Jeonju Si 54896, South Korea

[2] Jeonbuk Natl Univ, Ctr Adv Image Informat Technol, Jeonju 54896, South Korea

来源：

NEUROCOMPUTING | 2025年 / 618卷

基金：

新加坡国家研究基金会;

关键词：

Medical image segmentation; Hybrid models; Implicit representation; Self-distillation; Multi-resolution network; UNET PLUS PLUS; LESION SEGMENTATION; FUSION NETWORK; U-NET; CLASSIFICATION; ARCHITECTURE;

D O I：

10.1016/j.neucom.2024.129009

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Hybrid architectures based on Convolutional Neural Network (CNN) and Vision Transformer (ViT) have become an important research direction in medical image segmentation in recent years. However, the currently popular hybrid architectures weaken the decision making process within the Transformer model, the way in which the output of the Transformer is post-processed by the upsampling of the convolution stack makes it difficult to restore the blurred boundaries of the target area. To improve the feature learning capability by addressing these issues, we propose RAMIS, a novel hybrid architecture for general medical image segmentation. RAMIS develops implicit neural representation and self-distillation to simultaneously obtain the super-resolution details and core features of the image as input to the Transformer encoder. Meanwhile, RAMIS explores an unsupervised learning CNN to obtain the initial input to the Transformer decoder, which not only explicitly considers the correlation within different samples, reduces the constraints on small datasets, but also fully leverages the potential of Transformer's cross-attention for optimizing segmentation results. RAMIS designs a multi-resolution interaction network to post-process the Transformer output and solves the problem of blurred segmentation boundaries by combining super-resolution image. We extensively evaluate RAMIS on five datasets from three typical publicly available medical image segmentation datasets. Extensive experimental results demonstrate the general applicability and superior performance of the proposed method. The code and pre-trained models are available on our website https://ramis.netlify.app.

引用

页数：14

共 113 条

[91] autoSMIM: Automatic Superpixel-Based Masked Image Modeling for Skin Lesion Segmentation [J].

Wang, Zhonghua ;

Lyu, Junyan ;

Tang, Xiaoying .

IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (12) :3501-3511

[92] SMESwin Unet: Merging CNN and Transformer for Medical Image Segmentation [J].

Wang, Ziheng ;

Min, Xiongkuo ;

Shi, Fangyu ;

Jin, Ruinian ;

Nawrin, Saida S. ;

Yu, Ichen ;

Nagatomi, Ryoichi .

MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT V, 2022, 13435 :517-526

[93] A Multi-scale and Multi-attention Network for Skin Lesion Segmentation [J].

Wu, Cong ;

Zhang, Hang ;

Chen, Dingsheng ;

Gan, Haitao .

NEURAL INFORMATION PROCESSING, ICONIP 2023, PT IV, 2024, 14450 :537-550

[94] FAT-Net: Feature adaptive transformers for automated skin lesion segmentation [J].

Wu, Huisi ;

Chen, Shihuai ;

Chen, Guilian ;

Wang, Wei ;

Lei, Baiying ;

Wen, Zhenkun .

MEDICAL IMAGE ANALYSIS, 2022, 76

[95] SCS-Net: A Scale and Context Sensitive Network for Retinal Vessel Segmentation [J].

Wu, Huisi ;

Wang, Wei ;

Zhong, Jiafu ;

Lei, Baiying ;

Wen, Zhenkun ;

Qin, Jing .

MEDICAL IMAGE ANALYSIS, 2021, 70

[96] BGM-Net: Boundary-Guided Multiscale Network for Breast Lesion Segmentation in Ultrasound [J].

Wu, Yunzhu ;

Zhang, Ruoxin ;

Zhu, Lei ;

Wang, Weiming ;

Wang, Shengwen ;

Xie, Haoran ;

Cheng, Gary ;

Wang, Fu Lee ;

He, Xingxiang ;

Zhang, Hai .

FRONTIERS IN MOLECULAR BIOSCIENCES, 2021, 8

[97] Transformers in medical image segmentation: A review [J].

Xiao, Hanguang ;

Li, Li ;

Liu, Qiyuan ;

Zhu, Xiuhong ;

Zhang, Qihang .

BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 84

[98] SESV: Accurate Medical Image Segmentation by Predicting and Correcting Errors [J].

Xie, Yutong ;

Zhang, Jianpeng ;

Lu, Hao ;

Shen, Chunhua ;

Xia, Yong .

IEEE TRANSACTIONS ON MEDICAL IMAGING, 2021, 40 (01) :286-296

[99] A Mutual Bootstrapping Model for Automated Skin Lesion Segmentation and Classification [J].

Xie, Yutong ;

Zhang, Jianpeng ;

Xia, Yong ;

Shen, Chunhua .

IEEE TRANSACTIONS ON MEDICAL IMAGING, 2020, 39 (07) :2482-2493

[100] Using BI-RADS Stratifications as Auxiliary Information for Breast Masses Classification in Ultrasound Images [J].

Xing, Jie ;

Chen, Chao ;

Lu, Qinyang ;

Cai, Xun ;

Yu, Aijun ;

Xu, Yi ;

Xia, Xiaoling ;

Sun, Yue ;

Xiao, Jing ;

Huang, Lingyun .

IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (06) :2058-2070

← 3 4 5 6 7 8 9 10 11 12 →