LATrans-Unet: Improving CNN-Transformer with Location Adaptive for Medical Image Segmentation

被引:0
作者
Lin, Qiqin [1 ]
Yao, Junfeng [1 ,2 ,3 ]
Hong, Qingqi [1 ,3 ,4 ]
Cao, Xianpeng [1 ]
Zhou, Rongzhou [1 ]
Xie, Weixing [1 ]
机构
[1] Xiamen Univ, Sch Film, Sch Informat, Ctr Digital Media Comp, Xiamen 361005, Peoples R China
[2] Minist Culture & Tourism, Key Lab Digital Protect & Intelligent Proc Intang, Xiamen, Peoples R China
[3] Xiamen Univ, Inst Artificial Intelligence, Xiamen 361005, Peoples R China
[4] Hong Kong Ctr Cerebrocardiovasc Hlth Engn COCHE, Hong Kong, Peoples R China
来源
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XIII | 2024年 / 14437卷
关键词
Medical image segmentation; Transformer; Location information; Skip connection; NET;
D O I
10.1007/978-981-99-8558-6_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) have been widely employed in medical image segmentation. While CNNs excel in local feature encoding, their ability to capture long-range dependencies is limited. In contrast, ViTs have strong global modeling capabilities. However, existing attention-based ViT models face difficulties in adaptively preserving accurate location information, rendering them unable to handle variations in important information within medical images. To inherit the merits of CNN and ViT while avoiding their respective limitations, we propose a novel framework called LATrans-Unet. By comprehensively enhancing the representation of information in both shallow and deep levels, LATrans-Unet maximizes the integration of location information and contextual details. In the shallow levels, based on a skip connection called SimAM-skip, we emphasize information boundaries and bridge the encoder-decoder semantic gap. Additionally, to capture organ shape and location variations in medical images, we propose Location-Adaptive Attention in the deep levels. It enables accurate segmentation by guiding the model to track changes globally and adaptively. Extensive experiments on multi-organ and cardiac segmentation tasks validate the superior performance of LATrans-Unet compared to previous state-of-the-art methods. The codes and trained models will be available soon.
引用
收藏
页码:223 / 234
页数:12
相关论文
共 50 条
  • [41] CvT-UNet: A weld pool segmentation method integrating a CNN and a transformer
    Yang, Longcheng
    Wang, Huajun
    Meng, Wenjie
    Pan, Hongyu
    HELIYON, 2024, 10 (15)
  • [42] GSAC-UFormer: Groupwise Self-Attention Convolutional Transformer-Based UNet for Medical Image Segmentation
    Garbaz, Anass
    Oukdach, Yassine
    Charfi, Said
    El Ansari, Mohamed
    Koutti, Lahcen
    Salihoun, Mouna
    COGNITIVE COMPUTATION, 2025, 17 (02)
  • [43] FAFuse: A Four-Axis Fusion framework of CNN and Transformer for medical image segmentation
    Xu, Shoukun
    Xiao, Dehao
    Yuan, Baohua
    Liu, Yi
    Wang, Xueyuan
    Li, Ning
    Shi, Lin
    Chen, Jialu
    Zhang, Ju-Xiao
    Wang, Yanhao
    Cao, Jianfeng
    Shao, Yeqin
    Jiang, Mingjie
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 166
  • [44] FTransCNN: Fusing Transformer and a CNN based on fuzzy logic for uncertain medical image segmentation
    Ding, Weiping
    Wang, Haipeng
    Huang, Jiashuang
    Ju, Hengrong
    Geng, Yu
    Lin, Chin-Teng
    Pedrycz, Witold
    INFORMATION FUSION, 2023, 99
  • [45] D-TrAttUnet: Toward hybrid CNN-transformer architecture for generic and subtle segmentation in medical images
    Bougourzi F.
    Dornaika F.
    Distante C.
    Taleb-Ahmed A.
    Computers in Biology and Medicine, 2024, 176
  • [46] Image Deblurring Based on an Improved CNN-Transformer Combination Network
    Chen, Xiaolin
    Wan, Yuanyuan
    Wang, Donghe
    Wang, Yuqing
    APPLIED SCIENCES-BASEL, 2023, 13 (01):
  • [47] MSCT-UNET: multi-scale contrastive transformer within U-shaped network for medical image segmentation
    Xi, Heran
    Dong, Haoji
    Sheng, Yue
    Cui, Hui
    Huang, Chengying
    Li, Jinbao
    Zhu, Jinghua
    PHYSICS IN MEDICINE AND BIOLOGY, 2024, 69 (01)
  • [48] TACT: Text attention based CNN-Transformer network for polyp segmentation
    Zhao, Yiyang
    Li, Jinjiang
    Hua, Zhen
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2024, 34 (02)
  • [49] Medical Image Segmentation Based on Transformer and HarDNet Structures
    Shen, Tongping
    Xu, Huanqing
    IEEE ACCESS, 2023, 11 : 16621 - 16630
  • [50] ScribFormer: Transformer Makes CNN Work Better for Scribble-Based Medical Image Segmentation
    Li, Zihan
    Zheng, Yuan
    Shan, Dandan
    Yang, Shuzhou
    Li, Qingde
    Wang, Beizhan
    Zhang, Yuanting
    Hong, Qingqi
    Shen, Dinggang
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2024, 43 (06) : 2254 - 2265