LATrans-Unet: Improving CNN-Transformer with Location Adaptive for Medical Image Segmentation

被引:0
作者
Lin, Qiqin [1 ]
Yao, Junfeng [1 ,2 ,3 ]
Hong, Qingqi [1 ,3 ,4 ]
Cao, Xianpeng [1 ]
Zhou, Rongzhou [1 ]
Xie, Weixing [1 ]
机构
[1] Xiamen Univ, Sch Film, Sch Informat, Ctr Digital Media Comp, Xiamen 361005, Peoples R China
[2] Minist Culture & Tourism, Key Lab Digital Protect & Intelligent Proc Intang, Xiamen, Peoples R China
[3] Xiamen Univ, Inst Artificial Intelligence, Xiamen 361005, Peoples R China
[4] Hong Kong Ctr Cerebrocardiovasc Hlth Engn COCHE, Hong Kong, Peoples R China
来源
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XIII | 2024年 / 14437卷
关键词
Medical image segmentation; Transformer; Location information; Skip connection; NET;
D O I
10.1007/978-981-99-8558-6_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) have been widely employed in medical image segmentation. While CNNs excel in local feature encoding, their ability to capture long-range dependencies is limited. In contrast, ViTs have strong global modeling capabilities. However, existing attention-based ViT models face difficulties in adaptively preserving accurate location information, rendering them unable to handle variations in important information within medical images. To inherit the merits of CNN and ViT while avoiding their respective limitations, we propose a novel framework called LATrans-Unet. By comprehensively enhancing the representation of information in both shallow and deep levels, LATrans-Unet maximizes the integration of location information and contextual details. In the shallow levels, based on a skip connection called SimAM-skip, we emphasize information boundaries and bridge the encoder-decoder semantic gap. Additionally, to capture organ shape and location variations in medical images, we propose Location-Adaptive Attention in the deep levels. It enables accurate segmentation by guiding the model to track changes globally and adaptively. Extensive experiments on multi-organ and cardiac segmentation tasks validate the superior performance of LATrans-Unet compared to previous state-of-the-art methods. The codes and trained models will be available soon.
引用
收藏
页码:223 / 234
页数:12
相关论文
共 50 条
  • [1] AFC-Unet: Attention-fused full-scale CNN-transformer unet for medical image segmentation
    Meng, Wenjie
    Liu, Shujun
    Wang, Huajun
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 99
  • [2] TFCNs: A CNN-Transformer Hybrid Network for Medical Image Segmentation
    Li, Zihan
    Li, Dihan
    Xu, Cangbai
    Wang, Weice
    Hong, Qingqi
    Li, Qingde
    Tian, Jie
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT IV, 2022, 13532 : 781 - 792
  • [3] UCTNet: Uncertainty-guided CNN-Transformer hybrid networks for medical image segmentation
    Guo, Xiayu
    Lin, Xian
    Yang, Xin
    Yu, Li
    Cheng, Kwang-Ting
    Yan, Zengqiang
    PATTERN RECOGNITION, 2024, 152
  • [4] RAMIS: Increasing robustness and accuracy in medical image segmentation with hybrid CNN-transformer synergy
    Gu, Jia
    Tian, Fangzheng
    Oh, Il-Seok
    NEUROCOMPUTING, 2025, 618
  • [5] HCT-Unet: multi-target medical image segmentation via a hybrid CNN-transformer Unet incorporating multi-axis gated multi-layer perceptron
    Fan, Yazhuo
    Song, Jianhua
    Yuan, Lei
    Jia, Yunlin
    VISUAL COMPUTER, 2024, : 3457 - 3472
  • [6] HTC-Net: A hybrid CNN-transformer framework for medical image segmentation
    Tang, Hui
    Chen, Yuanbin
    Wang, Tao
    Zhou, Yuanbo
    Zhao, Longxuan
    Gao, Qinquan
    Du, Min
    Tan, Tao
    Zhang, Xinlin
    Tong, Tong
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 88
  • [7] Alternate encoder and dual decoder CNN-Transformer networks for medical image segmentation
    Zhang, Lin
    Guo, Xinyu
    Sun, Hongkun
    Wang, Weigang
    Yao, Liwei
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [8] Multi-Scale Orthogonal Model CNN-Transformer for Medical Image Segmentation
    Zhou, Wuyi
    Zeng, Xianhua
    Zhou, Mingkun
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 37 (10)
  • [9] TransUMobileNet: Integrating multi-channel attention fusion with hybrid CNN-Transformer architecture for medical image segmentation
    Cai, Sijing
    Jiang, Yukun
    Xiao, Yuwei
    Zeng, Jian
    Zhou, Guangming
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 107
  • [10] MCT-Net: a multi-branch hybrid CNN-transformer model for medical image segmentation
    Longfeng Shen
    Liangjin Diao
    Rui Peng
    Jiacong Chen
    Zhengtian Lu
    Fangzhen Ge
    Pattern Analysis and Applications, 2025, 28 (2)