LATrans-Unet: Improving CNN-Transformer with Location Adaptive for Medical Image Segmentation

被引:0
作者
Lin, Qiqin [1 ]
Yao, Junfeng [1 ,2 ,3 ]
Hong, Qingqi [1 ,3 ,4 ]
Cao, Xianpeng [1 ]
Zhou, Rongzhou [1 ]
Xie, Weixing [1 ]
机构
[1] Xiamen Univ, Sch Film, Sch Informat, Ctr Digital Media Comp, Xiamen 361005, Peoples R China
[2] Minist Culture & Tourism, Key Lab Digital Protect & Intelligent Proc Intang, Xiamen, Peoples R China
[3] Xiamen Univ, Inst Artificial Intelligence, Xiamen 361005, Peoples R China
[4] Hong Kong Ctr Cerebrocardiovasc Hlth Engn COCHE, Hong Kong, Peoples R China
来源
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XIII | 2024年 / 14437卷
关键词
Medical image segmentation; Transformer; Location information; Skip connection; NET;
D O I
10.1007/978-981-99-8558-6_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) have been widely employed in medical image segmentation. While CNNs excel in local feature encoding, their ability to capture long-range dependencies is limited. In contrast, ViTs have strong global modeling capabilities. However, existing attention-based ViT models face difficulties in adaptively preserving accurate location information, rendering them unable to handle variations in important information within medical images. To inherit the merits of CNN and ViT while avoiding their respective limitations, we propose a novel framework called LATrans-Unet. By comprehensively enhancing the representation of information in both shallow and deep levels, LATrans-Unet maximizes the integration of location information and contextual details. In the shallow levels, based on a skip connection called SimAM-skip, we emphasize information boundaries and bridge the encoder-decoder semantic gap. Additionally, to capture organ shape and location variations in medical images, we propose Location-Adaptive Attention in the deep levels. It enables accurate segmentation by guiding the model to track changes globally and adaptively. Extensive experiments on multi-organ and cardiac segmentation tasks validate the superior performance of LATrans-Unet compared to previous state-of-the-art methods. The codes and trained models will be available soon.
引用
收藏
页码:223 / 234
页数:12
相关论文
共 50 条
  • [31] Semhybridnet: a semantically enhanced hybrid CNN-transformer network for radar pulse image segmentation
    Hongjia Liu
    Yubin Xiao
    Xuan Wu
    Yuanshu Li
    Peng Zhao
    Yanchun Liang
    Liupu Wang
    You Zhou
    Complex & Intelligent Systems, 2024, 10 : 2851 - 2868
  • [32] FFSwinNet: CNN-Transformer Combined Network With FFT for Shale Core SEM Image Segmentation
    Feng, Yilong
    Jia, Lijuan
    Zhang, Jinchuan
    Chen, Junqi
    IEEE ACCESS, 2024, 12 : 73021 - 73032
  • [33] ConTrans: Improving Transformer with Convolutional Attention for Medical Image Segmentation
    Lin, Ailiang
    Xu, Jiayu
    Li, Jinxing
    Lu, Guangming
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT V, 2022, 13435 : 297 - 307
  • [34] AdaMCT: Adaptive Mixture of CNN-Transformer for Sequential Recommendation
    Jiang, Juyong
    Zhang, Peiyan
    Luo, Yingtao
    Li, Chaozhuo
    Kim, Jae Boum
    Zhang, Kai
    Wang, Senzhang
    Xie, Xing
    Kim, Sunghun
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 976 - 986
  • [35] Parallel Transformer-CNN Model for Medical Image Segmentation
    Zhou, Mingkun
    Nie, Xueyun
    Liu, Yuhang
    Li, Doudou
    2024 5TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATION, ICCEA 2024, 2024, : 1048 - 1051
  • [36] Hierarchical Decoder with Parallel Transformer and CNN for Medical Image Segmentation
    Li, Shijie
    Gong, Yu
    Xiang, Qingyuan
    Li, Zheng
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT XIV, 2025, 15044 : 133 - 147
  • [37] TAC-UNet: transformer-assisted convolutional neural network for medical image segmentation
    He, Jingliu
    Ma, Yuqi
    Yang, Mingyue
    Yang, Wensong
    Wu, Chunming
    Chen, Shanxiong
    QUANTITATIVE IMAGING IN MEDICINE AND SURGERY, 2024, 14 (12) : 8824 - 8839
  • [38] Multiresolution Aggregation Transformer UNet Based on Multiscale Input and Coordinate Attention for Medical Image Segmentation
    Chen, Shaolong
    Qiu, Changzhen
    Yang, Weiping
    Zhang, Zhiyong
    SENSORS, 2022, 22 (10)
  • [39] Progressive CNN-transformer semantic compensation network for polyp segmentation
    Li, Daxiang
    Li, Denghui
    Liu, Ying
    Tang, Yao
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2024, 32 (16): : 2523 - 2536
  • [40] CCTrans: Improving Medical Image Segmentation with Contoured Convolutional Transformer Network
    Wang, Jingling
    Zhang, Haixian
    Yi, Zhang
    MATHEMATICS, 2023, 11 (09)