Encoder Activation Diffusion and Decoder Transformer Fusion Network for Medical Image Segmentation

被引:0
|
作者
Li, Xueru [1 ]
Xu, Guoxia [2 ]
Zhao, Meng [1 ]
Shi, Fan [1 ]
Wang, Hao [2 ]
机构
[1] Tianjin Univ Technol, Key Lab Comp Vis & Syst, Minist Educ, Sch Comp Sci & Engn, Tianjin 300384, Peoples R China
[2] Xidian Univ, Sch Cyber Engn, Xian 710126, Peoples R China
基金
中国国家自然科学基金;
关键词
Medical Image Segmentation; Lightweight Convolution Modulation; Encoder Activation Diffusion; Multi-scale Decoding Fusion With Transformer;
D O I
10.1007/978-981-99-8558-6_16
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Over the years, medical image segmentation has played a vital role in assisting healthcare professionals in disease treatment. Convolutional neural networks have demonstrated remarkable success in this domain. Among these networks, the encoder-decoder architecture stands out as a classic and effective model for medical image segmentation. However, several challenges remain to be addressed, including segmentation issues arising from indistinct boundaries, difficulties in segmenting images with irregular shapes, and accurate segmentation of lesions with small targets. To address these limitations, we propose Encoder Activation Diffusion and Decoder Transformer Fusion Network (ADTF). Specifically, we propose a novel Lightweight Convolution Modulation (LCM) formed by a gated attention mechanism, using convolution to encode spatial features. LCM replaces the convolutional layer in the encoder-decoder network. Additionally, to enhance the integration of spatial information and dynamically extract more valuable high-order semantic information, we introduce Activation Diffusion Blocks after the encoder (EAD), so that the network can segment a complete medical segmentation image. Furthermore, we utilize a Transformer-based multi-scale feature fusion module on the decoder (MDFT) to achieve global interaction of multi-scale features. To validate our approach, we conduct experiments on multiple medical image segmentation datasets. Experimental results demonstrate that our model outperforms other state-of-the-art (SOTA) methods on commonly used evaluation metrics.
引用
收藏
页码:185 / 197
页数:13
相关论文
共 50 条
  • [1] Alternate encoder and dual decoder CNN-Transformer networks for medical image segmentation
    Zhang, Lin
    Guo, Xinyu
    Sun, Hongkun
    Wang, Weigang
    Yao, Liwei
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [2] Iterative Deep Convolutional Encoder-Decoder Network for Medical Image Segmentation
    Kim, Jung Uk
    Kim, Hak Gu
    Ro, Yong Man
    2017 39TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2017, : 685 - 688
  • [3] MULTI-ENCODER PARSE-DECODER NETWORK FOR SEQUENTIAL MEDICAL IMAGE SEGMENTATION
    Shi, Dachuan
    Liu, Ruiyang
    Tao, Linmi
    He, Zuoxiang
    Huo, Li
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 31 - 35
  • [4] LAEDNet: A Lightweight Attention Encoder-Decoder Network for ultrasound medical image segmentation
    Zhou, Quan
    Wang, Qianwen
    Bao, Yunchao
    Kong, Lingjun
    Jin, Xin
    Ou, Weihua
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 99
  • [5] A combined encoder–transformer–decoder network for volumetric segmentation of adrenal tumors
    Liping Wang
    Mingtao Ye
    Yanjie Lu
    Qicang Qiu
    Zhongfeng Niu
    Hengfeng Shi
    Jian Wang
    BioMedical Engineering OnLine, 22
  • [6] An Improved Encoder-Decoder Network for Ore Image Segmentation
    Yang, Hao
    Huang, Chao
    Wang, Long
    Luo, Xiong
    IEEE SENSORS JOURNAL, 2021, 21 (10) : 11469 - 11475
  • [7] CPFTransformer: transformer fusion context pyramid medical image segmentation network
    Li, Jiao
    Ye, Jinyu
    Zhang, Ruixin
    Wu, Yue
    Berhane, Gebremedhin Samuel
    Deng, Hongxia
    Shi, Hong
    FRONTIERS IN NEUROSCIENCE, 2023, 17
  • [8] Diverter transformer-based multi-encoder-multi-decoder network model for medical retinal blood vessel image segmentation
    Wu, Chengwei
    Guo, Min
    Ma, Miao
    Wang, Kaiguang
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 93
  • [9] SwinE-UNet3+: swin transformer encoder network for medical image segmentation
    Zou, Ping
    Wu, Jian-Sheng
    PROGRESS IN ARTIFICIAL INTELLIGENCE, 2023, 12 (01) : 99 - 105
  • [10] SwinE-UNet3+: swin transformer encoder network for medical image segmentation
    Ping Zou
    Jian-Sheng Wu
    Progress in Artificial Intelligence, 2023, 12 : 99 - 105