Diffusion model-based text-guided enhancement network for medical image segmentation

被引:7
|
作者
Dong, Zhiwei [1 ]
Yuan, Genji [1 ]
Hua, Zhen [1 ]
Li, Jinjiang [2 ]
机构
[1] Shandong Technol & Business Univ, Sch Comp Sci & Technol, Yantai, Peoples R China
[2] Shandong Technol & Business Univ, Sch Informat & Elect Engn, Yantai, Peoples R China
基金
中国国家自然科学基金;
关键词
Denoising diffusion model; Text attention mechanism; Guided feature enhancement; Medical image segmentation; CONVOLUTIONAL NEURAL-NETWORK; CELL-NUCLEI; MISDIAGNOSIS; CLASSIFICATION;
D O I
10.1016/j.eswa.2024.123549
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, denoising diffusion models have achieved remarkable success in generating pixel-level representations with semantic values for image generation modeling. In this study, we propose a novel end -toend framework, called TGEDiff, focusing on medical image segmentation. TGEDiff fuses a textual attention mechanism with the diffusion model by introducing an additional auxiliary categorization task to guide the diffusion model with textual information to generate excellent pixel-level representations. To overcome the limitation of limited perceptual fields for independent feature encoders within the diffusion model, we introduce a multi-kernel excitation module to extend the model's perceptual capability. Meanwhile, a guided feature enhancement module is introduced in Denoising-UNet to focus the model's attention on important regions and attenuate the influence of noise and irrelevant background in medical images. We critically evaluated TGEDiff on three datasets (Kvasir-SEG, Kvasir-Sessile, and GLaS), and TGEDiff achieved significant improvements over the state -of -the -art approach on all three datasets, with F1 scores and mIoU improving by 0.88% and 1.09%, 3.21% and 3.43%, respectively, 1.29% and 2.34%. These data validate that TGEDiff has excellent performance in medical image segmentation. TGEDiff is expected to facilitate accurate diagnosis and treatment of medical diseases through more precise deconvolutional structural segmentation.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] BerDiff: Conditional Bernoulli Diffusion Model for Medical Image Segmentation
    Chen, Tao
    Wang, Chenhui
    Shan, Hongming
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT IV, 2023, 14223 : 491 - 501
  • [22] Evolutionary Attention Network for Medical Image Segmentation
    Hassanzadeh, Tahereh
    Essam, Daryl
    Sarker, Ruhul
    2020 DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2020,
  • [23] Guided-attention and gated-aggregation network for medical image segmentation
    Fiaz, Mustansar
    Noman, Mubashir
    Cholakkal, Hisham
    Anwer, Rao Muhammad
    Hanna, Jacob
    Khan, Fahad Shahbaz
    PATTERN RECOGNITION, 2024, 156
  • [24] Edge-guided and hierarchical aggregation network for robust medical image segmentation
    Tang, Yi
    Zhao, Di
    Pertsau, Dmitry
    Gourinovitch, Alevtina
    Kupryianava, Dziana
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 101
  • [25] BGF-Net: Boundary guided filter network for medical image segmentation
    He, Yanlin
    Yi, Yugen
    Zheng, Caixia
    Kong, Jun
    COMPUTERS IN BIOLOGY AND MEDICINE, 2024, 171
  • [26] DCACNet: Dual context aggregation and attention-guided cross deconvolution network for medical image segmentation
    Lu, Hongchun
    Tian, Shengwei
    Yu, Long
    Liu, Lu
    Cheng, Junlong
    Wu, Weidong
    Kang, Xiaojing
    Zhang, Dezhi
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2022, 214
  • [27] Medical image segmentation network based on feature filtering with low number of parameters
    Ren, Zitong
    Guo, Zhiqing
    Wang, Liejun
    Xu, Lianghui
    Liu, Chao
    APPLIED SOFT COMPUTING, 2024, 167
  • [28] Active Contour Model Coupling with Backward Diffusion for Medical Image Segmentation
    Wang, Guodong
    Pan, Zhenkuan
    Zhang, Weizhong
    Dong, Qian
    PROCEEDINGS OF THE 2013 6TH INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS (BMEI 2013), VOLS 1 AND 2, 2013, : 101 - 105
  • [29] Fuzzy model-based clustering and its application in image segmentation
    Choy, Siu Kai
    Lam, Shu Yan
    Yu, Kwok Wai
    Lee, Wing Yan
    Leung, King Tai
    PATTERN RECOGNITION, 2017, 68 : 141 - 157
  • [30] Encoder Activation Diffusion and Decoder Transformer Fusion Network for Medical Image Segmentation
    Li, Xueru
    Xu, Guoxia
    Zhao, Meng
    Shi, Fan
    Wang, Hao
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XIII, 2024, 14437 : 185 - 197