Diffusion model-based text-guided enhancement network for medical image segmentation

被引:7
作者
Dong, Zhiwei [1 ]
Yuan, Genji [1 ]
Hua, Zhen [1 ]
Li, Jinjiang [2 ]
机构
[1] Shandong Technol & Business Univ, Sch Comp Sci & Technol, Yantai, Peoples R China
[2] Shandong Technol & Business Univ, Sch Informat & Elect Engn, Yantai, Peoples R China
基金
中国国家自然科学基金;
关键词
Denoising diffusion model; Text attention mechanism; Guided feature enhancement; Medical image segmentation; CONVOLUTIONAL NEURAL-NETWORK; CELL-NUCLEI; MISDIAGNOSIS; CLASSIFICATION;
D O I
10.1016/j.eswa.2024.123549
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, denoising diffusion models have achieved remarkable success in generating pixel-level representations with semantic values for image generation modeling. In this study, we propose a novel end -toend framework, called TGEDiff, focusing on medical image segmentation. TGEDiff fuses a textual attention mechanism with the diffusion model by introducing an additional auxiliary categorization task to guide the diffusion model with textual information to generate excellent pixel-level representations. To overcome the limitation of limited perceptual fields for independent feature encoders within the diffusion model, we introduce a multi-kernel excitation module to extend the model's perceptual capability. Meanwhile, a guided feature enhancement module is introduced in Denoising-UNet to focus the model's attention on important regions and attenuate the influence of noise and irrelevant background in medical images. We critically evaluated TGEDiff on three datasets (Kvasir-SEG, Kvasir-Sessile, and GLaS), and TGEDiff achieved significant improvements over the state -of -the -art approach on all three datasets, with F1 scores and mIoU improving by 0.88% and 1.09%, 3.21% and 3.43%, respectively, 1.29% and 2.34%. These data validate that TGEDiff has excellent performance in medical image segmentation. TGEDiff is expected to facilitate accurate diagnosis and treatment of medical diseases through more precise deconvolutional structural segmentation.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Medical image segmentation with transform and moment based features and incremental supervised neural network
    Iscan, Zafer
    Yuksel, Ayhan
    Dokur, Zuemray
    Korurek, Mehmet
    Olmez, Tamer
    DIGITAL SIGNAL PROCESSING, 2009, 19 (05) : 890 - 901
  • [42] A swin-transformer-based network with inductive bias ability for medical image segmentation
    Gao, Yan
    Xu, Huan
    Liu, Quanle
    Bie, Mei
    Che, Xiangjiu
    APPLIED INTELLIGENCE, 2025, 55 (02)
  • [43] Medical Image Segmentation based on Fully Convolutional Network and Minimizing Energy Between Curves
    Vo Thi Hong Tuyet
    Nguyen Thanh Binh
    TEM JOURNAL-TECHNOLOGY EDUCATION MANAGEMENT INFORMATICS, 2020, 9 (04): : 1348 - 1356
  • [44] DBEF-Net: Diffusion-Based Boundary-Enhanced Fusion Network for medical image segmentation
    Huang, Zhenyang
    Li, Jianjun
    Mao, Ning
    Yuan, Genji
    Li, Jinjiang
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 255
  • [45] A Novel Model-Based Approach for Medical Image Segmentation Using Spatially Constrained Inverted Dirichlet Mixture Models
    Fan, Wentao
    Hu, Can
    Du, Jixiang
    Bouguila, Nizar
    NEURAL PROCESSING LETTERS, 2018, 47 (02) : 619 - 639
  • [46] Conditional Diffusion Model with Spatial Attention and Latent Embedding for Medical Image Segmentation
    Hejrati, Behzad
    Banerjee, Soumyanil
    Glide-Hurst, Carri
    Dong, Ming
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT IX, 2024, 15009 : 202 - 212
  • [47] CSAP-UNet: Convolution and self-attention paralleling network for medical image segmentation with edge enhancement
    Fan X.
    Zhou J.
    Jiang X.
    Xin M.
    Hou L.
    Computers in Biology and Medicine, 2024, 172
  • [48] Model-Based Learning of Local Image Features for Unsupervised Texture Segmentation
    Kiechle, Martin
    Storath, Martin
    Weinmann, Andreas
    Kleinsteuber, Martin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (04) : 1994 - 2007
  • [49] Query-guided generalizable medical image segmentation
    Yang, Zhiyi
    Zhao, Zhou
    Gu, Yuliang
    Xu, Yongchao
    PATTERN RECOGNITION LETTERS, 2024, 184 : 52 - 58
  • [50] Medical image segmentation based on Mumford-Shah Model
    Lin, P
    Yan, XG
    Zheng, CX
    Yang, Y
    2004 INTERNATIONAL CONFERENCE ON COMMUNICATION, CIRCUITS, AND SYSTEMS, VOLS 1 AND 2: VOL 1: COMMUNICATION THEORY AND SYSTEMS - VOL 2: SIGNAL PROCESSING, CIRCUITS AND SYSTEMS, 2004, : 942 - 945