Diffusion model-based text-guided enhancement network for medical image segmentation

被引:7
|
作者
Dong, Zhiwei [1 ]
Yuan, Genji [1 ]
Hua, Zhen [1 ]
Li, Jinjiang [2 ]
机构
[1] Shandong Technol & Business Univ, Sch Comp Sci & Technol, Yantai, Peoples R China
[2] Shandong Technol & Business Univ, Sch Informat & Elect Engn, Yantai, Peoples R China
基金
中国国家自然科学基金;
关键词
Denoising diffusion model; Text attention mechanism; Guided feature enhancement; Medical image segmentation; CONVOLUTIONAL NEURAL-NETWORK; CELL-NUCLEI; MISDIAGNOSIS; CLASSIFICATION;
D O I
10.1016/j.eswa.2024.123549
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, denoising diffusion models have achieved remarkable success in generating pixel-level representations with semantic values for image generation modeling. In this study, we propose a novel end -toend framework, called TGEDiff, focusing on medical image segmentation. TGEDiff fuses a textual attention mechanism with the diffusion model by introducing an additional auxiliary categorization task to guide the diffusion model with textual information to generate excellent pixel-level representations. To overcome the limitation of limited perceptual fields for independent feature encoders within the diffusion model, we introduce a multi-kernel excitation module to extend the model's perceptual capability. Meanwhile, a guided feature enhancement module is introduced in Denoising-UNet to focus the model's attention on important regions and attenuate the influence of noise and irrelevant background in medical images. We critically evaluated TGEDiff on three datasets (Kvasir-SEG, Kvasir-Sessile, and GLaS), and TGEDiff achieved significant improvements over the state -of -the -art approach on all three datasets, with F1 scores and mIoU improving by 0.88% and 1.09%, 3.21% and 3.43%, respectively, 1.29% and 2.34%. These data validate that TGEDiff has excellent performance in medical image segmentation. TGEDiff is expected to facilitate accurate diagnosis and treatment of medical diseases through more precise deconvolutional structural segmentation.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Enhancing Label-Efficient Medical Image Segmentation with Text-Guided Diffusion Models
    Feng, Chun-Mei
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT VIII, 2024, 15008 : 253 - 262
  • [2] ABP: Asymmetric Bilateral Prompting for Text-Guided Medical Image Segmentation
    Zeng, Xinyi
    Zeng, Pinxian
    Cui, Jiaqi
    Li, Aibing
    Liu, Bo
    Wang, Chengdi
    Wang, Yan
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT IX, 2024, 15009 : 54 - 64
  • [3] DTAN: Diffusion-based Text Attention Network for medical image segmentation
    Zhao, Yiyang
    Li, Jinjiang
    Ren, Lu
    Chen, Zheng
    COMPUTERS IN BIOLOGY AND MEDICINE, 2024, 168
  • [4] Common Vision-Language Attention for Text-Guided Medical Image Segmentation of Pneumonia
    Guo, Yunpeng
    Zeng, Xinyi
    Zeng, Pinxian
    Fei, Yuchen
    Wen, Lu
    Zhou, Jiliu
    Wang, Yan
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT IX, 2024, 15009 : 192 - 201
  • [5] Text Knowledge-guided Segment Anything Model for Medical Image Segmentation
    Kim, Young Woon
    Cho, Hyunjun
    Ko, Sung-Jea
    Jung, Seung-Won
    2024 INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS, AND COMMUNICATIONS, ITC-CSCC 2024, 2024,
  • [6] A Medical Image Segmentation Network with Boundary Enhancement
    Sun Junmei
    Ge Qingqing
    Li Xiumei
    Zhao Baoqi
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2022, 44 (05) : 1643 - 1652
  • [7] Diffpvt:information filtering based diffusion model with PVT for medical image segmentation
    Wang, Chengming
    Yuan, Genji
    Li, Mengjun
    Li, Jinjiang
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2025,
  • [8] Explainable multi-module semantic guided attention based network for medical image segmentation
    Karri, Meghana
    Annavarapu, Chandra Sekhara Rao
    Acharya, U. Rajendra
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 151
  • [9] Cold SegDiffusion: A novel diffusion model for medical image segmentation
    Yan, Pengfei
    Li, Minglei
    Zhang, Jiusi
    Li, Guanyi
    Jiang, Yuchen
    Luo, Hao
    KNOWLEDGE-BASED SYSTEMS, 2024, 301
  • [10] Multiscale progressive text prompt network for medical image segmentation
    Han, Xianjun
    Chen, Qianqian
    Xie, Zhaoyang
    Li, Xuejun
    Yang, Hongyu
    COMPUTERS & GRAPHICS-UK, 2023, 116 : 262 - 274