Semi-Supervised Semantic Image Segmentation by Deep Diffusion Models and Generative Adversarial Networks

被引:5
作者
Diaz-Frances, Jose Angel [1 ]
Fernandez-Rodriguez, Jose David [1 ]
Thurnhofer-Hemsi, Karl [1 ]
Lopez-Rubio, Ezequiel [1 ]
机构
[1] Univ Malaga, ITIS Software, Calle Arquitecto Francisco Penalosa 18, Malaga 29010, Spain
关键词
Semantic segmentation; semi-supervised; diffusion model;
D O I
10.1142/S0129065724500576
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Typically, deep learning models for image segmentation tasks are trained using large datasets of images annotated at the pixel level, which can be expensive and highly time-consuming. A way to reduce the amount of annotated images required for training is to adopt a semi-supervised approach. In this regard, generative deep learning models, concretely Generative Adversarial Networks (GANs), have been adapted to semi-supervised training of segmentation tasks. This work proposes MaskGDM, a deep learning architecture combining some ideas from EditGAN, a GAN that jointly models images and their segmentations, together with a generative diffusion model. With careful integration, we find that using a generative diffusion model can improve EditGAN performance results in multiple segmentation datasets, both multi-class and with binary labels. According to the quantitative results obtained, the proposed model improves multi-class image segmentation when compared to the EditGAN and DatasetGAN models, respectively, by 4.5% and 5.0%. Moreover, using the ISIC dataset, our proposal improves the results from other models by up to 11% for the binary image segmentation approach.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Information Transfer in Semi-Supervised Semantic Segmentation
    Wu, Jiawei
    Fan, Haoyi
    Li, Zuoyong
    Liu, Guang-Hai
    Lin, Shouying
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (02) : 1174 - 1185
  • [22] Revisiting Consistency for Semi-Supervised Semantic Segmentation
    Grubisic, Ivan
    Orsic, Marin
    Segvic, Sinisa
    SENSORS, 2023, 23 (02)
  • [23] Semi-supervised Deep Learning via Transformation Consistency Regularization for Remote Sensing Image Semantic Segmentation
    Zhang, Bin
    Zhang, Yongjun
    Li, Yansheng
    Wan, Yi
    Guo, Haoyu
    Zheng, Zhi
    Yang, Kun
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 5782 - 5796
  • [24] Semi-supervised Segmentation of Liver Using Adversarial Learning with Deep Atlas Prior
    Zheng, Han
    Lin, Lanfen
    Hu, Hongjie
    Zhang, Qiaowei
    Chen, Qingqing
    Iwamoto, Yutaro
    Han, Xianhua
    Chen, Yen-Wei
    Tong, Ruofeng
    Wu, Jian
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT VI, 2019, 11769 : 148 - 156
  • [25] Semi-supervised Attentive Mutual-info Generative Adversarial Network for Brain Tumor Segmentation
    Xi, Nan
    2019 INTERNATIONAL CONFERENCE ON IMAGE AND VISION COMPUTING NEW ZEALAND (IVCNZ), 2019,
  • [26] Generative Adversarial Networks-Based Semi-Supervised Automatic Modulation Recognition for Cognitive Radio Networks
    Li, Mingxuan
    Li, Ou
    Liu, Guangyi
    Zhang, Ce
    SENSORS, 2018, 18 (11)
  • [27] Feature-enhanced adversarial semi-supervised semantic segmentation network for pulmonary embolism annotation
    Cheng, Ting-Wei
    Chua, Yi Wei
    Huang, Ching-Chun
    Chang, Jerry
    Kuo, Chin
    Cheng, Yun-Chien
    HELIYON, 2023, 9 (05)
  • [28] Adversarial Semi-supervised domain adaptation for semantic segmentation: A new role for labeled target samples
    Kechaou, Marwa
    Alaya, Mokhtar Z.
    Herault, Romain
    Gasso, Gilles
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2025, 253
  • [29] A Residual Correction Approach for Semi-supervised Semantic Segmentation
    Li, Haoliang
    Zheng, Huicheng
    PATTERN RECOGNITION AND COMPUTER VISION, PT IV, 2021, 13022 : 90 - 102
  • [30] Revisiting Network Perturbation for Semi-supervised Semantic Segmentation
    Li, Sien
    Wang, Tao
    Hui, Ruizhe
    Liu, Wenxi
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT XII, 2025, 15042 : 157 - 171