GAN and DCN Based Multi-step Supervised Learning for Image Semantic Segmentation

被引:7
|
作者
Fang, Jie [1 ,2 ]
Cao, Xiaoqian [3 ]
机构
[1] Chinese Acad Sci, Ctr Opt IMagery Anal & Learning OPTIMAL, Xian Inst Opt & Precis Mech, Xian 710119, Shaanxi, Peoples R China
[2] Univ Chinese Acad Sci, 19A Yuquanlu, Beijing 100049, Peoples R China
[3] Shaanxi Univ Sci & Technol, Coll Elect & Informat Engn, Xian 710021, Shaanxi, Peoples R China
来源
PATTERN RECOGNITION AND COMPUTER VISION, PT II | 2018年 / 11257卷
关键词
cGAN; DCN; Image semantic segmentation; Multi-step supervised learning;
D O I
10.1007/978-3-030-03335-4_3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image semantic segmentation contains two sub-tasks, segmenting and labeling. However, the recent fully convolutional network (FCN) based methods often ignore the first sub-task and consider it as a direct labeling one. Even though these methods have achieved competitive performances, they obtained spatially fragmented and disconnected outputs. The reason is that, pixel-level relationships inside the deepest layers become inconsistent since traditional FCNs do not have any explicit pixel grouping mechanism. To address this problem, a multi-step supervised learning method, which contains image-level supervised learning step and pixel-level supervised learning step, is proposed. Specifically, as for the visualized result of image semantic segmentation, it is actually an image-to-image transformation problem, from RGB domain to category label domain. The recent conditional generative adversarial network (cGAN) has achieved significant performance for image-to-image generation task, and the generated image remains good regional connectivity. Therefore, a cGAN supervised by RGB-category label map is used to obtain a coarse segmentation mask, which avoids generating disconnected segmentation results to a certain extent. Furthermore, an interaction information (II) loss term is proposed for cGAN to remain the spatial structure of the segmentation mask. Additionally, dilated convolutional networks (DCNs) have achieved significant performance in object detection field, especially for small objects because of its special receptive field settings. Specific to image semantic segmentation, if each pixel is seen as an object, this task can be transformed to object detection. In this case, combined with the segmentation mask from cGAN, a DCN supervised by the pixel-level label is used to finalize the category recognition of each pixel in the image. The proposed method achieves satisfactory performances on three public and challenging datasets for image semantic segmentation.
引用
收藏
页码:28 / 40
页数:13
相关论文
共 50 条
  • [1] Multi-step medical image segmentation based on reinforcement learning
    Zhiqiang Tian
    Xiangyu Si
    Yaoyue Zheng
    Zhang Chen
    Xiaojian Li
    Journal of Ambient Intelligence and Humanized Computing, 2022, 13 : 5011 - 5022
  • [2] Multi-step medical image segmentation based on reinforcement learning
    Tian, Zhiqiang
    Si, Xiangyu
    Zheng, Yaoyue
    Chen, Zhang
    Li, Xiaojian
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2020, 13 (11) : 5011 - 5022
  • [3] Multi-step Segmentation for Prostate MR Image based on Reinforcement Learning
    Si, Xiangyu
    Tian, Zhiqiang
    Li, Xiaojian
    Chen, Zhang
    Li, Gen
    MEDICAL IMAGING 2020: IMAGE-GUIDED PROCEDURES, ROBOTIC INTERVENTIONS, AND MODELING, 2021, 11315
  • [4] Multi-step morph-based image segmentation algorithm
    Wu, Yue
    Zhao, Yu-Ming
    Zhu, Kai
    Hongwai yu Jiguang Gongcheng/Infrared and Laser Engineering, 2004, 33 (06): : 607 - 610
  • [5] A Deeply Supervised Semantic Segmentation Method Based on GAN
    Zhao, Wei
    Wei, Qiyu
    Zeng, Zeng
    2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 5235 - 5240
  • [6] Image Piece Learning for Weakly Supervised Semantic Segmentation
    Li, Yi
    Guo, Yanqing
    Kao, Yueying
    He, Ran
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2017, 47 (04): : 648 - 659
  • [7] Semantic Segmentation Using a GAN and a Weakly Supervised Method Based on Deep Transfer Learning
    Wen, Shuhuan
    Tian, Wenbo
    Zhang, Hong
    Fan, Shaokang
    Zhou, Nannan
    Li, Xiongfei
    IEEE ACCESS, 2020, 8 : 176480 - 176494
  • [8] Multi-Branch Supervised Learning on Semantic Segmentation
    Chen, Wenxin
    Zhang, Ting
    Zhao, Xing
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 6841 - 6845
  • [9] Weakly Supervised Semantic Segmentation with a Multi-Image Model
    Vezhnevets, Alexander
    Ferrari, Vittorio
    Buhmann, Joachim M.
    2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2011, : 643 - 650
  • [10] MRL-Seg: Overcoming Imbalance in Medical Image Segmentation With Multi-Step Reinforcement Learning
    Yang, Feiyang
    Li, Xiongfei
    Duan, Haoran
    Xu, Feilong
    Huang, Yawen
    Zhang, Xiaoli
    Long, Yang
    Zheng, Yefeng
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (02) : 858 - 869