GAN and DCN Based Multi-step Supervised Learning for Image Semantic Segmentation

被引：7

作者：

Fang, Jie ^{[1
,2
]}

Cao, Xiaoqian ^{[3
]}

机构：

[1] Chinese Acad Sci, Ctr Opt IMagery Anal & Learning OPTIMAL, Xian Inst Opt & Precis Mech, Xian 710119, Shaanxi, Peoples R China

[2] Univ Chinese Acad Sci, 19A Yuquanlu, Beijing 100049, Peoples R China

[3] Shaanxi Univ Sci & Technol, Coll Elect & Informat Engn, Xian 710021, Shaanxi, Peoples R China

来源：

PATTERN RECOGNITION AND COMPUTER VISION, PT II | 2018年 / 11257卷

关键词：

cGAN; DCN; Image semantic segmentation; Multi-step supervised learning;

D O I：

10.1007/978-3-030-03335-4_3

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Image semantic segmentation contains two sub-tasks, segmenting and labeling. However, the recent fully convolutional network (FCN) based methods often ignore the first sub-task and consider it as a direct labeling one. Even though these methods have achieved competitive performances, they obtained spatially fragmented and disconnected outputs. The reason is that, pixel-level relationships inside the deepest layers become inconsistent since traditional FCNs do not have any explicit pixel grouping mechanism. To address this problem, a multi-step supervised learning method, which contains image-level supervised learning step and pixel-level supervised learning step, is proposed. Specifically, as for the visualized result of image semantic segmentation, it is actually an image-to-image transformation problem, from RGB domain to category label domain. The recent conditional generative adversarial network (cGAN) has achieved significant performance for image-to-image generation task, and the generated image remains good regional connectivity. Therefore, a cGAN supervised by RGB-category label map is used to obtain a coarse segmentation mask, which avoids generating disconnected segmentation results to a certain extent. Furthermore, an interaction information (II) loss term is proposed for cGAN to remain the spatial structure of the segmentation mask. Additionally, dilated convolutional networks (DCNs) have achieved significant performance in object detection field, especially for small objects because of its special receptive field settings. Specific to image semantic segmentation, if each pixel is seen as an object, this task can be transformed to object detection. In this case, combined with the segmentation mask from cGAN, a DCN supervised by the pixel-level label is used to finalize the category recognition of each pixel in the image. The proposed method achieves satisfactory performances on three public and challenging datasets for image semantic segmentation.

引用

页码：28 / 40

页数：13

共 50 条

[21] Image Semantic Segmentation Based on Multi-Scale Feature Extraction and Fully Connected Conditional Random Fields [J].

Dong Yongfeng ;

Yang Yuxin ;

Wang Liqin .

LASER & OPTOELECTRONICS PROGRESS, 2019, 56 (13)

[22] Infrared Image Semantic Segmentation Based On Improved DeepLab And Residual Network [J].

Xu, Zheng-guang ;

Wang, Jie ;

Wang, Lu-yao .

PROCEEDINGS OF 2018 10TH INTERNATIONAL CONFERENCE ON MODELLING, IDENTIFICATION AND CONTROL (ICMIC), 2018,

[23] Intelligent Image Semantic Segmentation: A Review Through Deep Learning Techniques for Remote Sensing Image Analysis [J].

Jiang, Baode ;

An, Xiaoya ;

Xu, Shaofen ;

Chen, Zhanlong .

JOURNAL OF THE INDIAN SOCIETY OF REMOTE SENSING, 2023, 51 (09) :1865-1878

[24] Intelligent Image Semantic Segmentation: A Review Through Deep Learning Techniques for Remote Sensing Image Analysis [J].

Baode Jiang ;

Xiaoya An ;

Shaofen Xu ;

Zhanlong Chen .

Journal of the Indian Society of Remote Sensing, 2023, 51 :1865-1878

[25] Domain consistency learning for continual test-time adaptation in image semantic segmentation [J].

Ye, Yanyu ;

Wei, Wei ;

Zhang, Lei ;

Ding, Chen ;

Zhang, Yanning .

PATTERN RECOGNITION, 2025, 165

[26] Self-Supervised Learning Framework toward State-of-the-Art Iris Image Segmentation [J].

Putri, Wenny Ramadha ;

Liu, Shen-Hsuan ;

Aslam, Muhammad Saqlain ;

Li, Yung-Hui ;

Chang, Chin-Chen ;

Wang, Jia-Ching .

SENSORS, 2022, 22 (06)

[27] Optimization of Image Semantic Segmentation Algorithms Based on Deeplab v3+ [J].

Meng Junxi ;

Zhang Li ;

Cao Yang ;

Zhang Letian ;

Song Qian .

LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (16)

[28] An enhancement model based on dense atrous and inception convolution for image semantic segmentation [J].

Zhou, Erjing ;

Xu, Xiang ;

Xu, Baomin ;

Wu, Hongwei .

APPLIED INTELLIGENCE, 2023, 53 (05) :5519-5531

[29] DECANet: Image Semantic Segmentation Method Based on Improved DeepLabv3+ [J].

Tang Lu ;

Wan Liang ;

Wang Tingting ;

Li Shusheng .

LASER & OPTOELECTRONICS PROGRESS, 2023, 60 (04)

[30] Image semantic segmentation with hierarchical feature fusion based on deep neural network [J].

Yang, Dawei ;

Du, Yan ;

Yao, Hongli ;

Bao, Liyan .

CONNECTION SCIENCE, 2022, 34 (01) :1772-1784

← 1 2 3 4 5 →