Domain Adaptive Semantic Segmentation Through Structure Enhancement

被引：0

作者：

Lv, Fengmao ^{[1
]}

Lian, Qing ^{[1
]}

Yang, Guowu ^{[1
]}

Lin, Guosheng ^{[2
]}

Pan, Sinno Jialin ^{[2
]}

Duan, Lixin ^{[1
]}

机构：

[1] Univ Elect Sci & Technol China, Big Data Res Ctr, Chengdu, Peoples R China

[2] Nanyang Technol Univ, Sch Engn & Comp Sci, Singapore, Singapore

来源：

COMPUTER VISION - ECCV 2018 WORKSHOPS, PT II | 2019年 / 11130卷

关键词：

Unsupervised domain adaptation; Semantic segmentation; Deep learning; Transfer learning;

D O I：

10.1007/978-3-030-11012-3_13

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Although fully convolutional networks have recently achieved great advances in semantic segmentation, the performance leaps heavily rely on supervision with pixel-level annotations which are extremely expensive and time-consuming to collect. Training models on synthetic data is a feasible way to relieve the annotation burden. However, the domain shift between synthetic and real images usually lead to poor generalization performance. In this work, we propose an effective method to adapt the segmentation network trained on synthetic images to real scenarios in an unsupervised fashion. To improve the adaptation performance for semantic segmentation, we enhance the structure information of the target images at both the feature level and the output level. Specifically, we enforce the segmentation network to learn a representation that encodes the target images' visual cues through image reconstruction, which is beneficial to the structured prediction of the target images. Further more, we implement adversarial training at the output space of the segmentation network to align the structured prediction of the source and target images based on the similar spatial structure they share. To validate the performance of our method, we conduct comprehensive experiments on the "GTA5 to Cityscapes" dataset which is a standard domain adaptation benchmark for semantic segmentation. The experimental results clearly demonstrate that our method can effectively bridge the synthetic and real image domains and obtain better adaptation performance compared with the existing state-of-the-art methods.

引用

页码：172 / 179

页数：8

共 18 条

[1]

[Anonymous], 2017, P 34 INT C MACHINE L

[2]

[Anonymous], 2017, ICCV

[3]

[Anonymous], 2018, ARXIV180210349

[4]

[Anonymous], 2017, ADV NEURAL INFORM PR

[5]

[Anonymous], 2014, UNSUPERVISED DOMAIN

[6] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].

Chen, Liang-Chieh ;

Papandreou, George ;

Kokkinos, Iasonas ;

Murphy, Kevin ;

Yuille, Alan L. .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848

[7] No More Discrimination: Cross City Adaptation of Road Scene Segmenters [J].

Chen, Yi-Hsin ;

Chen, Wei-Yu ;

Chen, Yu-Ting ;

Tsai, Bo-Cheng ;

Wang, Yu-Chiang Frank ;

Sun, Min .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2011-2020

[8] The Cityscapes Dataset for Semantic Urban Scene Understanding [J].

Cordts, Marius ;

Omran, Mohamed ;

Ramos, Sebastian ;

Rehfeld, Timo ;

Enzweiler, Markus ;

Benenson, Rodrigo ;

Franke, Uwe ;

Roth, Stefan ;

Schiele, Bernt .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223

[9]

Hoffman J., 2016, FCNS WILD PIXELLEVEL

[10]

Hoffman J., 2017, ARXIV171103213

← 1 2 →