Learning From Synthetic Images via Active Pseudo-Labeling

被引：33

作者：

Song, Liangchen ^{[1
,2
]}

Xu, Yonghao ^{[2
,3
]}

Zhang, Lefei ^{[1
,3
]}

Du, Bo ^{[4
]}

Zhang, Qian ^{[2
]}

Wang, Xinggang ^{[5
]}

机构：

[1] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Peoples R China

[2] Horizon Robot Inc, Beijing 100190, Peoples R China

[3] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan 430072, Peoples R China

[4] Wuhan Univ, Natl Engn Res Ctr Multimedia Software, Inst Artificial Intelligence, Sch Comp Sci, Wuhan 430072, Peoples R China

[5] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Peoples R China

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2020年 / 29卷

基金：

中国国家自然科学基金;

关键词：

Task analysis; Data models; Training; Visualization; Adaptation models; Neural networks; Predictive models; Deep learning; domain adaptation; style transfer; pseudo-labeling; semantic segmentation; object detection; DOMAIN ADAPTATION; NETWORKS;

D O I：

10.1109/TIP.2020.2989100

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Synthetic visual data refers to the data automatically rendered by the mature computer graphic algorithms. With the rapid development of these techniques, we can now collect photo-realistic synthetic images with accurate pixel-level annotations without much effort. However, due to the domain gaps between synthetic data and real data, in terms of not only visual appearance but also label distribution, directly applying models trained on synthetic images to real ones can hardly yield satisfactory performance. Since the collection of accurate labels for real images is very laborious and time-consuming, developing algorithms which can learn from synthetic images is of great significance. In this paper, we propose a novel framework, namely Active Pseudo-Labeling (APL), to reduce the domain gaps between synthetic images and real images. In APL framework, we first predict pseudo-labels for the unlabeled real images in the target domain by actively adapting the style of the real images to source domain. Specifically, the style of real images is adjusted via a novel task guided generative model, and then pseudo-labels are predicted for these actively adapted images. Lastly, we fine-tune the source-trained model in the pseudo-labeled target domain, which helps to fit the distribution of the real data. Experiments on both semantic segmentation and object detection tasks with several challenging benchmark data sets demonstrate the priority of our proposed method compared to the existing state-of-the-art approaches.

引用

页码：6452 / 6465

页数：14

共 73 条

[1]

[Anonymous], ARXIV181205418

[2]

[Anonymous], C COMP VIS PATT REC

[3]

[Anonymous], 2018, ARXIV PREPRINT ARXIV

[4]

[Anonymous], IEEE ACCESS

[5]

[Anonymous], 2016, DISTILL, DOI [DOI 10.23915/DISTILL.00003, 10.23915/distill.00003]

[6]

[Anonymous], 2018, P MACHINE LEARNING R

[7]

[Anonymous], 2019, P CVPR

[8]

[Anonymous], NATURE STATISTI810

[9]

[Anonymous], 2015, PROC CVPR IEEE

[10]

[Anonymous], P IEEE CVF INT C COM

← 1 2 3 4 5 6 7 8 →