Semantic Regularized Class-Conditional GANs for Semi-Supervised Fine-Grained Image Synthesis

被引：8

作者：

Chen, Tianyi ^{[1
]}

Wu, Si ^{[1
]}

Yang, Xuhui ^{[2
]}

Xu, Yong ^{[1
,2
,3
]}

Wong, Hau-San ^{[4
]}

机构：

[1] South China Univ Technol, Sch Comp Sci Engn, Guangzhou 510006, Guangdong, Peoples R China

[2] Peng Cheng Lab, Shenzhen 518055, Guangdong, Peoples R China

[3] Commun & Comp Network Lab Guangdong, Guangzhou 510006, Guangdong, Peoples R China

[4] City Univ Hong Kong, Dept Comp Sci, Kowloon, Hong Kong, Peoples R China

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2022年 / 24卷

基金：

中国国家自然科学基金; 中国博士后科学基金;

关键词：

Generators; Semantics; Image synthesis; Training; Task analysis; Generative adversarial networks; Data models; Semi-supervised learning; fine-grained image synthesis; generative adversarial networks; semantic regularization;

D O I：

10.1109/TMM.2021.3091859

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Learning effective generative models for natural image synthesis is a promising way to reduce the dependence of deep models on massive training data. This work focuses on Fine-Grained Image Synthesis (FGIS) in the semi-supervised setting where a small number of training instances are labeled. Different from generic image synthesis tasks, the available fine-grained data may be inadequate, and the differences among the object categories are typically subtle. To address these issues, we propose a Semantic Regularized class-conditional Generative Adversarial Network, which is referred to as SReGAN. We incorporate an additional discriminator and classifier into the generator-discriminator minimax game. Competing with two discriminators enforces the generator to model both marginal and class-conditional data distributions, which alleviates the problem of limited training data and labels. However, the discriminators may overlook the class separability. To induce the generator to discover the distinctions between classes, we construct semantically congruent and incongruent pairs in the generation process, and further regularize the generator by encouraging high similarities of congruent pairs, while penalizing that of incongruent ones in the classifier's feature space. We have conducted extensive experiments to verify the capability of SReGAN in generating high-fidelity images on a variety of FGIS benchmarks.

引用

页码：2975 / 2985

页数：11

共 48 条

[1] Plug & Play Generative Networks: Conditional Iterative Generation of Images in Latent Space [J].

Anh Nguyen ;

Clune, Jeff ;

Bengio, Yoshua ;

Dosovitskiy, Alexey ;

Yosinski, Jason .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :3510-3520

[2]

[Anonymous], 2016, P 30 C NEUR INF PROC

[3]

[Anonymous], 2015, P INT C LEARNING REP

[4]

[Anonymous], 2017, ADV NEURAL INFORM PR

[5]

Arjovsky M, 2017, PR MACH LEARN RES, V70

[6] CVAE-GAN: Fine-Grained Image Generation through Asymmetric Training [J].

Bao, Jianmin ;

Chen, Dong ;

Wen, Fang ;

Li, Houqiang ;

Hua, Gang .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2764-2773

[7] OneGAN: Simultaneous Unsupervised Learning of Conditional Image Generation, Foreground Segmentation, and Fine-Grained Clustering [J].

Benny, Yaniv ;

Wolf, Lior .

COMPUTER VISION - ECCV 2020, PT XXVI, 2020, 12371 :514-530

[8] FlyMap: Interacting with Maps Projected from a Drone [J].

Brock, Anke M. ;

Chatain, Julia ;

Park, Michelle ;

Fang, Tommy ;

Hachet, Martin ;

Landay, James A. ;

Cauchard, Jessica R. .

PROCEEDINGS PERVASIVE DISPLAYS 2018: THE 7TH ACM INTERNATIONAL SYMPOSIUM ON PERVASIVE DISPLAYS, 2018,

[9]

Chen X, 2016, ADV NEUR IN, V29

[10] Destruction and Construction Learning for Fine-grained Image Recognition [J].

Chen, Yue ;

Bai, Yalong ;

Zhang, Wei ;

Mei, Tao .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5152-5161

← 1 2 3 4 5 →