Conditional GAN for Small Datasets

被引：2

作者：

Hiruta, Komei ^{[1
]}

Saito, Ryusuke ^{[1
]}

Hatakeyama, Taro ^{[1
]}

Hashimoto, Atsushi ^{[1
]}

Kurihara, Satoshi ^{[2
]}

机构：

[1] Keio Univ, Grad Sch Sci & Technol, Yokohama, Kanagawa, Japan

[2] Keio Univ, Fac Sci & Technol, Yokohama, Kanagawa, Japan

来源：

2022 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM) | 2022年

关键词：

Conditional GANs; Deep Generative Model; Manga;

D O I：

10.1109/ISM55400.2022.00062

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Generating high-quality images with Generative Adversarial Networks (GANs) generally requires 100k+ training data. The required data amount is too large when we consider using GANs to support professional art creators; they need to follow the specific art style while interactively controlling the results along with their theme. This research proposes Conditional FastGAN, which adds a condition vector to FastGAN to produce high-quality different domain images even on small datasets. In our experiments, the MUCT Face Database of images consisting of face photos in various orientations and manga face images extracted from Osamu Tezuka's works were used as a small-scale dataset. Fine-tuning with manga face images to a model pre-trained with photo-only face images enabled control of the generated images according to explicit conditions, such as photos and manga, for the same latent variables. In addition, the proposed method improved the FID score by 2.55 from the original FastGAN in the case of manga face generation.

引用

页码：278 / 281

页数：4

共 25 条

[1]

Bird JJ, 2021, arXiv

[2] Generative Adversarial Networks [J].

Goodfellow, Ian ;

Pouget-Abadie, Jean ;

Mirza, Mehdi ;

Xu, Bing ;

Warde-Farley, David ;

Ozair, Sherjil ;

Courville, Aaron ;

Bengio, Yoshua .

COMMUNICATIONS OF THE ACM, 2020, 63 (11) :139-144

[3]

Heusel M, 2017, ADV NEUR IN, V30

[4]

Jin YH, 2017, Arxiv, DOI arXiv:1708.05509

[5] A Style-Based Generator Architecture for Generative Adversarial Networks [J].

Karras, Tero ;

Laine, Samuli ;

Aila, Timo .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4396-4405

[6]

Kim T, 2017, PR MACH LEARN RES, V70

[7]

Liu BC, 2020, AAAI CONF ARTIF INTE, V34, P4836

[8]

Liu BY, 2021, P INT C LEARN REPR

[9] Deep Learning Face Attributes in the Wild [J].

Liu, Ziwei ;

Luo, Ping ;

Wang, Xiaogang ;

Tang, Xiaoou .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :3730-3738

[10]

Milborrow Stephen, 2010, The MUCT Landmarked Face Database, V201

← 1 2 3 →