Cutout with patch-loss augmentation for improving generative adversarial networks against instability

被引：4

作者：

Shi, Mengchen ^{[1
]}

Xie, Fei ^{[1
]}

Yang, Jiquan ^{[1
]}

Zhao, Jing ^{[2
,3
]}

Liu, Xixiang ^{[4
]}

Wang, Fan ^{[5
]}

机构：

[1] Nanjing Normal Univ, Sch Elect & Automat Engn, Xuelin Rd 2, Nanjing 210023, Peoples R China

[2] Nanjing Univ Posts & Telecommun, Coll Automat, Wenyuan Rd 9, Nanjing 210023, Peoples R China

[3] Nanjing Univ Posts & Telecommun, Coll Artificial Intelligence, Wenyuan Rd 9, Nanjing 210023, Peoples R China

[4] Southeast Univ, Coll Instrument Sci & Engn, Four Archway Bldg 2, Nanjing 210096, Peoples R China

[5] Wuhan Univ, Sch Informat Management, Bayi Rd 299, Wuhan 430072, Peoples R China

来源：

COMPUTER VISION AND IMAGE UNDERSTANDING | 2023年 / 234卷

基金：

中国国家自然科学基金;

关键词：

Generative Adversarial Networks; Dataset augmentation; Convolution neural network;

D O I：

10.1016/j.cviu.2023.103761

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Generative adversarial networks heavily rely on large datasets and carefully chosen model parameters to avoid model overfitting or mode collapse. Cutout with patch-loss augmentation, a dataset augmentation designed for generative adversarial networks that applies cutout to both the discriminator and the generator with a patch-loss structure and a new loss function, is proposed as a solution to the issue. It can enhance the performance of generative adversarial networks on full datasets and promote better convergence and stability on limited datasets. Additionally, the tensor value clamp is proposed, accelerating training speed without compromising quality. The proposed method can be successfully used with various generative adversarial networks, according to experiments. The performance of generative adversarial networks trained with full data on CIFAR-10 is matched by our method with only 20% of the training data. Finally, combined with our approach, StyleGAN2-ADA's Frechet Inception Distance (FID) results on the CIFAR-10, LSUN-CAT, and FFHQ-256 datasets can be further enhanced.

引用

页数：9

共 43 条

[1] Arjovsky M., 2017, arXiv, DOI 10.48550/arXiv.1701.04862
[2] Arjovsky M, 2017, Arxiv, DOI [arXiv:1701.07875, 10.48550/arXiv.1701.07875]
[3] Bachman P, 2014, Arxiv, DOI arXiv:1412.4864
[4] Brock A, 2017, Arxiv, DOI arXiv:1609.07093
[5] Brock A, 2019, Arxiv, DOI arXiv:1809.11096
[6] Chen PG, 2024, Arxiv, DOI arXiv:2001.04086
[7] TensorMask: A Foundation for Dense Object Segmentation
Chen, Xinlei
Girshick, Ross
He, Kaiming
Dollar, Piotr
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2061 - 2069
[8] DeVries T, 2017, Arxiv, DOI arXiv:1708.04552
[9] Generative Adversarial Networks
Goodfellow, Ian
Pouget-Abadie, Jean
Mirza, Mehdi
Xu, Bing
Warde-Farley, David
Ozair, Sherjil
Courville, Aaron
Bengio, Yoshua
[J]. COMMUNICATIONS OF THE ACM, 2020, 63 (11) : 139 - 144
[10] He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]

← 1 2 3 4 5 →