Cutout with patch-loss augmentation for improving generative adversarial networks against instability

被引:4
作者
Shi, Mengchen [1 ]
Xie, Fei [1 ]
Yang, Jiquan [1 ]
Zhao, Jing [2 ,3 ]
Liu, Xixiang [4 ]
Wang, Fan [5 ]
机构
[1] Nanjing Normal Univ, Sch Elect & Automat Engn, Xuelin Rd 2, Nanjing 210023, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Coll Automat, Wenyuan Rd 9, Nanjing 210023, Peoples R China
[3] Nanjing Univ Posts & Telecommun, Coll Artificial Intelligence, Wenyuan Rd 9, Nanjing 210023, Peoples R China
[4] Southeast Univ, Coll Instrument Sci & Engn, Four Archway Bldg 2, Nanjing 210096, Peoples R China
[5] Wuhan Univ, Sch Informat Management, Bayi Rd 299, Wuhan 430072, Peoples R China
基金
中国国家自然科学基金;
关键词
Generative Adversarial Networks; Dataset augmentation; Convolution neural network;
D O I
10.1016/j.cviu.2023.103761
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generative adversarial networks heavily rely on large datasets and carefully chosen model parameters to avoid model overfitting or mode collapse. Cutout with patch-loss augmentation, a dataset augmentation designed for generative adversarial networks that applies cutout to both the discriminator and the generator with a patch-loss structure and a new loss function, is proposed as a solution to the issue. It can enhance the performance of generative adversarial networks on full datasets and promote better convergence and stability on limited datasets. Additionally, the tensor value clamp is proposed, accelerating training speed without compromising quality. The proposed method can be successfully used with various generative adversarial networks, according to experiments. The performance of generative adversarial networks trained with full data on CIFAR-10 is matched by our method with only 20% of the training data. Finally, combined with our approach, StyleGAN2-ADA's Frechet Inception Distance (FID) results on the CIFAR-10, LSUN-CAT, and FFHQ-256 datasets can be further enhanced.
引用
收藏
页数:9
相关论文
共 43 条
  • [1] Arjovsky M., 2017, arXiv, DOI 10.48550/arXiv.1701.04862
  • [2] Arjovsky M, 2017, Arxiv, DOI [arXiv:1701.07875, 10.48550/arXiv.1701.07875]
  • [3] Bachman P, 2014, Arxiv, DOI arXiv:1412.4864
  • [4] Brock A, 2017, Arxiv, DOI arXiv:1609.07093
  • [5] Brock A, 2019, Arxiv, DOI arXiv:1809.11096
  • [6] Chen PG, 2024, Arxiv, DOI arXiv:2001.04086
  • [7] TensorMask: A Foundation for Dense Object Segmentation
    Chen, Xinlei
    Girshick, Ross
    He, Kaiming
    Dollar, Piotr
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2061 - 2069
  • [8] DeVries T, 2017, Arxiv, DOI arXiv:1708.04552
  • [9] Generative Adversarial Networks
    Goodfellow, Ian
    Pouget-Abadie, Jean
    Mirza, Mehdi
    Xu, Bing
    Warde-Farley, David
    Ozair, Sherjil
    Courville, Aaron
    Bengio, Yoshua
    [J]. COMMUNICATIONS OF THE ACM, 2020, 63 (11) : 139 - 144
  • [10] He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]